Statistics

UPSC Statistics 2024 — Paper II

All 8 questions from UPSC Civil Services Mains Statistics 2024 Paper II (400 marks total). Every stem reproduced in full, with directive-word analysis, marks, word limits, and answer-approach pointers.

8Questions
400Total marks
2024Year
Paper IIPaper

Topics covered

Reliability, CUSUM charts, Sampling inspection, Linear programming, Monte Carlo (1)Control charts, Weibull distribution, Simplex method (1)Linear programming, Markov chains, reliability theory (1)Queueing theory, inventory management, statistical quality control (1)Indian Statistical System, AR(1) model, SLSE, Demography, Intelligence tests (1)OLS and GLS estimators, Index Number Tests, Population projection (1)Identification problem and econometric models (1)Item difficulty scaling and fertility measures (1)

A

Q1
50M Compulsory describe Reliability, CUSUM charts, Sampling inspection, Linear programming, Monte Carlo

(a) Consider a system consisting of three identical units connected in parallel. The unit reliability factor is 0·90. If the unit failures are independent of one another, and if the successful operation of the system depends on the satisfactory performance of any one unit, determine the system's reliability. 10 marks (b) Describe the procedure and some of the applications of Cumulative Sum (CUSUM) chart for monitoring process mean. 10 marks (c) Explain the following terms as used in sampling inspection plans : 5+5=10 marks (i) Producer's risk (ii) Average Outgoing Quality Limit (d) A Linear Programming Problem (LPP) in standard form is as given below : Optimize Z = CᵀX subject to AX = B with X ≥ 0 Write down the Dual Simplex form and its iterative procedure. 10 marks (e) What is Monte Carlo Simulation ? State the uses and applications of Monte Carlo Simulation. 10 marks

हिंदी में पढ़ें

(a) एक प्रणाली पर विचार कीजिए जिसमें तीन समान इकाइयाँ हों जो समानान्तर में जुड़ी हों। इकाई विश्वसनीयता कारक 0·90 है। यदि इकाई विफलताएँ एक दूसरे से स्वतंत्र हैं, और यदि प्रणाली का सफल संचालन किसी एक इकाई के संतोषजनक प्रदर्शन पर निर्भर करता है, तो प्रणाली की विश्वसनीयता ज्ञात कीजिए। 10 अंक (b) प्रक्रम माध्य की निगरानी के लिए संचयी योगफल (सी यू एस यू एम) चार्ट की कार्यविधि और उसके कुछ अनुप्रयोगों का वर्णन कीजिए। 10 अंक (c) प्रतिचयन निरीक्षण आयोजनाओं में उपयोग होने वाले निम्नलिखित पदों की व्याख्या कीजिए : 5+5=10 अंक (i) उत्पादक का जोखिम (ii) औसत निर्गमी गुणता सीमा (d) मानक रूप में एक रैखिक प्रोग्रामन समस्या (एल पी पी) नीचे दी गई है : इष्टतमीकरण (ऑप्टिमाइज़) कीजिए Z = CᵀX निम्न प्रतिबन्ध के अन्तर्गत AX = B साथ में X ≥ 0 द्वैत एकल रूप और इसकी पुनरावृत्त प्रक्रिया को लिखिए। 10 अंक (e) मोंटे कार्लो अनुकरण क्या है ? मोंटे कार्लो अनुकरण के उपयोगों तथा अनुप्रयोगों को बताइए। 10 अंक

Answer approach & key points

The directive 'describe' demands systematic exposition of procedures and concepts across all five sub-parts. Allocate approximately 15-18 minutes to part (a) requiring calculation, 12-15 minutes each to descriptive parts (b), (c), and (e), and 10-12 minutes to part (d) on dual simplex. Structure as: direct calculation for (a); stepwise procedural description for (b) with industrial applications; precise definitions with formulas for (c); algorithmic presentation for (d); and conceptual definition followed by domain-specific applications for (e).

  • Part (a): Correct application of parallel system reliability formula R_system = 1 - (1 - 0.90)³ = 1 - (0.10)³ = 0.999 or 99.9%
  • Part (b): CUSUM chart construction using V-mask or decision interval, cumulative sum calculation S_i = Σ(x_j - μ₀), and applications in pharmaceutical quality control or ISRO component manufacturing
  • Part (c)(i): Producer's risk (α) as probability of rejecting good lot (AQL quality), typically set at 5% with Type I error interpretation
  • Part (c)(ii): AOQL as maximum average outgoing quality after rectifying inspection, formula AOQ = p·P_a·(N-n)/N for sampling with replacement
  • Part (d): Dual simplex form with primal maximization converting to dual minimization, conditions for optimality (all c_j - z_j ≤ 0), and iterative steps for pivot selection when b_i < 0
  • Part (e): Monte Carlo as stochastic simulation using random number generation, with applications in nuclear shielding design, financial risk modeling, or Indian monsoon prediction
Q2
50M solve Control charts, Weibull distribution, Simplex method

(a) Obtain the control limits for X̄-chart and R-chart and describe the significance of joint study of these charts. 20 marks (b) Find the reliability and hazard functions of Weibull distribution with scale parameter θ and shape parameter β, and interpret the findings. 10 marks (c) Use simplex method to solve the following LPP : Maximize z = 5x₁ + 2x₂ subject to 6x₁ + x₂ ≥ 6 4x₁ + 3x₂ ≥ 12 x₁ + 2x₂ ≥ 4 x₁ ≥ 0, x₂ ≥ 0 20 marks

हिंदी में पढ़ें

(a) X̄-चार्ट तथा R-चार्ट के लिए नियंत्रण सीमाएँ प्राप्त कीजिए और इन चार्टों के साम्मिलित अध्ययन के महत्व का वर्णन कीजिए। 20 अंक (b) वेबुल बंटन, जिसका मापक्रम प्राचल θ और आकृति प्राचल β है, के लिए विश्वसनीयता एवं संकटप्रस्त (हेजर्ड) फलनों को प्राप्त कीजिए, तथा निष्कर्षों की व्याख्या कीजिए। 10 अंक (c) एकदा विधि का उपयोग करके निम्नलिखित रैखिक प्रोग्रामन समस्या (एल पी पी) को हल कीजिए : अधिकतमीकरण कीजिए z = 5x₁ + 2x₂ निम्न प्रतिबंधों के अंतर्गत 6x₁ + x₂ ≥ 6 4x₁ + 3x₂ ≥ 12 x₁ + 2x₂ ≥ 4 x₁ ≥ 0, x₂ ≥ 0 20 अंक

Answer approach & key points

Begin by deriving control limits for X̄-chart and R-chart using standard formulae with A₂, D₃, D₄ constants, explaining why joint monitoring prevents Type I/II errors in SPC (40% time). For Weibull, derive R(t) = exp[-(t/θ)^β] and h(t) = (β/θ)(t/θ)^(β-1), interpreting bathtub curve relevance for Indian manufacturing/equipment reliability (20% time). For the LPP, convert ≥ constraints to standard form using surplus variables and artificial variables (Big-M method), then execute simplex iterations to reach optimality (40% time).

  • Part (a): Correct formulae for X̄-chart limits (UCL/LCL = X̄̄ ± A₂R̄) and R-chart limits (UCL = D₄R̄, LCL = D₃R̄) with proper identification of constants
  • Part (a): Explanation that X̄-chart monitors process mean while R-chart monitors process variability; joint study detects both shifts and dispersion changes, preventing misinterpretation from exclusive use of either chart
  • Part (b): Derivation of reliability function R(t) = exp[-(t/θ)^β] and hazard function h(t) = (β/θ)(t/θ)^(β-1) from Weibull PDF
  • Part (b): Interpretation of β < 1 (decreasing hazard, infant mortality), β = 1 (constant hazard, exponential), β > 1 (increasing hazard, wear-out) with industrial examples
  • Part (c): Conversion of ≥ constraints to standard form: subtract surplus variables s₁, s₂, s₃ and add artificial variables A₁, A₂, A₃ with Big-M penalty in objective
  • Part (c): Complete simplex tableau iterations showing entering/leaving variables, pivot operations, and final optimal solution with Z_max = 12 at (x₁=3, x₂=0) or verified corner point
  • Part (c): Verification of solution by checking constraint satisfaction and comparing objective values at all extreme points
Q3
50M explain Linear programming, Markov chains, reliability theory

(a) With respect to a given Linear Programming Problem (LPP), explain the following concepts : 15 marks (i) Extreme Point Solutions (ii) Duality Theorem (iii) Complementary Slackness Principle (b) Define a Transition Probability Matrix (TPM). When is it said to be Regular and Ergodic ? Check whether the following TPM is Regular or Ergodic. Hence or otherwise obtain the $\lim\limits_{n \to \infty} P^n$, where $P = \begin{pmatrix} 0.88 & 0.12 \\ 0.15 & 0.85 \end{pmatrix}$. 15 marks (c) The reliability function R(t) of a cutting assembly is given by : $$R(t) = \begin{cases} \left(1-\dfrac{t}{t_0}\right)^2, & 0 \leq t \leq t_0 \\ \quad 0 & , \quad t \geq t_0 \end{cases}$$ (i) Determine the failure rate. (ii) Does the failure rate increase or decrease with time ? (iii) Determine the mean time to failure. 8+4+8=20 marks

हिंदी में पढ़ें

(a) एक दी गई रैखिक प्रोग्रामन समस्या (एल पी पी) के संदर्भ में, निम्नलिखित संकल्पनाओं की व्याख्या कीजिए : 15 अंक (i) चरम बिन्दु समाधान (ii) द्वैत प्रमेय (iii) पूरक शिथिलता सिद्धांत (b) संक्रमण प्रायिकता मैट्रिक्स (टी पी एम) को परिभाषित कीजिए। इसे कब नियमित और अभ्यतिग्राय (एर्गोडिक) कहते हैं ? परीक्षण कीजिए कि क्या निम्नलिखित संक्रमण प्रायिकता मैट्रिक्स (टी पी एम) नियमित है या अभ्यतिग्राय (एर्गोडिक) है। इस प्रकार या अन्य प्रकार से $\lim\limits_{n \to \infty} P^n$ को प्राप्त कीजिए, जहाँ $P = \begin{pmatrix} 0.88 & 0.12 \\ 0.15 & 0.85 \end{pmatrix}$ है। 15 अंक (c) एक कतन समुच्चय (कटिंग असेंबली) का विश्वसनीयता फलन R(t) दिया गया है : $$R(t) = \begin{cases} \left(1-\dfrac{t}{t_0}\right)^2, & 0 \leq t \leq t_0 \\ \quad 0 & , \quad t \geq t_0 \end{cases}$$ (i) विफलता दर निर्धारित कीजिए। (ii) क्या विफलता दर समय के साथ बढ़ती या घटती है ? (iii) विफलता का औसत समय निर्धारित कीजिए। 8+4+8=20 अंक

Answer approach & key points

Begin with clear definitions for (a)(i)-(iii) on LPP concepts, allocating ~30% time; for (b) define TPM, establish regularity/ergodicity criteria, then compute steady-state probabilities using eigenvalue or algebraic methods (~35% time); for (c) derive failure rate from R(t), analyze its monotonicity, and integrate for MTTF (~35% time). Structure: definitions → theorems → computational steps → physical interpretation.

  • (a)(i) Extreme Point Solutions: Definition as feasible region vertices, convex combination property, and Fundamental Theorem of LPP optimality at extreme points
  • (a)(ii) Duality Theorem: Statement of weak and strong duality, primal-dual relationship, and economic interpretation of shadow prices
  • (a)(iii) Complementary Slackness: Conditions relating primal slack and dual variables, optimality verification tool
  • (b) TPM definition with row-stochastic property; regularity (some P^n has all positive entries) vs ergodicity (irreducible + aperiodic); classification of given P as regular and ergodic; computation of limiting distribution π = (5/9, 4/9)
  • (c)(i) Failure rate λ(t) = 2/(t₀-t) for 0 ≤ t < t₀ using λ(t) = -R'(t)/R(t)
  • (c)(ii) Increasing failure rate (IFR) demonstration as λ'(t) > 0, indicating wear-out phase
  • (c)(iii) MTTF = t₀/3 via integration ∫₀^t₀ R(t)dt, with proper handling of improper integral at t₀
Q4
50M solve Queueing theory, inventory management, statistical quality control

(a) A manually handled toll-booth has two tellers, who are each capable of handling an average of 60 vehicles per hour, with the actual service times exponentially distributed. Vehicles arrive at the booth according to a Poisson process, at an average rate of 100 per hour. Determine the following : 15 marks (i) The probability that there are more than three vehicles in the booth at the same time (ii) The probability that a given teller is idle (iii) The probability that a vehicle spends more than 3 minutes in the booth (b) PQR Electronics produces 300 transistors per day, which go into the inventory. It supplies 150 transistors per day to XYZ Radios. The annual demand is 37,500 units. The inventory holding cost is $ 0·25 per transistor per year and the setup cost per production run is $ 200. Find the following : 15 marks (i) Economic Order Quantity (EOQ) (ii) Production run length (iii) Number of production runs per year (iv) Maximum Inventory Level (c) (i) Explain the terms 'chance causes' and 'assignable causes' of variation in quality control. Also provide some principal advantages of statistical quality control. 10 marks (ii) Describe the procedure of obtaining OC curve for single sampling plan. 10 marks

हिंदी में पढ़ें

(a) एक हस्तेन संभालने वाले टोल बूथ के पास दो टेलर हैं, जो प्रत्येक औसतन 60 वाहनों को प्रति घंटे की दर से संभालने के लिए सक्षम हैं, इनकी वास्तविक सेवा का समय चरघातांकीय बंटित है। वाहन बूथ पर प्वासों प्रक्रम के अनुसार औसतन 100 प्रति घंटे की दर से पहुँचते हैं। निम्नलिखित को निर्धारित कीजिए (ज्ञात कीजिए) : 15 अंक (i) एक ही समय पर बूथ में तीन से अधिक वाहनों के होने की प्रायिकता (ii) एक दिया गया टेलर खाली है, इसकी प्रायिकता (iii) एक वाहन बूथ में 3 मिनट से अधिक बिताता है, इसकी प्रायिकता (b) PQR इलेक्ट्रॉनिक्स प्रतिदिन 300 ट्रांजिस्टरों का उत्पादन करता है, जो तालिका (इन्वेंटरी) में चले जाते हैं। यह 150 ट्रांजिस्टर प्रतिदिन XYZ रेडियोज को आपूर्ति करता है। वार्षिक मांग 37,500 यूनिट्स है। तालिका धारण लागत $ 0·25 प्रति ट्रांजिस्टर प्रति वर्ष है और व्यवस्था लागत प्रति प्रोडक्शन रन $ 200 है। निम्नलिखित को ज्ञात कीजिए : 15 अंक (i) आर्थिक आदेश मात्रा (ई ओ क्यू) (ii) प्रोडक्शन रन की लंबाई (iii) प्रति वर्ष प्रोडक्शन रनों की संख्या (iv) अधिकतम तालिका (इन्वेंटरी) स्तर (c) (i) गुणवत्ता नियंत्रण में विचरण के पदों 'संयोग हेतु' और 'निर्देश्य कारणों' की व्याख्या कीजिए। सांख्यिकीय गुणवत्ता नियंत्रण के कुछ मुख्य लाभों को भी बताइए। 10 अंक (ii) एकल प्रतिचयन आयोजना के लिए संकारक अभिलक्षण (OC) वक्र को प्राप्त करने की प्रक्रिया का वर्णन कीजिए। 10 अंक

Answer approach & key points

Solve this multi-part numerical and theoretical question by allocating approximately 35% time to part (a) on M/M/2 queueing, 30% to part (b) on production inventory EOQ model, and 35% to part (c) on SQC theory. Begin with clear identification of model parameters for each part, show all formulas with standard notation, perform step-by-step calculations with proper unit conversions, and conclude with precise numerical answers and brief interpretations where asked.

  • For (a): Correct identification of M/M/2 queue parameters (λ=100/hr, μ=60/hr, c=2), calculation of traffic intensity ρ=λ/(cμ)=0.833, and use of multi-server queue formulas for P₀, Pₙ, and waiting time distribution
  • For (a)(i): Computation of P(n>3) = 1 - P₀ - P₁ - P₂ - P₃ using the formula Pₙ = (1/n!)(λ/μ)ⁿP₀ for n≤c and Pₙ = (1/c!cⁿ⁻ᶜ)(λ/μ)ⁿP₀ for n>c
  • For (a)(ii): Probability a given teller is idle = P₀ + ½P₁ (or equivalently 1 - ρ), recognizing that idle probability per server differs from system idle probability
  • For (b): Identification of production model parameters (p=300/day, d=150/day, D=37,500/yr, C₁=$0.25/yr, C₃=$200), and application of EPQ formulas rather than simple EOQ
  • For (b)(i)-(iv): Correct formulas for EOQ/EPQ = √[2DC₃p/(C₁(p-d))], production run length = Q/p, number of runs = D/Q, and maximum inventory = Q(p-d)/p with proper unit consistency
  • For (c)(i): Clear distinction between chance causes (random, inherent, unavoidable) and assignable causes (special, identifiable, removable), with 4-5 specific advantages of SQC such as early detection, reduced inspection costs, and customer satisfaction
  • For (c)(ii): Systematic description of OC curve construction: define p (lot fraction defective), calculate Pₐ using binomial/Poisson approximation, plot Pₐ vs p, and identify key points (AQL, LTPD, α, β)

B

Q5
50M Compulsory discuss Indian Statistical System, AR(1) model, SLSE, Demography, Intelligence tests

(a) Discuss the Indian Statistical System. State some important organisations and explain the main working of the National Statistical Organisation (NSO). 10 marks (b) Consider AR(1) model with non-zero mean 74·3293 and φ = 0·5705. If the last observed value is 67, then obtain the forecasting 1 time unit into the future yields. What is the forecasted value of 5 time units into the future ? 10 marks (c) Discuss the concept of structure and model for Simultaneous Linear Statistical Equations (SLSE) model. The application of least squares method for estimating the parameters in SLSE model is inappropriate. Explain. 10 marks (d) (i) Determine the average age at death of those who die between ages x and x + n. (ii) If l(x) = 100√(100 – x) find μ(84) exactly using appropriate method. 10 marks (e) What are intelligence tests and how are they used in measuring intelligence ? Define the terms mental age and IQ in this connection. 10 marks

हिंदी में पढ़ें

(a) भारतीय सांख्यिकीय पद्धति पर चर्चा कीजिए । कुछ महत्वपूर्ण संस्थाएँ बताइए तथा राष्ट्रीय सांख्यिकीय संगठन (एन एस ओ) के प्रमुख प्रकार्यों को समझाइए । 10 (b) एक AR(1) मॉडल पर विचार कीजिए जिसका शून्येतर माध्य 74·3293 और φ = 0·5705 है । यदि अंतिम प्रेक्षित मान 67 है, तो भविष्य उपज में 1 समय इकाई पूर्वानुमान प्राप्त कीजिए । भविष्य में 5 समय इकाइयों का पूर्वानुमानित मान क्या है ? 10 (c) युगपत रैखिक सांख्यिकीय समीकरण (एस एल एस ई) मॉडल के लिए संरचना तथा मॉडल की संकल्पना की चर्चा कीजिए । एस एल एस ई मॉडल में प्राचलों का आकलन करने के लिए न्यूनतम वर्ग विधि का अनुप्रयोग अनुचित है । समझाइए । 10 (d) (i) जिनकी मृत्यु, आयु x और x + n के बीच होती है, उनकी मृत्यु के समय औसत आयु ज्ञात कीजिए । (ii) यदि l(x) = 100√(100 – x) है, तो उपयुक्त विधि का उपयोग करके μ(84) का यथार्थ मान ज्ञात कीजिए । 10 (e) बुद्धि परीक्षण क्या हैं और बुद्धि को मापने में ये कैसे उपयोग किए जाते हैं ? इस संबंध में मानसिक आयु तथा बौद्धिक स्तर (आई क्यू) पदों को परिभाषित कीजिए । 10

Answer approach & key points

This multi-part question requires balanced coverage across five thematic areas. Allocate approximately 20% time each to parts (a), (c), and (e) which demand descriptive-discursive treatment, and 20% combined to parts (b) and (d) which require precise calculations. Begin with a brief roadmap indicating coverage of all sub-parts, then proceed sequentially: Indian statistical infrastructure → AR(1) forecasting with proper formula application → SLSE theoretical exposition → life table computations with force of mortality derivation → psychometric concepts with Binet-Simon/Wechsler references. Conclude each calculation part with interpreted results in context.

  • (a) Indian Statistical System: Evolution from PC Mahalanobis era; distinction between Central and State statistical machinery; NSO's role under Ministry of Statistics and Programme Implementation (MoSPI); key organizations—CSO, NSSO, Registrar General of India, NITI Aayog; NSO's functioning through data collection, coordination, and dissemination under National Statistical Commission oversight
  • (b) AR(1) forecasting: Correct model specification X_t = μ + φ(X_{t-1} - μ) + ε_t; one-step ahead forecast = μ + φ(x_n - μ) = 74.3293 + 0.5705(67 - 74.3293); five-step ahead forecast converges to mean μ as φ^5 → 0; explicit numerical computation
  • (c) SLSE model: Structural form vs reduced form distinction; endogeneity problem causing correlation between regressors and error terms; simultaneous equation bias; why OLS is inconsistent (covariance between Y and u non-zero); need for IV/2SLS methods
  • (d)(i) Average age at death: n_a_x = (∫_0^n (x+t)μ(x+t)l(x+t)dt)/(l(x)-l(x+n)) or equivalent life table expression; (d)(ii) Force of mortality μ(x) = -d[ln l(x)]/dx; exact derivation for l(x)=100√(100-x) yielding μ(84)=1/32
  • (e) Intelligence tests: Definition as standardized measures of cognitive ability; types—verbal, performance, group vs individual; uses in education, clinical diagnosis, occupational selection; mental age (MA) as performance level relative to age norms; IQ formulas—Stern's ratio IQ (MA/CA×100) and deviation IQ; reference to Indian adaptations
Q6
50M derive OLS and GLS estimators, Index Number Tests, Population projection

(a) For the linear model Y = Xβ + u, obtain the expressions for Ordinary Least Squares (OLS) and Generalised Least Squares (GLS) estimators of the parameters. Discuss their properties and compare them. 20 marks (b) Explain Time Reversal Test, Factor Reversal Test and Circular Test in the Index Number Theory. Using the following data, verify whether the Laspeyres' formula satisfies Time Reversal Test. 15 marks (c) Distinguish between population estimates and population projections. Briefly describe the component method of population projection. 15 marks

हिंदी में पढ़ें

(a) रैखिक मॉडल Y = Xβ + u के लिए, प्राचलों के साधारण न्यूनतम वर्ग (OLS) तथा व्यापकीकृत न्यूनतम वर्ग (GLS) आकलकों के व्यंजक प्राप्त कीजिए। उनके गुणों की विवेचना कीजिए और उनकी तुलना कीजिए। 20 (b) सूचकांक सिद्धांत में समय उत्क्रमता परीक्षण, उपादान (तत्व) उत्क्रमता परीक्षण तथा चक्रीय परीक्षण की व्याख्या कीजिए। निम्नलिखित आँकड़ों का उपयोग करके जाँच कीजिए कि क्या लासपेयर्स का सूत्र समय उत्क्रमता परीक्षण को संतुष्ट करता है। 15 (c) जनसंख्या आकलन और जनसंख्या पूर्वानुमान के बीच अंतर स्पष्ट कीजिए। जनसंख्या पूर्वानुमान के लिए घटक विधि का संक्षेप में वर्णन कीजिए। 15

Answer approach & key points

Begin by deriving OLS and GLS estimators using matrix notation for part (a), allocating approximately 40% of time/space given its 20 marks. For part (b), explain the three index number tests conceptually before applying the Time Reversal Test to Laspeyres' formula with the given data (~30%). For part (c), clearly distinguish estimates from projections and outline the component method with fertility, mortality, and migration components (~30%). Conclude with a brief synthesis of how these statistical tools inform policy-making in India.

  • For (a): Derivation of β̂_OLS = (X'X)^(-1)X'Y and β̂_GLS = (X'Ω^(-1)X)^(-1)X'Ω^(-1)Y with proper assumptions
  • For (a): Comparison of BLUE properties under homoscedasticity vs. heteroscedasticity/autocorrelation; efficiency of GLS over OLS when Ω ≠ σ²I
  • For (b): Clear statement of Time Reversal (P01 × P10 = 1), Factor Reversal (P01 × Q01 = Σp1q1/Σp0q0), and Circular (P01 × P12 × P20 = 1) tests
  • For (b): Numerical verification showing Laspeyres' index fails Time Reversal Test with explicit calculation
  • For (c): Distinction between estimates (current/retrospective) and projections (future-oriented, assumption-dependent)
  • For (c): Component method: projection of births by age-specific fertility, deaths by life tables, migration by net migration rates
Q7
50M explain Identification problem and econometric models

(a) Explain the problem of identification with a suitable example. Also discuss the conditions of identification. Check the identifiability of each equation of the following structural model : y₁ = 3y₂ – 2x₁ + x₂ + u₁ y₂ = y₃ + x₂ + u₂ y₃ = y₁ – y₂ – 2x₃ + u₃ 15 (b) Explain why mortality situations at two places cannot be compared on the basis of crude death rates. Describe the construction of standardised death rates for this purpose. What is a comparative mortality index and how is it used ? 20 (c) (i) Define reliability of a test. What is the effect of test length on the reliability of a test ? 5 (ii) Give different methods for estimating the reliability of a psychological test. 10

हिंदी में पढ़ें

(a) एक उपयुक्त उदाहरण के साथ अभिनिर्धारण की समस्या को समझाइए। अभिनिर्धारण की शर्तों की भी चर्चा कीजिए । निम्नलिखित संरचनात्मक मॉडल के प्रत्येक समीकरण की अभिज्ञेयता (अभिनिर्धारणीयता) की जाँच कीजिए : y₁ = 3y₂ – 2x₁ + x₂ + u₁ y₂ = y₃ + x₂ + u₂ y₃ = y₁ – y₂ – 2x₃ + u₃ 15 (b) स्पष्ट कीजिए कि दो स्थानों पर मृत्यु दर की स्थिति की तुलना अशोधित मृत्यु दरों के आधार पर क्यों नहीं की जा सकती । इस उद्देश्य के लिए, मानकीकृत मृत्यु दरों के निर्माण का वर्णन कीजिए । तुलनात्मक मृत्यु दर सूचकांक क्या है और इसका उपयोग कैसे किया जाता है ? 20 (c) (i) एक परीक्षण की विश्वसनीयता को परिभाषित कीजिए । एक परीक्षण की विश्वसनीयता पर परीक्षण की लंबाई का क्या प्रभाव होता है ? 5 (ii) एक मनोवैज्ञानिक परीक्षण की विश्वसनीयता के आकलन के लिए विभिन्न विधियों को बताइए । 10

Answer approach & key points

Explain the identification problem with a concrete supply-demand example, then apply order and rank conditions to the given 3-equation system. For (b), explain why CDR fails using Indian state examples (e.g., Kerala vs. Uttar Pradesh age structures), then detail direct and indirect standardization methods. For (c), define reliability via Spearman-Brown prophecy, discuss test length effects, and enumerate split-half, KR-20, KR-21, and Cronbach's alpha methods. Allocate approximately 30% time to (a), 40% to (b), 20% to (c)(i), and 10% to (c)(ii) based on marks distribution.

  • (a) Clear explanation of identification problem using supply-demand simultaneous equations example; distinction between under-identified, just-identified, and over-identified equations
  • (a) Correct application of order condition (K - k ≥ m - 1) and rank condition to all three equations; accurate classification of each equation's identifiability status
  • (b) Explanation of CDR limitations due to varying age-sex compositions; use of Indian demographic examples (e.g., aging Kerala vs. younger Bihar populations)
  • (b) Construction of standardized death rates: direct method (standard population weights) and indirect method (standard mortality rates); formula for Comparative Mortality Index (CMI) and its interpretation
  • (c)(i) Formal definition of reliability as ratio of true score variance to observed score variance; statement and explanation of Spearman-Brown prophecy formula for test length effects
  • (c)(ii) Comprehensive coverage of reliability estimation methods: test-retest, parallel forms, split-half (including Spearman-Brown correction), Kuder-Richardson formulas (KR-20, KR-21), and Cronbach's coefficient alpha with appropriate formulas
Q8
50M solve Item difficulty scaling and fertility measures

(a) Four items are to be constructed so that they are equispaced on the difficulty scale. If the easiest item is passed by 85% of the group and the most difficult by 25%, find the percentage of individuals in the group passing the other two items. (Standard Normal tables are provided) 15 (b) Define Crude Birth Rate, General Fertility Rate and Age-specific Fertility Rate and indicate why each is considered an improvement on the preceding measure of fertility. Define Total Fertility Rate and its utility. 15 (c) (i) Discuss the problem of autocorrelation. What are the consequences of OLS estimators in estimating the parameters in the presence of autocorrelation ? (ii) Explain the Durbin-Watson test for testing the autocorrelation. 10+10=20

हिंदी में पढ़ें

(a) चार मदों का निर्माण किया जाना है ताकि वे कठिनाई पैमाने पर समान दूरी पर हों । यदि सबसे सरल मद समूह के 85% द्वारा पास किया जाता है तथा सबसे कठिन 25% द्वारा, तो समूह में अन्य दो मदों को पास करने वाले व्यक्तियों का प्रतिशत ज्ञात कीजिए । (मानक प्रसामान्य सारणी दी गई है) 15 (b) अशोधित जन्म दर, सामान्य प्रजनन दर और वय-विशिष्ट प्रजनन दर को परिभाषित कीजिए तथा बताइए कि प्रत्येक को प्रजनन के पूर्ववर्ती माप पर सुधार क्यों माना जाता है। संपूर्ण प्रजनन दर को परिभाषित कीजिए तथा इसकी उपयोगिता बताइए। 15 (c) (i) स्वसहसंबंध की समस्या की चर्चा कीजिए। स्वसहसंबंध की उपस्थिति में प्राचलों के आकलन में ओ एल एस आकलकों के क्या परिणाम होते हैं ? (ii) स्वसहसंबंध के परीक्षण के लिए डर्बिन-वॉटसन परीक्षण की व्याख्या कीजिए। 10+10=20

Answer approach & key points

Begin with part (a) by converting percentages to z-scores using standard normal tables, then apply linear interpolation for equispaced difficulty; allocate ~30% time here. For part (b), define each fertility measure sequentially showing progressive refinement from crude to age-specific rates, then explain TFR's utility for population projection—spend ~30% time. For part (c), discuss autocorrelation consequences on OLS properties (BLUE violation), then detail Durbin-Watson test procedure with critical values—allocate ~40% time as this carries highest marks. Conclude with integrated insights on statistical applications in demographic and econometric analysis.

  • Part (a): Convert 85% and 25% to z-scores (-1.036 and 0.674), establish equidistant points on difficulty scale, calculate intermediate z-values, convert back to percentages (~58% and ~42%)
  • Part (b): CBR definition and limitation (ignores age-sex structure); GFR improvement (restricts to women 15-49); ASFR refinement (age-specific exposure); TFR as sum of ASFRs and utility for replacement-level fertility analysis (India's TFR ~2.0)
  • Part (c)(i): Autocorrelation causes (inertia, specification error, cobweb, data manipulation); consequences: OLS estimators remain unbiased but inefficient, standard errors biased, t/F tests invalid, R² misleading
  • Part (c)(ii): DW test assumptions (no lagged dependent variable, intercept, non-stochastic regressors); test statistic formula; decision zones (0 to 4 scale); inconclusive region problem; alternative tests (Durbin's h, Breusch-Godfrey)

Practice Statistics 2024 Paper II answer writing

Pick any question above, write your answer, and get a detailed AI evaluation against UPSC's standard rubric.

Start free evaluation →