Statistics

UPSC Statistics 2021 — Paper II

All 8 questions from UPSC Civil Services Mains Statistics 2021 Paper II (400 marks total). Every stem reproduced in full, with directive-word analysis, marks, word limits, and answer-approach pointers.

8Questions

400Total marks

2021Year

Paper IIPaper

Topics covered

Quality control, sampling, game theory, Markov chains, reliability (1)Reliability, control charts, acceptance sampling (1)Queuing systems and linear programming (1)Linear programming and inventory control (1)Econometrics and vital statistics (1)Time series analysis and index numbers (1)Demography, psychometrics, and life table analysis (1)Econometrics and population projection (1)

A

50M Compulsory solve Quality control, sampling, game theory, Markov chains, reliability

Explain Single Sampling Plan with the help of an example. Also, write the importance of an Operating Characteristics Curve in a sampling plan. (10 marks) Solve the above assignment problem. Depot I II III IV V A 160 130 175 190 200 Town B 135 120 130 160 175 C 140 110 145 170 185 D 50 50 80 80 110 E 55 35 80 80 105 (10 marks) Use algebraic method to solve the above game. Player B B₁ B₂ B₃ B₄ A₁ 0·25 0·20 0·14 0·30 Player A A₂ 0·27 0·16 0·12 0·14 A₃ 0·35 0·08 0·15 0·19 A₄ −0·02 0·08 0·13 0·00 (10 marks) Consider the Markov Chain with transition probability matrix: 0 1 2 0 (0 1 0) 1 (½ 0 ½) 2 (0 1 0) Show that the states are periodic and persistent non-null. (10 marks) State the importance of the hazard function. If the hazard rate of a component is given by: h(t) = { 0.015, t ≤ 200 { 0.025, t > 200 then find an expression for the reliability function of the component. (10 marks)

हिंदी में पढ़ें

एक उदाहरण की सहायता से, एकल प्रतिचयन आयोजना की व्याख्या कीजिए। एक प्रतिचयन आयोजना में संकारक अभिलक्षण वक्र के महत्व को भी लिखिए। (10 अंक) निम्नलिखित नियतन समस्या को हल कीजिए : डिपो I II III IV V A 160 130 175 190 200 शहर B 135 120 130 160 175 C 140 110 145 170 185 D 50 50 80 80 110 E 55 35 80 80 105 (10 अंक) बीजीय विधि का उपयोग करके निम्नलिखित खेल को हल कीजिए : खिलाड़ी B B₁ B₂ B₃ B₄ A₁ 0·25 0·20 0·14 0·30 खिलाड़ी A A₂ 0·27 0·16 0·12 0·14 A₃ 0·35 0·08 0·15 0·19 A₄ −0·02 0·08 0·13 0·00 (10 अंक) संक्रमण प्रायिकता आव्यूह 0 1 2 0 (0 1 0) 1 (½ 0 ½) 2 (0 1 0) के साथ एक मार्कोव श्रृंखला पर विचार कीजिए। दर्शाइए कि अवस्थाएँ आवर्ती और सततावृत अनिराकरणीय हैं। (10 अंक) संकटप्रस्तता फलन के महत्व को बताइए। यदि किसी घटक की संकटप्रस्तता दर इस प्रकार दी गई है : h(t) = { 0.015, t ≤ 200 { 0.025, t > 200 तो घटक के विश्वसनीयता फलन के एक व्यंजक को प्राप्त कीजिए। (10 अंक)

Answer approach & key points

The directive 'solve' requires demonstrating complete analytical solutions for all five numerical problems. Structure your answer by addressing each sub-question sequentially: (1) Single Sampling Plan with OC curve illustration, (2) Assignment problem using Hungarian method, (3) Game theory problem via algebraic method for mixed strategies, (4) Markov chain periodicity and persistence proof, and (5) Reliability function derivation from piecewise hazard rate. Each solution must show method, calculation, and final interpretation.

Single Sampling Plan: Define n (sample size) and c (acceptance number) with concrete example; sketch OC curve showing P(A) vs p with points at p=0, p=AQL, p=LTPD, p=1
Assignment Problem: Apply Hungarian method—row reduction, column reduction, minimum lines to cover zeros, optimality check, and final assignment with minimum cost
Game Theory: Verify no saddle point exists, formulate as LPP or use algebraic method for 2×2 subgames, find mixed strategy probabilities and game value
Markov Chain: Compute P² to show period d=2, verify irreducibility, calculate stationary distribution to confirm persistent non-null states
Reliability: Derive R(t) = exp(-∫h(u)du) giving R(t)=exp(-0.015t) for t≤200 and R(t)=exp(-3)exp(-0.025(t-200)) for t>200 with continuity at t=200

Open full rubric & evaluate →

50M solve Reliability, control charts, acceptance sampling

A manufacturer finds that on the average, a television set is used 1.8 hours per day. A one year warranty is offered on the picture tube having a mean time to failure (MTTF) of 2000 hours. If the distribution of time to failure is exponential, then determine the percentage of tubes failing during the warranty period. (15 marks) The number of defects on 20 items were recorded as given above: | Item No. | No. of defects | Item No. | No. of defects | |----------|----------------|----------|----------------| | 1 | 2 | 11 | 6 | | 2 | 0 | 12 | 0 | | 3 | 4 | 13 | 2 | | 4 | 1 | 14 | 1 | | 5 | 0 | 15 | 0 | | 6 | 8 | 16 | 3 | | 7 | 0 | 17 | 2 | | 8 | 1 | 18 | 1 | | 9 | 2 | 19 | 0 | | 10 | 0 | 20 | 2 | Use a suitable control chart to identify whether the process is in control or not? (15 marks) Explain the concepts of producer's and consumer's risks. It has been decided to sample 100 items at random from each large batch. We reject the batch if more than 2 defectives are found. If the acceptable quality level is 1% and the unacceptable quality level is 5%, then find the producer's and consumer's risks. (20 marks)

हिंदी में पढ़ें

एक निर्माता को यह पता चलता है कि औसतन एक टेलीविजन सेट का उपयोग प्रतिदिन 1.8 घंटे होता है। पिक्चर ट्यूब पर, जिसका विफलता तक माध्य काल (एम टी टी एफ) 2000 घंटे है, एक वर्ष की वारंटी की पेशकश की जाती है। यदि विफलता तक के काल का बंटन चरघातांकी है, तो वारंटी अवधि के दौरान विफल हुई ट्यूबों का प्रतिशत ज्ञात कीजिए। (15 अंक) 20 मदों पर दोषों की संख्या दर्ज की गई, जो नीचे दी गई है : | मद संख्या | दोषों की संख्या | मद संख्या | दोषों की संख्या | |-----------|----------------|-----------|----------------| | 1 | 2 | 11 | 6 | | 2 | 0 | 12 | 0 | | 3 | 4 | 13 | 2 | | 4 | 1 | 14 | 1 | | 5 | 0 | 15 | 0 | | 6 | 8 | 16 | 3 | | 7 | 0 | 17 | 2 | | 8 | 1 | 18 | 1 | | 9 | 2 | 19 | 0 | | 10 | 0 | 20 | 2 | एक उपयुक्त नियंत्रण सांचित्र का उपयोग कीजिए और यह पहचानिये कि क्या प्रक्रम नियंत्रण में है या नहीं ? (15 अंक) उत्पादक और उपभोक्ता के जोखिमों की संकल्पनाओं को समझाइए। प्रत्येक बड़े बैच से 100 मदों का एक यादृच्छिक प्रतिदर्श निकालने का निर्णय लिया गया है। हम बैच को अस्वीकार करते हैं यदि 2 से अधिक दोषपूर्ण पाये जाते हैं। यदि स्वीकार्य गुणता स्तर 1% है और अस्वीकार्य गुणता स्तर 5% है तो उत्पादक और उपभोक्ता के जोखिमों को प्राप्त कीजिए। (20 अंक)

Answer approach & key points

Solve this three-part numerical problem by first calculating the warranty failure probability using exponential distribution properties, then constructing and interpreting a c-chart for defect data with proper control limits, and finally computing producer's and consumer's risks using binomial distribution for the given sampling plan. Present each part sequentially with clear headings, showing all formulas, substitutions, and final interpretations.

Part 1: Calculate total warranty period as 1.8 × 365 = 657 hours and use P(T ≤ 657) = 1 - e^(-657/2000) for exponential failure probability
Part 2: Compute c-bar = 33/20 = 1.65, then UCL = 1.65 + 3√1.65 ≈ 5.50 and LCL = 0, identifying Item 6 (8 defects) as out of control
Part 3: Define producer's risk α = P(reject | p=0.01) and consumer's risk β = P(accept | p=0.05) using binomial or Poisson approximation
Correct application of Poisson approximation with λ₁ = 1 for AQL and λ₂ = 5 for LTPD to find α = 1 - P(X≤2; λ=1) and β = P(X≤2; λ=5)
Numerical values: α ≈ 0.080 or 8% and β ≈ 0.125 or 12.5% (or precise binomial equivalents)

Open full rubric & evaluate →

50M solve Queuing systems and linear programming

(a) Explain M|G|1 queuing system. Obtain Pollaczek-kinchine formula. (15 marks) (b) Use MODI method to solve the above transportation problem: Store I II III IV A 4 6 8 13 B 13 11 10 8 C 14 4 10 13 D 9 11 13 8 Supply 50 70 30 50 Demand 25 35 105 20 (15 marks) (c) Use two-phase method to solve: Maximize z = 2x₁ + x₂ + x₃ subject to the constraints 4x₁ + 6x₂ + 3x₃ ≤ 8 3x₁ - 6x₂ - 4x₃ ≤ 1 2x₁ + 3x₂ - 5x₃ ≥ 4 and x₁, x₂, x₃ ≥ 0. (20 marks)

हिंदी में पढ़ें

(a) पंक्ति प्रणाली M|G|1 की व्याख्या कीजिए । पोलेकजेक-किंचिन सूत्र को प्राप्त कीजिए । (15 अंक) (b) निम्नलिखित परिवहन समस्या का MODI विधि का उपयोग करके हल निकालिए : भंडार I II III IV A 4 6 8 13 B 13 11 10 8 C 14 4 10 13 D 9 11 13 8 पूर्ति 50 70 30 50 मांग 25 35 105 20 (15 अंक) (c) द्विप्रावस्था विधि का उपयोग करके हल कीजिए : अधिकतमीकरण z = 2x₁ + x₂ + x₃ निम्न प्रतिबंधों के अंतर्गत 4x₁ + 6x₂ + 3x₃ ≤ 8 3x₁ - 6x₂ - 4x₃ ≤ 1 2x₁ + 3x₂ - 5x₃ ≥ 4 और x₁, x₂, x₃ ≥ 0. (20 अंक)

Answer approach & key points

Solve this three-part numerical problem by allocating approximately 30% time to part (a) for deriving the Pollaczek-Khinchine formula, 30% to part (b) for the MODI method transportation problem, and 40% to part (c) for the two-phase simplex method. Begin each part with clear problem setup, show all computational steps systematically, and conclude with verified final answers. For (a), explain M|G|1 characteristics before derivation; for (b), ensure initial basic feasible solution before MODI optimization; for (c), complete Phase I before proceeding to Phase II.

Part (a): Correct explanation of M|G|1 queuing system components (Poisson arrivals, General service time, single server) and derivation of Pollaczek-Khinchine formula for mean queue length Lq = λ²E(S²)/[2(1-ρ)] or equivalent forms
Part (b): Correct initial basic feasible solution using VAM or NWCR method, followed by MODI (UV method) iterations with proper stepping stone paths until optimality is reached with minimum transportation cost
Part (c): Proper conversion of inequalities to equations using slack, surplus and artificial variables; successful completion of Phase I to eliminate artificial variables; Phase II optimization yielding optimal solution
Verification of supply-demand balance (total supply = total demand = 200) in part (b) before solving, and correct handling of ≥ constraint in part (c) with surplus and artificial variables
Clear presentation of all simplex tableaus for part (c) showing entering and leaving variables, pivot operations, and final optimal value of objective function

Open full rubric & evaluate →

50M solve Linear programming and inventory control

(a) Solve the following linear programming problem: Maximize z = 3x₁ + 5x₂ subject to the constraints 3x₁ + 2x₂ ≤ 18 x₁ ≤ 4 x₂ ≤ 6 and x₁, x₂ ≥ 0. Discuss the change in Cⱼ on the optimality of the optimal basic feasible solution. (15 marks) (b) A manufacturer has to supply his customers with 600 units of his product per year. Shortages are not allowed and storage amounts to 60 paise per unit per year. The set up cost per run is Rs. 80. Find (i) economic order quantity (ii) optimum period of supply per optimum order and (iii) increase in the total cost associated with ordering 20 per cent more and 40% less. (15 marks) (c) A machine is set to deliver the packets of a given weight. Ten samples of size 5 each were examined and the following results were obtained: | Sample No. | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |---|---|---|---|---|---|---|---|---|---|---| | Mean | 43 | 49 | 37 | 44 | 45 | 37 | 51 | 46 | 43 | 47 | | Range | 5 | 6 | 5 | 7 | 7 | 4 | 8 | 6 | 4 | 6 | Use mean and range charts to check whether process is under control. (Given for n = 5, d₂ = 2·326 and d₃ = 0·864) (20 marks)

हिंदी में पढ़ें

(a) निम्नलिखित रैखिक प्रोग्रामन समस्या को हल कीजिए : अधिकतमीकरण z = 3x₁ + 5x₂ निम्न प्रतिबंधों के अंतर्गत 3x₁ + 2x₂ ≤ 18 x₁ ≤ 4 x₂ ≤ 6 और x₁, x₂ ≥ 0. इष्टतम आधारी सुसंगत हल के इष्टतमत्व पर Cⱼ में परिवर्तन का वर्णन कीजिए । (15 अंक) (b) एक निर्माता को अपने ग्राहकों को प्रति वर्ष अपने उत्पाद की 600 इकाइयों की आपूर्ति करनी पड़ती है। अपयाप्तता की अनुमति नहीं है और गोदाम-भाड़ा 60 पैसे प्रति इकाई प्रति वर्ष है। झोंका लागत प्रति दौर 80 रुपये है। प्राप्त कीजिए (i) आर्थिक आदेश मात्रा (इकोनोमिक ऑर्डर क्वांटिटी) (ii) प्रति इष्टतम आदेश की आपूर्ति की इष्टतम अवधि और (iii) 20 प्रतिशत अधिक और 40% कम आदेश करने से संबंधित कुल लागत में वृद्धि। (15 अंक) (c) एक मशीन को दिये गये वजन के पैकेट देने के लिए सेट किया गया है। प्रत्येक आमाप 5 के दस प्रतिदर्शों की जाँच की गई और निम्नलिखित परिणाम प्राप्त हुए : | प्रतिदर्श संख्या | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |---|---|---|---|---|---|---|---|---|---|---| | माध्य | 43 | 49 | 37 | 44 | 45 | 37 | 51 | 46 | 43 | 47 | | परिसर | 5 | 6 | 5 | 7 | 7 | 4 | 8 | 6 | 4 | 6 | माध्य और परिसर सांचित्रों का उपयोग करके जाँच कीजिए कि क्या प्रक्रम नियंत्रण में है ? (n = 5 के लिए दिया है, d₂ = 2·326 और d₃ = 0·864) (20 अंक)

Answer approach & key points

Begin with the directive 'solve' for part (a), applying the simplex method or graphical method to find the optimal solution, then analyze sensitivity of Cⱼ coefficients. For part (b), apply the EOQ formula with given parameters (D=600, C₀=₹80, Cₕ=₹0.60) and calculate percentage cost variations. For part (c), construct X̄ and R control charts using given constants d₂=2.326 and d₃=0.864, computing center lines and control limits to assess process stability. Allocate approximately 30% time to (a), 25% to (b), and 45% to (c) given their mark distribution and computational complexity.

Part (a): Correct identification of feasible region vertices (0,0), (4,0), (4,3), (2,6), (0,6) and optimal solution at (2,6) with Z=36
Part (a): Sensitivity analysis showing range for C₁ as [0, 7.5] and C₂ as [4, ∞) maintaining optimality of current basis
Part (b): EOQ calculation as √(2×600×80/0.6) = 400 units; optimum period = 400/600 × 12 = 8 months
Part (b): Total cost at EOQ = ₹240; cost at 480 units = ₹244 (1.67% increase); cost at 240 units = ₹264 (10% increase)
Part (c): Grand mean X̄̄ = 44.2, average range R̄ = 5.8; X̄ chart limits: 44.2 ± 3×5.8/(2.326×√5) = 44.2 ± 3.34
Part (c): R chart limits: UCL = 5.8×(1+3×0.864/2.326) = 12.27, LCL = 5.8×(1-3×0.864/2.326) = 0 (adjusted to 0)
Part (c): Correct conclusion that Sample 3 (mean=37) and Sample 7 (mean=51) fall outside X̄ control limits, indicating process is not under statistical control

Open full rubric & evaluate →

B

50M Compulsory explain Econometrics and vital statistics

(a) Explain Zellner's seemingly unrelated regression model and the feasible generalized least squares method of estimating the model. (10 marks) (b) Explain the functions of N.S.S.O. (10 marks) (c) Obtain the generalized least squares estimators in the two-variable model Yᵢ = β₁ + β₂Xᵢ + uᵢ assuming the heteroscedastic variances are known and obtain their variances. (10 marks) (d) Why is it considered desirable to convert gross scores to some standard scores? Define 'standard scores' and 'normalised scores' and describe how they are derived. (10 marks) (e) Fill in blanks which are marked with a query in the above skeleton life table and explain the meaning of the symbols at the heads of the columns. (10 marks)

हिंदी में पढ़ें

(a) जेलनर के प्रतीयमानतः असंबंधित समाश्रयण निदर्श की व्याख्या कीजिए और निदर्श का आकलन करने के लिए सुसंगत व्यापकीकृत न्यूनतम वर्ग विधि को समझाइए। (10 अंक) (b) राष्ट्रीय प्रतिदर्श सर्वेक्षण संगठन (एन.एस.एस.ओ.) के प्रकार्यों को समझाइए। (10 अंक) (c) यह मानते हुए कि विषम विचलिता प्रसरण ज्ञात है, द्विचर निदर्श Yᵢ = β₁ + β₂Xᵢ + uᵢ में व्यापकीकृत न्यूनतम वर्ग आकलकों को प्राप्त कीजिए और उनके प्रसरणों को प्राप्त कीजिए। (10 अंक) (d) सकल समकों को किन्हीं मानक समकों में रूपांतरित करने को क्यों वांछनीय माना जाता है? 'मानक समकों' और 'प्रसामान्यीकृत समकों' को परिभाषित कीजिए और इन्हें कैसे व्युत्पन्न किया जाता है, इसका वर्णन कीजिए। (10 अंक) (e) रिक्त स्थानों को भरें, जो निम्नलिखित कंकाल बय-सारणी में एक क्वेरी के साथ चिह्नित हैं और कॉलम के प्रमुखों पर प्रतीकों का अर्थ समझाइए। (10 अंक)

Answer approach & key points

The directive 'explain' demands clear exposition with theoretical foundations and derivations. Allocate approximately 20% (10 marks) to each sub-part equally. For (a), present SURE model structure and FGLS estimation steps; for (b), enumerate NSSO functions with Indian statistical system context; for (c), derive GLS estimators with matrix algebra; for (d), clarify standardization rationale with formulae; for (e), complete life table calculations and interpret actuarial symbols. Structure: brief introduction, systematic part-wise treatment with equations, and concluding synthesis on statistical estimation methods.

Part (a): SURE model specification with contemporaneous correlation across equations, disturbance covariance matrix structure, and FGLS two-step estimation (estimated Ω used for feasible estimator)
Part (b): NSSO functions—conducting large-scale sample surveys (NSS, ASI), data dissemination, methodological research, coordination with state DES, and international reporting (SDGs)
Part (c): GLS transformation with known heteroscedastic variances σᵢ², weighted least squares derivation, variance-covariance matrix of estimators, and efficiency comparison with OLS
Part (d): Desirability of standard scores (comparability, norm-referenced interpretation), z-score formula, normalized scores (T-scores, stanines), and derivation steps
Part (e): Life table completion—calculating nqx, npx, nLx, Tx, ex from given lx and ndx columns; explanation of q (mortality), p (survival), L (person-years), T (total), e (expectation)

Open full rubric & evaluate →

50M explain Time series analysis and index numbers

(a) Explain Box-Jenkins methodology to build ARIMA models. (15 marks) (b) Prepare the cost of living index for 2006 on the basis of 2005 from the above data by (i) aggregative method and (ii) method of weighted relatives and comment. (15 marks) (c) Explain price statistics relating to 'Price Quotations'. Elucidate publications of data concerning foreign trade of India. (20 marks)

हिंदी में पढ़ें

(a) अरीमा (ARIMA) निदर्शों को बनाने के लिए बाक्स-जेनकिंस विधि तंत्र की व्याख्या कीजिए। (15 अंक) (b) निम्नलिखित आंकड़ों से 2005 के आधार पर 2006 के लिए निर्वाह-सूचकांक तैयार कीजिए (i) सामुदायिक विधि द्वारा और (ii) भारित अनुपातों की विधि द्वारा, और अपनी टिप्पणी दीजिए। (15 अंक) (c) 'मूल्य कोटेशनों' से संबंधित मूल्य आंकड़े बताइए। भारत के विदेशी व्यापार के विषय में आंकड़ों के प्रकाशनों को स्पष्ट कीजिए। (20 अंक)

Answer approach & key points

The directive 'explain' demands clear exposition with logical flow and appropriate technical depth. Allocate approximately 30% time/words to part (a) on Box-Jenkins methodology, 30% to part (b) on index number calculations with explicit working, and 40% to part (c) covering price quotations and foreign trade publications. Structure: brief conceptual introduction for each part, detailed methodological explanation with formulae, worked calculations for (b), and specific Indian statistical system references for (c).

Part (a): Identification of ARIMA model components (AR, I, MA), stationarity testing via ADF or KPSS, differencing procedures, ACF/PACF analysis for order identification, parameter estimation via maximum likelihood, diagnostic checking with Ljung-Box test, and forecasting with confidence intervals
Part (b): Correct application of aggregative method (Laspeyres or Paasche formula with base year quantities as weights) and weighted relatives method (price relatives multiplied by base year expenditure weights), proper identification of missing data items from 'above data' reference, and meaningful economic interpretation of cost of living changes
Part (c): Explanation of price quotation system (selection of representative items, specification of quality, choice of markets, timing of collection), role of NSSO and CSO in price data collection, and detailed coverage of DGCI&S publications (Monthly Statistics of Foreign Trade, Annual Report, commodity-wise/country-wise trade data, ITC-HS classification)
Integration of Indian context: citing RBI's use of ARIMA for monetary policy forecasting, CPI-AL/RL/CPI-U/CPI-IW construction by Labour Bureau, and Ministry of Commerce's trade statistics dissemination
Critical commentary in (b) on limitations of index numbers (substitution bias, quality changes, new goods) and in (c) on challenges in price quotation representativeness and trade data timeliness
Correct mathematical notation: ∇^d for differencing, φ(B) and θ(B) operators, p/d/q orders, and index number formulae with proper subscripts

Open full rubric & evaluate →

50M derive Demography, psychometrics, and life table analysis

(a) If c(x, t) denote observed proportion of females in the age group (x, x+t) and f(x, t) is the observed proportion of females giving birth to female children in the age group (x, x+t) at time t. Let us assume that X is uniformly distributed in (α, β). Then show that $$ \hat{B}_f(t)=\left[\hat{r}_{c,f|t} \hat{\sigma}_c \hat{\sigma}_f (\beta-\alpha) + \frac{[\hat{T}_f(t)]^2}{(\beta-\alpha)} \frac{1}{\hat{G}_f(t)}\right], $$ where $\hat{T}_f(t)$ is the estimated total fertility. $\hat{B}_f(t)$ is the estimated female birthrate at time t. $\hat{G}_f(t)$ is the estimated General Fertility rate. $\hat{r}_{c,f|t}$ represents product moment correlation coefficient between c and f given t. $\hat{\sigma}_c, \hat{\sigma}_f$ are observed standard deviations of c and f respectively. (15 marks) (b) What do you mean by Intelligence Quotient (I.Q.) ? Describe the procedure and test of measuring I.Q. How does an aptitude test differ from an Intelligence Test ? The reliability coefficient of a test of 60 items is 0·65. How much the test should be lengthened to raise the self correlation to 0·95 ? What effect will the doubling and tripling the test's length have upon the reliability coefficients ? What is the reliability of a test having 135 comparable items ? (15 marks) (c) Define instantaneous force of mortality (μₓ). Show that qₓ = (1/lₓ) ∫₀¹ μₓ₊ₜ lₓ₊ₜ dx where qₓ is the probability of dying within one year following the attainment of age x. Also prove that μₓ = (1/eₓ⁰) [1 + (deₓ⁰/dx)] where eₓ⁰ is the complete expectation of life. (20 marks)

हिंदी में पढ़ें

(a) यदि c(x, t) आयु वर्ग (x, x+t) में महिलाओं का प्रेक्षित अनुपात है और f(x, t) आयु वर्ग (x, x+t) में उन महिलाओं का प्रेक्षित अनुपात है जो महिला बच्चों को जन्म देती हैं, समय t पर। हम यह मान लेते हैं कि X, (α, β) में एकसमान बंटित है। तब दर्शाइए कि $$ \hat{B}_f(t)=\left[\hat{r}_{c,f|t} \hat{\sigma}_c \hat{\sigma}_f (\beta-\alpha) + \frac{[\hat{T}_f(t)]^2}{(\beta-\alpha)} \frac{1}{\hat{G}_f(t)}\right], $$ जहाँ $\hat{T}_f(t)$ आकलित कुल उर्वरता है। $\hat{B}_f(t)$ समय t पर आकलित महिला जन्म दर है। $\hat{G}_f(t)$ आकलित सामान्य प्रजनन दर है। $\hat{r}_{c,f|t}$ निरूपित करता है c और f के बीच गुणन-आश्रित संबंध गुणांक को जब कि t दिया हुआ है। $\hat{\sigma}_c, \hat{\sigma}_f$ क्रमशः: c और f के प्रेक्षित मानक विचलन हैं। (15 अंक) (b) बौद्धिक स्तर (आई. क्यू.) से आप क्या समझते हैं ? आई. क्यू. को मापने की विधि और परीक्षण का वर्णन कीजिए। एक उपयुक्ता (एप्टीट्यूड) परीक्षण, एक बौद्धिक परीक्षण से किस प्रकार भिन्न है ? 60 मदों के एक परीक्षण का विश्वसनीयता गुणांक 0.65 है। परीक्षण को कितना लम्बा किया जाना चाहिए ताकि स्व-सहसंबंध (सेल्फ कोरिलेशन) बढ़ कर 0.95 हो जाए ? परीक्षण की लम्बाई को दो गुना और तीन गुना करने पर विश्वसनीयता गुणांकों पर क्या प्रभाव पड़ेगा ? 135 तुलनीय मदों वाले एक परीक्षण की विश्वसनीयता क्या है ? (15 अंक) (c) तत्क्षण मरता की तीव्रता (μₓ) को परिभाषित कीजिए । दर्शाइए कि qₓ = (1/lₓ) ∫₀¹ μₓ₊ₜ lₓ₊ₜ dx जहाँ qₓ आयु x प्राप्त करने के उपरान्त एक वर्ष के भीतर मरने की प्रायिकता है । यह भी सिद्ध कीजिए कि μₓ = (1/eₓ⁰) [1 + (deₓ⁰/dx)] जहाँ eₓ⁰ जीवन की पूर्ण प्रत्याशा है । (20 अंक)

Answer approach & key points

The directive 'derive' demands rigorous mathematical proofs and derivations. Allocate approximately 30% time to part (a) on female birthrate estimation using correlation structure, 30% to part (b) covering IQ definition, measurement procedures, aptitude-intelligence distinction, and reliability calculations using Spearman-Brown prophecy, and 40% to part (c) on force of mortality derivations and life table relationships. Structure with clear section headings, state assumptions explicitly, show step-by-step derivations, and conclude with precise final expressions.

Part (a): Derivation of female birthrate formula using uniform distribution assumption, correlation structure between c(x,t) and f(x,t), and proper substitution of T_f(t) and G_f(t) with algebraic manipulation of (β-α) terms
Part (b): Precise definition of IQ (Mental Age/Chronological Age × 100 or deviation IQ), Stanford-Binet and Wechsler procedures, distinction between aptitude (specific potential) and intelligence (general ability) tests
Part (b): Application of Spearman-Brown prophecy formula n = r₂(1-r₁)/r₁(1-r₂) to find required test length for reliability 0.95, and calculation of new reliabilities for doubled/tripled lengths and 135 items
Part (c): Definition of μₓ as instantaneous death rate and derivation of qₓ = (1/lₓ)∫₀¹ μₓ₊ₜ lₓ₊ₜ dt using relationship between force of mortality and survival function
Part (c): Proof of μₓ = (1/eₓ⁰)[1 + (deₓ⁰/dx)] using complete expectation of life definition eₓ⁰ = Tₓ/lₓ and differentiation with respect to age
Correct handling of Indian demographic context: mention of SRS (Sample Registration System) data for fertility estimation and applicability to Indian population studies

Open full rubric & evaluate →

50M explain Econometrics and population projection

(a) What is autocorrelation ? What are its consequences ? Explain the Goldfeld-Quandt test and Glesjer test for heteroscedasticity. (20 marks) (b) Check the identifiability of the following two-equation system : β₁₁y₁ₜ + β₁₂y₂ₜ + γ₁₁x₁ₜ + γ₁₂x₂ₜ = u₁ₜ β₂₁y₁ₜ + β₂₂y₂ₜ + γ₂₁x₁ₜ + γ₂₂x₂ₜ = u₂ₜ Given the restrictions (i) γ₁₂ = 0, γ₂₁ = 0 and (ii) γ₁₁ = 0, γ₁₂ = 0 (15 marks) (c) Describe Leslie matrix and describe Leslie Matrix Technique for the population projection. (15 marks)

हिंदी में पढ़ें

(a) स्वसहसंबंध क्या है ? इसके परिणाम क्या हैं ? विषम विचलितता (हेटेरोस्किडास्टिसिटी) के लिए गोल्डफेल्ड-क्वांट (Goldfeld-Quandt) परीक्षण और ग्लेसजर (Glesjer) परीक्षण को समझाइए । (20 अंक) (b) निम्नलिखित द्वि-समीकरण प्रणाली की अभिज्ञेयता (आइडेंटिफायबिलिटी) की जाँच कीजिए : β₁₁y₁ₜ + β₁₂y₂ₜ + γ₁₁x₁ₜ + γ₁₂x₂ₜ = u₁ₜ β₂₁y₁ₜ + β₂₂y₂ₜ + γ₂₁x₁ₜ + γ₂₂x₂ₜ = u₂ₜ दिये गये प्रतिबंध हैं (i) γ₁₂ = 0, γ₂₁ = 0 और (ii) γ₁₁ = 0, γ₁₂ = 0 (15 अंक) (c) लेस्ली (Leslie) आव्यूह का वर्णन कीजिए और समष्टि प्रक्षेपण के लिए लेस्ली आव्यूह तकनीक का वर्णन कीजिए । (15 अंक)

Answer approach & key points

The directive 'explain' demands clear exposition with causal reasoning and illustrative examples. Allocate approximately 40% of time/words to part (a) given its 20 marks weight, covering autocorrelation definition, consequences, and both heteroscedasticity tests with step-wise procedures. Devote roughly 30% each to parts (b) and (c): for (b), construct the identification analysis using order and rank conditions for both restriction sets; for (c), explain Leslie matrix structure, fertility/survival parameters, and iterative projection mechanics with Indian demographic application. Structure as: definitional clarity → methodological exposition → worked application → concluding synthesis.

Part (a): Precise definition of autocorrelation (correlation between error terms across observations) and distinction from heteroscedasticity; enumeration of consequences including inflated t-statistics, inefficient OLS estimates, and misleading R²
Part (a): Goldfeld-Quandt test: correct procedure of ordering observations, splitting samples, computing F-ratio of residual variances, and interpretation against critical values
Part (a): Glesjer test: auxiliary regression of absolute/squared residuals on explanatory variables, test statistic derivation, and comparison with Goldfeld-Quandt in terms of power and applicability
Part (b): Application of order condition (K-k ≥ m-1) and rank condition for identification under restriction set (i) γ₁₂=0, γ₂₁=0, showing both equations are identified
Part (b): Analysis under restriction set (ii) γ₁₁=0, γ₁₂=0, demonstrating identification failure for equation 1 due to rank deficiency
Part (c): Leslie matrix structure: age-specific fertility rates (Fᵢ) in first row, survival probabilities (Pᵢ) on sub-diagonal, zeros elsewhere; matrix dimensions matching age classes
Part (c): Population projection technique: iterative multiplication n(t+1) = L × n(t), stable population properties, intrinsic growth rate extraction, and application to Indian census projections

Open full rubric & evaluate →

Practice Statistics 2021 Paper II answer writing

Pick any question above, write your answer, and get a detailed AI evaluation against UPSC's standard rubric.

Start free evaluation →