Statistics

UPSC Statistics 2023

All 16 questions from the 2023 Civil Services Mains Statistics paper across 2 papers — 800 marks in total. Each question comes with a detailed evaluation rubric, directive word analysis, and model answer points.

16Questions

800Total marks

2Papers

2023Exam year

Paper I

8 questions · 400 marks

50M Compulsory solve Probability theory and distributions

(a) Out of 1000 persons born, only 900 reach the age of 15 years, and out of every 1000 who reach the age of 15 years, 950 reach the age of 50 years. Out of every 1000 who reach the age of 50 years, 40 die in one year. Accordingly, what is the probability that a person would attain the age of 51 years ? (10 marks) (b) Let X be a continuous random variable with probability density function : f(x) = $$ \begin{cases} \frac{x}{2}, & 0 \leq x < 1 \\\ \frac{1}{2}, & 1 \leq x < 2 \\\ \frac{3-x}{2}, & 2 \leq x < 3 \\\ 0, & \text{elsewhere} \end{cases} $$ Obtain the cumulative distribution function of X and hence find the value of $P\left(X > \frac{3}{2}\right)$. (10 marks) (c) Let {Xₙ, n ≥ 1} be a sequence of mutually independent random variables such that P(Xₙ = nᵅ) = P(Xₙ = – nᵅ) = 0·5, for any α > 0. Derive the condition on α under which the sequence {Xₙ, n ≥ 1} obeys WLLNs. (10 marks) (d) Apply Run Test to test the randomness of the following sequence of H and T at 5% level of significance : HHHHHHTHHHHHTHTHHHH TTHHHHTHHHTTHHHHHH THHTTHHTHHH Given : Z₍₀·₀₂₅₎ = 1·96 Z₍₀·₀₅₎ = 1·645 (10 marks) (e) Differentiate between prior and posterior distributions. In case of squared error loss function, find out the Bayes estimator for unknown parameter. (10 marks)

Answer approach & key points

Solve each sub-part systematically with clear mathematical working. For (a), apply conditional probability using survival data; (b) integrate piecewise to find CDF and evaluate tail probability; (c) apply Khinchin's WLLN condition checking variance behavior; (d) count runs and apply normal approximation for hypothesis testing; (e) state Bayes theorem and minimize posterior expected loss. Allocate approximately 2 minutes per mark, presenting each solution with clear labeling and logical flow from given information to final answer.

(a) Correct application of chain rule for conditional probability: P(age 51) = P(survive to 15) × P(survive to 50 | 15) × P(survive 50-51 | 50) = 0.9 × 0.95 × 0.96
(b) Proper piecewise integration of f(x) to obtain F(x) with continuity checks at x=1 and x=2, then P(X > 3/2) = 1 - F(3/2) = 5/8
(c) Derivation that E(Xₙ) = 0, Var(Xₙ) = n^(2α), and application of Khinchin's theorem requiring (1/n²)ΣVar(Xᵢ) → 0, yielding condition α < 1/2
(d) Correct counting of runs (r=12), expected runs μᵣ = 2n₁n₂/(n₁+n₂) + 1, variance σᵣ², and Z-test showing |Z| < 1.96 so randomness not rejected
(e) Clear distinction: prior π(θ) represents pre-sample belief, posterior π(θ|x) ∝ L(x|θ)π(θ) updates belief; Bayes estimator under squared error loss is posterior mean E[θ|x]

Paper I

Paper II

Practice any of these questions