Statistics

UPSC Statistics 2021

All 16 questions from the 2021 Civil Services Mains Statistics paper across 2 papers — 800 marks in total. Each question comes with a detailed evaluation rubric, directive word analysis, and model answer points.

16Questions

800Total marks

2Papers

2021Exam year

Paper I

8 questions · 400 marks

50M Compulsory solve Probability theory and statistical inference

(a) A production unit manufacturing surgical masks is concerned about the quality of their masks. A random sample of n masks are inspected to estimate 'p', the probability of manufacturing a defective mask. How large a sample is required so that the estimate of p lies in the range p ± 0.1 with probability 0.95 ? (10 marks) (b) An insurance company studies a sample of 150 policy-holders. There are three categories of policies : auto, home and medical. The following results are obtained about the policies held by the policy-holders : (i) 30 have only home insurance (ii) 10 have only medical insurance (iii) 98 have auto insurance, but not all three types of insurance (iv) 27 have medical insurance, but not all three types of insurance (v) 13 have auto and medical insurance Given that a policy-holder has medical insurance, calculate the probability that he has home insurance. (10 marks) (c) Let X and Y be independent and identically distributed exponential random variables with mean λ > 0. Define $$Z = \begin{cases} 1, & \text{if} \quad X < Y \\ 0, & \text{if} \quad X \geq Y \end{cases}$$ Find E[X|Z = 1] + E[X|Z = 0]. (10 marks) (d) Let X₁, X₂, ..., Xₙ be a random sample from $$f(x, \theta) = \frac{\log(\theta)}{\theta - 1}\theta^x; \quad 0 < x < 1, \quad \theta > 1$$ Is there a function of θ, say g(θ), for which there exists an unbiased estimator whose variance attains the C-R lower bound ? If yes, find it. If not, show why not. (10 marks) (e) Let f(x, θ) be the Cauchy pdf $$f(x, \theta) = \frac{\theta}{\pi} \frac{1}{\theta^2 + x^2}; -\infty < x < \infty, \theta > 0$$ (i) Show that this family does not have Monotone Likelihood Ratio (MLR). (ii) If X is one observation from f(x, θ), show that |X| is sufficient for θ and hence the distribution of |X| does have an MLR. (5+5 marks)

Answer approach & key points

Solve each sub-part systematically with clear mathematical derivations. For (a), apply normal approximation to binomial for sample size determination; for (b), use set theory and conditional probability with Venn diagram analysis; for (c), exploit memoryless property of exponential distribution and symmetry arguments; for (d), verify regularity conditions and apply Cramér-Rao inequality; for (e), construct likelihood ratio and apply factorization theorem. Allocate approximately 15% time to (a), 15% to (b), 20% to (c), 25% to (d), and 25% to (e) given their analytical complexity.

(a) Sample size formula using n = z²₀.₀₂₅ × p(1-p)/d² with conservative p = 0.5 yielding n = 97 (or 96 with p unspecified)
(b) Complete Venn diagram construction: only auto = 85, auto∩home only = 0, all three = 0, medical∩home only = 4, yielding conditional probability 4/27
(c) E[X|Z=1] = E[X|X<Y] = λ/2 by memoryless property and E[X|Z=0] = λ + λ/2 = 3λ/2, sum = 2λ
(d) Verification of regularity conditions, Fisher information calculation I(θ) = [θ(log θ)² - (θ-1)²]/[θ(θ-1)²(log θ)²], and proof that only linear functions of the canonical parameter attain C-R bound
(e)(i) Counterexample showing L(θ₂)/L(θ₁) is not monotone by comparing likelihood ratios at x = 0 and x → ∞ for θ₂ > θ₁
(e)(ii) Factorization theorem application showing |X| sufficient, and proof that g(|X|;θ) = 2θ/[π(θ²+x²)] for x > 0 has MLR in |X|

Paper I

Paper II

Practice any of these questions