Q3
(a) (i) If X is a random variable with finite variance, show that lim n² P{|X| > n} = 0. n → ∞ (10 marks) (ii) In a certain recruitment test, there are multiple choice questions. There are four possible options to each question, out of which one is correct. The probability of knowing correct option for an intelligent student is 90%, while it is 20% for a weaker student. An intelligent student ticks the correct option. What is the probability that he was guessing ? (10 marks) (b) Determine whether the sequence of mutually independent random variables {Xₙ, n ≥ 1}, in which P(Xₙ = ± n^λ) = 1/(2n^(2λ)) P(Xₙ = 0) = 1 - 1/n^(2λ) (λ < 1/2) obeys Central Limit Theorem (CLT) or not. (15 marks) (c) Define Sequential Probability Ratio Test (SPRT) along with its operating characteristic function and average sample number. Determine SPRT for testing H₀ : θ = 4 against H₁ : θ = 5 in N(θ, 1) with α = 0·5 and β = 0·2. (15 marks)
हिंदी में प्रश्न पढ़ें
(a) (i) यदि X परिमित प्रसरण के साथ एक यादृच्छिक चर है, तो दिखाइए कि lim n² P{|X| > n} = 0. n → ∞ (10 अंक) (ii) किसी एक भर्ती परीक्षा में, बहुविकल्पीय प्रश्न हैं । प्रत्येक प्रश्न में चार संभव विकल्प हैं, जिनमें से एक सही है । एक बुद्धिमान छात्र के सही विकल्प जानने की प्रायिकता 90% है, जबकि एक कमजोर छात्र की केवल 20% है । एक बुद्धिमान छात्र सही विकल्प पर निशान लगाता है । इसके अनुमान से सही विकल्प पर निशान लगाने की प्रायिकता क्या है ? (10 अंक) (b) परीक्षण कीजिए कि परस्पर स्वतंत्र यादृच्छिक चरों की श्रृंखला {Xₙ, n ≥ 1}, जिसमें P(Xₙ = ± n^λ) = 1/(2n^(2λ)) P(Xₙ = 0) = 1 - 1/n^(2λ) (λ < 1/2) केंद्रीय सीमा प्रमेय (CLT) का पालन करती है या नहीं । (15 अंक) (c) अनुक्रमिक प्रायिकता अनुपात परीक्षण (SPRT) को इसके संकारक अभिलक्षण फलन एवं औसत प्रतिदर्श संख्या के साथ परिभाषित कीजिए । N(θ, 1) में H₀ : θ = 4 विरुद्ध H₁ : θ = 5 के परीक्षण के लिए अनुक्रमिक प्रायिकता अनुपात परीक्षण (SPRT), α = 0·5 तथा β = 0·2 के साथ ज्ञात कीजिए । (15 अंक)
Directive word: Prove
This question asks you to prove. The directive word signals the depth of analysis expected, the structure of your answer, and the weight of evidence you must bring.
See our UPSC directive words guide for a full breakdown of how to respond to each command word.
How this answer will be evaluated
Approach
Prove the limit result in (a)(i) using Markov/Chebyshev inequalities; solve (a)(ii) using Bayes' theorem with clear event definitions; for (b), verify Lindeberg condition or Lyapunov's theorem to establish CLT validity; for (c), define SPRT components then derive boundaries A, B and continuation region. Allocate ~20% time to (a)(i), ~15% to (a)(ii), ~30% to (b), and ~35% to (c) given mark distribution. Structure: state definitions → apply methods → derive results → interpret findings.
Key points expected
- (a)(i): Application of Markov's inequality or direct variance bound to show n²P{|X|>n} ≤ E[X²I(|X|>n)] → 0
- (a)(ii): Bayes' theorem setup with events K (knows), G (guesses), C (correct); calculation of P(G|C) = P(C|G)P(G)/P(C)
- (b): Verification of Lindeberg condition or checking variance of sum → ∞; showing standardized sum converges to N(0,1)
- (b): Explicit computation that Var(Sₙ) = Σn^(2λ) → ∞ and Lyapunov condition holds for λ < 1/2
- (c): Definition of SPRT, OC function L(θ), and ASN Eθ(N); derivation of Wald's boundaries A ≈ (1-β)/α, B ≈ β/(1-α)
- (c): Specific SPRT construction for N(θ,1): continuation region as sum(Xi - 4.5) between adjusted log-boundaries
Evaluation rubric
| Dimension | Weight | Max marks | Excellent | Average | Poor |
|---|---|---|---|---|---|
| Setup correctness | 20% | 10 | Correctly defines all random variables, events (K, G, C in a-ii), hypotheses (H₀, H₁ in c), and states conditions (finite variance in a-i, λ<1/2 in b, normal distribution in c); proper notation throughout | Defines most variables correctly but misses some conditions (e.g., forgets independence assumption in b or confuses α, β in c) | Incorrect or missing definitions; confuses P{|X|>n} with P{|X|<n}; wrong hypothesis specification in SPRT |
| Method choice | 20% | 10 | Selects optimal methods: Markov inequality/variance bound for (a-i), Bayes' theorem for (a-ii), Lindeberg/Lyapunov CLT for (b), Wald's SPRT theory for (c); justifies why alternatives are inferior | Uses correct general approaches but suboptimal variants (e.g., Chebyshev instead of Markov for a-i, direct verification instead of Lindeberg for b) | Wrong methods entirely (e.g., tries Chebyshev with wrong scaling for a-i, uses standard CLT without checking conditions in b, treats SPRT as fixed-sample test) |
| Computation accuracy | 20% | 10 | Flawless calculations: correct limit derivation n²P{|X|>n} ≤ ∫_{|x|>n} x²dF → 0; P(G|C) = (0.25×0.1)/(0.9+0.025) = 10/185; Var(Sₙ) = n^(2λ+1)/(2λ+1); A=4, B=0.25; correct continuation inequalities | Minor computational slips (e.g., arithmetic error in Bayes' denominator, off-by-one in variance sum, approximate boundary values without derivation) | Major errors: incorrect probability values (e.g., P(C|G)=1/4 forgotten), wrong variance calculation, boundary formulas swapped or missing log terms |
| Interpretation | 20% | 10 | Explains why finite variance ensures tail decay faster than n⁻²; interprets P(G|C) ≈ 5.4% as low guessing probability despite 10% guess rate; explains why λ<1/2 ensures CLT via variance growth; interprets SPRT's efficiency vs fixed-sample test | States results without full interpretation; mentions CLT holds but doesn't explain role of λ; describes SPRT procedure without comparing to Neyman-Pearson | No interpretation; fails to explain what limit result means for tail behavior; no insight on why CLT fails/works; treats SPRT as black box |
| Final answer & units | 20% | 10 | All boxed/concluded answers: limit equals 0; P(guessing | correct) = 10/185 or 2/37 ≈ 0.054; explicit 'CLT holds' with justification; SPRT with explicit A=4, B=1/4, continuation region in terms of Sₙ and n; proper probabilistic notation throughout | Correct final values but poorly presented; missing explicit conclusion on CLT; SPRT boundaries stated without clear decision rules | Missing or wrong final answers; incomplete SPRT specification; no clear statement of whether CLT applies |
Practice this exact question
Write your answer, then get a detailed evaluation from our AI trained on UPSC's answer-writing standards. Free first evaluation — no signup needed to start.
Evaluate my answer →More from Statistics 2023 Paper I
- Q1 (a) Out of 1000 persons born, only 900 reach the age of 15 years, and out of every 1000 who reach the age of 15 years, 950 reach the age of…
- Q2 (a) Let X, Y, Z be three mutually independent standard exponential variates and W₁ = X + Y + Z, W₂ = (X + Y)/(X + Y + Z), W₃ = X/(X + Y). T…
- Q3 (a) (i) If X is a random variable with finite variance, show that lim n² P{|X| > n} = 0. n → ∞ (10 marks) (ii) In a certain recruitment tes…
- Q4 (a) What is the role of properties of completeness and sufficiency in Statistical Inference ? Explain. In U (0, θ), find out Uniformly Mini…
- Q5 (a) (i) If **X** = (X₁ X₂ X₃)' is distributed as N₃ (μ, Σ), find the distribution of [(X₁ – X₂) (X₂ – X₃)]'. (5 marks) (ii) Suppose that **…
- Q6 (a) Let **X** = (X₁ X₂ X₃)' be distributed as N₃ (μ, Σ), where μ = (2 −1 3)' and Σ = $\begin{pmatrix} 4 & 1 & 0 \\ 1 & 2 & 1 \\ 0 & 1 & 3 \…
- Q7 (a) Discuss the difference between sampling for variables and sampling for attributes with examples. For a qualitative characteristic, find…
- Q8 (a) Differentiate between randomised block design and balanced incomplete block design. In usual notations, for a balanced incomplete block…