Q4
(a) Consider Poisson distribution $$P_{\theta}(X = j) = \frac{e^{-\theta} \theta^{j}}{j!} = p_{j}, j = 0, 1, 2, ....$$ Let $f_{j}$ be the frequency for X = j and $E(f_{j}) = m_{j} = np_{j}$. Discuss how you obtain minimum chi-square estimate for $\theta$. Does minimum chi-square method necessarily yield a sufficient statistic even if it exists ? (20 marks) (b) (i) Let the joint probability density function of X and Y be $$f(x, y) = C . \exp \{-(4x^{2} + 9y^{2} - xy)\},$$ where C is a constant. Find E(X), V(X), E(Y), V(Y) and the correlation coefficient between X and Y. (10 marks) (ii) If $X_{1}, X_{2}, ..., X_{6}$ are independent random variables such that $$P(X_{i} = -1) = P(X_{i} = 1) = \frac{1}{2}, i = 1, 2, ..., 6,$$ then obtain the value of $$P\left[\sum_{i=1}^{6} X_{i} = 4\right].$$ (5 marks) (c) The following data present the time (in minutes), that a commuter had to wait to catch a bus to reach his destination : Use the sign-test at 0·05 level of significance to test the claim of the bus operators that commuters do not have to wait for more than 15 minutes before the bus is made available to them. [Given Z₍₀.₀₂₅₎ = 1·96, Z₍₀.₀₅₎ = 1·645] (15 marks)
हिंदी में प्रश्न पढ़ें
(a) प्वासों बंटन $$P_{\theta}(X = j) = \frac{e^{-\theta} \theta^{j}}{j!} = p_{j}, j = 0, 1, 2, ....$$ पर विचार कीजिए । मान लीजिए कि X = j की बारम्बारता $f_{j}$ है तथा $E(f_{j}) = m_{j} = np_{j}$ है । आप $\theta$ का न्यूनतम काई-वर्ग आकल कैसे प्राप्त करेंगे, इसकी विवेचना कीजिए । यदि पर्याप्त प्रतिदर्शज का अस्तित्व भी है तो क्या न्यूनतम काई-वर्ग विधि पर्याप्त प्रतिदर्शज अवश्य देगा ? (20 अंक) (b) (i) मान लीजिए कि X तथा Y का संयुक्त प्रायिकता घनत्व फलन $$f(x, y) = C . \exp \{-(4x^{2} + 9y^{2} - xy)\},$$ है, जहाँ C एक अचर है । E(X), V(X), E(Y), V(Y) और X और Y के बीच सहसंबंध गुणांक को ज्ञात कीजिए । (10 अंक) (ii) यदि स्वतंत्र यादृच्छिक चर $X_{1}, X_{2}, ..., X_{6}$ इस प्रकार हैं कि $$P(X_{i} = -1) = P(X_{i} = 1) = \frac{1}{2}, i = 1, 2, ..., 6$$ हैं, तब $$P\left[\sum_{i=1}^{6} X_{i} = 4\right]$$ का मान प्राप्त कीजिए । (5 अंक) (c) निम्नलिखित आँकड़े एक यात्री को उसके गंतव्य तक पहुँचने के लिए बस को पकड़ने के लिए किए गए प्रतीक्षा समय (मिनटों में) को दर्शाते हैं : साइन-परीक्षण का 0·05 सार्थकता स्तर पर उपयोग करते हुए बस संचालकों के द्वारा दावा कि यात्रियों को बस को पकड़ने के लिए 15 मिनट से अधिक प्रतीक्षा नहीं करनी पड़ती, का परीक्षण कीजिए। [दिया गया है Z₍₀.₀₂₅₎ = 1·96, Z₍₀.₀₅₎ = 1·645] (15 अंक)
Directive word: Discuss
This question asks you to discuss. The directive word signals the depth of analysis expected, the structure of your answer, and the weight of evidence you must bring.
See our UPSC directive words guide for a full breakdown of how to respond to each command word.
How this answer will be evaluated
Approach
The directive 'discuss' in part (a) requires a balanced analytical treatment with derivation and critical evaluation, while parts (b) and (c) are primarily computational. Allocate approximately 40% of effort to part (a) given its 20 marks and theoretical depth, 30% to part (b) covering both (i) bivariate normal properties and (ii) probability calculation, and 30% to part (c) for the non-parametric test. Structure as: brief theoretical exposition for (a), systematic derivations for (b), and complete hypothesis testing procedure for (c).
Key points expected
- For (a): Derivation of minimum chi-square estimator for Poisson parameter by minimizing Σ(f_j - np_j)²/(np_j) with respect to θ, leading to the estimating equation
- For (a): Critical discussion that minimum chi-square method does NOT necessarily yield sufficient statistics—contrast with MLE which preserves sufficiency via factorization theorem; cite example where MCS estimator differs from sufficient statistic
- For (b)(i): Recognition of bivariate normal form, completion of squares to identify μ_x = μ_y = 0, extraction of variances σ_x² = 9/35, σ_y² = 4/35 and covariance to find ρ = 1/6
- For (b)(ii): Identification that ΣX_i follows distribution of (number of +1's) - (number of -1's), equivalent to 2×Binomial(6,½) - 6, yielding P(ΣX_i=4) = P(5 successes) = 6/64 = 3/32
- For (c): Correct application of sign test with null hypothesis H₀: median ≤ 15 vs H₁: median > 15, counting positive signs (values > 15), using normal approximation with continuity correction, and proper conclusion based on Z = 1.645 critical value
Evaluation rubric
| Dimension | Weight | Max marks | Excellent | Average | Poor |
|---|---|---|---|---|---|
| Setup correctness | 20% | 10 | Correctly sets up chi-square minimization for (a) with proper likelihood and constraint; identifies bivariate normal structure in (b)(i) with accurate matrix completion; establishes correct Binomial framework for (b)(ii); properly formulates sign test hypotheses and identifies test statistic for (c) | Basic setup present but minor errors in constraint handling for (a), incomplete square completion for (b)(i), or incorrect direction of alternative hypothesis for (c) | Fundamental setup errors: wrong objective function for chi-square, failure to recognize bivariate normal form, incorrect probability model for sum of Rademacher variables, or completely wrong test selection for (c) |
| Method choice | 20% | 10 | Selects appropriate calculus-based optimization for MCS in (a), matrix inversion/completion of squares for (b)(i), exact combinatorial counting for (b)(ii), and correct normal approximation with continuity correction for sign test in (c) | Correct general methods but suboptimal choices: numerical instead of analytical optimization, unnecessary integration instead of pattern recognition, or omission of continuity correction | Inappropriate methods: using MLE instead of MCS in (a), treating variables as independent in (b)(i), using CLT unnecessarily for exact calculation in (b)(ii), or using t-test instead of sign test for (c) |
| Computation accuracy | 20% | 10 | Flawless calculations: correct MCS estimating equation solution, precise variance-covariance extraction yielding ρ = 1/6, exact probability 3/32, and accurate Z-score with proper sign count leading to correct decision | Minor computational slips: arithmetic errors in solving equations, slight miscalculations in correlation coefficient, or small errors in binomial probability | Major computational failures: incorrect derivatives, wrong determinant calculations, probability values outside [0,1], or critical value misapplication causing wrong conclusion |
| Interpretation | 20% | 10 | For (a), clearly explains why MCS may lose sufficiency compared to MLE with concrete reasoning; for (b), interprets correlation magnitude and distributional properties; for (c), correctly interprets p-value in context of bus operator's claim with practical significance | Adequate interpretation but lacking depth: generic statements about sufficiency without comparison, mechanical reporting of statistics without context, or correct but minimal conclusion | Misinterpretation of results: claiming MCS always preserves sufficiency, incorrect sign of correlation interpretation, or reversing null/alternative conclusion in hypothesis test |
| Final answer & units | 20% | 10 | All final answers explicitly stated with proper units where applicable: MCS estimator formula, numerical values for E(X), V(X), E(Y), V(Y), ρ, exact probability fraction, and clear reject/fail-to-reject decision with significance level for sign test | Most answers present but some missing: estimator without final simplified form, correlation without numerical value, or decision without justification | Missing or incorrect final answers: no explicit estimator, incomplete probability distribution results, or no conclusion for hypothesis test despite calculations |
Practice this exact question
Write your answer, then get a detailed evaluation from our AI trained on UPSC's answer-writing standards. Free first evaluation — no signup needed to start.
Evaluate my answer →More from Statistics 2022 Paper I
- Q1 (a) Let X and Y be independent random variables with exponential distribution having respective means $\frac{1}{\lambda_1}$ and $\frac{1}{\…
- Q2 (a) Let a random variable X have exponential distribution with mean 1/θ, θ > 0. To test H₀ : θ = 3 against H₁ : θ = 2, construct sequential…
- Q3 (a) (i) How large a sample must be taken in order that the probability will be at least 0·90 that the sample mean will be within 0·4 – neig…
- Q4 (a) Consider Poisson distribution $$P_{\theta}(X = j) = \frac{e^{-\theta} \theta^{j}}{j!} = p_{j}, j = 0, 1, 2, ....$$ Let $f_{j}$ be the f…
- Q5 (a) Define general linear model with usual assumptions. If y₁ = β₁ + u₁, y₂ = –β₁ + β₂ + u₂, y₃ = –β₂ + u₃, where u₁, u₂, u₃ are mutually i…
- Q6 (a) In a set of two-way classified data according to k levels of factor A and r levels of factor B, there is one observation in each cell.…
- Q7 (a) Consider the following data given for a BIBD with v = b = 4, r = k = 3, λ = 2 and N = 12 : Analyse the design. [Given that : F₃,₅ (0·05…
- Q8 (a) (i) What are orthogonal polynomials ? How do you fit an orthogonal polynomial of degree 'p' ? (ii) For the model Y_(n×1) = X_(n×k) β_(k…