Q4
(a) Find the most powerful test of size α(= 0·05) for testing H₀: μ = 0 vs. H₁: μ = 1, given a random sample of size 25 from N(μ, 16) population. (20 marks) (b) A lot consists of some defective items. A random sample of 25 items has 6 defective items with probability p₁ = θ and 19 non-defective items with probability p₂ = 1 – θ. Then estimate θ using the following: (i) MLE method (ii) Minimum χ²-method (iii) Modified minimum χ²-method (15 marks) (c) Differentiate between Mann-Whitney U-test and Wilcoxon sign test. The following data pertain to APGAR scores of 15 pregnant women in two care programmes A and B: Programme A : 8 7 6 2 5 8 7 3 Programme B : 9 9 7 8 10 9 6 Is there a significant difference in APGAR scores of pregnant women under the two care programmes? [Given, U₍₀.₀₅₎ = 10] (15 marks)
हिंदी में प्रश्न पढ़ें
(a) H₀: μ = 0 विरुद्ध H₁: μ = 1 के परीक्षण के लिए, α(= 0·05) आमाप का शक्तम परीक्षण प्राप्त कीजिए, जबकि 25 आमाप का एक यादृच्छिक प्रतिदर्श N(μ, 16) समष्टि से लिया गया है। (20 अंक) (b) एक प्रचय में कुछ दोषपूर्ण वस्तुएँ हैं। 25 वस्तुओं के एक यादृच्छिक प्रतिदर्श में 6 दोषपूर्ण वस्तुएँ हैं, जिसकी प्रायिकता p₁ = θ है और 19 दोष रहित वस्तुएँ हैं, जिसकी प्रायिकता p₂ = 1 – θ है। तब निम्न का उपयोग करके θ का आकलन कीजिए : (i) MLE विधि (ii) न्यूनतम χ²-विधि (iii) आपरिवर्तित न्यूनतम χ²-विधि (15 अंक) (c) मैन-हिटनी U-परीक्षण और विल्कॉक्सन चिह्न परीक्षण के बीच अंतर कीजिए। निम्नलिखित आँकड़े दो देखभाल कार्यक्रमों A और B में 15 गर्भवती महिलाओं के APGAR स्कोरों से सम्बन्धित हैं : कार्यक्रम A : 8 7 6 2 5 8 7 3 कार्यक्रम B : 9 9 7 8 10 9 6 क्या दोनों देखभाल कार्यक्रमों के अन्तर्गत गर्भवती महिलाओं के APGAR स्कोरों में सार्थक अंतर है? [दिया गया है, U₍₀.₀₅₎ = 10] (15 अंक)
Directive word: Solve
This question asks you to solve. The directive word signals the depth of analysis expected, the structure of your answer, and the weight of evidence you must bring.
See our UPSC directive words guide for a full breakdown of how to respond to each command word.
How this answer will be evaluated
Approach
Solve this multi-part numerical problem by allocating approximately 40% time to part (a) given its 20 marks, 30% to part (b) covering three estimation methods, and 30% to part (c) involving both differentiation and non-parametric testing. Structure as: (a) derive Neyman-Pearson lemma application with critical region, (b) present three estimation approaches with clear derivations, (c) tabulate differences then perform Mann-Whitney U-test with ranking and decision.
Key points expected
- Part (a): Apply Neyman-Pearson lemma to derive most powerful test; identify critical region as sample mean > k; compute critical value k = 0.4 + 1.645×(4/5) = 1.716 using Z-test; state rejection rule and power function
- Part (b)(i): Derive MLE as θ̂ = 6/25 = 0.24 using binomial likelihood L(θ) = θ⁶(1-θ)¹⁹ and log-likelihood differentiation
- Part (b)(ii): Set up minimum χ² by minimizing Σ(Oi-Ei)²/Ei; show equivalence to MLE in this case or derive modified form with expected frequencies 25θ and 25(1-θ)
- Part (b)(iii): Apply modified minimum χ² using weights in denominator; demonstrate Neyman modification or minimum logit χ² approach if applicable
- Part (c): Differentiate Mann-Whitney (two independent samples, ordinal/rank data) vs Wilcoxon signed-rank (paired/matched samples); correctly identify independent samples scenario here
- Part (c) computation: Rank pooled data (1-15), compute sum of ranks for smaller sample (Programme B, n=7), calculate U = 56 - 28 = 28 or U' = 49 - 28 = 21; compare with critical value U₀.₀₅ = 10; conclude no significant difference since U > 10
Evaluation rubric
| Dimension | Weight | Max marks | Excellent | Average | Poor |
|---|---|---|---|---|---|
| Setup correctness | 20% | 10 | Correctly identifies all distributions and parameters: N(μ,16) with σ=4 known for (a), Binomial(25,θ) for (b), independent samples for Mann-Whitney in (c); states hypotheses with correct inequalities and significance level α=0.05 | Identifies most distributions correctly but confuses paired vs independent samples in (c) or states hypotheses without proper notation; minor errors in parameter specification | Wrong distribution assumptions (e.g., uses t-test instead of Z-test in (a), treats (b) as Poisson); fails to state hypotheses or uses incorrect significance level |
| Method choice | 20% | 10 | Applies Neyman-Pearson lemma correctly for (a); selects appropriate estimation methods for (b) with clear distinction between three approaches; correctly chooses Mann-Whitney U-test over Wilcoxon signed-rank for (c) with proper justification | Uses correct general methods but applies Neyman-Pearson without full justification; estimates θ using MLE correctly but confuses minimum χ² variants; chooses correct test in (c) without clear rationale | Uses likelihood ratio test instead of Neyman-Pearson for (a); applies method of moments or wrong estimation technique for (b); uses Wilcoxon signed-rank incorrectly for independent samples in (c) |
| Computation accuracy | 20% | 10 | Accurate calculations throughout: critical value k=1.716 or Z=1.645 for (a); θ̂=0.24 with correct second-order condition for MLE, proper χ² minimization for (ii)-(iii); correct ranking, rank sums (RA=46, RB=59), U=17.5 or U=17, U'=17.5 for (c) | Minor arithmetic errors in critical value or rank sums; correct MLE but algebraic errors in χ² derivatives; ranking correct but U calculation formula slightly wrong | Major computational errors: wrong critical region, incorrect θ estimate, confused ranking with ties handled incorrectly, wrong U formula application leading to invalid conclusion |
| Interpretation | 20% | 10 | Clear interpretation: states rejection region and power for (a); explains why MLE and minimum χ² coincide for (b); interprets U statistic correctly, compares with critical value, states p-value direction, and gives contextual conclusion about APGAR programmes | States results without full interpretation; mentions significance but lacks context; partial explanation of method equivalence in (b); conclusion present but not well-linked to maternal health context | No interpretation of results; fails to compare test statistic with critical value; no conclusion about hypothesis or practical significance; missing context entirely |
| Final answer & units | 20% | 10 | All final answers boxed/highlighted: critical region X̄ > 1.716 with power 0.638; θ̂=0.24 (MLE), identical or properly derived modified estimate; U=17.5 (or 17), decision 'Do not reject H₀' with clear statement that no significant difference exists between programmes at 5% level | Most answers present but not clearly demarcated; missing power calculation or final decision statement; units correct but presentation inconsistent | Missing final answers or incorrect decisions; wrong conclusion about significance; no clear statement of test outcome; units omitted where relevant (e.g., APGAR score interpretation) |
Practice this exact question
Write your answer, then get a detailed evaluation from our AI trained on UPSC's answer-writing standards. Free first evaluation — no signup needed to start.
Evaluate my answer →More from Statistics 2024 Paper I
- Q1 (a) Two events A and B are such that P(A) = 1/3, P(B) = 1/4 and P(A|B) + P(B|A) = 2/3. Evaluate the following: (i) P(A^c ∪ B^c) (5 marks) (…
- Q2 (a) Let the joint probability density function of two random variables X and Y be f(x, y) = x/3, for 0 < 2x < 3y < 6; 0, otherwise. Compute…
- Q3 (a) Let moment generating function of random variable X exist in the neighbourhood of zero and if $$E(X^n) = \frac{1}{5} + (-1)^n \frac{2}{…
- Q4 (a) Find the most powerful test of size α(= 0·05) for testing H₀: μ = 0 vs. H₁: μ = 1, given a random sample of size 25 from N(μ, 16) popul…
- Q5 (a) How will you justify the usage of the principle of least squares in estimating the parameters of a linear regression model? With usual…
- Q6 (a) (X, Y) has bivariate normal distribution BN(μ₁, μ₂, σ₁², σ₂², ρ). (i) Show that X and Y are independent if and only if ρ = 0. (6 marks)…
- Q7 (a) A very big population is divided into two strata. The allocation of units of stratified random sample of size n for the two strata unde…
- Q8 (a) A 2²-factorial design was used to develop the yield of a crop. Two factors A and B were used at two levels: low (–1) and high (+1). The…