Q7
(a) Discuss the difference between sampling for variables and sampling for attributes with examples. For a qualitative characteristic, find an unbiased estimator of population proportion along with its variance when sample is drawn by simple random sampling without replacement. Also obtain an unbiased estimator of this variance. 20 (b) The table given below gives the population and sample sizes, stratum means and variance of a stratified random sample of size 50. Symbols used have their usual meanings. | Stratum Number | Nᵢ | nᵢ | ȳᵢ | sᵢ² | |---|---|---|---|---| | 1 | 30 | 5 | 35 | 36 | | 2 | 50 | 10 | 40 | 49 | | 3 | 60 | 15 | 40 | 81 | | 4 | 60 | 20 | 55 | 144 | Verify that the existing allocation is optimum for given 4 strata. Also calculate the estimate of population variance under this allocation. 15 (c) Differentiate between Simple Random Sampling and Probability Proportional to Size Sampling. How will you draw a PPS sample of size n from a population of size N (n < N) by (i) Cumulative Total Method and (ii) Lahri's Method ? Explain. 15
हिंदी में प्रश्न पढ़ें
(a) चरों के प्रतिचयन एवं गुणात्मक चरों के प्रतिचयन में अंतर का उदाहरणों सहित वर्णन कीजिए। एक गुणात्मक अभिलक्षण के लिए समष्टि अनुपात का अनभिनत आकलक तथा इस आकलक का प्रसरण ज्ञात कीजिए जबकि प्रतिचयन प्रतिस्थापन रहित सरल यादृच्छिक विधि द्वारा किया गया है। इस प्रसरण का अनभिनत आकलक भी निकालिए। 20 (b) नीचे दी गई सारणी में 50 आकार के स्तरीकृत यादृच्छिक प्रतिदर्श के स्तरों का माध्य एवं प्रसरण तथा स्तरों की समष्टि का आकार तथा स्तरों से चयनित प्रतिदर्शी आकारों को दिया गया है। चिह्नों को उनके सामान्य अर्थों में प्रयुक्त किया गया है। | स्तर संख्या | Nᵢ | nᵢ | ȳᵢ | sᵢ² | |:---:|:---:|:---:|:---:|:---:| | 1 | 30 | 5 | 35 | 36 | | 2 | 50 | 10 | 40 | 49 | | 3 | 60 | 15 | 40 | 81 | | 4 | 60 | 20 | 55 | 144 | प्रमाणित कीजिए कि दिए गए 4 स्तरों के लिए मौजूदा आवंटन इष्टतम है। समष्टि प्रसरण का इस आवंटन के सापेक्ष आकलक भी ज्ञात कीजिए। 15 (c) सरल यादृच्छिक प्रतिचयन तथा आकार अनुपातिक प्रायिकता प्रतिचयन में विभेद कीजिए । एक n आकार के आकार अनुपातिक प्रायिकता प्रतिदर्श को आप (i) संचयी योग विधि तथा (ii) लाहिरी विधि द्वारा N (n < N) आकार की समष्टि से कैसे चुनेंगे ? स्पष्ट कीजिए । 15
Directive word: Discuss
This question asks you to discuss. The directive word signals the depth of analysis expected, the structure of your answer, and the weight of evidence you must bring.
See our UPSC directive words guide for a full breakdown of how to respond to each command word.
How this answer will be evaluated
Approach
Begin with a clear conceptual distinction in part (a) between variables (quantitative) and attributes (qualitative) with Indian examples like agricultural yield vs literacy status. Derive the unbiased estimator p̂ = n'/n for population proportion and its variance V(p̂) = (N-n)/(N-1) · p(1-p)/n, then obtain unbiased estimator v(p̂). For part (b), verify Neyman optimum allocation by checking if nᵢ ∝ NᵢSᵢ/√cᵢ (assuming equal costs), then compute V(ȳ_st). For part (c), contrast SRS with PPS on selection probability basis, then detail both Cumulative Total and Lahri's methods with numerical illustration. Allocate approximately 40% time to part (a), 30% each to (b) and (c) based on marks distribution.
Key points expected
- Part (a): Clear distinction between sampling for variables (measurable quantities like income, yield) vs attributes (dichotomous characteristics like employment status, disease presence) with appropriate Indian examples
- Part (a): Derivation of unbiased estimator p̂ = n'/n for population proportion P, its variance V(p̂) = (N-n)/(N-1) · P(1-P)/n under SRSWOR, and unbiased estimator of variance v(p̂) = (N-n)/(N-1) · p̂(1-p̂)/(n-1)
- Part (b): Verification of Neyman optimum allocation condition nᵢ/n = NᵢSᵢ/ΣNⱼSⱼ using given data; calculation showing existing allocation matches or approximates this ratio
- Part (b): Computation of stratified mean estimate ȳ_st = ΣWᵢȳᵢ where Wᵢ = Nᵢ/N, and population variance estimate V(ȳ_st) = ΣWᵢ²(Nᵢ-nᵢ)/(Nᵢnᵢ) · sᵢ²
- Part (c): Systematic comparison of SRS (equal probability) vs PPS (probability ∝ size) on grounds of efficiency, especially for skewed populations like industrial output or agricultural holdings
- Part (c): Step-wise description of Cumulative Total Method: list cumulative totals, select random numbers between 1 and ΣXᵢ, identify selected units
- Part (c): Step-wise description of Lahri's Method: select random number i from 1 to N and random number j from 1 to M (M=max size), accept if j ≤ Xᵢ, else reject and repeat
Evaluation rubric
| Dimension | Weight | Max marks | Excellent | Average | Poor |
|---|---|---|---|---|---|
| Setup correctness | 20% | 10 | Correctly identifies all parameters for each sub-part: for (a) defines population proportion P and sample proportion p̂; for (b) computes N=200, stratum weights Wᵢ, and sets up Neyman allocation verification framework; for (c) clearly defines size measures Xᵢ and selection probabilities for both PPS methods | Identifies most parameters but makes minor errors in defining finite population correction or stratum weights; incomplete setup for one sub-part | Confuses variables vs attributes, misidentifies stratum parameters, or fails to define size measures for PPS; major conceptual errors in problem setup |
| Method choice | 20% | 10 | Selects appropriate methodological frameworks: hypergeometric/SRSWOR theory for (a), Neyman optimum allocation formula for (b), and correct algorithmic descriptions for both PPS methods in (c); justifies why each method is suitable | Uses correct general methods but misses nuances like finite population correction in (a) or assumes equal costs without mention in (b); describes one PPS method adequately but other superficially | Applies wrong methods (e.g., SRSWR instead of SRSWOR, proportional instead of optimum allocation); confuses cumulative total with Lahri's method or describes incorrect algorithms |
| Computation accuracy | 20% | 10 | All calculations precise: for (a) correct variance formula with FPC; for (b) accurate verification that nᵢ ∝ Nᵢsᵢ (computing N₁s₁=180, N₂s₂=350, N₃s₃=540, N₄s₄=720, ratios 5:10:15:20 match 180:350:540:720 approximately), correct V(ȳ_st) = ΣWᵢ²(Nᵢ-nᵢ)sᵢ²/(Nᵢnᵢ) = 0.0025×30 + 0.00196×35 + 0.0015×36 + 0.001×54 = 0.075+0.0686+0.054+0.054 ≈ 0.2516; for (c) numerical illustration of both methods | Correct approach but arithmetic errors in variance calculation or allocation verification; partial numerical demonstration for PPS methods | Major computational errors: incorrect stratum variance formula, wrong weights, or failure to perform any numerical verification; no worked example for PPS methods |
| Interpretation | 20% | 10 | Interprets results meaningfully: explains why optimum allocation improves precision over proportional allocation in (b); discusses efficiency gains of PPS over SRS for skewed populations (e.g., Indian industrial survey where few large units dominate); notes practical constraints like cost and feasibility in allocation decisions | Provides basic interpretation without depth; mentions efficiency but doesn't relate to specific context or data characteristics; limited discussion of practical implications | No interpretation of numerical results; fails to explain why methods differ in efficiency or when to prefer one over another; purely mechanical presentation |
| Final answer & units | 20% | 10 | All final answers clearly stated with appropriate units/notations: variance expressions in (a), explicit conclusion that allocation is approximately optimum with calculated V(ȳ_st) ≈ 0.252 in (b), and complete algorithmic summaries for both PPS methods in (c); proper mathematical notation throughout | Final answers present but poorly organized or missing units; incomplete presentation of one sub-part's conclusions; some notational inconsistencies | Missing final answers for one or more sub-parts; no clear conclusion on optimum allocation; algorithms described without final summary; messy or illegible presentation |
Practice this exact question
Write your answer, then get a detailed evaluation from our AI trained on UPSC's answer-writing standards. Free first evaluation — no signup needed to start.
Evaluate my answer →More from Statistics 2023 Paper I
- Q1 (a) Out of 1000 persons born, only 900 reach the age of 15 years, and out of every 1000 who reach the age of 15 years, 950 reach the age of…
- Q2 (a) Let X, Y, Z be three mutually independent standard exponential variates and W₁ = X + Y + Z, W₂ = (X + Y)/(X + Y + Z), W₃ = X/(X + Y). T…
- Q3 (a) (i) If X is a random variable with finite variance, show that lim n² P{|X| > n} = 0. n → ∞ (10 marks) (ii) In a certain recruitment tes…
- Q4 (a) What is the role of properties of completeness and sufficiency in Statistical Inference ? Explain. In U (0, θ), find out Uniformly Mini…
- Q5 (a) (i) If **X** = (X₁ X₂ X₃)' is distributed as N₃ (μ, Σ), find the distribution of [(X₁ – X₂) (X₂ – X₃)]'. (5 marks) (ii) Suppose that **…
- Q6 (a) Let **X** = (X₁ X₂ X₃)' be distributed as N₃ (μ, Σ), where μ = (2 −1 3)' and Σ = $\begin{pmatrix} 4 & 1 & 0 \\ 1 & 2 & 1 \\ 0 & 1 & 3 \…
- Q7 (a) Discuss the difference between sampling for variables and sampling for attributes with examples. For a qualitative characteristic, find…
- Q8 (a) Differentiate between randomised block design and balanced incomplete block design. In usual notations, for a balanced incomplete block…