(a) Define time series. For a moving-average process with weights {a₁, a₂, ..., aₘ} of random components {eᵢ, i = 1, 2, ...}, where eᵢ's are i.i.d. N(0, σ²), obtain the correlogram function. Find its form, when all the weights are equal and their sum

Question

(a) Define time series. For a moving-average process with weights {a₁, a₂, ..., aₘ} of random components {eᵢ, i = 1, 2, ...}, where eᵢ's are i.i.d. N(0, σ²), obtain the correlogram function. Find its form, when all the weights are equal and their sum is 1. (15 marks)

(b) The marks obtained by student A in Mathematics and Language tests of maximum marks 150 each are 120 and 105 respectively. Find out in which subject, student A is more able as compared to other students based on the measure of T score. The following table gives a sample of marks obtained by 15 students of the same class :

Score in Mathematics | Score in Language
---|---
100 | 67
75 | 63
88 | 73
85 | 77
92 | 60
94 | 53
93 | 50
84 | 48
67 | 38
96 | 73
100 | 36
102 | 45
94 | 47
73 | 39
83 | 56

(15 marks)

(c) Describe the 2-stage least squares (2SLS) method of estimation of parameters in linear regression model. Also, state the assumptions and discuss its properties. (20 marks)

UPSC Answer Check · Accepted Answer

The directive 'describe' demands systematic exposition with technical precision. Structure: (a) 30% time/space—define time series rigorously, derive MA(m) autocorrelation structure, and simplify to uniform weights case showing the triangular decay pattern; (b) 30%—calculate sample means and standard deviations, compute T-scores for both subjects, and interpret relative standing; (c) 40%—detail 2SLS algorithm (first-stage reduced form, second-stage structural), list full assumptions (instrument relevance, exogeneity, rank condition), and prove consistency/asymptotic normality. Conclude with comparative assessment of 2SLS vs OLS in simultaneous equations contexts relevant to Indian economic policy evaluation.
- For (a): Formal definition of time series as ordered sequence of random variables; derivation of autocovariance γ(k) = σ²Σaᵢaᵢ₊ₖ for MA(m) with truncation; correlogram ρ(k) = γ(k)/γ(0); special case aᵢ = 1/m yielding ρ(k) = (m−|k|)/m for |k| < m and zero otherwise
- For (b): Correct computation of sample mean (x̄_M = 87.4, x̄_L = 54.2) and sample standard deviation (s_M ≈ 10.47, s_L ≈ 12.38); T-score formula T = 50 + 10×(X−X̄)/S; calculation yielding T_M ≈ 81.2 and T_L ≈ 91.1; correct interpretation that higher T-score in Language indicates better relative performance despite lower absolute marks
- For (c): Complete 2SLS procedure—stage 1 regress endogenous regressors on all exogenous/instrumental variables, stage 2 use fitted values in structural equation; explicit assumptions (linearity, instrument exogeneity E(Z'u)=0, relevance rank E(Z'X) full column, no perfect multicollinearity)
- For (c): Properties derivation—consistency via law of large numbers and continuous mapping theorem, asymptotic normality with variance σ²(X'P_ZX)⁻¹ where P_Z is projection matrix, comparison with OLS inconsistency under simultaneity
- For (c): Practical illustration such as estimating agricultural supply response where price is endogenous—using rainfall/transport cost as instruments, relevant to Indian agricultural policy analysis

Dimension	Weight	Max marks	Excellent	Average	Poor
Setup correctness	20%	10	For (a): correct MA(m) autocovariance formula with proper summation limits and lag notation; for (b): accurate identification of T-score as standardized measure with mean 50, SD 10; for (c): precise distinction between structural and reduced forms, correct projection matrix specification	Partial setup with minor notational errors (e.g., missing absolute value in lag, confused population/sample distinction) or incomplete reduced form specification in 2SLS	Fundamental setup errors: wrong autocovariance formula, treating T-score as Z-score, or conflating 2SLS with indirect least squares without proper justification
Method choice	20%	10	Optimal methods: autocorrelation derivation via expectation of product; sample standard deviation with n−1 denominator; explicit two-stage regression algebra with matrix notation; instrument validity tests mentioned	Correct but inefficient methods (e.g., population SD formula, scalar-only presentation of 2SLS), or missing discussion of why 2SLS beats OLS in simultaneous systems	Inappropriate methods: simple correlation instead of partial autocorrelation, raw score comparison ignoring variability, or attempting OLS on clearly simultaneous system
Computation accuracy	20%	10	Numerically exact: for uniform weights ρ(k) = 1−\|k\|/m; T-scores within ±0.5 of true values; 2SLS variance formula correctly derived with all matrix operations explicit	Minor computational slips (e.g., rounding to nearest integer in T-scores, sign error in autocorrelation for negative lags, correct final formula but arithmetic errors in illustration)	Major computational failures: wrong correlogram shape, T-scores with wrong mean/SD reference, or algebraic errors in 2SLS consistency proof rendering conclusion invalid
Interpretation	20%	10	For (a): explains why MA(m) has finite memory and how uniform weights create linear decay; for (b): correctly identifies Language as better relative performance despite lower absolute marks, discusses normative implications; for (c): interprets 2SLS as IV estimator, discusses efficiency loss vs consistency gain, cites Indian econometric applications	Correct but superficial interpretation—states results without explaining economic/statistical significance, or generic discussion of 2SLS without context-specific insight	Misinterpretation: claims MA process has long memory, concludes Mathematics is better due to higher raw marks, or presents 2SLS as universally superior without noting efficiency cost
Final answer & units	20%	10	All final answers boxed/highlighted: correlogram formula ρ(k) = (m−\|k\|)/m² × m = (m−\|k\|)/m for \|k\|<m; explicit T-score values with subject conclusion; 2SLS estimator formula β̂_2SLS = (X'P_ZX)⁻¹X'P_Zy with assumptions checklist and properties summary	Final answers present but poorly formatted, missing units (dimensionless ρ, no units for T-scores acceptable), or incomplete properties list	Missing final answers, wrong boxed results, or no conclusion on relative subject ability; 2SLS description without estimator formula or properties

Q8

Directive word: Describe

How this answer will be evaluated

Approach

Key points expected

Evaluation rubric

Practice this exact question

More from Statistics 2025 Paper II