Q6
(a) Explain Akaike information criterion for order selection in an ARMA (p, q) process. 15 marks (b) Define autocorrelation coefficient. What are its consequences for ordinary least squares? Discuss the maximum likelihood estimation of the model, in the usual notations, Y = Xβ + u with AR (autoregressive)(1) disturbance. 20 marks (c) Explain the method of collection of industrial data. Describe the (i) official publications for data collection and (ii) statistics collected by the various official agencies pertaining to industrial production. 15 marks
हिंदी में प्रश्न पढ़ें
(a) एक ए० आर० एम० ए० (p, q) प्रक्रम में क्रम चयन के लिए अकैके सूचना मानदंड की व्याख्या कीजिए। 15 (b) स्वसहसंबंध गुणांक को परिभाषित कीजिए। साधारण न्यूनतम वर्गों के लिए इसके परिणाम क्या हैं? प्रचलित संकेतों में, ए० आर० (स्वसमाश्रयी)(1) विघोष के साथ, निदर्श Y = Xβ + u के अधिकतम संभाविता आकलन का वर्णन कीजिए। 20 (c) औद्योगिक आँकड़ों के संग्रह की विधि की व्याख्या कीजिए। (i) आँकड़ों के संग्रह के लिए राजकीय प्रकाशनों का और (ii) औद्योगिक उत्पादन से संबंधित विभिन्न राजकीय एजेंसियों द्वारा एकत्र किये गये आँकड़ों का वर्णन कीजिए। 15
Directive word: Explain
This question asks you to explain. The directive word signals the depth of analysis expected, the structure of your answer, and the weight of evidence you must bring.
See our UPSC directive words guide for a full breakdown of how to respond to each command word.
How this answer will be evaluated
Approach
Explain the theoretical foundations across all three parts with appropriate mathematical derivations where required. Allocate approximately 30% effort to part (a) on AIC and ARMA order selection, 40% to part (b) on autocorrelation and MLE estimation given its higher marks, and 30% to part (c) on industrial data collection methods. Structure with clear sectional headings, begin each part with precise definitions, develop through step-by-step reasoning, and conclude with practical implications or limitations.
Key points expected
- Part (a): Definition of AIC as -2log(L) + 2k where L is likelihood and k is number of parameters; trade-off between goodness-of-fit and model complexity; comparison with BIC/AICc; application to ARMA(p,q) via minimization over candidate orders
- Part (b): Autocorrelation coefficient ρ_k = Cov(u_t, u_{t-k})/Var(u_t); consequences for OLS: biased standard errors, inefficient estimates, invalid t/F tests; MLE derivation for AR(1) errors with transformation matrix Ω, concentrated likelihood, and iterative estimation
- Part (c): Methods: census vs sample surveys, ASI (Annual Survey of Industries) schedule, establishment surveys; Official publications: ASI Summary Results, Index of Industrial Production (IIP), Economic Census; Agencies: CSO (now NSO), DIPP, Labour Bureau, RBI industrial data
- Mathematical rigor: Proper likelihood functions, matrix notation for GLS transformation, stationarity conditions for AR(1) parameter |ρ| < 1
- Applied context: Indian industrial statistics system, ASI coverage of registered manufacturing, limitations of informal sector data
Evaluation rubric
| Dimension | Weight | Max marks | Excellent | Average | Poor |
|---|---|---|---|---|---|
| Setup correctness | 20% | 10 | Correctly defines AIC formula with proper likelihood specification for (a); accurately states autocorrelation definition and AR(1) error structure u_t = ρu_{t-1} + ε_t for (b); precisely identifies ASI, NSS, and relevant Indian statistical machinery for (c) | Basic definitions present but with minor notational errors; AR(1) structure vaguely described; lists some agencies but confuses their functions | Incorrect AIC formula, wrong autocorrelation definition, or fundamental misunderstanding of Indian industrial data sources; omits key institutional names |
| Method choice | 20% | 10 | For (a) explains penalization rationale and order selection algorithm; for (b) derives GLS/ML approach with proper Ω matrix and Prais-Winsten transformation; for (c) distinguishes between census, survey, and administrative data methods with ASI methodology | Mentions MLE and GLS without full derivation; describes data collection superficially without methodological distinctions | Confuses AIC with information criteria properties; applies OLS to autocorrelated errors without correction; conflates different data collection mechanisms |
| Computation accuracy | 20% | 10 | Accurate log-likelihood expression for ARMA: log L = -T/2 log(2π) - T/2 log(σ²) - 1/2 log|Ω| - 1/2σ² u'Ω^{-1}u; correct concentrated likelihood and first-order conditions; proper variance formulas under AR(1) | Correct general likelihood form but errors in matrix specifics or missing |Ω| term; incomplete derivation steps | Major computational errors in likelihood construction; incorrect variance formulas; no derivation attempted for MLE |
| Interpretation | 20% | 10 | Interprets AIC's bias-variance tradeoff and overfitting prevention; explains why OLS becomes BLUE only when ρ=0 and efficiency loss magnitude; evaluates reliability of Indian industrial statistics, informal sector gaps, and ASI limitations | Generic statements about model selection and autocorrelation problems; descriptive account of data sources without critical evaluation | No interpretation of why methods matter; fails to discuss consequences or limitations; purely descriptive without analytical insight |
| Final answer & units | 20% | 10 | Synthesizes across parts: connects model selection (a) with estimation under misspecification (b) and data quality implications (c); notes consistency of MLE vs AIC's asymptotic properties; concludes with integrated view on statistical reliability for policy | Concludes each part separately without cross-references; missing synthesis of time series and industrial statistics themes | No conclusion or abrupt ending; parts treated as disconnected questions; missing any integrative insight |
Practice this exact question
Write your answer, then get a detailed evaluation from our AI trained on UPSC's answer-writing standards. Free first evaluation — no signup needed to start.
Evaluate my answer →More from Statistics 2022 Paper II
- Q1 (a) Distinguish between process control and product control. Explain the various sources of variation encountered in a process control stud…
- Q2 (a) Samples of size n = 5 units are taken from a process every hour. The x̄ and R̄ values for a particular quality characteristic are deter…
- Q3 (a) It is planned to raise a research team to a strength of 50 chemists, which is to be maintained. The wastage of recruits depends on thei…
- Q4 (a) Use penalty method to solve the following linear programming problem : Maximize Z = x₁ + 2x₂ + 3x₃ - x₄ subject to the constraints x₁ +…
- Q5 (a) Apply the method of link relatives to the following data and calculate the seasonal indices : Price of Rice (in ₹ per 10 kg) | Quarter…
- Q6 (a) Explain Akaike information criterion for order selection in an ARMA (p, q) process. 15 marks (b) Define autocorrelation coefficient. Wh…
- Q7 (a) What are the various indices of mortality measure? Explain the purpose and procedure for standardizing them. (20 marks) (b) With usual…
- Q8 (a) How does the concept of wholesale price index work? Describe the major components of wholesale price index. Explain the methodology of…