(a) (i) If **X** = (X₁ X₂ X₃)' is distributed as N₃ (μ, Σ), find the distribution of [(X₁ – X₂) (X₂ – X₃)]'. (5 marks)

(ii) Suppose that **X** = (X₁ X₂ X₃)' ~ N₃ (**0**, Σ), where Σ = $\begin{pmatrix} 1 & ho & 0 \ ho & 1 & ho \ 0 & ho & 1

Question

(a) (i) If **X** = (X₁ X₂ X₃)' is distributed as N₃ (μ, Σ), find the distribution of [(X₁ – X₂) (X₂ – X₃)]'. (5 marks)

(ii) Suppose that **X** = (X₁ X₂ X₃)' ~ N₃ (**0**, Σ), where Σ = $\begin{pmatrix} 1 & ho & 0 \ ho & 1 & ho \ 0 & ho & 1 \end{pmatrix}$. Is there a value of ρ for which (X₁ + X₂ + X₃) and (X₁ – X₂ – X₃) are independent ? (5 marks)

(b) Show that **X** = (X₁, X₂, ..., Xₚ)' has p-variate normal distribution if and only if every linear combination (l₁X₁ + l₂X₂ + ... + lₚXₚ) of **X** follows a univariate normal distribution. (10 marks)

(c) Let x₁, x₂, ..., xₙ be n given observations, and suppose that Yᵢ = β₀ + β₁xᵢ + eᵢ; i = 1, 2, ..., n, where β₀, β₁ are unknown parameters and eᵢ are mutually independent normal random variables with E(eᵢ) = 0 and V(eᵢ) = σ², i = 1, 2, ..., n. Also, σ² is assumed to be unknown. Test the null hypothesis H₀ : β₀ = β₁ = 0. (10 marks)

(d) Complete the following analysis of variance table of a design and examine whether there is a significant difference between the treatments at 5% level of significance:

| Source of Variation | Degrees of Freedom | Sum of Squares | Mean Sum of Squares | Variance Ratio |
|---------------------|-------------------|----------------|---------------------|----------------|
| Blocks | — | 21 | 4·2 | — |
| Treatments | — | — | 5·0 | — |
| Error | 15 | 12 | — | |
| Total | — | — | | |

Given that F_{·05}(3, 15) = 8·70, F_{·05}(5, 15) = 4·62 (10 marks)

(e) Define regression estimator used for the estimation of population mean. Obtain its bias and Mean Square Error (MSE) to the first order of approximation. (10 marks)

UPSC Answer Check · Accepted Answer

Solve this multi-part numerical problem by allocating time proportionally to marks: spend ~20% on (a)(i)-(ii) combined, ~20% on (b), ~20% on (c), ~20% on (d), and ~20% on (e). Begin each sub-part by stating the relevant theorem or formula, show complete derivation/calculation steps, and conclude with precise final answers. For (d), complete the ANOVA table systematically before hypothesis testing. For (e), clearly define the estimator before deriving bias and MSE.
- (a)(i) Apply linear transformation theorem: if Y = AX, then Y ~ N₂(Aμ, AΣA') with correct matrix A = [[1,-1,0],[0,1,-1]]
- (a)(ii) Use independence condition Cov(X₁+X₂+X₃, X₁-X₂-X₃) = 0; solve for ρ = -1/2 and verify validity
- (b) Prove both directions: (⇒) by definition of MVN, (⇒) using characteristic functions or Cramér-Wold theorem
- (c) Set up F-test for H₀: β₀=β₁=0 using extra sum of squares; compute F = [(SSR/2)]/[SSE/(n-2)] with correct df
- (d) Complete ANOVA table: Blocks df=5, Treatments df=3, Total df=23, Total SS=33, Error MS=0.8; compute F_Treatments=6.25 and compare with critical value
- (e) Define regression estimator Ŷ_reg = ȳ + b(X̄ - x̄); derive bias ≈ 0 and MSE ≈ (1-f)S²_y(1-ρ²)/n to first order

Dimension	Weight	Max marks	Excellent	Average	Poor
Setup correctness	20%	10	Correctly identifies distributions, states all assumptions, and sets up proper null hypotheses for all sub-parts; for (a) correctly specifies transformation matrices and for (c) properly defines the linear model with error structure	Identifies most distributions and hypotheses correctly but misses some assumptions or has minor errors in transformation matrix setup; incomplete specification of model conditions	Wrong distributional assumptions, incorrect hypothesis formulation, or fundamental misunderstanding of linear transformation properties; fails to specify the linear model correctly
Method choice	20%	10	Selects optimal methods: linear transformation theorem for (a), Cramér-Wold/characteristic functions for (b), extra sum of squares F-test for (c), standard ANOVA procedure for (d), and large-sample approximation for (e); justifies choices appropriately	Uses correct but suboptimal methods (e.g., direct covariance calculation instead of matrix approach for (a)); or applies correct methods without clear justification; minor errors in method selection for one sub-part	Wrong methods chosen (e.g., t-tests instead of F-test for (c), or ignores Cramér-Wold for (b)); applies ANOVA incorrectly or uses wrong approximation order for (e)
Computation accuracy	20%	10	All matrix multiplications, covariance calculations, F-statistics, and ANOVA table entries computed flawlessly; correct algebraic manipulation in bias/MSE derivation; no arithmetic errors	Most calculations correct with minor arithmetic slips (e.g., sign error in ρ solution, small df miscalculation, or rounding errors in final ANOVA entries); correct approach but execution lapses	Major computational errors: wrong matrix products, incorrect covariance values, wrong F-statistic formula, or fundamentally flawed ANOVA table completion; incorrect bias/MSE derivation
Interpretation	20%	10	Correctly interprets independence condition in (a)(ii), explains significance of Cramér-Wold characterization, draws proper conclusion about H₀ in (c) with clear decision rule, correctly interprets F-test result in (d) with context, and explains when regression estimator is preferred	Basic interpretation present but lacks depth; states conclusions without clear connection to computed values; misses nuanced interpretation of why methods work	No interpretation of results, wrong conclusion from test statistics, or fails to state final decision; confuses independence with zero correlation; misinterprets ANOVA output
Final answer & units	20%	10	All final answers explicitly stated: complete distribution parameters for (a), verified ρ value, clear statement of iff proof, explicit F-statistic with rejection decision, completed ANOVA table with conclusion, and final bias/MSE expressions; proper notation throughout	Most final answers present but some incomplete or buried in working; missing explicit statement of distribution in (a)(i) or final decision in (c); ANOVA table filled but conclusion not clearly stated	Missing final answers, incomplete ANOVA table, no conclusion on hypothesis tests, or answers without proper statistical notation; fails to box or highlight key results

Q5

Directive word: Solve

How this answer will be evaluated

Approach

Key points expected

Evaluation rubric

Practice this exact question

More from Statistics 2023 Paper I