Answer Writing

Evaluate / Critically Evaluate in UPSC: Worked Examples + Rubric

Published 2026-04-27 · UPSC Answer Check Editorial

In the UPSC Mains examination, the directive "evaluate" is often mistaken for "discuss" or "explain." While a discussion explores various facets of a topic, an evaluation demands a judgment. To evaluate is to determine the value, merit, or success of a policy, system, or statement based on specific criteria. "Critically evaluate" takes this a step further; it requires you to challenge underlying assumptions and provide a more incisive, nuanced appraisal, often questioning the prevailing narrative while remaining objective.

What "evaluate" demands beyond description

Most aspirants lose marks because they provide a descriptive answer—telling the examiner what a scheme is, rather than how well it has worked. Evaluation is not a summary; it is an audit.

When you see "evaluate" or "critically evaluate," the examiner is looking for four specific cognitive steps:

  1. Establishment of Criteria: On what basis are you judging? (e.g., Is the Collegium system being evaluated on "transparency" or "judicial independence"?)
  2. Evidence Gathering: Providing factual data, judicial pronouncements, or committee reports to support the claim.
  3. Weighting: Balancing the strengths against the weaknesses.
  4. The Verdict: A reasoned conclusion that states whether the subject is successful, flawed, or requires specific reforms.

Comparison with Near-Cousin Directives

DirectivePrimary GoalMental Process
DiscussExplorePresent multiple perspectives on a topic.
ExamineInvestigateLook closely at the components to understand how they work.
AnalyzeDeconstructBreak the topic into parts to show the relationship between them.
EvaluateJudgeWeigh the pros and cons to arrive at a verdict of merit.

3 PYQ worked examples

To master this, you must move from "listing points" to "building an argument." Below are three examples from recent papers.

Example 1: The Direct Evaluation (Critical Examination)

PYQ: "Discuss the evolution of collegium system in India. Critically examine the advantages and disadvantages of the system of appointment of the Judges of the Supreme Court of India and that of the USA." (2025 Paper 2 Q13)

The Demand: The first part is descriptive (evolution), but the second part is a critical evaluation. You must not just list pros and cons; you must weigh them to see which system better serves the ideal of an independent judiciary.

The Evidence:

  • India: Mention the First Judges Case (1993) and Third Judges Case (1998) for evolution. For evaluation, cite the lack of transparency (disadvantage) vs. protection from executive whims (advantage).
  • USA: Mention Senate confirmation. Evaluate the democratic accountability (advantage) vs. the extreme political polarization/partisan bias (disadvantage).

The Critical Nuance: A high-scoring answer will argue that while the US system is more transparent, the Indian system prioritizes "independence" over "accountability," creating a unique tension in Indian jurisprudence.


Example 2: The Implicit Evaluation (Examine)

PYQ: "e-governance projects have a built-in bias towards technology and back-end integration than user-centric designs. Examine." (2025 Paper 2 Q7)

The Demand: Although the word is "examine," the statement is a claim. You are being asked to evaluate whether this "bias" actually exists.

The Evidence:

  • Support for the claim: Cite Aadhaar's biometric failures in rural areas (technology over user) or the UMANG app's complex UI/UX despite high back-end integration.
  • Counter-evidence: Mention MyGov's crowdsourcing and the move towards "Jan Centric" portals.

The Verdict: The answer should conclude that while the "digital backbone" is strong, the "last-mile interface" remains a gap, confirming the statement to a large extent.


Example 3: The Justification Evaluation (Do you agree?)

PYQ: "Civil Society Organizations are often perceived as being anti-State actors than non-State actors. Do you agree? Justify." (2025 Paper 2 Q8)

The Demand: This is a direct call for a verdict. "Do you agree?" is the most explicit form of evaluation.

The Evidence:

  • Non-State Actor Role: Cite CSOs like CRY working in education or collaborating on mid-day meal programmes.
  • Anti-State Perception: Cite the Narmada Bachao Andolan or the tightening of FCRA regulations as evidence of the state's perception of CSOs as threats.

The Verdict: A balanced view—CSOs are essential for Article 19 rights and acting as watchdogs, but their perception as "anti-state" often stems from a clash between developmental goals and grassroots rights.

Contrast: 5/10 vs 8/10 Approach

For the CSO question (Example 3), here is the difference in quality:

Feature5/10 Answer (Descriptive)8/10 Answer (Evaluative)
ApproachLists what CSOs are and then lists why some people dislike them.Analyzes the tension between state sovereignty and democratic dissent.
Evidence"Some NGOs get foreign funding which is bad.""The amendment of FCRA regulations reflects a state effort to curb perceived foreign interference."
Conclusion"CSOs should work with the government for the better of India.""While CSOs are vital for democratic health, a regulatory framework that balances national security with civic space is required."

If you are unsure if your answer is too descriptive, you can evaluate your own answer against the official rubric to see if you've actually provided a verdict.

The verdict-with-evidence formula

To ensure you don't slip into "description mode," use this structural formula for every evaluation question:

1. The Opening (The Premise) Avoid generic introductions. Start with a sentence that acknowledges the tension in the question.

  • Pattern: "While [Subject X] has been instrumental in achieving [Positive Goal], its effectiveness is often contested due to [Primary Challenge/Bias]."
  • Example for e-governance: "While e-governance has streamlined service delivery in India, there is a growing consensus that the focus on back-end integration has often overshadowed the necessity of user-centric design."

2. The Evidence Pillars (The Weighting) Divide your body into two clear sections:

  • Pillar A (The Merit): "The success/validity of [Subject] is evident in..." (Insert 3-4 points with data/examples).
  • Pillar B (The Limitation): "However, the efficacy of [Subject] is undermined by..." (Insert 3-4 points with data/examples).

3. The Synthesis (The Verdict) The conclusion must be a "reasoned judgment." Do not simply summarize.

  • Avoid: "Thus, both sides have points."
  • Use: "Based on the above, it is evident that while [Subject] is conceptually sound, its operational success depends on [Specific Reform/Condition]."

Common over/under-evaluation traps

The Under-Evaluation Trap (The "Safe" Player)

This is the most common mistake. The candidate provides a perfect "Discuss" answer but forgets to "Evaluate."

  • The Symptom: The answer reads like a textbook chapter. It explains the topic but never takes a stand or judges the merit.
  • The Fix: Use evaluative adjectives: ineffective, suboptimal, landmark, flawed, transformative, superficial.

The Over-Evaluation Trap (The "Opinionated" Player)

The candidate takes a strong stand but forgets that UPSC is a test for a civil servant, not an editorial columnist.

  • The Symptom: Using emotional language ("The government's failure is catastrophic") or taking extreme ideological stances without citing a report or a judgment.
  • The Fix: Anchor every criticism in a source. Instead of "The policy is bad," use "The [Name] Committee highlighted that the policy failed to address [X]."

The "Critical" Oversight

In "Critically Evaluate," candidates often ignore the "Critically" part.

  • The Mistake: Treating "Critically Evaluate" as the same as "Evaluate."
  • The Fix: "Critically" means you must look at the assumptions. If a question asks to critically evaluate a scheme, ask: Was the premise of the scheme correct? Did it address the root cause or just the symptom?

To see how these traps impact your score, you can get scored on this question using our 5-dimensional rubric, which specifically weights "Demand-Directive" as 20% of your total marks.

Practice Prompt

To apply this framework, try answering the following prompt from the 2025 papers:

"The Government of India recently stated that Left Wing Extremism (LWE) will be eliminated by 2026. Evaluate the feasibility of this target in light of the socio-economic conditions of the affected regions." (10M / 150 words)

Checklist for your practice:

  • Did you start with a premise acknowledging the ambition of the 2026 target?
  • Did you provide evidence of success (e.g., reduction in violence, infrastructure growth)?
  • Did you provide evidence of remaining challenges (e.g., land alienation, tribal distrust)?
  • Did your conclusion give a "verdict" on whether 2026 is a realistic deadline or an aspirational one?

Conclusion: Your Next Step

Evaluation is the bridge between being a student (who knows facts) and being an officer (who can make decisions). The key is to stop describing and start judging—but always with evidence as your shield.

Your next action: Pick one "Evaluate" PYQ from the last three years. Write the answer using the Verdict-with-Evidence formula, and specifically highlight the sentence where you provide your final judgment.

Put it into practice

Write an answer, get AI-powered feedback in minutes.