📌 Day 4: AI-Powered Assessment for English Teaching and Learning.

📌 Day 4: AI-Powered Assessment — Measuring What Matters | 60-Day ELT Masterclass
60-Day ELT Masterclass

📌 Day 4: AI-Powered Assessment

Measuring What Matters in 2026: Precision, Personalization, and Pedagogy

📌 Introduction: The AI Assessment Revolution

Assessment in ELT has evolved from **pen-and-paper tests** (1980s) to **computer-adaptive testing** (2000s) to **AI-powered, personalized feedback** (2026). Today’s tools don’t just measure learning—they predict gaps, adapt in real-time, and scaffolding next steps.

🔍 Why AI?

AI solves three core problems in assessment:

  1. Bias: Human graders favor fluency over accuracy; AI evaluates predefined criteria (e.g., CEFR descriptors).
  2. Scalability: AI can assess 100 speaking samples in the time a human grades 5.
  3. Actionable Data: AI doesn’t just score—it diagnoses (e.g., “Your learners struggle with past perfect form, not use“).

Example: Cambridge’s Linguaskill uses AI to score writing/speaking against CEFR bands instantly, with feedback like:

“Band B1: You used simple linkers (‘and’, ‘but’). For B2, try ‘although’ or ‘despite’.”

📚 Assessment Types in 2026

1. Formative Assessment

Definition: Ongoing, low-stakes feedback to improve learning (not grade it).

🔍 Key Features

  • Frequency: Daily/weekly (e.g., exit tickets, peer feedback).
  • Purpose: Identify gaps during learning (e.g., “80% of students confused ‘affect’ vs. ‘effect'”).
  • AI Tools:

Example: After a lesson on conditionals, AI analyzes students’ chatbot conversations and flags:

“12/20 students used ‘will’ instead of ‘would’ in 2nd conditionals.”

🛠 Practical Activity

AI-Powered Exit Ticket:

  1. Students record a 30-second audio response to: “Explain a 2nd conditional rule with an example.”
  2. Upload to Speechace (free tier).
  3. AI generates:
    “Accuracy: 75% · Fluency: 80% · Error: ‘If I will go’ → Teach: ‘If + subject + past simple’.”

2. Summative Assessment

Definition: High-stakes evaluation of learning after instruction (e.g., final exams).

🔍 Key Features

  • Frequency: End of unit/term (e.g., Cambridge B2 First).
  • Purpose: Certify proficiency (e.g., “This student is CEFR B2”).
  • AI Tools:
    • Aptis: AI-scored speaking/writing.
    • TOEFL iBT: AI evaluates integrated tasks.

Example: Oxford’s Oxford Test of English uses AI to:

“Score speaking in 5 minutes with CEFR-aligned rubrics for fluency, pronunciation, and grammar.”

🛠 Practical Activity

AI Mock Exam:

  1. Students take a free EF SET practice test (AI-scored).
  2. AI generates:
    “Listening: C1 · Reading: B2 · Focus: Improve skimming for gist in reading tasks.”
  3. Teacher uses data to plan review sessions.

3. Diagnostic Assessment

Definition: Identifies strengths/weaknesses before instruction to personalize learning.

🔍 Key Features

  • Frequency: Start of course/unit (e.g., placement tests).
  • Purpose: Answer: “What does this student already know?”
  • AI Tools:

Example: British Council’s Listening Diagnostic:

“You scored 65% on inferencing questions. Practice: B1 Listening for Gist.”

🛠 Practical Activity

AI Gap Analysis:

  1. Students complete a 15-minute EF SET Quick Check.
  2. AI reports:
    “Grammar: 80% · Vocabulary: 60% · Weakness: Phrasal verbs. Assign: British Council Phrasal Verbs Unit.”

4. Ipsative Assessment

Definition: Compares students to their past performance (not peers/norms).

🔍 Key Features

  • Frequency: Ongoing (e.g., portfolio reviews).
  • Purpose: Answer: “How much has this student improved?”
  • AI Tools:
    • Seesaw: AI tracks progress in student portfolios.
    • Kaizena: Voice comments + skill tagging.

Example: A student’s writing samples over 3 months:

September: “I go to park.” (A1)
December: “Last week, I went to the park with my friends.” (A2) → +1 CEFR sub-level.

🛠 Practical Activity

AI Progress Portfolio:

  1. Students record a 1-minute speaking sample monthly (e.g., “Describe your weekend”).
  2. Upload to VoiceThread; AI compares fluency/accuracy to prior samples.
  3. Teacher + student review:
    “↑ Fluency: 15% faster · ↓ Errors: ‘goed’ → ‘went’ (fixed!). Next goal: Use 2 past perfect sentences.”

🤖 AI Tools for Assessment: A Practical Guide

1. AI for Speaking Assessment

Tool Features Example Use Case Link
Speechace
  • Scores fluency, pronunciation, grammar.
  • CEFR-aligned feedback.
  • Free for 3-min samples.
Student records: “My favorite holiday was…” → AI flags: “‘was go’ → Teach: ‘went’.”
Try Speechace
ELSA Speak
  • Focuses on pronunciation.
  • Breaks down errors by phoneme.
  • Free tier available.
Student says: “I leef in Paris.” → AI: “/iː/ → /ɪ/. Practice: ‘ship’ vs. ‘sheep’.”
Try ELSA
Pearson Versant
  • Certified for high-stakes tests.
  • Scores in 5 minutes.
  • Paid, but free demo.
Job applicant takes test → AI report: “Fluency: B2 · Grammar: B1 · Recommend: Practice complex sentences.”
Try Versant

💡 Pro Tip: Combine tools! Use ELSA for pronunciation + Speechace for grammar/fluency.

2. AI for Writing Assessment

Tool Features Example Use Case Link
Grammarly
  • Checks grammar, clarity, engagement.
  • Free for basic feedback.
  • Integrates with Google Docs.
Student writes: “I goed to the store.” → AI: “Did you mean ‘went’?”
Try Grammarly
Quill.org
  • Free for teachers.
  • Focuses on sentence-level skills.
  • Provides lesson plans.
Activity: “Combine these sentences using ‘although’.” → AI scores responses and suggests scaffolding.
Try Quill
Write & Improve
  • Cambridge’s free tool.
  • CEFR-aligned feedback.
  • Track progress over time.
Student submits: “My weekend was bore.” → AI: “B1 · Error: ‘bore’ → ‘boring’. Tip: Use ‘-ing’ for adjectives.”
Try Write & Improve

💡 Pro Tip: Use Write & Improve for CEFR alignment + Quill for targeted practice.

3. AI for Listening Assessment

Tool Features Example Use Case Link
ELSA Listen
  • Dictation + comprehension tasks.
  • Scores accuracy/speed.
  • Free for basic use.
Student listens: “The meeting is at 3 PM.” → Types: “The meet is at 3.” → AI: “Error: ‘meet’ → ‘meeting’.”
Try ELSA Listen
British Council Listen & Watch
  • Free videos + quizzes.
  • AI tracks progress.
  • CEFR-aligned.
Student watches a B1 video → Takes quiz → AI: “70% on detail questions. Practice: B1 Listening for Gist.”
Try BC Listen
Sanako Connect
  • Classroom listening labs.
  • AI grades responses.
  • Paid, but free trial.
Teacher assigns a lecture → AI reports: “Class average: 65% on inference questions. Focus: Teach signal words (‘however’, ‘therefore’).”
Try Sanako

4. AI for Grammar/Vocabulary

Tool Features Example Use Case Link
NoRedInk
  • Adaptive grammar/vocab practice.
  • Aligns with CCSS/CEFR.
  • Free for teachers.
Student practices: “She go to school.” → AI: “Error: ‘go’ → ‘goes’. Next: 3rd person -s rules.”
Try NoRedInk
Kahoot!
  • AI-generated quizzes.
  • Gamified learning.
  • Free for basic use.
Teacher inputs: “Past simple irregular verbs.” → AI generates a 10-question quiz with explanations.
Try Kahoot!
Quizlet
  • AI-powered flashcards.
  • Diagnostic tests.
  • Free tier available.
Student studies “phrasal verbs” → AI quiz flags: “‘look up’ confused with ‘look for’.”
Try Quizlet

💡 Pro Tip: Use NoRedInk for grammar + Quizlet for vocab retention.

🛠 AI Assessment Designer

Use this tool to generate a **custom AI-powered assessment plan** for your class. Select your focus and level:

🎯 Practical Activities: AI in Your Classroom

Activity 1: AI Speaking Rubric

Use Speechace to assess a student’s 1-minute response to: “Describe a time you felt proud.

What will the AI not evaluate?
  1. A. Fluency (speech rate/pauses)
  2. B. Creativity of ideas
  3. C. Grammar accuracy
  4. D. Pronunciation (phonemes/stress)
✅ Correct! AI tools like Speechace focus on linguistic features (A/C/D), not content creativity (B). Use a human rubric for ideas/originality.

Activity 2: AI Writing Feedback

A student submits this to Write & Improve:

“Last weekend I go to the beach with friends. We eated picnic.”

What feedback will the AI most likely provide?

A. “Your ideas are unclear. Add more details about the beach.”
B. “Band: A2. Errors: ‘go’ → ‘went’; ‘eated’ → ‘ate’. Tip: Use past simple for finished actions.”
C. “Your handwriting is messy. Rewrite neatly.”
✅ Correct! Write & Improve focuses on grammar/accuracy and CEFR banding. It ignores handwriting (C) and content clarity (A) unless severely lacking.
❌ Try again! Write & Improve is designed for linguistic accuracy, not content or handwriting.

Activity 3: AI Listening Diagnostic

Your B1 students take the EF SET Quick Check. The AI reports:

“Listening: 58% · Weakness: Inferencing (30% correct).”

What’s the best next step?

A. Teach past perfect tense (unrelated to listening).
C. Have students listen to a C1-level podcast (too difficult).
✅ Correct! The AI identified a specific skill gap (inferencing). Targeted practice at the right level (B1) is most effective.
❌ Not ideal. The AI data shows a listening sub-skill issue (inferencing), not a grammar (A) or level (C) problem.

📚 References & Further Reading

Academic sources:

  • Bachman, L. F., & Palmer, A. S. (2010). Language Assessment in Practice (2nd ed.). Oxford University Press.
  • Fulcher, G., & Davidson, F. (2012). Language Testing and Assessment (4th ed.). Routledge.
  • Cambridge English. (2020). AI in Language Assessment. Read more.
  • Oxford University Press. (2021). Assessment for Learning. Explore.

Free tools:

60-Day ELT Masterclass by Sourov Deb

45 episodes · 15 bonus specials · 5–30 min each · Interactive HTML format · Practice-first · YouTube + LinkedIn

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top