📌 Day 4: AI-Powered Assessment — Measuring What Matters | 60-Day ELT Masterclass

60-Day ELT Masterclass

📌 Day 4: AI-Powered Assessment

Measuring What Matters in 2026: Precision, Personalization, and Pedagogy

📌 Introduction: The AI Assessment Revolution

Assessment in ELT has evolved from **pen-and-paper tests** (1980s) to **computer-adaptive testing** (2000s) to **AI-powered, personalized feedback** (2026). Today’s tools don’t just measure learning—they predict gaps, adapt in real-time, and scaffolding next steps.

🔍 Why AI?

AI solves three core problems in assessment:

Bias: Human graders favor fluency over accuracy; AI evaluates predefined criteria (e.g., CEFR descriptors).
Scalability: AI can assess 100 speaking samples in the time a human grades 5.
Actionable Data: AI doesn’t just score—it diagnoses (e.g., “Your learners struggle with past perfect form, not use“).

Example: Cambridge’s Linguaskill uses AI to score writing/speaking against CEFR bands instantly, with feedback like:

“Band B1: You used simple linkers (‘and’, ‘but’). For B2, try ‘although’ or ‘despite’.”

📚 Assessment Types in 2026

1. Formative Assessment

Definition: Ongoing, low-stakes feedback to improve learning (not grade it).

🔍 Key Features

Frequency: Daily/weekly (e.g., exit tickets, peer feedback).
Purpose: Identify gaps during learning (e.g., “80% of students confused ‘affect’ vs. ‘effect'”).
AI Tools:
- Pearson Versant: Scores speaking in real-time.
- EduApps: Auto-grades short writing tasks.

Example: After a lesson on conditionals, AI analyzes students’ chatbot conversations and flags:

“12/20 students used ‘will’ instead of ‘would’ in 2nd conditionals.”

🛠 Practical Activity

AI-Powered Exit Ticket:

Students record a 30-second audio response to: “Explain a 2nd conditional rule with an example.”
Upload to Speechace (free tier).
AI generates:
“Accuracy: 75% · Fluency: 80% · Error: ‘If I will go’ → Teach: ‘If + subject + past simple’.”

2. Summative Assessment

Definition: High-stakes evaluation of learning after instruction (e.g., final exams).

🔍 Key Features

Frequency: End of unit/term (e.g., Cambridge B2 First).
Purpose: Certify proficiency (e.g., “This student is CEFR B2”).
AI Tools:
- Aptis: AI-scored speaking/writing.
- TOEFL iBT: AI evaluates integrated tasks.

Example: Oxford’s Oxford Test of English uses AI to:

“Score speaking in 5 minutes with CEFR-aligned rubrics for fluency, pronunciation, and grammar.”

🛠 Practical Activity

AI Mock Exam:

Students take a free EF SET practice test (AI-scored).
AI generates:
“Listening: C1 · Reading: B2 · Focus: Improve skimming for gist in reading tasks.”
Teacher uses data to plan review sessions.

3. Diagnostic Assessment

Definition: Identifies strengths/weaknesses before instruction to personalize learning.

🔍 Key Features

Frequency: Start of course/unit (e.g., placement tests).
Purpose: Answer: “What does this student already know?”
AI Tools:
- Oxford Placement Test: AI adapts difficulty in real-time.
- Cambridge English Test: 25-question adaptive quiz.

Example: British Council’s Listening Diagnostic:

“You scored 65% on inferencing questions. Practice: B1 Listening for Gist.”

🛠 Practical Activity

AI Gap Analysis:

Students complete a 15-minute EF SET Quick Check.
AI reports:
“Grammar: 80% · Vocabulary: 60% · Weakness: Phrasal verbs. Assign: British Council Phrasal Verbs Unit.”

4. Ipsative Assessment

Definition: Compares students to their past performance (not peers/norms).

🔍 Key Features

Frequency: Ongoing (e.g., portfolio reviews).
Purpose: Answer: “How much has this student improved?”
AI Tools:
- Seesaw: AI tracks progress in student portfolios.
- Kaizena: Voice comments + skill tagging.

Example: A student’s writing samples over 3 months:

September: “I go to park.” (A1)
December: “Last week, I went to the park with my friends.” (A2) → +1 CEFR sub-level.

🛠 Practical Activity

AI Progress Portfolio:

Students record a 1-minute speaking sample monthly (e.g., “Describe your weekend”).
Upload to VoiceThread; AI compares fluency/accuracy to prior samples.
Teacher + student review:
“↑ Fluency: 15% faster · ↓ Errors: ‘goed’ → ‘went’ (fixed!). Next goal: Use 2 past perfect sentences.”

🤖 AI Tools for Assessment: A Practical Guide

1. AI for Speaking Assessment

Tool	Features	Example Use Case	Link
Speechace	Scores fluency, pronunciation, grammar. CEFR-aligned feedback. Free for 3-min samples.	Student records: “My favorite holiday was…” → AI flags: “‘was go’ → Teach: ‘went’.”	Try Speechace
ELSA Speak	Focuses on pronunciation. Breaks down errors by phoneme. Free tier available.	Student says: “I leef in Paris.” → AI: “/iː/ → /ɪ/. Practice: ‘ship’ vs. ‘sheep’.”	Try ELSA
Pearson Versant	Certified for high-stakes tests. Scores in 5 minutes. Paid, but free demo.	Job applicant takes test → AI report: “Fluency: B2 · Grammar: B1 · Recommend: Practice complex sentences.”	Try Versant

💡 Pro Tip: Combine tools! Use ELSA for pronunciation + Speechace for grammar/fluency.

2. AI for Writing Assessment

Tool	Features	Example Use Case	Link
Grammarly	Checks grammar, clarity, engagement. Free for basic feedback. Integrates with Google Docs.	Student writes: “I goed to the store.” → AI: “Did you mean ‘went’?”	Try Grammarly
Quill.org	Free for teachers. Focuses on sentence-level skills. Provides lesson plans.	Activity: “Combine these sentences using ‘although’.” → AI scores responses and suggests scaffolding.	Try Quill
Write & Improve	Cambridge’s free tool. CEFR-aligned feedback. Track progress over time.	Student submits: “My weekend was bore.” → AI: “B1 · Error: ‘bore’ → ‘boring’. Tip: Use ‘-ing’ for adjectives.”	Try Write & Improve

💡 Pro Tip: Use Write & Improve for CEFR alignment + Quill for targeted practice.

3. AI for Listening Assessment

Tool	Features	Example Use Case	Link
ELSA Listen	Dictation + comprehension tasks. Scores accuracy/speed. Free for basic use.	Student listens: “The meeting is at 3 PM.” → Types: “The meet is at 3.” → AI: “Error: ‘meet’ → ‘meeting’.”	Try ELSA Listen
British Council Listen & Watch	Free videos + quizzes. AI tracks progress. CEFR-aligned.	Student watches a B1 video → Takes quiz → AI: “70% on detail questions. Practice: B1 Listening for Gist.”	Try BC Listen
Sanako Connect	Classroom listening labs. AI grades responses. Paid, but free trial.	Teacher assigns a lecture → AI reports: “Class average: 65% on inference questions. Focus: Teach signal words (‘however’, ‘therefore’).”	Try Sanako

4. AI for Grammar/Vocabulary

Tool	Features	Example Use Case	Link
NoRedInk	Adaptive grammar/vocab practice. Aligns with CCSS/CEFR. Free for teachers.	Student practices: “She go to school.” → AI: “Error: ‘go’ → ‘goes’. Next: 3rd person -s rules.”	Try NoRedInk
Kahoot!	AI-generated quizzes. Gamified learning. Free for basic use.	Teacher inputs: “Past simple irregular verbs.” → AI generates a 10-question quiz with explanations.	Try Kahoot!
Quizlet	AI-powered flashcards. Diagnostic tests. Free tier available.	Student studies “phrasal verbs” → AI quiz flags: “‘look up’ confused with ‘look for’.”	Try Quizlet

💡 Pro Tip: Use NoRedInk for grammar + Quizlet for vocab retention.

🛠 AI Assessment Designer

Use this tool to generate a **custom AI-powered assessment plan** for your class. Select your focus and level:

🎯 Practical Activities: AI in Your Classroom

Activity 1: AI Speaking Rubric

Use Speechace to assess a student’s 1-minute response to: “Describe a time you felt proud.”

What will the AI not evaluate?

A. Fluency (speech rate/pauses)
B. Creativity of ideas
C. Grammar accuracy
D. Pronunciation (phonemes/stress)

✅ Correct! AI tools like Speechace focus on linguistic features (A/C/D), not content creativity (B). Use a human rubric for ideas/originality.

Activity 2: AI Writing Feedback

A student submits this to Write & Improve:

“Last weekend I go to the beach with friends. We eated picnic.”

What feedback will the AI most likely provide?

A. “Your ideas are unclear. Add more details about the beach.”

B. “Band: A2. Errors: ‘go’ → ‘went’; ‘eated’ → ‘ate’. Tip: Use past simple for finished actions.”

C. “Your handwriting is messy. Rewrite neatly.”

✅ Correct! Write & Improve focuses on grammar/accuracy and CEFR banding. It ignores handwriting (C) and content clarity (A) unless severely lacking.

❌ Try again! Write & Improve is designed for linguistic accuracy, not content or handwriting.

Activity 3: AI Listening Diagnostic

Your B1 students take the EF SET Quick Check. The AI reports:

“Listening: 58% · Weakness: Inferencing (30% correct).”

What’s the best next step?

A. Teach past perfect tense (unrelated to listening).

B. Assign British Council’s “Inferring Attitudes” lesson.

C. Have students listen to a C1-level podcast (too difficult).

✅ Correct! The AI identified a specific skill gap (inferencing). Targeted practice at the right level (B1) is most effective.

❌ Not ideal. The AI data shows a listening sub-skill issue (inferencing), not a grammar (A) or level (C) problem.

📚 References & Further Reading

Academic sources:

Bachman, L. F., & Palmer, A. S. (2010). Language Assessment in Practice (2nd ed.). Oxford University Press.
Fulcher, G., & Davidson, F. (2012). Language Testing and Assessment (4th ed.). Routledge.
Cambridge English. (2020). AI in Language Assessment. Read more.
Oxford University Press. (2021). Assessment for Learning. Explore.

Free tools:

Speechace (speaking)
Write & Improve (writing)
EF SET Quick Check (diagnostic)
Quill (grammar)

📌 Day 4: AI-Powered Assessment for English Teaching and Learning.

📌 Day 4: AI-Powered Assessment

📌 Introduction: The AI Assessment Revolution

🔍 Why AI?

📚 Assessment Types in 2026

1. Formative Assessment

🔍 Key Features

🛠 Practical Activity

2. Summative Assessment

🔍 Key Features

🛠 Practical Activity

3. Diagnostic Assessment

🔍 Key Features

🛠 Practical Activity

4. Ipsative Assessment

🔍 Key Features

🛠 Practical Activity

🤖 AI Tools for Assessment: A Practical Guide

1. AI for Speaking Assessment

2. AI for Writing Assessment

3. AI for Listening Assessment

4. AI for Grammar/Vocabulary

🛠 AI Assessment Designer

🎯 Practical Activities: AI in Your Classroom

Activity 1: AI Speaking Rubric

Activity 2: AI Writing Feedback

Activity 3: AI Listening Diagnostic

📚 References & Further Reading

Leave a Comment Cancel Reply