# Evaluate AI response accuracy against reference answers from Google Sheets
This workflow automatically assesses the correctness of AI agent responses by comparing them with reference answers using semantic analysis powered by OpenAI. Designed for teams building AI assistants who need reliable quality metrics.
## Who it´s for
- AI assistant developers needing to test response accuracy
- Teams deploying chatbots and requiring quality metrics
- Analysts evaluating LLM performance on benchmark datasets
- Educators automating grading of historical question responses
## What the automation does
- Retrieves a question from a Google Sheets row or incoming message
- Sends the query to an OpenAI-based AI agent via LangChain
- Generates a concise answer using model knowledge
- Compares the generated answer with the reference using GPT-4o-mini
- Assigns a score from 1 to 5 based on factual accuracy and semantic alignment
- Stores the result as a structured evaluation metric
- Can be triggered via webhook, chat message, or batch processing
## What´s included
- Ready-to-use n8n workflow
- Trigger and handler logic using LangChain and OpenAI
- Integrations with Google Sheets API and OpenAI API
- Basic text instructions for launch and adaptation
## Requirements for setup
- n8n account (cloud or self-hosted)
- OpenAI API key
- Access to a Google Sheet with columns: question, reference answer, ID
- Basic understanding of JSON and API operations
## Benefits and outcomes
- Objective AI quality scoring without manual review
- Track model improvements after updates or retraining
- Support regression testing for chatbots
- Batch-evaluate hundreds of responses automatically
- Integrate into CI/CD pipelines for AI products
- Simplify auditing of educational bot knowledge
## Important: template only
Important: you are purchasing a ready-made automation workflow template only. Rollout into your infrastructure, connecting specific accounts and services, 1:1 setup help, custom adjustments for non-standard stacks and any consulting support are provided as a separate paid service at an individual rate. To discuss custom work or 1:1 help, contact via chat
No feedback yet