- A conceptual framework in natural language processing (NLP) that employs large language models (LLMs) as evaluators to assess the performance of other language-based systems or outputs. Instead of relying solely on human annotators, the approach leverages the general language capabilities of advanced language models to serve as automated judges. ← Wikipedia
This term is sponsored by: your name/company?
- Previous term: LLM-as-a-Judge
- Next term: LMB
- Random term: QA (webglossary.info/random 🎲)