LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring

Jang, Jinhee; Moon, Ayoung; Jung, Minkyoung; Kim, Youngbin; Lee, Seung Jin

doi:10.18653/v1/2025.findings-emnlp.1072

상세 보기

LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring

Jang, Jinhee;
Moon, Ayoung;
Jung, Minkyoung;
Kim, Youngbin;
Lee, Seung Jin

Citations

WEB OF SCIENCE

0

Citations

SCOPUS

0

초록

The emergence of large language models (LLMs) has brought a new paradigm to automated essay scoring (AES), a long-standing and practical application of natural language processing in education. However, achieving human-level multi-perspective understanding and judgment remains a challenge. In this work, we propose Roundtable Essay Scoring (RES), a multi-agent evaluation framework designed to perform precise and human-aligned scoring under a zero-shot setting. RES constructs evaluator agents based on LLMs, each tailored to a specific prompt and topic context. Each agent independently generates a trait-based rubric and conducts a multi-perspective evaluation. Then, by simulating a roundtable-style discussion, RES consolidates individual evaluations through a dialectical reasoning process to produce a final holistic score that more closely aligns with human evaluation. By enabling collaboration and consensus among agents with diverse evaluation perspectives, RES outperforms prior zero-shot AES approaches. Experiments on the ASAP dataset using ChatGPT and Claude show that RES achieves up to a 34.86% improvement in average QWK over straightforward prompting (Vanilla) methods.

제목: LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring

저자: Jang, Jinhee; Moon, Ayoung; Jung, Minkyoung; Kim, Youngbin; Lee, Seung Jin

DOI: 10.18653/v1/2025.findings-emnlp.1072

발행일: 2025-11

유형: Conference Paper

저널명: EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2025

페이지: 19674 ~ 19687

ScholarWorks@중앙대학교

상세 보기

초록