A Bengali-language evaluation model aimed at assessing multilingual Large Language Models by scoring outputs against English reference responses.
Hercule-bn is a Bengali-language evaluation model from the CIA Suite, designed to assess multilingual Large Language Models (LLMs). By leveraging English reference responses, it evaluates Bengali outputs, ensuring accurate scoring and alignment with human evaluations. The model is fine-tuned on the INTEL dataset and supports zero-shot evaluations for languages not seen during training. It provides feedback with scores ranging from 1 to 5, and users can access wrapper functions and classes for seamless integration from the associated GitHub repository.
MIT
Sumanth Doddapaneni and Mohammed Safi Ur Rahman Khan and Dilip Venkatesh and Raj Dabre and Anoop Kunchukuttan and Mitesh M. Khapra
Evaluator Language model
open
AI4Bharat
Sector Agnostic
21/02/25 13:21:04
Nikhil Narasimhan
0
MIT
© 2025 - Copyright AIKosha. All rights reserved. This portal is developed by National e-Governance Division for IndiaAI mission.