Bhashini's Bengali-Gujarati Translation Benchmark is a detailed text dataset for testing machine translation quality. It includes document-level information and helps researchers build better multilingual translation systems.
The dataset NTREX_bn_gu_benchmark provides news test references for Machine Translation (MT) evaluation, focusing on translations from Bengali to Gujarati. It forms part of a larger collection supporting translations into 128 target languages and includes document-level information. This dataset is tailored for the news domain, offering a robust resource for assessing translation quality and improving multilingual translation systems. Submitted by Microsoft, it serves as a valuable benchmark for researchers and practitioners working on Bengali-to-Gujarati translation tasks.
CC BY-SA 4.0
To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.
© 2025 - Copyright AIKosha. All rights reserved. This portal is developed by National e-Governance Division for IndiaAI mission.