Indian Flag
Government Of India
A-
A
A+
Bengali to Gujarati Translation Benchmark Dataset

Bengali to Gujarati Translation Benchmark Dataset

Bhashini's Bengali-Gujarati Translation Benchmark is a detailed text dataset for testing machine translation quality. It includes document-level information and helps researchers build better multilingual translation systems.

About Dataset

The dataset NTREX_bn_gu_benchmark provides news test references for Machine Translation (MT) evaluation, focusing on translations from Bengali to Gujarati. It forms part of a larger collection supporting translations into 128 target languages and includes document-level information. This dataset is tailored for the news domain, offering a robust resource for assessing translation quality and improving multilingual translation systems. Submitted by Microsoft, it serves as a valuable benchmark for researchers and practitioners working on Bengali-to-Gujarati translation tasks.

Activity Overview Activity Overview

  • Downloads 2
  • Views 31
  • File Size 1.37 MB

Tags Tags

  • Translation
  • Document-Level Evaluation
  • NLP Dataset
  • Language Modeling
  • Bilingual Translation
  • Benchmark
  • News Domain
  • Machine Translation
  • Microsoft
  • Bengali-Gujarati

License Control License Control

CC BY-SA 4.0

params.json ( 1.04 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Version Control Version Control

FolderVersion 1(1.37 MB)
  • admin·18 day(s) ago
    • application/json
      params.json
    • application/json
      data.json