Indian Flag
Government Of India
A-
A
A+
English to Bengali Translation Benchmark Dataset

English to Bengali Translation Benchmark Dataset

Bhashini's English-Bengali Translation Benchmark is a detailed text dataset for testing machine translation quality. It includes document-level information and helps researchers build better multilingual translation systems.

About Dataset

The dataset NTREX_en_bn_benchmark_2 provides news test references for Machine Translation (MT) evaluation, focusing on translations from English to Bengali. As part of a broader collection supporting translations into 128 target languages, this dataset includes document-level information, making it a valuable resource for multilingual MT benchmarking. Tailored for the news domain, it enables the evaluation of translation quality and supports the development of advanced translation systems. Submitted by Microsoft, this dataset is an essential tool for researchers and developers working on English-to-Bengali translation tasks.

Activity Overview Activity Overview

  • Downloads 2
  • Views 28
  • File Size 1007.50 KB

Tags Tags

  • Translation
  • Document-Level Evaluation
  • NLP Dataset
  • Language Modeling
  • Bilingual Translation
  • Benchmark
  • News Domain
  • Machine Translation
  • Microsoft
  • English-Bengali

License Control License Control

CC BY-SA 4.0

data.json ( 1006.46 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Version Control Version Control

FolderVersion 1(1007.50 KB)
  • admin·18 day(s) ago
    • application/json
      data.json
    • application/json
      params.json