Indian Flag
Government Of India
A-
A
A+
Punjabi ASR Benchmark Dataset (Kathbath hard Punjabi)

Punjabi ASR Benchmark Dataset (Kathbath hard Punjabi)

Hard Punjabi ASR (Automatic Speech Recognition) benchmark dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

The kathbath_hard_punjabi dataset is a Punjabi Automatic Speech Recognition (ASR) benchmark dataset. Designed to evaluate ASR models for the Punjabi language, it includes challenging scenarios and diverse data from news and general domains. This dataset is an essential resource for researchers and developers working on Punjabi speech recognition, offering a robust foundation for building and benchmarking ASR systems. Submitted by AI4Bharat, it contributes to advancing speech technology for low-resource languages.

Activity Overview Activity Overview

  • Downloads 2
  • Views 18
  • File Size 171.68 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • News Domain
  • Punjabi
  • General Domain
  • Low-Resource Languages
  • Automatic Speech Recognition
  • AI4Bharat
  • ASR
  • Speech Processing

License Control License Control

CC BY-SA 4.0

844424930437090-34-f.wav ( 140.09 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Version Control Version Control

FolderVersion 1(171.68 MB)
  • admin·19 day(s) ago
    • audio/wav
      844424930437090-34-f.wav
    • audio/wav
      844424930437087-34-f.wav
    • audio/wav
      844424930437081-34-f.wav
    • audio/wav
      844424930437114-34-f.wav
    • audio/wav
      844424930437111-34-f.wav
    • audio/wav
      844424930437085-34-f.wav
    • audio/wav
      844424930437099-34-f.wav
    • audio/wav
      844424930437105-34-f.wav
    • audio/wav
      844424930437073-34-f.wav
    • audio/wav
      844424930437060-34-f.wav
    • more_horiz 40 more