Indian Flag
Government Of India
A-
A
A+

Indic-Conformer model for ASR

Indo-Aryan Indic-Conformer is a multilingual speech model for North-Indian languages. This model is based on Conformer large architecture, with 115M parameters.

  • Digital India BHASHINI Division
    Digital India BHASHINI Division
  • BHASHINI_shailendra
    BHASHINI_shailendra

About Model

Bhashini - The Indo-Aryan Indic-Conformer is a multilingual automatic speech recognition (ASR) model designed specifically for North-Indian languages. It is based on the Conformer large architecture, which is known for its efficiency and accuracy in processing speech signals. The model contains 115 million parameters, enabling it to effectively transcribe spoken language into text with high precision.

This ASR model has been trained on the Shrutlip dataset, a rich dataset designed to enhance automatic speech recognition capabilities in Indian languages. The model primarily supports the Odia language and has been developed by AI4Bharat, a leading research initiative focused on advancing AI-driven solutions for Indian languages.

With a batch processing setup, this model is optimized for large-scale speech-to-text tasks across general domains. It is a valuable resource for applications in speech transcription, voice-enabled interfaces, digital accessibility, and natural language processing (NLP) research. Given the increasing demand for multilingual ASR systems, this model serves as a foundational tool for improving speech technology in India’s diverse linguistic landscape.

The Indo-Aryan Indic-Conformer is open-source, and its implementation is available on GitHub, making it accessible for researchers, developers, and AI practitioners working in the domain of Indian language speech processing.

For more details about the use of model, refer to github: https://github.com/AI4Bharat/IndicTrans2/tree/main

Indic-Conformer model for ASR

Metadata Metadata

MIT

AI4Bharat

Speech Recognition Model

Open

Digital India BHASHINI Division

Sector Agnostic

05/03/25 15:23:44

Admin

64.91 KB

Activity Overview Activity Overview

  • Downloads 13
  • Views 192
  • File Size 64.91 KB

Tags Tags

  • Automatic Speech Recognition
  • Speech Technology
  • Speech Processing
  • Speech Lab
  • Bhashini

License Control License Control

MIT

Version Control Version Control

FolderVersion 1(64.91 KB)
  • admin·1 month(s) ago
    • zip
      indic-asr-api-backend-master.zip