Indian Flag
Government Of India
A-
A
A+

AI4Bharat - Fastspeech2 Model using Hybrid Segmentation (HS): Text to Speech Model

Text-to-speech models trained using FastPitch and HiFi-GAN vocoder, separately for each language. Supports both 'female' and 'male' voices

  • AI4Bharat
    AI4Bharat
  • Nikhil_Narasimhan
    Nikhil_Narasimhan

About Model

This repository contains a Fastspeech2 Model for 16 Indian languages (male and female both) implemented using the Hybrid Segmentation (HS) for speech synthesis. The model is capable of generating mel-spectrograms from text inputs and can be used to synthesize speech. Fs2 is composed of 6 feed-forward Transformer blocks with multi-head self-attention and 1D convolution on both phoneme encoder and mel-spectrogram decoder.

AI4Bharat - Fastspeech2 Model using Hybrid Segmentation (HS): Text to Speech Model

Metadata Metadata

MIT

IIT Madras

Text to Speech

open

AI4Bharat

Sector Agnostic

21/02/25 13:21:39

Nikhil Narasimhan

0

Activity Overview Activity Overview

  • Downloads 45
  • Views 556
  • File Size 0

Tags Tags

  • Multilingual
  • NLP
  • Text Processing
  • Transformer
  • Text to Speech
  • Language Detection

License Control License Control

MIT

Version Control Version Control

No Record(s) Found

No Version(s) Found