Vistaar-ASR

This repository evaluates the Whisper Hindi speech-to-text model on the Kathbath dataset. The model is trained using the Whisper framework and is designed to recognize Hindi speech. The evaluation script uses the evaluation.py file to test the model on the Kathbath dataset.

Setup

Clone the repository using git clone https://github.com/AI4Bharat/vistaar.git.
Install the required packages by running sudo apt install ffmpeg and pip install -r /content/vistaar/requirements.txt.
Navigate to the repository directory using cd /content/vistaar.

Download Dataset

Download the Kathbath dataset using wget https://objectstore.e2enetworks.net/indic-asr-public/indicwhisper/vistaar_benchmarks/kathbath.zip.
Download the Whisper Hindi model using wget https://objectstore.e2enetworks.net/indic-asr-public/indicwhisper/all_lang_models/hindi_models.zip.
Download the Gramvaani dataset using wget https://asr.iitm.ac.in/Gramvaani/NEW/GV_Eval_3h.tar.gz.

Evaluation

Unzip the downloaded files using unzip kathbath.zip, unzip hindi_models.zip, and tar -xf GV_Eval_3h.tar.gz.
Run the evaluation script using python evaluation.py with the following arguments:

python evaluation.py \
  --model_path /path/to/whisper-hindi-model \
  --manifest_path /path/to/kathbath/manifest.json \
  --manifest_name manifest \
  --device GPU/or/CPU \
  --batch_size 8 \
  --language Hindi/or/other/language

Output

The evaluation script outputs the model's performance metrics, including the character error rate (CER): 3.76% and word error rate (WER): 10.42% with the time taken for evaluation, 56 min.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
indic_whisper.ipynb		indic_whisper.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vistaar-ASR

Setup

Download Dataset

Evaluation

Output

About

Uh oh!

Releases

Packages

Languages

License

j0gi-18/Vistaar-ASR

Folders and files

Latest commit

History

Repository files navigation

Vistaar-ASR

Setup

Download Dataset

Evaluation

Output

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages