FireRed2ONNX

Introduction

convert FireRedASR-AED to ONNX format with batch inference. accelerate inference and maintain the original ASR performance.

Getting started

Create and Activate the conda environment

conda create -n asr_export python=3.12
conda activate asr_export

Install dependencies

pip install -r requirements.txt

note: onnxruntime-gpu 1.22.0 need glibc >= 2.27

Download or Prepare FireRedASR-AED weights, e.g.,

huggingface-cli download FireRedTeam/FireRedASR-AED-L --local-dir ./weights/FireRedASR-AED-L

Export FireRedASR-ASR to ONNX (Save to onnx_folder_path)

python Export_FireRedASR_AED_Batch.py --model_path ./weights/FireRedASR-AED-L --project_path ./FireRedASR --onnx_folder_path ./onnx_model

(Optional, with limited improvement) Optim exported ONNX models by ONNXSlim(Save to ./onnx_model by default)

python Optim_FireRedASR_AED_ONNX_Batch.py --input onnx_model --output onnx_slim

Inference with CUDA

python Inference_FireRedASR_AED_ONNX_Batch.py --model_path ./weights/FireRedASR-AED-L --project_path ./FireRedASR --onnx_folder_path ./onnx_slim --batch_size 4

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
FireRedASR		FireRedASR
example		example
.DS_Store		.DS_Store
.gitignore		.gitignore
Export_FireRedASR_AED_Batch.py		Export_FireRedASR_AED_Batch.py
Inference_FireRedASR_AED_ONNX_Batch.py		Inference_FireRedASR_AED_ONNX_Batch.py
Optim_FireRedASR_AED_ONNX_Batch.py		Optim_FireRedASR_AED_ONNX_Batch.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FireRed2ONNX

Introduction

Getting started

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FireRed2ONNX

Introduction

Getting started

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages