Video Processing and Subtitle Generation Bot

Overview

This project is a video processing bot that extracts audio from video files, performs speech recognition, generates subtitles, and allows for text correction and dialect conversion. It utilizes various libraries for audio processing, speech recognition, and translation.

Features

Audio Extraction: Extracts audio from video files.
Speech Recognition: Converts audio to text using VAD (Voice Activity Detection).
Subtitle Generation: Creates SRT files for subtitles.
Text Correction: Uses OpenAI's API to correct transcription errors.
Dialect Conversion: Converts subtitles to specified dialects.
Web Interface: A Flask-based web interface for user interaction.

Requirements

Python 3.x
Required libraries:
- flask
- pyrogram
- librosa
- soundfile
- numpy
- deep_translator
- transformers
- openai
- noisereduce
- pyloudnorm
- av
- werkzeug

Installation

Clone the repository:

git clone <repository-url>
cd <repository-directory>

Install the required dependencies.
Set up your OpenAI API key in the make_it_correct.py file.

Create necessary directories for input and output files:

mkdir -p data/input_videos data/audio_outputs data/text_outputs data/subtitles data/chunks

Usage

Running the Bot

To run the bot, execute the following command:

python bot.py

Web Interface

To access the web interface, run:

python app.py

Then navigate to http://localhost:5000 in your web browser.

Commands

/start: Start the bot and receive instructions.
Upload a video: Send a video file to the bot for processing.

Processing Steps

Upload a video file.
Select the original language of the video.
Choose the target language for subtitles.
Decide if you want to enhance audio quality.
Choose whether to correct the subtitles.
Select the desired dialect for the subtitles.

Contributing

Contributions are welcome! Please feel free to submit a pull request or open an issue for any suggestions or improvements.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

OpenAI for providing the API for text correction.
Hugging Face for the speech recognition models.
Deep Translator for translation capabilities.

IMG_2889.MP4

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
static/uploads/fc67d12e-a983-48e1-aa8c-5ddaa026588a		static/uploads/fc67d12e-a983-48e1-aa8c-5ddaa026588a
templates		templates
LICENSE		LICENSE
README.md		README.md
acent.py		acent.py
app.py		app.py
bot.py		bot.py
isfahan.json		isfahan.json
libraries.docx		libraries.docx
main.py		main.py
shirazi.json		shirazi.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Video Processing and Subtitle Generation Bot

Overview

Features

Requirements

Installation

Usage

Running the Bot

Web Interface

Commands

Processing Steps

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

TahaBakhtari/SubtitleGenerator

Folders and files

Latest commit

History

Repository files navigation

Video Processing and Subtitle Generation Bot

Overview

Features

Requirements

Installation

Usage

Running the Bot

Web Interface

Commands

Processing Steps

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages