NarrAD (WACV 2025 Oral)

This is an official Github repository for WACV'25 paper. NarrAD: Automatic Generation of Audio Descriptions for Movies with Rich Narrative Context.
[https://ieeexplore.ieee.org/abstract/document/10944028]

Results

Here are the qualitative results of NarrAD.

You can check out several demo videos here: [https://bit.ly/4aSwOTr].
You can check out full outputs on the MAD evaluation set here: [https://drive.google.com/drive/folders/1PIjL6qpZt4D2nxQwwMD7iZ9xmlfjRiuh?usp=drive_link].

Data Preparation

Download the MAD dataset from https://github.com/Soldelli/MAD and place the mad-v2-ad-named.csv file to datasets directory, renaming it to MAD_train.csv.
Movie frames are saved in the format frame_000000.png in the videos/{movie} directory. Due to file size constraints, only frames for selected samples are provided.
Movie scripts used for AD creation can be found in the scripts directory. We have pre-parsed the movie scripts and generated lines.csv, stage_directions.txt, and scenes.csv.
transcribe.csv has been pre-generated using the Google Cloud API.
Prompts for using GPT can be found in the prompts directory, and the generated results are saved in the results directory.

How to run?

ROOT_DIR refers to the root directory of the project.
API_KEY refers to your openai api key

1. Dialogue synchronization

python src/main.py --rootdir $ROOT_DIR --api_key $API_KEY --task synchronize

2. AD Generation

python src/main.py --rootdir $ROOT_DIR --api_key $API_KEY --task generate

3. AD Curation

python src/main.py --rootdir $ROOT_DIR --api_key $API_KEY --task curate

Citation

Please cite NarrAD as:

@inproceedings{park2025narrad,
  title={NarrAD: Automatic Generation of Audio Descriptions for Movies with Rich Narrative Context},
  author={Park, Jaehyeong and Ye, Juncheol and Lee, Seungkook and Ka, Hyun W and Han, Dongsu},
  booktitle={2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
  pages={409--419},
  year={2025},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
datasets		datasets
images		images
prompts		prompts
scripts		scripts
src		src
videos/LES_MISERABLES		videos/LES_MISERABLES
ad_stats.csv		ad_stats.csv
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NarrAD (WACV 2025 Oral)

Results

Data Preparation

How to run?

1. Dialogue synchronization

2. AD Generation

3. AD Curation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

kaist-ina/NarrAD

Folders and files

Latest commit

History

Repository files navigation

NarrAD (WACV 2025 Oral)

Results

Data Preparation

How to run?

1. Dialogue synchronization

2. AD Generation

3. AD Curation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages