Minibench: ExecuTorch Android Benchmark App

Minibench is a benchmarking app for testing the performance of the ExecuTorch runtime on Android devices.

It supports both generic (vision, audio, etc) models and LLM.

For generic model, it reports metrics such as model load time, and average inference time.
For LLM, it reports metrics such as model load time, and tokens per second.
We are working on providing more metrics in the future.

Minibench is usedful for giving reference performance data when developers integrate ExecuTorch with their own Android app.

Build

You will need executorch AAR for Java and JNI dependencies.

export ANDROID_NDK=<path_to_android_ndk>
sh build/build_android_llm_demo.sh

and copy the AAR to app/libs.

mkdir -p app/libs
cp $BUILD_AAR_DIR/executorch.aar app/libs

You can also refer to this script to see how it is built.

Then you can build and install the app on Android Studio, or simply run

./gradlew installDebug

Usage

This apk does not come with a launcher icon. Instead, trigger it from command line

Push model to a directory

adb shell mkdir /data/local/tmp/minibench
adb push my_model.pte /data/local/tmp/minibench
# optionally, push tokenizer for LLM
adb push tokenizer.bin /data/local/tmp/minibench

Generic model

adb shell am start -W -S -n org.pytorch.minibench/org.pytorch.minibench.LlmBenchmarkActivity \
 --es model_dir /data/local/tmp/minibench

LLM

adb shell am start -W -S -n org.pytorch.minibench/org.pytorch.minibench.LlmBenchmarkActivity \
 --es model_dir /data/local/tmp/minibench --es tokenizer_path /data/local/tmp/minibench/tokenizer.bin

Fetch results

adb shell run-as org.pytorch.minibench cat files/benchmark_results.json

If the ExecuTorch runner is initialized and loads your model, but there is a load error or run error, you will see error code from that JSON.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Minibench: ExecuTorch Android Benchmark App

Build

Usage

Push model to a directory

Generic model

LLM

Fetch results

Files

README.md

Latest commit

History

README.md

File metadata and controls

Minibench: ExecuTorch Android Benchmark App

Build

Usage

Push model to a directory

Generic model

LLM

Fetch results