About TranslateGemma-OpenVINO

Python sample code that runs TranslateGemma on Intel devices (CPU, GPU, NPU) by using OpenVINO GenAI pipeline.

Files in this repo:

translate.py the main pipeline. (modified from visual_language_chat.py)
export-requirements.txt required Python packages for model export. (download link)
deployment-requirements.txt required Python packages for model deployment. (download link)
chat_template-gemma3.json Gemma3 chat_template used to workaround the validation error of OV GenAI VLM pipeline (download link)
text_en.txt an English text file used to test text translation. (source)
text_zh-TW.txt a (Traditional) Chinese text file used to test text translation. (source)
image_cs.jpg an image that contains Czech characters used to test image translation. (source)
image_en.png an image that contains English characters used to test image translation. (source)

Quick Start Guide

Prepare Model

Install required packages

Input the following command to install required packages for model export. The --upgrade-strategy eager option is needed to ensure optimum-intel is upgraded to the latest version.

pip install --upgrade-strategy eager -r export-requirements.txt

Hugging Face login

The script needs to download models from Hugging Face. To get the access, please visit https://huggingface.co/google/translategemma-4b-it then login (by hitting log in)

Make sure your access token has been prepared. Make sure huggingface-cli has been installed. Open a Command Prompt, run huggingface-cli login with your access token

pip install "huggingface_hub[cli]<1.0,>=0.34.0"
huggingface-cli login

Transformers 4.55.4 requires huggingface-hub<1.0,>=0.34.0

Download and export model

Then, run the export with Optimum CLI:

optimum-cli export openvino --model google/translategemma-4b-it --trust-remote-code translategemma-4b-it

Models will be exported under model_dir (translategemma-4b-it in this example)
The argument --weight-format can be used to quantize the model. See Quantization for the detail

Run

Install required packages

Input the following command to install required packages for model deployment.

pip install -r deployment-requirements.txt

Pipeline usage

translate.py --model_dir MODEL_DIR
             --text TEXT
             --image IMAGE
             --device {CPU,GPU,NPU}
             --source_lang_code SOURCE_LANG_CODE
             --target_lang_code TARGET_LANG_CODE

The arguments --model_dir, --source_lang_code and -target_lang_code are required:
Either --text TEXT or --image IMAGE should be provided
The --device can be CPU, GPU or NPU
Language code examples: en, en-GB, zh or zh-TW. Full language code can be found here or check chat_template.jinja locally under model_dir

Text Translation Example

Command:

python translate.py --model_dir translategemma-4b-it --device GPU --source_lang_code zh-TW --target_lang_code en --text text_zh-TW.txt

Result:

Input:
白日依山盡，黃河入海流；欲窮千里目，更上一層樓。

Output:
As the sun sets behind the mountains, the Yellow River flows into the sea. To gain a broader perspective, one must climb to a higher vantage point.

Image Translation Example

Command:

python translate.py --model_dir translategemma-4b-it --device GPU --source_lang_code cs --target_lang_code en --image image_cs.jpg

Input:

Output:
Pedestrian Zone

Child Supervision

IZS, CBS in Supervision
0 - 24 hours

Quantization

When exporting a model, the argument --weight-format can be used to quantize the model. The supported weights are int8, int4 and nf4. Please visit OpenVINO model preparation guide for the detail.

optimum-cli export openvino --model google/translategemma-4b-it --trust-remote-code --weight-format int8 translategemma-4b-it_int8

Tested device

The pipeline is verified on a Intel(R) Core(TM) Ultra 5 238V (Lunar Lake) system with 32GB memory. GPU/NPU driver info below

GPU: Intel(R) Arc(TM) 130V GPU, driver 32.0.101.8425 (1/16/2026)
NPU: Intel(R) AI Boost, driver 32.0.100.4514 (12/17/2025)

Result

Model	weight	CPU	GPU	NPU
translategemma-4b-it	fp16	OK	OK	OK
	int8	OK	OK	OK
	int4	OK	OK	OK(1)
	nf4	OK	OK	OK(1)
translategemma-12b-it	int8	OK	OK	NG(2)
	int4	OK	OK	NG(3)
	nf4	OK	OK	NG(4)

(1) To run int4 or nf4 models on NPU, below argumetns are required when exporting the model. See LLM Inference on NPU for the detail
- for int4: --weight-format int4 --sym --ratio 1.0 --group-size 128
- for nf4: --weight-format nf4 --sym --ratio 1.0 --group-size -1
(2) Failed due to insufficient memory, see log.txt for the detail
(3) Output is garbage, see log.txt for the detail
(4) No output, see log.txt for the detail

Log

Full log log.txt is provided for reference

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About TranslateGemma-OpenVINO

Quick Start Guide

Prepare Model

Install required packages

Hugging Face login

Download and export model

Run

Install required packages

Pipeline usage

Text Translation Example

Image Translation Example

Quantization

Tested device

Result

Log

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LICENSE		LICENSE
README.md		README.md
chat_template-gemma3.json		chat_template-gemma3.json
deployment-requirements.txt		deployment-requirements.txt
export-requirements.txt		export-requirements.txt
image_cs.jpg		image_cs.jpg
image_en.png		image_en.png
log.txt		log.txt
text_en.txt		text_en.txt
text_zh-TW.txt		text_zh-TW.txt
translate.py		translate.py

Folders and files

Latest commit

History

Repository files navigation

About TranslateGemma-OpenVINO

Quick Start Guide

Prepare Model

Install required packages

Hugging Face login

Download and export model

Run

Install required packages

Pipeline usage

Text Translation Example

Image Translation Example

Quantization

Tested device

Result

Log

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages