Zing Forum

Reading

ComfyUI-TranslateGemma: Integrating Google's Multilingual Translation Model into AI Art Workflows

ComfyUI-TranslateGemma is a node plugin that integrates Google's open-source TranslateGemma translation model into ComfyUI. It supports mutual translation between 55 languages, image text translation, and offers multiple model specifications and quantization options to adapt to different hardware configurations.

ComfyUITranslateGemmaGemmaAI绘画翻译模型多语言开源项目ComfyUI插件
Published 2026-06-13 17:16Recent activity 2026-06-13 17:21Estimated read 5 min
ComfyUI-TranslateGemma: Integrating Google's Multilingual Translation Model into AI Art Workflows
1

Section 01

Introduction: Core Overview of the ComfyUI-TranslateGemma Plugin

This article introduces the ComfyUI-TranslateGemma plugin, which integrates Google's open-source TranslateGemma translation model into ComfyUI. It supports mutual translation between 55 languages, image text translation, and provides multiple model specifications (4B/12B/27B) and quantization options to adapt to different hardware. This plugin addresses the pain point of cross-language prompt processing for AI art users, allowing translation without switching tools.

2

Section 02

Background of the TranslateGemma Model

TranslateGemma is an open-source translation model released by Google in 2025. Trained on the Gemma3 architecture, it can run completely locally (protecting privacy). It offers three specifications: 4B (lightweight, suitable for devices with limited VRAM), 12B (balanced performance and resource usage), and 27B (best quality, requiring large VRAM).

3

Section 03

Core Features of ComfyUI Integration

Core Functions: Supports mutual translation between 55 languages, image text translation, and automatic language detection. Highlights of January 2026 Update: Added a dedicated Chinese simplified-traditional conversion mode (implemented via OpenCC), intelligent token budget (automatic resource calculation), long text processing strategies (disable/auto-continue/segmented), device management (specify CPU/GPU), and BitsAndBytes quantization (8bit/4bit to reduce VRAM usage).

4

Section 04

Installation and Configuration Guide

Installation Methods:

  1. ComfyUI-Manager (Recommended): Search for "TranslateGemma", install, and restart.
  2. Manual: Clone the repository to ComfyUI/custom_nodes and install dependencies (requirements.txt; for quantization, use requirements-quantization.txt). Hugging Face Authorization: Accept the Gemma terms, log in to HF, or set the HF_TOKEN. Model Storage: Default path is ComfyUI/models/LLM/TranslateGemma/. If network is restricted, download manually and place it there.
5

Section 05

Usage Scenarios and Tips

Common Scenarios:

  • Prompt Translation: For example, Chinese "一位穿着汉服的少女在樱花树下" → English "A young girl wearing Hanfu under cherry blossom trees"
  • Image Text Translation: Extract and translate text from images (requires image input and source language)
  • Simplified-Traditional Conversion: Enable the chinese_conversion_only mode (fast, no VRAM usage) Optimization Suggestions: Choose 4B or 4bit quantization if VRAM is insufficient; use 27B for quality; use the segmented strategy for batch processing; download models in advance if network is restricted.
6

Section 06

Troubleshooting Solutions

Solutions to Common Issues:

  • Download Failure: Set up a proxy, use an HF mirror, or download manually.
  • 401/403 Errors: Unauthorized for Gemma; need to accept terms on HF and configure the token.
  • Insufficient VRAM: Switch to a smaller model or enable quantization.
7

Section 07

Project Significance and Summary

Significance: Fills the gap of local high-quality translation in the ComfyUI ecosystem, solves issues like latency, privacy, and restrictions of online APIs, and supports fully offline workflows. Summary: ComfyUI-TranslateGemma is a practical tool that integrates TranslateGemma's multilingual capabilities. It is suitable for scenarios like prompt translation and image text processing, and is worth trying for ComfyUI users.