Reading

ComfyUI-TranslateGemma: Integrating Google's Multilingual Translation Model into AI Art Workflows

ComfyUI-TranslateGemma is a node plugin that integrates Google's open-source TranslateGemma translation model into ComfyUI. It supports mutual translation between 55 languages, image text translation, and offers multiple model specifications and quantization options to adapt to different hardware configurations.

ComfyUITranslateGemmaGemmaAI绘画翻译模型多语言开源项目ComfyUI插件

Published 2026-06-13 17:16Recent activity 2026-06-13 17:21Estimated read 5 min

ComfyUI-TranslateGemma: Integrating Google's Multilingual Translation Model into AI Art Workflows

Section 01

Introduction: Core Overview of the ComfyUI-TranslateGemma Plugin

This article introduces the ComfyUI-TranslateGemma plugin, which integrates Google's open-source TranslateGemma translation model into ComfyUI. It supports mutual translation between 55 languages, image text translation, and provides multiple model specifications (4B/12B/27B) and quantization options to adapt to different hardware. This plugin addresses the pain point of cross-language prompt processing for AI art users, allowing translation without switching tools.

Section 02

Background of the TranslateGemma Model

TranslateGemma is an open-source translation model released by Google in 2025. Trained on the Gemma3 architecture, it can run completely locally (protecting privacy). It offers three specifications: 4B (lightweight, suitable for devices with limited VRAM), 12B (balanced performance and resource usage), and 27B (best quality, requiring large VRAM).

Section 03

Core Features of ComfyUI Integration

Core Functions: Supports mutual translation between 55 languages, image text translation, and automatic language detection. Highlights of January 2026 Update: Added a dedicated Chinese simplified-traditional conversion mode (implemented via OpenCC), intelligent token budget (automatic resource calculation), long text processing strategies (disable/auto-continue/segmented), device management (specify CPU/GPU), and BitsAndBytes quantization (8bit/4bit to reduce VRAM usage).

Section 04

Installation and Configuration Guide

Installation Methods:

ComfyUI-Manager (Recommended): Search for "TranslateGemma", install, and restart.
Manual: Clone the repository to ComfyUI/custom_nodes and install dependencies (requirements.txt; for quantization, use requirements-quantization.txt). Hugging Face Authorization: Accept the Gemma terms, log in to HF, or set the HF_TOKEN. Model Storage: Default path is ComfyUI/models/LLM/TranslateGemma/. If network is restricted, download manually and place it there.

Section 05

Usage Scenarios and Tips

Common Scenarios:

Prompt Translation: For example, Chinese "一位穿着汉服的少女在樱花树下" → English "A young girl wearing Hanfu under cherry blossom trees"
Image Text Translation: Extract and translate text from images (requires image input and source language)
Simplified-Traditional Conversion: Enable the chinese_conversion_only mode (fast, no VRAM usage) Optimization Suggestions: Choose 4B or 4bit quantization if VRAM is insufficient; use 27B for quality; use the segmented strategy for batch processing; download models in advance if network is restricted.

Section 06

Troubleshooting Solutions

Solutions to Common Issues:

Download Failure: Set up a proxy, use an HF mirror, or download manually.
401/403 Errors: Unauthorized for Gemma; need to accept terms on HF and configure the token.
Insufficient VRAM: Switch to a smaller model or enable quantization.

Section 07

Project Significance and Summary

Significance: Fills the gap of local high-quality translation in the ComfyUI ecosystem, solves issues like latency, privacy, and restrictions of online APIs, and supports fully offline workflows. Summary: ComfyUI-TranslateGemma is a practical tool that integrates TranslateGemma's multilingual capabilities. It is suitable for scenarios like prompt translation and image text processing, and is worth trying for ComfyUI users.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

libmlxforge: An Embedded MLX LLM Inference Engine for Apple Silicon

libmlxforge is an embeddable MLX large language model (LLM) inference engine designed specifically for Apple Silicon. It provides a unified C ABI interface, supports calls from Node.js, Swift, and Rust, and features continuous batching, streaming output, JSON-constrained structured output, and embedding vector generation.

Recent activity 2026-06-09 17:23