System v2.4.0 Production ReadyHigh-Performance
High-Performance
Gemma Deployments.
From edge devices to production clusters — run, deploy, and debug Gemma 4 models. Built for developers who move fast.
Choose Your Path
Modular workflows for developer integration.
Current Model Lineup
Explore all modelsarrow_forward| Model Variant | Context | Min. VRAM | Best For | |
|---|---|---|---|---|
E2B | 8K | 1.4 GB | Embedded Systems | Explorearrow_forward |
E4B | 32K | 3.2 GB | Personal Coding | Explorearrow_forward |
26B A4BPopular | 128K | 16.4 GB | Technical Writing | Explorearrow_forward |
31B | 256K | 24.0 GB | Scientific Analysis | Explorearrow_forward |
Fixes & Guides
Hot / Recent Fixes
Criticalarrow_outward
'Failed to Load Model'
Commonly caused by insufficient VRAM or corrupted GGUF weights in the local cache.
Error: 0xCF2Amberarrow_outward
'MLX module not found'
MacOS specific. Occurs when building from source without the Apple Silicon toolchain enabled.
pip install mlxDiagnosticarrow_outward
'Unused24 tokens'
Token alignment error. Requires manual update of the tokenizer_config.json to map pad_token to eos_token.
// Quick Patch: sed -i 's/unused24/eos_token/g' config.json
常见问题
关于 Gemma 4 的常见问题解答
邮件列表
加入我们的社区
订阅邮件列表,及时获取最新消息和更新