Skip to content

Model translategemma 12b it q4_k_m

hydropix edited this page Jan 17, 2026 · 1 revision

translategemma:12b-it-q4_K_M

Ollama Model ID: translategemma:12b-it-q4_K_M


Summary

Metric Value
Average Score 🟠 6.8/10
Accuracy 7.3/10
Fluency 6.8/10
Style 6.2/10
Languages Tested 19
Total Translations 95
Best Language Portuguese (7.8)
Worst Language Bengali (6.2)

Language Performance

Top Languages

Rank Language Overall Accuracy Fluency Style
1 Portuguese 🟡 7.8 8.0 8.0 6.8
2 Spanish 🟡 7.6 7.6 7.6 6.6
3 French 🟡 7.4 7.4 7.4 6.4
4 Russian 🟡 7.4 7.8 7.4 6.8
5 Chinese (Simplified) 🟡 7.0 7.2 7.0 6.0
6 Ukrainian 🟡 7.0 7.4 7.0 6.4
7 Italian 🟠 6.8 7.4 7.0 6.2
8 Chinese (Traditional) 🟠 6.8 7.0 6.8 6.0
9 Arabic 🟠 6.8 7.2 7.0 6.4
10 Hebrew 🟠 6.8 7.2 6.8 6.2
View all 19 languages
Rank Language Overall Accuracy Fluency Style
1 Portuguese 🟡 7.8 8.0 8.0 6.8
2 Spanish 🟡 7.6 7.6 7.6 6.6
3 French 🟡 7.4 7.4 7.4 6.4
4 Russian 🟡 7.4 7.8 7.4 6.8
5 Chinese (Simplified) 🟡 7.0 7.2 7.0 6.0
6 Ukrainian 🟡 7.0 7.4 7.0 6.4
7 Italian 🟠 6.8 7.4 7.0 6.2
8 Chinese (Traditional) 🟠 6.8 7.0 6.8 6.0
9 Arabic 🟠 6.8 7.2 7.0 6.4
10 Hebrew 🟠 6.8 7.2 6.8 6.2
11 Polish 🟠 6.6 7.0 7.0 5.8
12 German 🟠 6.4 7.2 6.4 6.0
13 Korean 🟠 6.4 7.2 6.4 6.0
14 Vietnamese 🟠 6.4 7.2 6.4 5.8
15 Thai 🟠 6.4 7.4 6.4 6.2
16 Tamil 🟠 6.4 7.2 6.4 6.0
17 Japanese 🟠 6.2 7.0 6.2 6.0
18 Hindi 🟠 6.2 7.2 6.2 6.0
19 Bengali 🟠 6.2 7.0 6.2 5.8

Performance by Category

European Major Languages

Language Overall Accuracy Fluency Style
Portuguese 🟡 7.8 8.0 8.0 6.8
Spanish 🟡 7.6 7.6 7.6 6.6
French 🟡 7.4 7.4 7.4 6.4
Italian 🟠 6.8 7.4 7.0 6.2
Polish 🟠 6.6 7.0 7.0 5.8
German 🟠 6.4 7.2 6.4 6.0

Category Average: 🟡 7.1

Cyrillic Languages

Language Overall Accuracy Fluency Style
Russian 🟡 7.4 7.8 7.4 6.8
Ukrainian 🟡 7.0 7.4 7.0 6.4

Category Average: 🟡 7.2

Asian Languages

Language Overall Accuracy Fluency Style
Chinese (Simplified) 🟡 7.0 7.2 7.0 6.0
Chinese (Traditional) 🟠 6.8 7.0 6.8 6.0
Korean 🟠 6.4 7.2 6.4 6.0
Vietnamese 🟠 6.4 7.2 6.4 5.8
Thai 🟠 6.4 7.4 6.4 6.2
Tamil 🟠 6.4 7.2 6.4 6.0
Japanese 🟠 6.2 7.0 6.2 6.0
Hindi 🟠 6.2 7.2 6.2 6.0
Bengali 🟠 6.2 7.0 6.2 5.8

Category Average: 🟠 6.4

Semitic Languages

Language Overall Accuracy Fluency Style
Arabic 🟠 6.8 7.2 7.0 6.4
Hebrew 🟠 6.8 7.2 6.8 6.2

Category Average: 🟠 6.8


Translation Examples

Best Translation

Spanish - Pride and Prejudice Score: 🟢 9.0/10

View Translation

Original:

Translation:

Es un hecho universalmente reconocido que un hombre soltero, en posesión de una buena fortuna, debe estar buscando esposa. Por poco conocidas que sean las ideas o sentimientos de un hombre al establecerse en un nuevo vecindario, esta verdad está tan arraigada en la mente de las familias vecinas que él es considerado como la posesión legítima de alguna de sus hijas.

Feedback: The translation faithfully preserves Austen's meaning and ironic tone with excellent fluency in Spanish. Minor style consideration: 'debe estar buscando' is slightly more literal than the original's ironic 'must be in want of,' though the overall satirical effect remains intact.

Worst Translation

Chinese (Traditional) - A Study in Scarlet Score: 🟠 5.0/10

View Translation

Original:

Translation:

「您曾經到過阿富汗,我猜得到。」我驚訝地問道:「您怎麼會知道?」 「別在意,」他自嘲地笑了笑,「現在的問題是關於血紅蛋白。您肯定能理解我這個發現的重要性,對吧?」 「這在化學上無疑是個有趣的發現,」我回答道,「但從實際應用上來說,」 「我的確,先生,這可是近年來最實用的醫學和法律學交叉領域的發現。您沒看到這為我們提供了一項驗證血跡的絕對可靠的方法嗎?」

Feedback: The translation captures the basic plot but has significant structural and linguistic issues: the dialogue attribution is confused (mixing who says what), "自嘲地" (self-mockingly) misrepresents "chuckling to himself," and the final response contains awkward phrasing ("我的確,先生" is grammatically incorrect). The conversational flow and Victorian-era tone are notably diminished.


Score Distribution

Score Range Count Percentage
🟢 Excellent (9-10) 2 2.1%
🟡 Good (7-8) 61 64.2%
🟠 Acceptable (5-6) 32 33.7%
🔴 Poor (3-4) 0 0.0%
⚫ Failed (1-2) 0 0.0%

Performance Metrics

Metric Value
Average Translation Time 1926.0ms
Success Rate 100.0%

← Back to Home | All Models

Clone this wiki locally