Skip to content

Model translategemma 4b

hydropix edited this page Jan 16, 2026 · 1 revision

translategemma:4b

Ollama Model ID: translategemma:4b


Summary

Metric Value
Average Score 🟠 6.1/10
Accuracy 6.5/10
Fluency 6.3/10
Style 5.3/10
Languages Tested 19
Total Translations 95
Best Language French (7.0)
Worst Language Bengali (5.0)

Language Performance

Top Languages

Rank Language Overall Accuracy Fluency Style
1 French 🟡 7.0 7.0 7.6 6.2
2 Spanish 🟠 6.8 7.0 7.2 5.8
3 Portuguese 🟠 6.6 6.8 6.8 5.8
4 Italian 🟠 6.4 6.6 7.0 5.4
5 Chinese (Simplified) 🟠 6.4 6.6 6.6 5.4
6 Vietnamese 🟠 6.4 6.8 6.4 5.6
7 Russian 🟠 6.4 7.0 6.6 5.6
8 Ukrainian 🟠 6.4 6.8 6.6 5.6
9 Chinese (Traditional) 🟠 6.2 6.6 6.2 5.4
10 Arabic 🟠 6.2 6.4 6.6 5.4
View all 19 languages
Rank Language Overall Accuracy Fluency Style
1 French 🟡 7.0 7.0 7.6 6.2
2 Spanish 🟠 6.8 7.0 7.2 5.8
3 Portuguese 🟠 6.6 6.8 6.8 5.8
4 Italian 🟠 6.4 6.6 7.0 5.4
5 Chinese (Simplified) 🟠 6.4 6.6 6.6 5.4
6 Vietnamese 🟠 6.4 6.8 6.4 5.6
7 Russian 🟠 6.4 7.0 6.6 5.6
8 Ukrainian 🟠 6.4 6.8 6.6 5.6
9 Chinese (Traditional) 🟠 6.2 6.6 6.2 5.4
10 Arabic 🟠 6.2 6.4 6.6 5.4
11 Thai 🟠 6.0 6.6 6.0 5.6
12 German 🟠 5.8 6.6 6.0 5.4
13 Polish 🟠 5.8 6.0 6.0 4.8
14 Japanese 🟠 5.8 6.2 6.2 5.0
15 Hebrew 🟠 5.8 6.2 5.8 5.2
16 Korean 🟠 5.4 6.2 5.6 4.8
17 Hindi 🟠 5.4 6.4 5.4 4.8
18 Tamil 🟠 5.2 6.0 5.2 4.6
19 Bengali 🟠 5.0 6.0 5.0 4.8

Performance by Category

European Major Languages

Language Overall Accuracy Fluency Style
French 🟡 7.0 7.0 7.6 6.2
Spanish 🟠 6.8 7.0 7.2 5.8
Portuguese 🟠 6.6 6.8 6.8 5.8
Italian 🟠 6.4 6.6 7.0 5.4
German 🟠 5.8 6.6 6.0 5.4
Polish 🟠 5.8 6.0 6.0 4.8

Category Average: 🟠 6.4

Asian Languages

Language Overall Accuracy Fluency Style
Chinese (Simplified) 🟠 6.4 6.6 6.6 5.4
Vietnamese 🟠 6.4 6.8 6.4 5.6
Chinese (Traditional) 🟠 6.2 6.6 6.2 5.4
Thai 🟠 6.0 6.6 6.0 5.6
Japanese 🟠 5.8 6.2 6.2 5.0
Korean 🟠 5.4 6.2 5.6 4.8
Hindi 🟠 5.4 6.4 5.4 4.8
Tamil 🟠 5.2 6.0 5.2 4.6
Bengali 🟠 5.0 6.0 5.0 4.8

Category Average: 🟠 5.8

Cyrillic Languages

Language Overall Accuracy Fluency Style
Russian 🟠 6.4 7.0 6.6 5.6
Ukrainian 🟠 6.4 6.8 6.6 5.6

Category Average: 🟠 6.4

Semitic Languages

Language Overall Accuracy Fluency Style
Arabic 🟠 6.2 6.4 6.6 5.4
Hebrew 🟠 5.8 6.2 5.8 5.2

Category Average: 🟠 6.0


Translation Examples

Best Translation

French - Pride and Prejudice Score: 🟡 8.0/10

View Translation

Original:

Translation:

Il est un fait universellement reconnu qu'un homme célibataire, fortuné, doit nécessairement trouver une épouse. Quoique les sentiments ou les opinions de cet homme soient méconnus au moment de son entrée dans un quartier, cette vérité est si bien ancrée dans l'esprit des familles environnantes, qu'il est considéré comme étant la propriété de l'une de leurs filles.

Feedback: The translation accurately conveys the meaning and ironic tone of Austen's opening, with natural French phrasing. However, 'quartier' (neighborhood/district) is slightly less precise than the original's social connotation, and the translation loses some of the original's rhythmic emphasis on 'universally acknowledged' by rendering it more straightforwardly.

Worst Translation

Bengali - The Picture of Dorian Gray Score: ⚫ 2.0/10

View Translation

Original:

Translation:

स्टूडियोটি গোলাপের সুগন্ধ দিয়ে পরিপূর্ণ ছিল, এবং যখন গ্রীষ্মের হালকা বাতাস বাগানের গাছের মধ্যে দিয়ে বয়ে যেত, তখন খোলা দরজার মাধ্যমে লিলি花的 তীব্র সুবাস অথবা গোলাপের সুগন্ধ প্রবেশ করত। পারস্যের চামড়ার তৈরি সোফার কোণা থেকে, যেখানে তিনি आराम করে ছিলেন এবং ক্রমাগত সিগারেট খাচ্ছিলেন, লর্ড হেনরি ওয়োটন ল্যাবুরনামের মিষ্টি এবং সোনালী ফুলগুলোর হালকা ঝলক দেখতে পেতেন।

Feedback: The translation contains critical errors including mixed scripts (Devanagari and Chinese characters), incorrect botanical terminology ('lilac' mistranslated as 'lily flower'), and loss of aesthetic nuance. The sensory richness and refined tone of Wilde's prose are severely compromised by awkward phrasing and script inconsistencies that make it nearly incomprehensible.


Score Distribution

Score Range Count Percentage
🟢 Excellent (9-10) 0 0.0%
🟡 Good (7-8) 29 30.5%
🟠 Acceptable (5-6) 60 63.2%
🔴 Poor (3-4) 5 5.3%
⚫ Failed (1-2) 1 1.1%

Performance Metrics

Metric Value
Average Translation Time 1099.0ms
Success Rate 100.0%

← Back to Home | All Models

Clone this wiki locally