Skip to content

Commit 5dfcc95

Browse files
authored
[ru] Added the glossary and translation guide (#490)
* Added the glossary and translation guide * Fixed casing * Minor fixes * Updated glossary * Glossary update * Glossary update * Glossary update
1 parent 92671bc commit 5dfcc95

File tree

3 files changed

+197
-0
lines changed

3 files changed

+197
-0
lines changed

chapters/ru/TRANSLATING.txt

Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,47 @@
1+
1. We use the formal "you" (i.e. "вы" instead of "ты") to keep the neutral tone.
2+
However, don't make the text too formal to keep it more engaging.
3+
4+
2. Don't translate industry-accepted acronyms. e.g. TPU or GPU.
5+
6+
3. The Russian language accepts English words especially in modern contexts more than
7+
many other languages (i.e. Anglicisms). Check for the correct usage of terms in
8+
computer science and commonly used terms in other publications.
9+
10+
4. Russian word order is often different from English. If after translating a sentence
11+
it sounds unnatural try to change the word or clause order to make it more natural.
12+
13+
5. Beware of "false friends" in Russian and English translations. Translators are trained
14+
for years to specifically avoid false English friends and avoid anglicised translations.
15+
e.g. "точность" is "accuracy", but "carefulness" is "аккуратность". For more examples refer to:
16+
http://falsefriends.ru/ffslovar.htm
17+
18+
6. Keep voice active and consistent. Don't overdo it but try to avoid a passive voice.
19+
20+
7. Refer and contribute to the glossary frequently to stay on top of the latest
21+
choices we make. This minimizes the amount of editing that is required.
22+
23+
8. Keep POV consistent.
24+
25+
9. Smaller sentences are better sentences. Apply with nuance.
26+
27+
10. If translating a technical word, keep the choice of Russian translation consistent.
28+
This does not apply for non-technical choices, as in those cases variety actually
29+
helps keep the text engaging.
30+
31+
11. This is merely a translation. Don't add any technical/contextual information
32+
not present in the original text. Also don't leave stuff out. The creative
33+
choices in composing this information were the original authors' to make.
34+
Our creative choices are in doing a quality translation.
35+
36+
12. Be exact when choosing equivalents for technical words. Package is package.
37+
Library is library. Don't mix and match. Also, since both "batch" and "package"
38+
can be translated as "пакет", use "батч" for "batch" and "пакет" for "package" to
39+
avoid ambiguity.
40+
41+
13. Library names are kept in the original forms, e.g. "🤗 Datasets", however,
42+
the word dataset in a sentence gets a translation to "датасет".
43+
44+
14. As a style choice prefer the imperative over constructions with auxiliary words
45+
to avoid unnecessary verbosity and addressing of the reader, which seems
46+
unnatural in Russian. e.g. "см. главу X" - "See chapter X" instead of
47+
"Вы можете найти это в главе X" - "You can see this in chapter X".

chapters/ru/_toctree.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -88,3 +88,7 @@
8888
title: Введение
8989
- local: chapter6/2
9090
title: Обучение токенизатора на основе существующего
91+
- title: Глоссарий
92+
sections:
93+
- local: glossary/1
94+
title: Глоссарий

chapters/ru/glossary/1.mdx

Lines changed: 146 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,146 @@
1+
# Глоссарий
2+
3+
| Оригинал | Перевод |
4+
|---------------------------------|-----------------------------------------|
5+
| Abstraction | абстракция |
6+
| Account | учетная запись |
7+
| Accuracy | accuracy |
8+
| Artificial General Intelligence | сильный искусственный интеллект |
9+
| Attention | внимание |
10+
| Attention mask (layer) | маска внимания (слой) |
11+
| Backward Pass\* | обратный проход |
12+
| Batch | батч |
13+
| Bias | смещение |
14+
| Causal Language Modeling | каузальное языковое моделирование |
15+
| Chapter | глава |
16+
| Checkpoint(s) | чекпоинт |
17+
| Class | класс |
18+
| Classification | классификация |
19+
| Code | код |
20+
| Colab Notebook | блокнот Colab |
21+
| Command | команда |
22+
| Computer Vision | компьютерное зрение |
23+
| Configuration | конфигурация |
24+
| Course | курс |
25+
| Decoder | декодировщик / декодер |
26+
| Dependency | зависимость |
27+
| Deployment | развертывание (программного обеспечения)|
28+
| Development | разработка |
29+
| Dictionary | dictionary |
30+
| Distribution | распределение |
31+
| Download | download |
32+
| Encoder | кодировщик / энкодер |
33+
| Extractive question answering | выделительная вопросно-ответная система |
34+
| F1 score | F1-мера |
35+
| Feature | признак |
36+
| Fine-tune | дообучать |
37+
| Fine-tuning | дообучение |
38+
| Folder | папка / директория |
39+
| Forward Pass\* | прямой проход |
40+
| Function | функция |
41+
| Generative question answering | генеративная вопросно-ответная система |
42+
| Google | Google |
43+
| Hugging Face | Hugging Face |
44+
| Incompatibility | несовместимость |
45+
| Inference | инференс |
46+
| Input | вход |
47+
| Input data | входные данные |
48+
| Label (verb) | размечать |
49+
| Label (subj) | метка класса |
50+
| Layer | слой |
51+
| Library | библиотека |
52+
| Linux | Linux |
53+
| Load | загружать |
54+
| Loss function | функция потерь |
55+
| Machine Learning | машинное обучение |
56+
| macOS | macOS |
57+
| Mask | маска |
58+
| Mask Filling | предсказание замаскированного токена |
59+
| Mask Token | токен-маска |
60+
| Masked Language Modeling | маскированное языковое моделирование |
61+
| Model | модель |
62+
| Model Hub | Model Hub |
63+
| Module | модуль |
64+
| Named Entities | именованные сущности |
65+
| Named Entity Recognition | распознавание именованных сущностей |
66+
| Natural Language Processing | обработка естественного языка |
67+
| Output | выход |
68+
| Package | пакет |
69+
| Package Manager | менеджер пакетов |
70+
| Padding (объект) | padding |
71+
| Padding (действие) | дополнение |
72+
| Parameter | параметр |
73+
| Postprocessing | постобработка / последующая обработка |
74+
| Preprocessing | предобработка / предварительная обработка|
75+
| Pretraining | предварительное обучение / предобучение |
76+
| Pretrained model | предварительно обученная модель |
77+
| Pretrained model | предобученная модель |
78+
| Prompt | начальный текст |
79+
| Python | Python |
80+
| Pytorch | Pytorch |
81+
| Question Answering | вопросно-ответная система |
82+
| Save | сохранять |
83+
| Sample | пример |
84+
| Script | скрипт |
85+
| Self-Attention | самовнимание |
86+
| Self-Contained | самостоятельный |
87+
| Sentiment analysis | анализ тональности текста (сентимент-анализ)|
88+
| Sequence-to-sequence models | sequence-to-sequence модель |
89+
| Setup | установка (программы) / настройка (среды)|
90+
| Speech Processing | обработка речи |
91+
| Speech Recognition | распознавание речи |
92+
| Summarization | суммаризация |
93+
| Target | целевая переменная |
94+
| Task | задача |
95+
| TensorFlow | Tensorflow |
96+
| Terminal | терминал |
97+
| Text generation | генерация текста |
98+
| Tokenizer | Tokenizer (библиотека) / токенизатор |
99+
| Train | обучение (обучать) |
100+
| Transfer Learning | Transfer Learning / трансферное обучение|
101+
| Transformer | трансформер |
102+
| Transformer models | архитектура трансформер |
103+
| Translation | (машинный) перевод |
104+
| Virtual Environment | виртуальное окружение |
105+
| Weight | вес |
106+
| Weights | веса |
107+
| Windows | Windows |
108+
| Working Environment | рабочее окружение |
109+
| Workload | нагрузка |
110+
| Workspace | Workspace |
111+
| Zero-shot classification | zero-shot классификация |
112+
=======
113+
114+
\* Данные термины могут употребляться взаимозаменяемо с их английской версией
115+
116+
## Сокращения
117+
118+
| Оригинал | Перевод |
119+
|-----------|-------------|
120+
| NLP | NLP |
121+
| API | API |
122+
| GPU | GPU |
123+
| TPU | TPU |
124+
| ML | ML |
125+
126+
## Notes
127+
128+
Please refer to [TRANSLATING.txt](/chapters/ru/TRANSLATING.txt) for a translation guide. Here are some excerpts relevant to the glossary:
129+
130+
- Refer and contribute to the glossary frequently to stay on top of the latest
131+
choices we make. This minimizes the amount of editing that is required.
132+
Add new terms alphabetically sorted.
133+
134+
- The Russian language accepts English words especially in modern contexts more
135+
than many other languages (i.e. Anglicisms). Check for the correct usage of
136+
terms in computer science and commonly used terms in other publications.
137+
138+
- Don't translate industry-accepted acronyms. e.g. TPU or GPU.
139+
140+
- If translating a technical word, keep the choice of Russian translation consistent.
141+
This does not apply for non-technical choices, as in those cases variety actually
142+
helps keep the text engaging.
143+
144+
- Be exact when choosing equivalents for technical words. Package is package.
145+
Library is library. Don't mix and match.
146+

0 commit comments

Comments
 (0)