Skip to content

Commit 6a3e260

Browse files
committed
chore: update with latest data and internships
1 parent 4c45d7e commit 6a3e260

File tree

3 files changed

+46
-39
lines changed

3 files changed

+46
-39
lines changed

latex-cv/cv_template.jinja.tex

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,6 +184,11 @@
184184
\bigskip%
185185
(# endfor #)
186186

187+
\smallskip\divider%
188+
\vspace{-0.4em}
189+
\textbf{Internships:} (#- for internship in internships_list -#) \ihref{(( internship.url ))}{(( _process_text_to_latex(internship.company) ))} ((internship.date))((", " if not loop.last else ";")) (#- endfor -#)
190+
\bigskip
191+
187192
\vspace{-0.3em} % TODO: fix
188193
\cvsection{Education}
189194
(# for education in education_list #)

latex-cv/generate_tex.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -64,6 +64,7 @@ def main(
6464
short_summary=data['summary']['short'],
6565
summary=data['summary']['long'],
6666
experience_list=data['experience'],
67+
internships_list=data['internships'],
6768
education_list=data['education'],
6869
certificates_list=data['certificates'],
6970
skills_list=data['skills'],

user-data.yml

Lines changed: 40 additions & 39 deletions
Original file line numberDiff line numberDiff line change
@@ -25,32 +25,31 @@ bio:
2525

2626
summary:
2727
tagline: NLP Researcher and Engineer
28-
short: Passionate Researcher with 6+ years of experience, now doing evals, XAI & LLM Compression
28+
short: Passionate Researcher with 6+ years of experience, doing XAI, Pruning and Agents;
2929
long:
30-
- BaSc with Honors in HSE, Russia, MsA Erasmus Mundus LCT till 2024;
31-
- 6+ years of programming experience;
32-
- 5+ years of Data Science Research experience in Startups, Yandex DS School, EPAM, and JetBrains;
33-
- Completed 8+ ML research projects.
30+
- 6+ years of programming, 5+ of NLP Research experience in Startups, EPAM, JetBrains and Toloka AI;
31+
- MsA with Honors at Erasmus Mundus LCT; BaSc with Honors in HSE, Russia;
32+
- Completed 10+ ML research projects.
3433
github_profile: |
34+
- 💼 NLP Researcher at [Toloka AI](https://toloka.ai/), former NLP at [EPAM Systems](https://www.epam.com/), and Intern at [JetBrains Research](https://www.jetbrains.com/research/);
3535
- 📄 Erasmus Mundus **['Language & Communication Technologies'](https://lct-master.org/) student** at the University of Groningen and Saarland University;
36-
- 💼 Former NLP Data Scientist at [EPAM Systems](https://www.epam.com/) and NLP Intern at [Jetbrains Research](https://www.jetbrains.com/research/);
37-
- 👨‍🏫 Lecturer and Python Course manager at the [Yandex School of Data Analysis](https://academy.yandex.com/dataschool/);
38-
- 💻 Interested in NLP, Interpretability, SP, as well as in efficient DL-models Inference;
36+
- 👨‍🏫 Lecturer and Ex. Python Course manager at the [Yandex School of Data Analysis](https://academy.yandex.com/dataschool/);
37+
- 💻 Interested in NLP, Interpretability, Pruning and Human-AI collaboration;
3938
- 📝 More: [CV file](https://docs.google.com/viewer?url=https://raw.githubusercontent.com/k4black/k4black/main/chernyshev_cv.pdf) or [linkedin.com/in/kdchernyshev](https://www.linkedin.com/in/kdchernyshev/) or mail me 😊.
4039
4140
4241
personal:
4342
tags: [Music Production, Juggling, Slackline]
44-
summary: >
45-
Cheerful and sociable person, keen on slackline and juggling,
46-
love music making creativity and strive to master a guitar.
43+
summary: Cheerful and sociable person, keen on slackline and juggling, love music making.
4744

4845
skills:
4946
- group: Data Science
5047
tags:
5148
- name: NLP
5249
level: 3
53-
- name: DL
50+
# - name: DL
51+
# level: 3
52+
- name: Agents
5453
level: 3
5554
- name: XAI
5655
level: 2
@@ -119,8 +118,8 @@ skills:
119118

120119

121120
achievements:
122-
- Erasmus Mundus Scholarship 2022-2024;
123-
- Placed 2nd at Moscow State hackathon "Digital Transformation 2021";
121+
- Erasmus Mundus Scholarship 2022-2024; Honours Master's degree in LCT;
122+
# - Placed 2nd at Moscow State hackathon "Digital Transformation 2021";
124123
- Honours Bachelor's degree in CS;
125124
- Largely improved Python course at YSDA, Top-1 by students' rating;
126125
- Finished YSDA - Master’s-level Data Science program, 3% acceptance rate.
@@ -143,30 +142,29 @@ publications:
143142
In our experiments, multi-task learning performs on par with standard fine-tuning for sexism
144143
detection and noticeably better for coarse-grained sexism classification, while fine-tuning is
145144
preferable for fine-grained classification.
145+
- title: "U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMs"
146+
venue: accepted, ACL-2025 workshop
147+
url: https://toloka.ai/math-benchmark
148+
year: 2025
149+
authors: [K.Chernyshev, V.Polshkov, E.Artemova, A.Myasnikov, V.Stepanov, A.Miasnikov, S.Tilga]
150+
abstract: >
151+
The current evaluation of mathematical skills in LLMs is limited, as existing benchmarks are either relatively small, primarily focus on elementary and high-school problems, or lack diversity in topics. Additionally, the inclusion of visual elements in tasks remains largely under-explored.
152+
To address these gaps, we introduce U-MATH, a novel benchmark of 1,100 unpublished open-ended university-level problems sourced from teaching materials. It is balanced across six core subjects, with 20% of multimodal problems. Given the open-ended nature of U-MATH problems, we employ an LLM to judge the correctness of generated solutions. To this end, we release μ-MATH, a dataset to evaluate the LLMs' capabilities in judging solutions.
153+
The evaluation of general domain, math-specific, and multimodal LLMs highlights the challenges presented by U-MATH. Our findings reveal that LLMs achieve a maximum accuracy of only 63% on text-based tasks, with even lower 45% on visual problems. The solution assessment proves challenging for LLMs, with the best LLM judge having an F1-score of 80% on μ-MATH.
146154
147155
148156
experience:
149157
- role: Machine Learning Researcher
150-
company: Toloka.ai
158+
company: Toloka AI
151159
location: Germany
152160
url: https://toloka.ai
153161
start: Jun 2024
154162
end: Present
155163
description:
156-
- To be updated.
164+
- Collected and published benchmark for text+visual university-level math (U-MATH, ACL 2025 accepted);
165+
- Developed a substantial part of Agentic Platform for Human-AI collaboration, improving the quality on 30%+;
157166
tags: [PyTorch, HuggingFace, GenAi, Data Quality]
158167

159-
# - role: NLP Intern
160-
# company: JetBrains Research
161-
# location: Netherlands
162-
# url: https://www.jetbrains.com/research/
163-
# start: Jun 2023
164-
# end: Present
165-
# description:
166-
# - Analysing Internal Representation of code generation models;
167-
# - To be updated.
168-
# tags: [Python, PyTorch, HuggingFace, RL, SkLearn, DataLore]
169-
170168
- role: NLP Data Scientist
171169
company: EPAM Systems
172170
location: Serbia
@@ -207,16 +205,19 @@ experience:
207205
- Designed and developed a solution for scanned document analysis, trained a high mAP (~0.94) CV model for tables, imgs, and stamps.
208206
tags: [Python, PyTorch, HuggingFace, SkLearn, PyTest, ONNX, Docker, Gitlab-CI]
209207

210-
# - role: Research Intern
211-
# company: LATNA Lab at Higher School of Economics
212-
# location: Russia
213-
# url: https://nnov.hse.ru/en/latna/
214-
# start: Apr 2019
215-
# end: Jan 2021
216-
# description:
217-
# - Conducted research on Compressed Sensing with l1 and l0 norms, resulting in a near SoTA recovery algorithm with faster convergence;
218-
# - Created Abstractive Summarization model using Knowledge Graphs.
219-
# tags: [Statistics, Python, HuggingFace, CoreNLP, SkLearn, SciPy]
208+
209+
internships:
210+
- company: JetBrains Research
211+
location: Netherlands
212+
url: https://www.jetbrains.com/research/
213+
date: Summer 2023
214+
description: Analyzed Internal Representation of code generation models.
215+
216+
- company: LATNA Lab at Higher School of Economics
217+
location: Russia
218+
url: https://nnov.hse.ru/en/latna/
219+
date: 2019 - 2020
220+
description: Created Abstractive Summarization model using Knowledge Graphs.
220221

221222

222223
education:
@@ -225,10 +226,10 @@ education:
225226
location: Netherlands & Germany
226227
url: https://lct-master.org/
227228
start: Sep 2022
228-
end: Present
229+
end: Aug 2024
229230
description: |
230231
Erasmus Mundus "Language & Communication Technologies";
231-
GPA: 8.7/10 (ongoing) +Assistant at Language Technology Project;
232+
GPA: 8.7/10 +Assistant at Language Technology Project;
232233
Thesis on Mechanistic Interpretability for LLM pruning.
233234
234235
- degree: Post Graduate 2-year Program (Data Science)

0 commit comments

Comments
 (0)