You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: user-data.yml
+40-39Lines changed: 40 additions & 39 deletions
Original file line number
Diff line number
Diff line change
@@ -25,32 +25,31 @@ bio:
25
25
26
26
summary:
27
27
tagline: NLP Researcher and Engineer
28
-
short: Passionate Researcher with 6+ years of experience, now doing evals, XAI & LLM Compression
28
+
short: Passionate Researcher with 6+ years of experience, doing XAI, Pruning and Agents;
29
29
long:
30
-
- BaSc with Honors in HSE, Russia, MsA Erasmus Mundus LCT till 2024;
31
-
- 6+ years of programming experience;
32
-
- 5+ years of Data Science Research experience in Startups, Yandex DS School, EPAM, and JetBrains;
33
-
- Completed 8+ ML research projects.
30
+
- 6+ years of programming, 5+ of NLP Research experience in Startups, EPAM, JetBrains and Toloka AI;
31
+
- MsA with Honors at Erasmus Mundus LCT; BaSc with Honors in HSE, Russia;
32
+
- Completed 10+ ML research projects.
34
33
github_profile: |
34
+
- 💼 NLP Researcher at [Toloka AI](https://toloka.ai/), former NLP at [EPAM Systems](https://www.epam.com/), and Intern at [JetBrains Research](https://www.jetbrains.com/research/);
35
35
- 📄 Erasmus Mundus **['Language & Communication Technologies'](https://lct-master.org/) student** at the University of Groningen and Saarland University;
36
-
- 💼 Former NLP Data Scientist at [EPAM Systems](https://www.epam.com/) and NLP Intern at [Jetbrains Research](https://www.jetbrains.com/research/);
37
-
- 👨🏫 Lecturer and Python Course manager at the [Yandex School of Data Analysis](https://academy.yandex.com/dataschool/);
38
-
- 💻 Interested in NLP, Interpretability, SP, as well as in efficient DL-models Inference;
36
+
- 👨🏫 Lecturer and Ex. Python Course manager at the [Yandex School of Data Analysis](https://academy.yandex.com/dataschool/);
37
+
- 💻 Interested in NLP, Interpretability, Pruning and Human-AI collaboration;
39
38
- 📝 More: [CV file](https://docs.google.com/viewer?url=https://raw.githubusercontent.com/k4black/k4black/main/chernyshev_cv.pdf) or [linkedin.com/in/kdchernyshev](https://www.linkedin.com/in/kdchernyshev/) or mail me 😊.
40
39
41
40
42
41
personal:
43
42
tags: [Music Production, Juggling, Slackline]
44
-
summary: >
45
-
Cheerful and sociable person, keen on slackline and juggling,
46
-
love music making creativity and strive to master a guitar.
43
+
summary: Cheerful and sociable person, keen on slackline and juggling, love music making.
47
44
48
45
skills:
49
46
- group: Data Science
50
47
tags:
51
48
- name: NLP
52
49
level: 3
53
-
- name: DL
50
+
# - name: DL
51
+
# level: 3
52
+
- name: Agents
54
53
level: 3
55
54
- name: XAI
56
55
level: 2
@@ -119,8 +118,8 @@ skills:
119
118
120
119
121
120
achievements:
122
-
- Erasmus Mundus Scholarship 2022-2024;
123
-
- Placed 2nd at Moscow State hackathon "Digital Transformation 2021";
121
+
- Erasmus Mundus Scholarship 2022-2024; Honours Master's degree in LCT;
122
+
# - Placed 2nd at Moscow State hackathon "Digital Transformation 2021";
124
123
- Honours Bachelor's degree in CS;
125
124
- Largely improved Python course at YSDA, Top-1 by students' rating;
The current evaluation of mathematical skills in LLMs is limited, as existing benchmarks are either relatively small, primarily focus on elementary and high-school problems, or lack diversity in topics. Additionally, the inclusion of visual elements in tasks remains largely under-explored.
152
+
To address these gaps, we introduce U-MATH, a novel benchmark of 1,100 unpublished open-ended university-level problems sourced from teaching materials. It is balanced across six core subjects, with 20% of multimodal problems. Given the open-ended nature of U-MATH problems, we employ an LLM to judge the correctness of generated solutions. To this end, we release μ-MATH, a dataset to evaluate the LLMs' capabilities in judging solutions.
153
+
The evaluation of general domain, math-specific, and multimodal LLMs highlights the challenges presented by U-MATH. Our findings reveal that LLMs achieve a maximum accuracy of only 63% on text-based tasks, with even lower 45% on visual problems. The solution assessment proves challenging for LLMs, with the best LLM judge having an F1-score of 80% on μ-MATH.
146
154
147
155
148
156
experience:
149
157
- role: Machine Learning Researcher
150
-
company: Toloka.ai
158
+
company: Toloka AI
151
159
location: Germany
152
160
url: https://toloka.ai
153
161
start: Jun 2024
154
162
end: Present
155
163
description:
156
-
- To be updated.
164
+
- Collected and published benchmark for text+visual university-level math (U-MATH, ACL 2025 accepted);
165
+
- Developed a substantial part of Agentic Platform for Human-AI collaboration, improving the quality on 30%+;
157
166
tags: [PyTorch, HuggingFace, GenAi, Data Quality]
158
167
159
-
# - role: NLP Intern
160
-
# company: JetBrains Research
161
-
# location: Netherlands
162
-
# url: https://www.jetbrains.com/research/
163
-
# start: Jun 2023
164
-
# end: Present
165
-
# description:
166
-
# - Analysing Internal Representation of code generation models;
0 commit comments