Ddulaev/diploma successful exps #3

Timoniche · 2025-11-09T21:02:49Z

No description provided.

refactored sasrec train_sasrec.py to ipynb format (datasphere tested)

tiger_baseline full run added and tested

tensorboards for sasrec & tiger + rq-vae runs

kmeans, 20 epochs trained + tensorboard logs

NonameUntitled · 2025-11-10T11:03:53Z

tiger/configs/tiger_kmeans_train_config.json

+    "d_kv": 64,
+    "dropout": 0.1,
+    "activation": "relu",
+    "num_beams": 100,


Можно уменьшить до 30, это ускорит код и результаты должны не сильно поменяться

NonameUntitled · 2025-11-10T11:05:06Z

tiger/configs/tiger_kmeans_train_config.json

+    "sampler_type": "tiger"
+  },
+  "dataloader": {
+    "train_batch_size": 256,


В твоих экспах ты можешь брать больший batch size, чтобы лучше gpu утилизировать, главное чтобы на всех экспах он был один

NonameUntitled · 2025-11-10T12:24:50Z

tiger/modeling/dataset/base.py

+            if item_frequency_counts is None:
+                # We do not yet know final max, so start conservatively and grow if needed
+                item_frequency_counts = {}


Непонятно почему это бы не убрать? В чем логика делать выше None?

NonameUntitled · 2025-11-15T20:14:36Z

notebooks/DatasetProcessing.ipynb

Пожалуйста отредачь ноутбуки чтобы их можно было корректно сравнивать, убери свою метаинформацию о выходах и запусках.

Timoniche and others added 20 commits October 22, 2025 17:31

sasrec baseline

3ecc1cc

refactored sasrec train_sasrec.py to ipynb format (datasphere tested)

tiger_baseline added

2142ad3

tiger_baseline full run added and tested

baselines tensorboards

8df00e9

tensorboards for sasrec & tiger + rq-vae runs

Respect max epoch count in trainer

2e15136

Respect max epoch count in trainer

68ebc84

kmeans tiger baseline (1 epoch wip check)

a67912d

ddulaev

baee340

kmeans, 20 epochs trained + tensorboard logs

minor: README.md typo + .idea gitignore

3680abd

built simple positive_pairs dataset

7a484f0

wip merge (minor)

da513ad

positive pairs json to txt

a625071

cf dataset builder json to txt

a785403

finetuning notebook impl

4df9d58

[success] cf kmeans run

07f12a0

sasrec cold-warm-hot ndcg + recall

6a7d95e

bugfix, semantic_* .ids is stored flattened in the batch

e6a8f75

Beauty -> Beauty_legacy

10a56fa

correct inter.json & index_rqkmeans.json

2df21b5

[success] kmeans tuned/not tuned/sasrec runs on correct embeddings

a1b26d3

Merge branch 'main' into ddulaev/diploma

b5513e5

NonameUntitled suggested changes Nov 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ddulaev/diploma successful exps #3

Ddulaev/diploma successful exps #3

Uh oh!

Timoniche commented Nov 9, 2025

Uh oh!

NonameUntitled Nov 10, 2025

Uh oh!

NonameUntitled Nov 10, 2025

Uh oh!

NonameUntitled Nov 10, 2025

Uh oh!

NonameUntitled Nov 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Ddulaev/diploma successful exps #3

Are you sure you want to change the base?

Ddulaev/diploma successful exps #3

Uh oh!

Conversation

Timoniche commented Nov 9, 2025

Uh oh!

NonameUntitled Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

NonameUntitled Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

NonameUntitled Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

NonameUntitled Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants