Replies: 2 comments
-
There are a few things
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Other repos trying their hands on Titans
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
There are multiple systems out there that go beyond adding mixture-of-experts and Mamba to Transformers. HRM and Titan seemed to be a good template for what is to come, and they are apparently good enough for Zebra Puzzles and Sudoku variants. Yet there are not enough text-based tasks for these models, so it would be fun to explore in that general direction. https://github.com/lucidrains/titans-pytorch https://github.com/sapientinc/HRM
Some have noted tht HRM has data leaks in training, that would be a bit concerning sapientinc/HRM#12 (comment)
Beta Was this translation helpful? Give feedback.
All reactions