File tree Expand file tree Collapse file tree 1 file changed +7
-1
lines changed
Expand file tree Collapse file tree 1 file changed +7
-1
lines changed Original file line number Diff line number Diff line change @@ -73,6 +73,9 @@ Here we list projects undertaken in the RAM framework that are shared publicly,
7373
7474#### _ synthetic data for complex reasoning & tools_
7575
76+ - ** NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions** [[ paper]] ( https://arxiv.org/abs/2502.13124 ) .
77+ _ Scaling reasoning capabilities with diverse and high-quality questions._
78+
7679- ** Source2Synth** [[ paper]] ( https://arxiv.org/abs/2409.08239 ) .
7780 _ Generating synthetic data from real sources to improve LLMs on complex reasoning tasks._
7881
@@ -139,8 +142,11 @@ Here we list projects undertaken in the RAM framework that are shared publicly,
139142
140143#### _ architectures_
141144
145+ - ** Multi-token attention** [[ paper]] ( https://arxiv.org/abs/2504.00927 )
146+ _ Attention mechanism that can focus on multiple tokens simultaneously_ .
147+
142148- ** Byte Latent Transformer** [[ paper]] ( https://arxiv.org/abs/2412.09871 )
143- _ New Byte-level LLM architecture that matches tokenization-based LLM performance at scale.
149+ _ New Byte-level LLM architecture that matches tokenization-based LLM performance at scale._
144150
145151- ** Adaptive Decoding via Latent Preference Optimization** [[ paper]] ( https://arxiv.org/abs/2411.09661 )
146152 _ New layer that selects decoding params automatically * per token* _ .
You can’t perform that action at this time.
0 commit comments