-
Northeastern University
- https://simonucl.github.io/
- @simon_ycl
Pinned Loading
-
CHATS-lab/verbalized-sampling
CHATS-lab/verbalized-sampling PublicVerbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achieves 2-3x diversity improvement while maintaining quality. …
-
spiral-rl/spiral
spiral-rl/spiral PublicSPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
-
-
LeonGuertler/TextArena
LeonGuertler/TextArena PublicA Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
-
-
Cohere-Labs-Community/iterative-data-selection
Cohere-Labs-Community/iterative-data-selection Public
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


