RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
-
Updated
May 12, 2024 - Python
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Inference-time scaling for LLMs-as-a-judge.
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Robotics research demonstrating reliability and robustness in the real world (continuously updated)
This repo implements our paper, "Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt", which has been accepted at NeurIPS 2023.
TraderNet-CRv2 - Combining Deep Reinforcement Learning with Technical Analysis and Trend Monitoring on Cryptocurrency Markets
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
Dota 2 bot that is trained by Deep RL with expert demonstrations
Compiling strategy guides into reward functions for reinforcement learning. Uses Claude Vision to extract unit tests from game guides, then trains agents with dense, interpretable rewards.
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
Code for "DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks"
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
3D gym environments to train RL agents to play the Slime Volleyball game in 3 dimensions using Webots as simulator.
Python + PyTorch. Advanced Reinforcement Learning (SAC/PPO/A2C) for ✨autonomous Robot Sumo combat featuring competitive self-play in continuous action spaces.
Benchmarks for risk-aware reward shaping of autonomous driving
⚔️ Playing Jedi Academy with Deep Curiosity ⚔️
Add a description, image, and links to the reward-shaping topic page so that developers can more easily learn about it.
To associate your repository with the reward-shaping topic, visit your repo's landing page and select "manage topics."