Skip to content
This repository was archived by the owner on Jan 21, 2025. It is now read-only.

Conversation

@copybara-service
Copy link
Contributor

Adding a new Gradient Estimator for Routing using REINFORCE with a leave-one-out baseline.

…ave-one-out baseline.

PiperOrigin-RevId: 435129337
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants