Skip to content

[RLlib-contrib] Alpha Zero#36736

Merged
sven1977 merged 9 commits intoray-project:masterfrom
avnishn:rllib_contrib_alpha_zero
Oct 4, 2023
Merged

[RLlib-contrib] Alpha Zero#36736
sven1977 merged 9 commits intoray-project:masterfrom
avnishn:rllib_contrib_alpha_zero

Conversation

@avnishn
Copy link
Member

@avnishn avnishn commented Jun 22, 2023

Signed-off-by: Avnish avnishnarayan@gmail.com

Why are these changes needed?

Related issue number

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

avnishn added 4 commits June 22, 2023 16:18
Signed-off-by: Avnish <avnishnarayan@gmail.com>
…b_contrib_alpha_zero

Signed-off-by: Avnish <avnishnarayan@gmail.com>
Signed-off-by: Avnish <avnishnarayan@gmail.com>
Signed-off-by: Avnish <avnishnarayan@gmail.com>
@stale
Copy link

stale bot commented Aug 10, 2023

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

  • If you'd like to keep this open, just leave any comment, and the stale label will be removed.

@stale stale bot added the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Aug 10, 2023
@sven1977 sven1977 self-assigned this Oct 3, 2023
@stale stale bot removed the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Oct 3, 2023
@sven1977 sven1977 added rllib RLlib related issues rllib-contrib labels Oct 3, 2023
Copy link
Contributor

@sven1977 sven1977 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@sven1977 sven1977 merged commit 331c5b7 into ray-project:master Oct 4, 2023
Zandew pushed a commit to Zandew/ray that referenced this pull request Oct 10, 2023
vymao pushed a commit to vymao/ray that referenced this pull request Oct 11, 2023
Signed-off-by: Victor <vctr.y.m@example.com>
value = leaf.reward
else:
child_priors, value = self.model.compute_priors_and_value(leaf.obs)
if self.add_dirichlet_noise:

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we sure we want to add dirichlet noise to every child prior, not only to the ones starting in the root node? This behaviour differs from the one proposed in the paper and the open spiel implementation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

rllib RLlib related issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants