added toysgd benchmark by benjamc · Pull Request #103 · automl/DACBench

benjamc · 2021-08-09T15:27:15Z

Please have a look. :-)
Especially on the bounds for the coefficients and the initial x (not sure about that).

TheEimer · 2021-08-09T15:32:38Z

dacbench/benchmarks/toysgd_benchmark.py

+import dacbench.envs.toysgd
+importlib.reload(dacbench.envs.toysgd)
+
+HISTORY_LENGTH = 40


Is the history length used somewhere I don't see right now?

oops forgot to delete, not used right now

TheEimer · 2021-08-09T15:40:40Z

dacbench/benchmarks/toysgd_benchmark.py

+    {
+        "action_space_class": "Box",
+        "action_space_args": [-np.inf * np.ones((2,)), np.inf * np.ones((2,))],
+        "observation_space_class": "Dict",


I'll not say you shouldn't or can't do that, but Dict spaces are impractical without the dict wrapper. If you don't care, feel free to ignore

I just took inspiration from the sgd benchmark. If that's suboptimal, feel free to remodel it :) or I could also do it tomorrow

TheEimer · 2021-08-09T15:41:56Z

Initial x as 0 makes sense, I think. Without haing every run it, I can't really say if the coefficient range is good or not, but it's super easy to adapt, so I think that's good.
So thanks for the benchmarks, I'll merge as soon as you adressed my comment (second one is optional).

TheEimer · 2021-09-08T13:20:24Z

Any updates here, @benjamc ? I found another thing to fix in the meantime, the reset doesn't seem to return a state. I missed that, but it's definitely not desired behavior ;D

TheEimer · 2021-09-08T13:57:21Z

Also, we may want to set an upper limit for values. The lr get huge pretty fast with actions > 1

TheEimer · 2021-09-08T14:07:07Z

Changes from my side:

limited actions to stay below 1
made reset return a state
initialized lr with 0 (might not be smart at all, but None is worse)

Things where input would be nice:

how can we initialize the lr without giving the agent a nan in the reset state?
is 1 a good limit for the actions?

As soon as we settled those two, we can merge

benjamc · 2021-09-08T15:10:20Z

[lr] setting the learning rate to 0 as a starter should be fine. this way the agent is definitely forced to do something :D
and the agent does not set the learning rate itself but the log learning rate . I am not sure if this is really good or if we should set the learning rate to a really small value because the agent can never achieve a learning rate of 0 because of log.
[actions] do not limit actions

added toysgd benchmark

07e0398

benjamc requested a review from TheEimer August 9, 2021 15:27

TheEimer reviewed Aug 9, 2021

View reviewed changes

TheEimer added 2 commits September 8, 2021 16:03

some changes

2f4b77b

removed history length

1c83251

unlimit action space

5ddd446

TheEimer merged commit ac31683 into main Sep 9, 2021

TheEimer deleted the AndySGD branch March 21, 2023 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added toysgd benchmark#103

added toysgd benchmark#103
TheEimer merged 4 commits intomainfrom
AndySGD

benjamc commented Aug 9, 2021

Uh oh!

TheEimer Aug 9, 2021

Uh oh!

benjamc Aug 9, 2021

Uh oh!

TheEimer Aug 9, 2021

Uh oh!

benjamc Aug 9, 2021

Uh oh!

TheEimer commented Aug 9, 2021

Uh oh!

TheEimer commented Sep 8, 2021

Uh oh!

TheEimer commented Sep 8, 2021

Uh oh!

TheEimer commented Sep 8, 2021

Uh oh!

benjamc commented Sep 8, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

benjamc commented Aug 9, 2021

Uh oh!

TheEimer Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

benjamc Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

TheEimer Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

benjamc Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

TheEimer commented Aug 9, 2021

Uh oh!

TheEimer commented Sep 8, 2021

Uh oh!

TheEimer commented Sep 8, 2021

Uh oh!

TheEimer commented Sep 8, 2021

Uh oh!

benjamc commented Sep 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

benjamc commented Sep 8, 2021 •

edited

Loading