Update run_one_step_off_policy.sh #29
e2e_ascend.yml
on: push
E2E Ascend testing for non-RL algorithm scenarios
E2E Ascend testing for RL training scenarios of LLM models
E2E Ascend testing for RL training scenarios of VLM models
E2E Ascend testing for experimental features