I followed all the steps in the reinforcement learning tutorial, but I'm facing an issue and that is the reward is not updating like it's supposed to. Plus, when I open the model, the time inside it skyrockets to something huge. Can't figure out what's going wrong here. Can anyone help on this?
* I've got all the Python packages updated and in place, and the model is set up in meters, so that's why the Boxes might seem a bit on the small side.
Socket connected Waiting for READY message READY Waiting for ActionSpace message Waiting for ObservationSpace message Sending Reset message Waiting for Observation message {"done":0,"reward":0,"state":2} Sending Action message: 3 Waiting for Observation message
Also, is this the last version of env.py after update to gymnasium from gym?example-flexsim-env.py