Reinforcement learning - problem by test runs

Question

question

katerina-fratczak asked Jul 03 2024 at 10:36 AM katerina-fratczak commented Jul 04 2024 at 6:25 AM

Reinforcement learning - problem by test runs

Hello,

I have problem by running the training test runs of my model with push logic and two decision events:

In python stops it like this:

The python scripts (modified long time ago by us because the original training scripts did not run well) run by the training model well. I am not sure now, if the problem is in FlexSim or in python.

Thank you

15_our_3_model_push.fsm

flexsim_env.py

flexsim_training.py

reinforcement learning

1720002556798.png (49.7 KiB)

1720002593625.png (18.7 KiB)

15-our-3-model-push.fsm (35.0 KiB)

flexsim-env.py (8.4 KiB)

flexsim-training.py (2.4 KiB)

Answer 1 · 2024-07-03T19:45:41Z

Jordan Johnson answered Jul 03 2024 at 7:45 PM katerina-fratczak commented Jul 04 2024 at 6:25 AM

I did a little debugging. I think something may have changed with stable_baselines3 since we wrote the demo script. It looks like the return value of model.predict() is a tuple containing the array of actions, rather than just the array of actions.

To fix it, I changed:

action = model.predict(observation)

to this:

(action, _) = model.predict(observation)

The second version "unpacks" the tuple and saves the first value in the action variable.

I'll add an issue to our bug list to investigate this issue, to see if we need to adjust to changing software.

· 1

question