I am following the tutorial (link) on the FlexSim website to integrate RL with an existing FlexSim model. Using the codes provide there too for training and inference.
As I was training the RL agent, I could see that the fps (frames per second) showing a consistent decrease (see graph) with model training time. High fps means more iterations of FlexSim environment (faster training) and it usually should fluctuate around a constant value.
I was wondering if this is a known case and if someone could point me to a way to fix is.