Using reinforcement learning I want the operator1 to unload the products in the shortest queue. Since the queue3 is closer to the queue4, the operator3 will release the queue faster rather than the operator2. Anyway the operator1 mustn't unload products only in queue3. Can someone help?
Please refer to Operator RL Probem.fsm in the FlexSim's share files site.