0325.fsm @Felix Möhlmann @Jason Lightfoot @Kavika F @
Hi, thank you very much for your assistance, but I have some questions about the heuristic algorithm. I want to know how you design such a code and why there are three situations of 0, 1, 2, and 0, The effect of the value of 1,2 on the overall model
And I have to clarify the definition and logic of each state action
There are currently three questions
1. The state and action of reinforcement learning cases provided in flexsim
My understanding is that Model.parameters["LastItemType"].value = getvarnum(Model.find("Processor1"), "f_lastlabelval");
The definition of this grammar is that the value of "f_lastlabelval" of the observation machine will be our state
But I would like to ask if there is a better understanding of the instructions.
And the syntax of action
item.Type == Model.parameters["ItemType1"].value Then what is the logic of this action.
2. Odd-job production extended by myself
My grammar is shown in the figure I want to understand the overall logic and description more clearly
Also about the action, I also want to know how the action is generated.
3. There is no set up time topic
Just like the previous question
I want to believe that I can understand the complete problem better after I clarify it