I want to know what this official website provides
int done = (Model.time > 1000);
Does this mean that it will calculate the value of Reward every 1000 seconds?
I want to know what this official website provides
int done = (Model.time > 1000);
Does this mean that it will calculate the value of Reward every 1000 seconds?
I'm so sorry
Here is the image:
Hi @Ryan_Wei, was Felix Möhlmann's answer helpful? If so, please click the "Accept" button at the bottom of their answer. Or if you still have questions, add a comment and we'll continue the conversation.
If we haven't heard back from you within 3 business days we'll auto-accept an answer, but you can always comment back to reopen your question.
The reward function passes an array with two elements to the reinforcement learning algorithm. The first value ist the reward itself. The second value controls whether the algorithm continues the current simulation run (0) or concludes the run and starts a new one (1).
(Model.time > 1000) evaluates either to 0 or 1, depending on the current time in the simulation. So the first time the reward is send after the simulation passes 1000s, a new replication will be started.
17 People are following this question.
FlexSim can help you understand and improve any system or process. Transform your existing data into accurate predictions.
FlexSim is a fully 3D simulation software environment. FlexSim can be used to simulate any process in any industry.
FlexSim®, FlexSim Healthcare™, Problem Solved.®, the FlexSim logo, the FlexSim X-mark, and the FlexSim Healthcare logo with stylized Caduceus mark are trademarks of FlexSim Software Products, Inc. All rights reserved.
Privacy | Do not sell or share my personal information | Cookie preferences | Report noncompliance | Terms of use | Legal | © Autodesk Inc. All rights reserved