Hi everyone,what's the intension of those 3 code as follow
a. Model.find("Sink1").Reward = 0;
b. int done = (Model.time > 1000);
c. return [reward, done];
Can I delete the code of row a in order to show the label of reward in dashboard?
If I want to increase the accuracy of training model,that is I need to change the number of " model.learn(total_timesteps=1000)"?
If it's still not enough,can I use punishment or increase reward to achieve this goal?
Thank you!