Questions about reinforcement learning example

Question

question

Scarlett X asked May 16 2022 at 12:41 AM Jordan Johnson commented Jun 20 2022 at 2:59 PM

Questions about reinforcement learning example

Hi everyone, https://docs.flexsim.com/en/22.1/ModelLogic/ReinforcementLearning/Training/Training.html

I have same questions about reinforcement learning in flexsim,

If I reset and run model 1000 times, it means I had trained model for 1000 times?
If Q1 is true, how can I use the trained model which had been trained for 1000 times to another model?
If I want to increase processor from 1 to 3, I only need to set 3 decision events in reinforcement learning?

Thank you!!

Software Version:

FlexSim 22.0.0

reinforcement learning

· 1

______

Cookie preferences

Your privacy is important to us and so is an optimal experience. To help us customize information and build applications, we collect data about your use of this site.

May we collect and use your data?

Learn more about the Third Party Services we use and our Privacy Statement.

Strictly necessary – required for our site to work and to provide services to you

These cookies allow us to record your preferences or login information, respond to your requests or fulfill items in your shopping cart.

YES

Improve your experience – allows us to show you what is relevant to you

These cookies enable us to provide enhanced functionality and personalization. They may be set by us or by third party providers whose services we use to deliver information and experiences tailored to you. If you do not allow these cookies, some or all of these services may not be available for you.

YES

NO

Customize your advertising – permits us to offer targeted advertising to you

These cookies collect data about you based on your activities and interests in order to show you relevant ads and to track effectiveness. By collecting this data, the ads you see will be more tailored to your interests. If you do not allow these cookies, you will experience less targeted advertising.

YES

NO

Are you sure you want a less customized experience?

We can access your data only if you select "yes" for the categories on the previous screen. This lets us tailor our marketing so that it's more relevant for you. You can change your settings at any time by visiting our privacy statement

Your experience. Your choice.

We care about your privacy. The data we collect helps us understand how you use our products, what information you might be interested in, and what we can improve to make your engagement with Autodesk more rewarding.

May we collect and use your data to tailor your experience?

Explore the benefits of a customized experience by managing your privacy settings for this site or visit our Privacy Statement to learn more about your options.

Answer 1 · 2022-05-16T16:27:28Z

Jordan Johnson answered May 16 2022 at 4:27 PM Jordan Johnson commented Jun 20 2022 at 2:59 PM

To train the model, you need to run configure the Reinforcement Learning tool. Then you need to run flexsim_training.py. When you run the model yourself, using the reset and run buttons, the model will fire the OnRequestAction trigger, whenever you request an action. But this does not train the AI. When you run flexsim_training.py, the Python script will launch the model and run it. In this case, when you request an action, the OnRequestAction trigger does not fire. Instead, FlexSim sends the observations to the RL algorithm, and gets an action in response. The RL algorithm uses FlexSim as the environment. Running flexsim_training.py is how you train the AI.

Once you have run flexsim_training.py, you'll see that it produces a .zip file. That .zip file contains the AI. You can use the trained brain by running flexsim_inference.py, which becomes a server. You can then configure your OnRequestAction to send the observations to that server, and get an action back.

https://docs.flexsim.com/en/22.1/ModelLogic/ReinforcementLearning/UsingATrainedModel/UsingATrainedModel.html

To increase the number of processors, you would need to make sure the LastItemType parameter matches the last item processed. For example, suppose Processor1 works on a Type 2 item, and Processor 2 works on a Type 5 item. Then, when Processor1 requests an action, the LastItemType value needs to be 2. And when Processor 2 requests an action, the LastItemType value needs to be 5. And yes, then you can listen to the three decision events.

· 7

Scarlett X commented · May 20 2022 at 3:59 AM

Thank you for your prompt reply.Does it means I had trained the model when I run flexsim_training and python show "Saving model...

Waiting for input to do some test runs..." ?

0 ·

Jordan Johnson ♦♦ Scarlett X commented · May 20 2022 at 3:40 PM

Yes, that is correct. When the Python scripts we provide show "Saving model...", that means that python is saving the agent as a .zip file.

0 ·

Scarlett X Jordan Johnson ♦♦ commented · May 23 2022 at 12:02 AM

I'm sorry ,what's "Waiting for input to do some test runs" ? What it want me to do?

0 ·

Show more comments

question