Throughput Reinforcement Learning

Question

This question was closed Dec 18, 2022 at 10:52 AM by Jason Lightfoot for the following reason: New question asked

question

mark zhen asked Dec 12, '22 Jason Lightfoot commented Jan 4, '23

Throughput Reinforcement Learning

I want to learn how to increase my total throughput within a fixed period of time. My current reward function is as follows, but his learning is not very smooth

Model.find("Sink2").as(Object).inObjects.length

I would like to ask if there are any other ideas that can make my reward function learning more effective, (and I would also like to ask how I should write about the punishment mechanism) For example, if he is less than a standard I set, I will deduct points for him. If it is bigger than the standard I set, I will give him extra points

Software Version:

FlexSim 22.0.0

reinforcement learning

1670822008630.png (10.5 KiB)

5 |100000

Attachments: Up to 12 attachments (including images) can be used with a maximum of 23.8 MiB each and 47.7 MiB total.

Answer 1 · 2022-12-12T09:03:43Z

Joerg Vogel answered Dec 12, '22 Jason Lightfoot commented Jan 4, '23

currently you evaluate the static value of inObjects by this code or more visually the number of input ports.

Maybe you want to get a more progressiv value like input of items, then you can try to do this by stats property over input.value

· 15

5 |100000

Attachments: Up to 12 attachments (including images) can be used with a maximum of 23.8 MiB each and 47.7 MiB total.

mark zhen commented · Dec 15, 2022 at 09:48 AM

Then if I want to write a function that is the total processing time of a single machine minus the total idle time of a single machine, how should I write it?

0 ·

Jason Lightfoot ♦♦ mark zhen commented · Dec 15, 2022 at 12:08 PM

To access those values you want to use :

<object>.stats.state().getTotalTimeAt(STATE_IDLE)

replacing STATE_IDLE with the appropriate macro.

1 ·

mark zhen Jason Lightfoot ♦♦ commented · Dec 18, 2022 at 06:42 AM

I wonder if I should write like this?

0 ·

1671345753064.png (5.8 KiB)

Show more comments

Joerg Vogel mark zhen commented · Dec 15, 2022 at 09:54 AM

@mark zhen, we were at this point already. Please ask this request as a new question, because then more users, distributors and developers can participate ! Thank you.

0 ·

Joerg Vogel mark zhen commented · Dec 15, 2022 at 10:05 AM

Direct hint: you shouldn’t consider this, because there might occurs a situation when total idle time is larger than total processing time. I am not sure how your reward system reacts on negative values.

0 ·

mark zhen commented · Dec 17, 2022 at 02:33 PM

I don't know why he reported an error, can you help me deal with it?allcombos-22-0-1.fsm

0 ·

allcombos-22-0-1.fsm (328.8 KiB)

mark zhen mark zhen commented · Dec 18, 2022 at 06:34 AM

@Joerg Vogel @Jason Lightfoot

0 ·

Joerg Vogel mark zhen commented · Dec 18, 2022 at 08:48 AM

Comment as new question created.

0 ·

mark zhen Joerg Vogel commented · Dec 18, 2022 at 11:03 AM

why mine is closed

0 ·

Show more comments

question

Throughput Reinforcement Learning

1 Answer

Things to know…

question details

question

Throughput Reinforcement Learning

1 Answer

Things to know…

question details

Related Questions