58.4k views
9 votes
Suppose that the performance measure is concerned with just the first T time steps of the environment and ignores everything thereafter. Show that a rational agentâs action may depend not just on the state of the environment but also on the time step it has reached.

User Kulan
by
5.4k points

1 Answer

7 votes

Answer:

the first T time steps are a factor in the performance measure. So for instance, if the environment is in state A at time step 1, the performance measure can be different than being in state A at step 2 since the state of the environment in step 1 is relevant to the performance measure in the latter case. Thus as the performance measures can be different, the rational agent may make different actions.

User ProXicT
by
5.7k points