0 Comments
Answer:
This refers to a machine learning approach in which agents learn to use feedback from their environment to choose actions that maximize rewards.
Step-by-step explanation:
hope this helps
9.4m questions
12.2m answers