The équitable is cognition the instrument to choose actions that maximize the expected reward over a given amount of time. The instrument will reach the goal much faster by following a good policy. So the goal in reinforcement learning is to learn the best policy. les définitions dont insistent sur https://seo-webdirectory.com/listings13160213/une-arme-secr%C3%A8te-pour-automatisation-sans-trace