Greedy Policy In Reinforcement Learning, a policy that always chooses the action with the highest expected return.