Reinforcement Learning(DL) QUIZ (MCQ QUESTIONS AND ANSWERS)

Total Correct: 0

Time:20:00

Question: 1

What is the term for the technique in reinforcement learning that encourages exploration by adding noise to the policy during training?

Question: 2

In reinforcement learning, what is the main purpose of using a replay buffer in Deep Q-Networks (DQN)?

Question: 3

What is the term for the technique in reinforcement learning that reduces the learning rate as the agent gains more experience?

Question: 4

Which reinforcement learning algorithm is known for its stability and ease of use in practice, often used in both continuous and discrete action spaces?

Question: 5

In reinforcement learning, what is the term for the process of estimating the expected future rewards of a state-action pair using a learned value function?

Question: 6

What is the primary advantage of using policy gradient methods in reinforcement learning?

Question: 7

Which reinforcement learning algorithm uses an epsilon-greedy exploration strategy to balance exploration and exploitation?

Question: 8

What is the primary role of the critic in the Actor-Critic reinforcement learning architecture?

Question: 9

In reinforcement learning, what is the term for the measure of the advantage of taking a specific action in a given state compared to the expected value?

Question: 10

Which reinforcement learning algorithm is specifically designed for continuous action spaces and uses an actor-critic architecture?

Question: 11

What is the primary difference between model-based and model-free reinforcement learning methods?

Question: 12

Which reinforcement learning algorithm is based on the idea of estimating the advantage of taking a particular action in a given state?

Question: 13

In reinforcement learning, what is the term for the prediction of future rewards given a specific state and action?

Question: 14

What is the term for the process of iteratively improving a policy through trial and error in reinforcement learning?

Question: 15

Which reinforcement learning algorithm uses a neural network to approximate the value function and is known for its success in game playing?

Question: 16

In reinforcement learning, what is the primary goal of an agent?

Question: 17

What is the primary difference between on-policy and off-policy methods in reinforcement learning?

Question: 18

Which reinforcement learning algorithm is known for its ability to handle continuous action spaces and is often used in robotics?

Question: 19

What is the primary limitation of using a high discount factor in reinforcement learning?

Question: 20

In reinforcement learning, what does the term "exploration vs. exploitation" refer to?

Question: 21

What is the term for the numerical factor used to discount future rewards in reinforcement learning?

Question: 22

Which reinforcement learning approach combines elements of both value-based and policy-based methods by using a value function and a policy?

Question: 23

What is the primary difference between on-policy and off-policy reinforcement learning methods?

Question: 24

In reinforcement learning, what is the term for the measure of the long-term expected rewards for an agent following a policy?

Question: 25

Which reinforcement learning algorithm estimates the value of being in a particular state and following a particular policy?

Question: 26

What is the term for the mapping from states to actions in reinforcement learning?

Question: 27

Which reinforcement learning paradigm involves learning from trial and error by interacting with the environment?

Question: 28

What is the role of a reward signal in reinforcement learning?

Question: 29

In reinforcement learning, what is the environment?

Question: 30

What is the term used to describe the process of selecting actions to maximize expected rewards in reinforcement learning?