Knowledge Graph — Coursera Notes › Academic disciplines › Computer Science / Information Technology › Artificial Intelligence › Machine Learning & Data

Q-learning

concept · part of Machine Learning & Data

A value-based reinforcement learning algorithm that learns Q-values for state-action pairs using the Bellman equation, effective for discrete action spaces.

Inside Q-learning (2)

Bellman equation — An equation used in Q-learning to update Q-values by balancing immediate and future rewards.
Epsilon-greedy strategy — An exploration strategy used in Q-learning that alternates between random actions and exploiting learned Q-values.

Connections

Alternative to Policy gradients
Uses Epsilon-greedy strategy
Uses Bellman equation

This is the text view of an interactive 3D knowledge graph — open this page with JavaScript enabled to explore it visually.

Q-learning

Inside Q-learning (2)

Connections

Select a node

Quiz

Proposed changes

Q-learning

Inside Q-learning (2)

Connections

Select a node

Quiz

Proposed changes

🔒 Only the owner can edit this graph