Tetris AI — DQN Agent

Score

Lines

Episodes: 1,000 · Best Score: — · Parameters: 103,300

← Left

→ Right

↻ Rotate

↓ Drop

Higher Q-value = agent believes this action leads to better future score

← Left

—

→ Right

—

↻ Rotate

—

↓ Drop

—

The only human judgment in the system — everything else the agent learned itself

Line clear+100

Tetris (4 lines)+800

Per hole−5

Bumpiness−0.5

Height−0.3

Death−500

Input: 15 board features

Hidden: 256 neurons

Hidden: 128 neurons

Output: 4 Q-values

Trained with experience replay + target network stabilisation

• 10 column heights
• Total holes
• Surface bumpiness
• Max column height
• Average height
• Lines cleared so far