Making Similar Mistakes Twice

Learnings from 12 Hour Hackathon

7 min readOct 19, 2021

We’re exactly like Artificial Intelligence.

Deep learning, specifically.

One of the core parts of deep learning is reinforcement learning. It’s similar to operant conditioning in psychology where we’re rewarded or punished based on a set of behaviors. In reinforcement learning, an agent is exploring a new environment and as they are taking actions, changing states they get rewarded or punished.

Making Similar Mistakes Twice

Learnings from 12 Hour Hackathon

Written by Zayn Patel