Brainstorming Session - Supervised Learning - Little Bits of Artificial Intelligence

Imagine we’re fitting a line through dots.

Without a constant:

If we only use y = m * x, our line would always go through (0,0). But what if our data looks like this?

Clearly, y = 2x + 1 fits best — not y = 2x.

That +1 is the bias (or intercept) — it lets the line shift up or down to match the data.

Think of it like this:

The slope (m) tilts the line.

The bias (c) moves the line up or down.

So: it adds the constant to better fit data that doesn’t start at 0.

Answer: Every training step!

During training (every loop in gradient descent), the model:

This happens on every iteration (called an “epoch”) until the error becomes really small.

It uses gradients (think: slopes of the error curve).
Let’s visualize it simply:

The math (we saw this in the gradient descent part):

m -= learning_rate * total_error_m
c -= learning_rate * total_error_c

If the error says: “Your guess is too low,” it increases m or c

If the guess is too high, it decreases m or c

This continues until the guesses are very close to the actual values

Summary: One-Liner Answers

Question	Simple Answer
Why add a constant?	To shift the line up/down and fit data that doesn’t pass through zero.
When adjust?	On every training step (epoch) during learning.
How adjust?	Using the gradient of the error — small steps that make predictions more accurate.

Supervised Learning – Connecting the Dots