Xavier Initialization applicability in Neural Network

Use Case: Handwritten Digit Recognition (like MNIST)

Imagine we’re building a neural network to identify handwritten digits (0–9) from images.

Problem Without Proper Initialization:

If we randomly assign too small or too large weights:

Our network may:

How Xavier Helps:

Xavier initialization chooses weights in a way that:

Simple Step-by-Step:

Xavier Initialization applicability in Neural Network – Xavier Initialization example with Simple Python