AlexNet Architecture (2012)

Table of Contents

AlexNet (2012)
#

AlexNet won the ImageNet competition in 2012 and ushered in the deep learning era. Here are its key innovations.

5 Convolutional Layers + 3 Fully Connected Layers
Total: 60 million parameters

A groundbreaking network depth for its time.

First major CNN to use ReLU (replacing tanh/sigmoid)

Activation	Problem
Sigmoid/Tanh	Vanishing gradient
ReLU	Fast training, gradient preservation

50% Dropout applied to FC layers

A novel regularization technique for preventing overfitting.

Trained on 2x GTX 580 GPUs in parallel

Hardware acceleration enabling large-scale network training.

Artificially expanding training data for better generalization.

Normalization technique applied after ReLU.

Later replaced by Batch Normalization.