Deep Learning · study

Deep learning is how computers learn patterns too subtle to program by hand — turning pixels into objects, audio into words, and prompts into prose.

FIG_002

A neural network: layers of weighted connections.

The basic unit is simple: multiply inputs by weights, add a bias, pass the result through a nonlinearity, and repeat. Stack enough of those and the whole network can approximate essentially any function.

Stacking layers is what adds power: the early ones learn edges and textures, the later ones objects and meaning — features composed from features, with no hand engineering.

FIG_003

A transformer cell: each token attends to the others.

Learning reduces to calculus. Backpropagation carries the error from the output back through every layer, computing for each weight the direction to adjust — and gradient descent takes the step.

Inside each cell, attention lets every token draw on every other; the mixed signal then passes through ordinary weighted units, and gradient descent tunes millions of those weights until the output matches the data.

FIG_004

Training: the loss falls as the model learns.

FIG_001

Neural super-sampling: a network upscales a cheap low-res frame into a crisp one — DLSS, every frame.

Architecture encodes assumptions about the data. Convolutions bake in the structure of images, recurrence the order of sequences, attention the long-range links in language — each a different prior about the world.

FIG_005

A convolution: a kernel slides across the image, building a feature map.

Scale changed what the method could do. The same gradients and the same layers, given enough data and compute, went from research curiosity to systems that write, draw, and reason.

And it scales: with more data, compute, and parameters, the same methods keep improving — which is why they now underpin most of modern AI.

Contents.

1. Mathematical Background

2. Foundations

3. Neural Networks

4. Optimization

5. Regularization

6. Architectures

7. Theory & Frontiers

8. Generative Models

9. Probabilistic Methods

10. Practical Deep Learning

11. Large Models & Agents

12. Reinforcement Learning

Progress.░░░░░░░░░░░░░░░░░░

Articles done:0 / 68

Complete:0%

Notes written:0

Highlights:0