Канал: SOTA Deep Learning Tutorials

Triton Vector Addition Kernel, part 4: Benchmarking vs PyTorch and tuning

Triton Vector Addition Kernel, part 4: Benchmarking vs PyTorch and tuning

Triton Vector Addition Kernel, part 3: Verifying Numerical Accuracy

Triton Vector Addition Kernel, part 3: Verifying Numerical Accuracy

Triton Vector Addition Kernel, part 2: Coding the Triton Kernel

Triton Vector Addition Kernel, part 2: Coding the Triton Kernel

Triton Vector Addition Kernel, part 1: Making the Shift to Parallel Programming

Triton Vector Addition Kernel, part 1: Making the Shift to Parallel Programming

Intro to Triton: A Parallel Programming Compiler and Language, esp for AI acceleration (updated)

Intro to Triton: A Parallel Programming Compiler and Language, esp for AI acceleration (updated)

Tiled Matrix Multiplication in Triton - part 1

Tiled Matrix Multiplication in Triton - part 1

Triton Compiler Reserved Keywords, or ... what happened to all my params?

Triton Compiler Reserved Keywords, or ... what happened to all my params?

Coding Online Softmax in PyTorch - a faster Softmax via reduced memory access

Coding Online Softmax in PyTorch - a faster Softmax via reduced memory access

Coding a Triton Kernel for Softmax (fwd pass) Computation

Coding a Triton Kernel for Softmax (fwd pass) Computation

Intro to Triton: Coding Softmax in PyTorch

Intro to Triton: Coding Softmax in PyTorch

Leetcode explained - Web Crawler Multithreaded, implemented in Python 3 (leetcode 1242)

Leetcode explained - Web Crawler Multithreaded, implemented in Python 3 (leetcode 1242)

Hot dog detector - not so much state of the art deep learning, but funny

Hot dog detector - not so much state of the art deep learning, but funny

In 20 minutes: Build an AI pet breed classifier with Deep Learning... 20 minutes & 50 cents.

In 20 minutes: Build an AI pet breed classifier with Deep Learning... 20 minutes & 50 cents.

Meet AdaMod: New Deep Learning Optimizer with Long Term Memory

Meet AdaMod: New Deep Learning Optimizer with Long Term Memory

Deep Learning (AI) Optimizer - Meet DiffGrad

Deep Learning (AI) Optimizer - Meet DiffGrad