Gaurav Tadkapally
projects
everything from scratch
writeups
large language models
gpt-2 + pre-training
multi-head attn
· layer-norm
llama-2
rope embeddings
· rms-norm
· silu
llama-3 and llama-3.1
rope embeddings
· group query attn
lstm
machine learning
linear regression
numpy-only
logistic regression
numpy-only
neural network w/ backprop
numpy-only
k-means