\(\rightarrow\) Some CUDA/C++ learning notes:
- GPU architecture and warp scheduling.
- Occupancy, Compute intensity, and Tiling.
- DRAM banks and why it matters for code optimization.
- CUDA block indexing, and coalesced memory accesses
\(\rightarrow\) Random C++:
\(\rightarrow\) Technical
- Handling checkpoints from terminal - some useful tricks.
- Quick reset of my compute pod.
- Speed up your migration to VIM.
\(\rightarrow\) LLMs
- Some notes on perplexity and beyond!
- Recent trends to speed up autoregressive inference of LLMs (unifinished).
\(\rightarrow\) Math:
Some GitHub repos:
- GPT2 factorized with (multiple) Kronecker Factors: \(\rightarrow\) Github link.
- Backpropagation from scratch \(\rightarrow\) Github link.
- Randomized Algorithms \(\rightarrow\) Github link.
- Reinforcement Learning \(\rightarrow\) Github link.
Bio/Contact:
đź“ŤPassau, Germany.
- I’m a part-time ML research assistant @Uni Passau, working on LLMs.
- I’m also a computational math master’s student at @UniPassau.
- Gmail:
ayoub.benayad.467
; LinkedIn ; Instagram.