Projects — Kashif

smol-llama

360M parameter LLaMA trained from scratch on 6B tokens. GQA, RoPE, SwiGLU. Single H100, 22hrs, $53.

Curated 6B token dataset from FineWeb. Pre-tokenized with custom 49K BPE vocab.

Autograd engine and neural networks in ~500 lines of NumPy. Educational deep learning.

Lightweight vector DB with hnswlib + SQLite. Crash recovery, 2K+ PyPI downloads.

AI Discord bot. Natural conversations, image understanding, and generation.

NumPy-like ndarray in Rust with Python bindings. Learning project.

VS Code light and dark theme generated from OpenCode reference JSON.