- theory
- math
- applied
- code
- philosophy
- external-services
•
•
•
•
•
-
Recommender Systems - From Item Neighbors to Generative Sequence Models
A research-grounded story of how recommender systems went from item-item neighbors at Amazon to generative sequence models at Meta. Each wave shipped real business wins, hit a structural ceiling, and seeded the next. Curated through cornerstone papers (Linden 2003, Koren 2009, YouTube DNN 2016, DIN 2018, PinSage 2018, EBR 2020, TIGER 2023, HSTU 2024), industry deployments, and the failure modes that overturned consensus.
-
Retrieval Encoder Training Operationalization
A method guide for deciding when to train your own retrieval encoder. Pins down what an encoder serves (relevance), how success is measured, and the levers that move nDCG, latency, and index size. Instantiates the formulation across web QA, legal, code, and biomedical retrieval. Cites GradCache, NV-Retriever, Gecko, GOR, Matryoshka, EmbeddingGemma, E5, and BGE.
-
Why Innovation Does Not Live in Indonesia
A tech founder's economic argument for why Southeast Asia's largest economy cannot make its innovation flywheel spin. From rent-seeking capital allocation and a colonial trust deficit to missing exit markets, every stage of the cycle is broken.
-
Encoders - Squash Reality into a Vector Space
From SIFT histograms to CLIP's billion-parameter shared space, this post traces how vision and text communities independently discovered the same compression trick and then converged into one. Covers handcrafted features, shallow embeddings, deep CNNs, BERT, ViT, self-supervised pre-training, and multimodal alignment; with open problems and portfolio projects.
-
World Models - From Dyna to Foundation Simulators
A decade of teaching machines to imagine. From Sutton's 1990 Dyna planning loop through Ha & Schmidhuber's dreaming agents, the RSSM latent-dynamics lineage, foundation-scale video simulators, and LeCun's JEPA thesis. Includes open problems and a portfolio project guide.