Main Contents

Jun 22, 2025
Middle Aged Work Advice
Work Experiences
Mar 25, 2025
Dynamic Batching for Training Large Sequence Models (LLMs)
Code PyTorch
Jan 24, 2025
RAG System Architecture
Projects
Nov 1, 2024
Vibe Coding Car Racing Simulator (Fail)
Code Misc
Sep 29, 2024
Deriving the Basic Policy Gradient Update (REINFORCE)
Reinforcement Learning
Sep 15, 2024
Data Extraction for Unstructured Document Data
Projects
Sep 7, 2024
First 100 words
Misc
Aug 24, 2024
Temporal Difference Learning: Taking advantage of Incomplete Trajectories
Reinforcement Learning
Aug 3, 2024
Synthetic Question Generation for Retrieval Evaluation of RAG
Jul 25, 2024
Deriving the minimax equation for GANs
Generative Models
Jul 7, 2024
Chunking code for RAG; parsing-recursion-stack
Code
Feb 15, 2024
A classical NLP researcher and a GPT-era Engineer meet at the coffee machine
Generative Models
Jan 25, 2024
Python Decorators for Monitoring GPU Usage
Code PyTorch
Jan 14, 2024
A baby and a PhD
Work Experiences
Dec 5, 2023
LLM Research and Adaptation Landscape
Generative Models
Oct 10, 2023
Monitoring Jobs on the Server
Code
Oct 5, 2023
Taking a break from NLP Research
Work Experiences
Oct 1, 2023
Lean OmegaConf Argparse System
Code
Apr 6, 2023
Training Sparse Neural Networks with L0 Regularisation
Machine Learning
Feb 1, 2023
Dynamic Programming for Reinforcement Learning, the importance of the Bellman equations; (with Gymnasium)
Reinforcement Learning
Nov 27, 2022
Could Large Language Models be conscious? (David Chalmers @ Neurips 2022)
Misc
Oct 24, 2022
NLP Papers at ICML2022
Review
Jun 28, 2022
You don't feel like you're good enough, but its not a competition
Work Experiences
Jun 23, 2022
Stochastic Gradient Langevin Dynamics
Bayesian Inference Machine Learning Optimization
Jun 16, 2022
NYCMidnight-100words
Misc
May 10, 2022
Recipe for connecting to Google Drive from Remote Server
Code
Feb 15, 2022
Minimum Bayes Risk Decoding
Bayesian Inference
Nov 18, 2021
Formalising Analogies for A.I
Machine Learning
Sep 2, 2021
You're not doing well, but motivation is optional
Work Experiences
Apr 25, 2021
Likelihood weighted Sequential Importance Sampling
Machine Learning
Jan 10, 2021
Neural Tangent Kernel, Every Model trained by GD is a kernel machine (Review)
Review
Dec 31, 2020
Some QA from Deep Learning (CS 462/482)
Machine Learning
Dec 12, 2020
NEURIPS 2020
Review
Dec 2, 2020
Adversarial NLP examples with Fast Gradient Sign Method
Misc
Nov 21, 2020
EMNLP 2020
Review
Sep 4, 2020
Variance of the Estimator in Machine Learning
Machine Learning
May 8, 2020
Some Clustering Papers at ICLR20
Review
Apr 23, 2020
A minimum keystroke (py)Debugger for Lazy ML/DS people who don't IDE
Projects Code
Mar 31, 2020
Recipe for building jq from source without admin(sudo) rights
Code
Jan 17, 2020
The Sigmoid in Regression, Neural Network Activation and LSTM Gates
Machine Learning
Sep 30, 2019
Arithmetic(Book)
Review
Aug 11, 2019
Clean TreeLSTMs implementation in PyTorch using NLTK treepositions and Easy-First Parsing
PyTorch
Jul 1, 2019
Pad pack sequences for Pytorch batch processing with DataLoader
PyTorch
May 20, 2019
Modes of Convergence
Misc
Mar 20, 2019
Coordinate Ascent Mean-field Variational Inference (Univariate Gaussian Example)
Bayesian Inference Machine Learning
Mar 1, 2019
Dirichlet Process Gaussian Mixture Models (Generation)
Bayesian Inference
Dec 1, 2018
Gotchas in Cython; Handling numpy arrays in cython class
Code
Aug 30, 2018
Onboarding for Practical Machine Learning Research
Machine Learning
Jul 20, 2018
Equivalence of constrained and unconstrained form for Ridge Regression
Optimization
Jul 7, 2018
Studying drug-drug interactions and predictors of adverse vascular outcomes
Projects
Jul 1, 2018
PyTorch Automatic differentiation for non-scalar variables; Reconstructing the Jacobian
Calculus PyTorch
May 30, 2018
From psychologist to CS PhD Student
Work Experiences
May 28, 2018
Capturing Last-mile Transactions of Smallholder Palm Oil Farmers
Projects
May 10, 2018
Migrating from python 2.7 to python 3 (and maintaining compatibility)
Code
Apr 7, 2018
Lagrange Multipliers and Constrained Optimization
Calculus Optimization
Apr 6, 2018
Taylor Series approximation, newton's method and optimization
Calculus Optimization
Apr 5, 2018
Hessian, second order derivatives, convexity, and saddle points
Calculus
Apr 4, 2018
Jacobian, Chain rule and backpropagation
Calculus Machine Learning
Apr 3, 2018
Gradients, partial derivatives, directional derivatives, and gradient descent
Calculus Machine Learning Optimization
Apr 2, 2018
Derivatives, differentiability and loss functions
Calculus
Apr 1, 2018
Calculus for Machine Learning
Machine Learning Calculus
Mar 30, 2018
Algorithms on Graphs: Fastest Route
Misc
Dec 5, 2017
Gibbs Sampling on Dirichlet Multinomial Naive Bayes (Text)
Bayesian Inference
Nov 18, 2017
Markov Chain Monte-Carlo
Bayesian Inference
Nov 16, 2017
EM Algorithm for Gaussian mixtures
Bayesian Inference
Nov 1, 2017
Communicating Data Science
Work Experiences
Sep 20, 2017
Cross disciplinary projects
Work Experiences
May 6, 2017
Conjugate Priors
Bayesian Inference
Mar 1, 2017
Closed form Bayesian Inference for Binomial distributions
Bayesian Inference
Jan 13, 2015
DSO Advice
Work Experiences