About

About

I’m a machine learning research engineer with a strong software engineering background.

I really love building things. I spend my time doing independent research, trying to improve the state-of-the-art in ML, and tinkering with emerging technologies.

Some interests:

  • Transformers, VAEs, multimodality, LLMs, diffusion, RL, SSL.
  • Improving capabilities via mechanistic interpretability
  • Mathematical foundations of ML
  • Software architecture and system design
  • Performance optimization and distributed systems
  • DevOps and automation

You can DM me on Twitter if you have any interesting ideas or opportunities.