About
I’m a machine learning research engineer with a strong software engineering background.
I really love building things. I spend my time doing independent research, trying to improve the state-of-the-art in ML, and tinkering with emerging technologies.
Some interests:
- Transformers, VAEs, multimodality, LLMs, diffusion, RL, SSL.
- Improving capabilities via mechanistic interpretability
- Mathematical foundations of ML
- Software architecture and system design
- Performance optimization and distributed systems
- DevOps and automation
You can DM me on Twitter if you have any interesting ideas or opportunities.