Back in 2019, while juggling undergrad life, fate led me to the brilliant FORai research group
founded by Aidan Gomez, Ivan Zhang, and Bryan M. Li. Little did I know, diving into fascinating RL
projects would introduce me to the amazing Nick Frosst, who gave me a life-changing intro to Cohere.
That single connection reshaped my path forever!
Fast forward, I joined Cohere as an early employee and founding team member, riding the rocket ship
π from a cozy crew of 25 to a dynamic family of 250+ talented folks. This journey wasn't just about
numbers β it was about building something extraordinary with some of the brightest, kindest, and
most inspiring minds of our time. π§ β¨
β’ Built cutting-edge distributed training frameworks for scaling massive language models using JAX,
Ray, TPUs, and GPUs π οΈ
β’ Trained, evaluated, and shipped Cohere's multilingual embeddings model β enabling AI to speak my
native language πβ€οΈ
β’ Scaled semantic search to mind-blowing billion-scale efficiency, achieving a 1,000x cost reduction
with quantization magic! πͺπ
β’ Designed an ironclad evaluation framework to measure performance across 250+ tasks alongside an
epic team. πͺ
Every moment with the incredible Adrien Morisot, Siddhartha Rao Kamalakara, Sylvie Shi, Bharat
Venkitesh, Acyr Locatelli, Dwarak Talupuru, Elliott Choi, Jay Alammar, and so many more was pure
gold. π
The countless discussions, late-night breakthroughs, and shared laughs? Absolutely
unforgettable. β€οΈ
Tech Arsenal: JAX, TPUs, PyTorch, Triton, PySpark, Beam π οΈ
Focus Areas: Large Language Models, Transformers, Semantic Search, Distributed
Systems, Model Optimization π
This chapter of my life was nothing short of legendary π₯