Hi, I'm Parth Parmar.

I’m a software engineer focused on building infrastructure for large-scale AI systems. My work involves both distributed systems and machine learning infrastructure.

I’m interested in the engineering challenges behind modern AI systems, how models move from research prototypes to reliable production systems that people rely on every day. I work on evaluation systems, data generation pipelines, and platforms for deploying AI.

I like problems that involve both machine learning and systems engineering. Many of the hard parts of AI today are not about better models, but about building systems that hold up in production. I write here about what I’m learning.

Currently Exploring..

Model serving and inference systems
Data pipelines and orchestration
Platform reliability and observability

Hi, I'm Parth Parmar.

Currently Exploring..

Recent posts

Making Models Fit: Quantization and Memory Optimization for Single-GPU Inference

Understanding LLM Inference by Building a Server From Scratch

The Hidden Pipeline Behind LLM Loading