Nick Gupta

Bachelor's Degree in Computer Science
Columbia University
Email: [email protected]
LinkedIn
GitHub
Pronouns: He, his, him

Nick Gupta's Picture

Overview

As a Staff/Principal-level Machine Learning Engineer and technical lead, I design, ship, and operate production ML and GenAI systems end-to-end, turning ambiguous goals into measurable roadmaps, reliable architectures, and scalable execution. I hold a Bachelor's Degree in Computer Science from Columbia University in the City of New York. I am a U.S. citizen and do not require visa sponsorship.

I specialize in building high-impact, reusable foundations that make multiple teams faster: modern retrieval and ranking stacks, LLM/VLM-powered assistants, and distributed agentic systems with strong evaluation and safety guardrails. My work emphasizes practical optimization-cost-aware model routing, caching and batching, efficient serving (CPU/GPU), and compression techniques such as distillation and quantization to meet strict p95/p99 latency, reliability, and cost constraints in real production environments.

I am known for cross-functional leadership and technical decision-making that scales: aligning product, data, platform/SRE, privacy/security, and research stakeholders through crisp design docs, clear success metrics, and fast feedback loops. I value inclusive, high-trust teams and bring a calm, metrics-driven approach to building systems that are secure, observable, and maintainable over the long term.

My focus is generative and agentic AI: retrieval + reranking, RAG, tool-use orchestration, evaluation harnesses, and safety/guardrails paired with systems-level optimization (batching/caching, quantization/distillation, GPU efficiency, and cost-aware routing) to hit strict latency, reliability, and cost targets.


TECHNICAL SKILLS

Education

Experience

Open-Source Contributions

Projects

Volunteer Activities

End of Resume