Final-year BTech AI/ML student building production-ready machine learning systems. I don't just train models — I deploy them. My focus is the full ML lifecycle: data pipelines, training, inference optimisation, API serving and monitoring.
Currently building local LLM inference servers with vLLM, quantised model deployment with GGUF, and RAG evaluation pipelines with RAGAS metrics.
“Structured feedback that improved prompt acceptance by 18%.”
I reply within 24 hours. Open to ML/AI engineering roles, internships and collaborations.