Hritvik Taneja

I am a fourth-year Ph.D. student at Georgia Tech, advised by Prof. Moinuddin Qureshi in the FAST Lab. My research focuses on improving the performance of CPU and GPU workloads by leveraging the intricacies of modern memory systems and proposing architectural enhancements to overcome memory bottlenecks. Currently, I’m working on optimizing bottlenecks that arise when serving large language models (LLMs) in ultra–low-latency settings.

News