LLM Optimization: How to Make AI Inference Faster and Pocket-Friendly!

LLM Optimization: How to Make AI Inference Faster and Pocket-Friendly!

25 Sep 202413:09pm - 25 Sep 202414:09pm

LLM Optimization: How to Make AI Inference Faster and Pocket-Friendly!

About the Event

Join us for an in-depth session on optimizing Large Language Models (LLMs) for faster and more cost-effective AI inference. Discover advanced techniques using NVIDIA’s cutting-edge tools like TensorRT-LLM, Triton Inference Server, and NVIDIA Inference Microservices to significantly reduce latency, memory consumption, and operational costs. Learn how to streamline deployments, boost performance, and improve resource efficiency through real-world examples and case studies. This session will equip you with the skills to scale AI solutions profitably while maximizing return on investment.

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Sunil Patel

Sunil Patel

Senior Solutions Architect - Deep Learning at NVIDIA

Sunil Patel has 8 years of experience working with deep learning. Sunil has worked on NLP and computer vision model development. Sunil is currently working on optimizing computer vision and LLM models on the GPU. Sunil has contributed to Nvidia Tensorrt and Nvidia Deepstream SDKs being used by millions of developers worldwide, Sunil has worked on end-to-end application deployments scaling over 1000+ GPUs. Sunil also works on Stable Diffusion and other Generative AI model training & optimization. Sunil holds several publications and four patents in the area of deep learning. You can reach him on LinkedIn.

Participate in discussion

Registration Details

2219

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution