A ChatBot and Multi-Model RAG on your Laptop
A ChatBot and Multi-Model RAG on your Laptop
21 May 202413:05pm - 21 May 202414:05pm
A ChatBot and Multi-Model RAG on your Laptop
About the Event
What if we could run state-of-the-art open-source LLMs and RAG on a typical personal computer? Did you think it was a lost cause? Well, it's not!
In this session, thanks to Hugging Face Optimum, OpenVino and Intel Labs, We will demonstrate how to apply different optimization techniques such as 4-bit quantization, speculative decoding to the 2.7B MFST Phi-2 and LLaVA Phi-2 model , and chat with your MM docs on mid-range laptop powered by an Intel Corporation Meteor Lake CPU.
In this DataHour, Moshe will cover the following points in detail:
- Why local LLM inference is desirable and now possible
- Intel Meteor Lake
- The Microsoft Phi-2 model
- Quantization with Intel OpenVINO and Optimum Intel
- ChatBot on your Laptop – Demo
- Introduction to fastRAG
- Chat with your MM docs on your Laptop – Demo
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
Who is this DataHour for?
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
About the Speaker
Participate in discussion
Registration Details
Registered
Become a Speaker
Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event
- Professional Exposure
- Networking Opportunities
- Thought Leadership
- Knowledge Exchange
- Leading-Edge Insights
- Community Contribution
