LLM as Judge: Evaluation Methods and Tools

LLM as Judge: Evaluation Methods and Tools

10 Oct 202413:10pm - 10 Oct 202414:10pm

LLM as Judge: Evaluation Methods and Tools

About the Event

Join us for an insightful session on leveraging Large Language Models (LLMs) as semi-automated evaluators. We’ll explore cutting-edge techniques and research on using LLMs to streamline the evaluation process, focusing on practical tools like Ragas and Deepeval within the RAG framework. This session will guide you through building a basic LLM evaluation framework, while also addressing potential biases in these approaches. Whether you're new to LLMs or experienced, you'll gain valuable insights into optimizing AI-driven evaluations.

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Sangeetha Venkatesan

Sangeetha Venkatesan

NLP Engineer at Chubb

Sangeetha Venkatesan is an NLP Engineer who strongly focuses on Information Retrieval, Retrieval-Augmented Generation (RAG), and evaluation frameworks and methodologies. She is passionate about researching and integrating these concepts into product development. As a learning-driven engineer, she explores new insights every day. Sangeetha also shares her knowledge and experiences through her Substack blog. You can reach her on LinkedIn.

Participate in discussion

Registration Details

2198

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution