WatchTower - Securing LLMs against Jailbreak Attempts

WatchTower - Securing LLMs against Jailbreak Attempts

17 Jul 202413:07pm - 17 Jul 202414:07pm

WatchTower - Securing LLMs against Jailbreak Attempts

About the Event

As large language models (LLMs) become integral to various applications, the risk of these models being manipulated through jailbreaking attempts is a significant concern. Jailbreaking involves exploiting model vulnerabilities to produce harmful, inappropriate, or unintended outputs. For example, attackers might bypass safety filters to generate offensive content, posing ethical and legal risks and undermining user trust. Addressing these vulnerabilities is crucial for maintaining the integrity and reliability of AI systems.

In this webinar, we will delve into the pain points associated with LLM security and explore how to effectively mitigate these threats. We will discuss how to identify the right datasets for learning, and demonstrate the process of fine-tuning the Mistra 7B model to detect and prevent jailbreak attempts.

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Sagar Patel

Sagar Patel

Senior Data Scientist at Visa

Sagar Patel brings over seven years of experience in designing ML and AI solutions across various domains. He boasts extensive expertise in developing text analytics platforms and services utilizing NLP. Currently pioneering Visa’s internal ChatGPT platform, his work focuses on adapting the latest developments in GenAI while upholding the highest levels of security. Passionate about teaching, he has delivered over 1,200 hours of training in ML and AI to both classroom and corporate audiences.

Participate in discussion

Registration Details

5364

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution