Building Responsible AI Agents with Guardrails and Safety in Action

About

In this practical session, participants will learn how to build autonomous AI agents using open-source LLMs and apply responsible AI principles through real-world guardrailing techniques. We will walk through the full pipeline — from creating a task-specific agent using LLaMA or Mistral-based models, to integrating NVIDIA NeMo Guardrails, Llama Guard, and prompt-based safety strategies.

We’ll cover critical safety challenges such as:

Prompt injection and jailbreaks
Toxicity and bias mitigation
Controlling agent autonomy and output

This session will include:

Setting up an AI agent for a real-world use case (e.g., customer support, knowledge
assistant)
Injecting common adversarial prompts to test vulnerabilities
Applying NVIDIA NeMo Guardrails and Llama Guard to detect and prevent harmful
outputs
Using prompt-based guardrailing as a first line of defense
Discussing practical limitations and failure cases in alignment and safety

Key Takeaways:

Understand responsible AI concepts in the context of agentic systems
Gain hands-on experience in building agents and securing them
Learn the strengths and limits of guardrail tools like NeMo Guardrails and Llama Guard
Walk away with a working demo, GitHub repo, and safety test checklist

Speaker

Anuj Saini

Director Data Science

Download Brochure

Phone Number

Email Id

I Agree to the Terms & Conditions

Send WhatsApp Updates

Building Responsible AI Agents with Guardrails and Safety in Action

About

Key Takeaways:

Speaker

Anuj Saini

Download agenda

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory

lms_analytics

liap

visit

li_at

s_plt

lang

s_tp

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

s_pltp

s_tslv

li_theme

li_theme_set

Google (11)

_gcl_au

SID

SAPISID

__Secure-#

APISID

SSID

HSID

DV

NID

1P_JAR

OTZ

Facebook (2)

_fbp

fr

LinkedIn (6)

bscookie

lidc

bcookie

aam_uuid

UserMatchHistory

li_sugr

Microsoft (2)

MR

ANONCHK

04

10

19

48