NyRAG: Building Production-Ready RAG Applications with Zero Code

Riya Bansal. Last Updated : 09 Jan, 2026

4 min read

Retrieval-Augmented Generation (RAG) technology almost immediately became the standard in intelligent applications. This was a result of the quickly developing field of artificial intelligence that combined large language models and external knowledge bases with different real-time access methods. RAG implementation of the traditional kind poses major difficulties: complex vector database setups, intricate embedding pathways, orchestration of infrastructure, and the necessity for pulling in the DevOps specialists.

Here are some of the main drawbacks of RAG’s traditional development:

Infrastructure setup and configuration can take weeks.
Vector database solutions can be extremely costly.
There is a need for integration of multiple tools, which creates complexity.
Developers will face a steep learning curve.
Challenges come up regarding production deployment.

Radically new RAG development technique NyRAG, a significant advance in RAG development that simplifies the entire process into a simple, configuration-driven workflow, is now presented. Regardless of whether you are creating AI-enabled customer support bots, internal knowledge management systems, or semantic search engines, NyRAG is going to speed up your journey from idea to production.

What is NyRAG?
- Key Features of NyRAG
- How NyRAG Works: The 5-Stage Pipeline
Getting Started with NyRAG
- Web Crawling Mode
- Document Processing Mode
Hands-On Task 1: Web-based Knowledge Base
Comparison with other Frameworks
Use Cases of NyRAG
Conclusion

What is NyRAG?

NyRAG is a Python-based open-source framework that redefines the development of Retrieval-Augmented Generation (RAG). It takes away the burden of complicated infrastructure setup and makes it possible for you to roll out smart chatbots and semantic search systems in no time at all. Sometimes, as quickly as within minutes.

Key Features of NyRAG

No-code configuration technique
Web crawling + document processing
Local Docker or Vespa Cloud deployment
Integrated chat interface
Hybrid search with Vespa engine

How NyRAG Works: The 5-Stage Pipeline

Stage 1: Query Enhancement

Initially, an AI model produces several different searches based on your question to enhance retrieval coverage.

Stage 2: Embedding Generation

Then, the queries undergo a transformation into vector embeddings with the use of SentenceTransformer models.

Stage 3: Vespa Search

After that, the system carries out the nearest-neighbor searches on the indexed chunks.

Stage 4: Chunk Fusion

Consequently, the outputs are combined, deduplicated, and ranked according to their relevance score.

Stage 5: Answer Generation

Finally, the leading chunks are transferred to an AI model (through OpenRouter) to produce justified answers.

Getting Started with NyRAG

The Prerequisites for NyRAG are:

Python with 3.10 version or higher
Docker Desktop (if you are working in local mode)
An OpenRouter API key

The commands to install NyRAG are:

Using the pip command:

pip install nyrag

Using uv command (recommended)

uv pip install -U nyrag

Now, let’s try to understand the dual modes of NyRAG, namely, web crawling and document processing.

Web Crawling Mode

Honors robots.txt
Subdomains included by default
URL exclusion lists
User agents are customizable (Chrome, Firefox, Safari, Mobile)

Document Processing Mode

Saves PDF, DOCX, TXT, Markdown
Folder scanning in a recursive way
Filtering based on file size and type
Capabilities of managing intricate document architectures

Hands-On Task 1: Web-based Knowledge Base

In this task, we’ll be building a chatbot that will answer our questions using documentation from the company website.

Step 1: Setting up the environment

Follow the commands below to set up the environment for your local system

mkdir nyrag-website-demo
cd nyrag-website-demo
uv venv
source .venv/bin/activate
uv pip install -U nyrag

Step 2: Create Configuration

Using the file ‘company_docs_config.yml’, we’ll define the configurations:

name: company_knowledge_base  

mode: web  

start_loc: https://docs.yourcompany.com/  

exclude:  

- https://docs.yourcompany.com/api-changelog/*  

- https://docs.yourcompany.com/legacy/*  

crawl_params:  

respect_robots_txt: true  

follow_subdomains: true  

aggressive_crawl: false  

user_agent_type: chrome  

rag_params:  

embedding_model: sentence-transformers/all-MiniLM-L6-v2  

embedding_dim: 384  

chunk_size: 1024  

chunk_overlap: 100

Step 3: Crawl & Index

Using the commands below, we’ll crawl the website, extract the text content, split it into chunks, generate the embeddings, which are then indexed into Vespa.

export NYRAG_LOCAL=1
nyrag --config company_docs_config.yml

Step 4: Launch Chat Interface

Now, use the commands and launch the chat interface.

export NYRAG_CONFIG=company_docs_config.yml
export OPENROUTER_API_KEY=your-api-key
export OPENROUTER_MODEL=anthropic/claude-sonnet-4

uvicorn nyrag.api:app –host 0.0.0.0 –port 8000

Step 5: Test your bot

You can try the following queries:

“How do I authenticate API requests?”

“What are the rate limits?”

“Explain the webhook configuration process.”

Comparison with other Frameworks

Let’s compare NyRAG with other frameworks to see what it’s best suited for:

Framework	Pros	Cons	Best For
NyRAG	Zero-code, integrated pipeline	Less flexible architecture	Quick deployment
LangChain	Highly customizable	Requires coding	Complex workflows
LlamaIndex	Great documentation	Manual DB setup	Custom integrations
Haystack	Modular design	Steeper learning curve	Enterprise apps

Use Cases of NyRAG

Customer Support Chatbots: It is used to get instant responses and the most accurate ones. It also helps in lowering the number of support tickets.
Internal Knowledge Management: It offers a faster and smoother introduction of new employees and provides a way to get information about employees through different departments.
Research Assistants: It helps researchers in researching documents, pulling insights from them, and asking questions related to academic literature, providing concise accounts of extensive texts.
Code Documentation Search: It increases the overall productivity of developers as both Virtual guides and API docs are indexed.

Conclusion

The division separating idea and production-ready RAG applications has become very thin. By using NyRAG, you are not simply incorporating a library; you’re obtaining a full RAG development platform that manages crawling, embedding, indexing, retrieval, and chat interfaces by default.

Whether you are making your first AI application or scaling your hundredth one, NyRAG is the provider of the success foundation. The issue is not whether RAG will change your application. Rather, it is how fast you are able to set it up.

Riya Bansal.

Gen AI Intern at Analytics Vidhya
Department of Computer Science, Vellore Institute of Technology, Vellore, India

I am currently working as a Gen AI Intern at Analytics Vidhya, where I contribute to innovative AI-driven solutions that empower businesses to leverage data effectively. As a final-year Computer Science student at Vellore Institute of Technology, I bring a solid foundation in software development, data analytics, and machine learning to my role.

Feel free to connect with me at [email protected]

Free Courses

4.7

Foundations of Vector Database

Learn vector database fundamentals and build a mini RAG MVP with FAISS.

4.8

Building Scalable Industry Applications with RAG and Agents

Learn RAG, agentic systems, and scalable LLM deployment strategies.

4.8

Mastering Multimodal RAG & Embeddings with Amazon Nova & Bedrock

Master multimodal RAG and embeddings using Amazon Nova and Bedrock.

4.5

Framework to Choose the Right LLM for Your Business

Select the best LLM for your business. Understand LLM performance.

4.8

Navigating LLM Tradeoffs: Techniques for Speed, Cost, Scale & Accuracy

Master LLM optimization for the best balance of performance & cost.

Reading list

NyRAG: Building Production-Ready RAG Applications with Zero Code

Table of contents

What is NyRAG?

Key Features of NyRAG

How NyRAG Works: The 5-Stage Pipeline

Stage 1: Query Enhancement

Stage 2: Embedding Generation

Stage 3: Vespa Search

Stage 4: Chunk Fusion

Stage 5: Answer Generation

Getting Started with NyRAG

Web Crawling Mode

Document Processing Mode

Hands-On Task 1: Web-based Knowledge Base

Step 1: Setting up the environment

Step 2: Create Configuration

Step 3: Crawl & Index

Step 4: Launch Chat Interface

Step 5: Test your bot

Comparison with other Frameworks

Use Cases of NyRAG

Conclusion

Login to continue reading and enjoy expert-curated content.

Free Courses

Foundations of Vector Database

Building Scalable Industry Applications with RAG and Agents

Mastering Multimodal RAG & Embeddings with Amazon Nova & Bedrock

Framework to Choose the Right LLM for Your Business

Navigating LLM Tradeoffs: Techniques for Speed, Cost, Scale & Accuracy

Recommended Articles

Responses From Readers

Become an Author

Flagship Programs

Free Courses

Popular Categories

Generative AI Tools and Techniques

Popular GenAI Models

AI Development Frameworks

Data Science Tools and Techniques

Reading list

Introduction to Generative AI

Introduction to Generative AI applications

No-code Generative AI app development

Code-focused Generative AI App Development

Introduction to Responsible AI

LLMS

Prompt Engineering

Finetuning LLMs

Training LLMs from Scratch

Langchain

RAG

LlamaIndex

Stable Diffusion

NyRAG: Building Production-Ready RAG Applications with Zero Code

Table of contents

What is NyRAG?

Key Features of NyRAG

How NyRAG Works: The 5-Stage Pipeline

Stage 1: Query Enhancement

Stage 2: Embedding Generation

Stage 3: Vespa Search

Stage 4: Chunk Fusion

Stage 5: Answer Generation

Getting Started with NyRAG

Web Crawling Mode

Document Processing Mode

Hands-On Task 1: Web-based Knowledge Base

Step 1: Setting up the environment

Step 2: Create Configuration

Step 3: Crawl & Index

Step 4: Launch Chat Interface

Step 5: Test your bot

Comparison with other Frameworks

Use Cases of NyRAG

Conclusion

Login to continue reading and enjoy expert-curated content.

Free Courses

Foundations of Vector Database

Building Scalable Industry Applications with RAG and Agents

Mastering Multimodal RAG & Embeddings with Amazon Nova & Bedrock

Framework to Choose the Right LLM for Your Business

Navigating LLM Tradeoffs: Techniques for Speed, Cost, Scale & Accuracy

Recommended Articles

Responses From Readers

Become an Author

Flagship Programs

Free Courses

Popular Categories

Generative AI Tools and Techniques

Popular GenAI Models

AI Development Frameworks

Data Science Tools and Techniques