India’s AI Leap 🇮🇳 : 10 LLMs that are Built in India

Himanshi Singh 07 Jul, 2024
9 min read

Introduction

In the world of big-league tech, where giant global players usually lead the AI race, India is making some exciting moves. A new world of Indian-made Large Language Models (LLMs) and AI tools is starting to shine, each with its special flair. We’re here to put these local heroes under the spotlight, showing off their cool features and groundbreaking progress. Ready for an adventure into the diverse and dynamic world of India’s own AI creations? Let’s jump in and discover what makes these Indian LLMs and AI tools smart and remarkable. Let us now look at the top 10 LLMs built in India.

 Navarasa 2.0

Telugu LLM Labs presents Navarasa 2.0, an advanced iteration of the Gemma series language models. This 7B/2B instruction-tuned configuration model supports an extensive suite of 15 Indian languages and English, building upon its predecessor that was initially fine-tuned for 9 Indian languages.

Navarasa 2.0 is designed to be versatile and suitable for various applications, including content generation, translation, customer support, and educational resources, particularly in local languages. Its capability to function across multiple Indian languages substantially increases its utility for businesses and developers targeting India’s linguistically diverse population. This broad language support is crucial in enhancing digital inclusivity, allowing more individuals to access technology in their native languages.

Navarasa | Indian LLM Model | Indian AI model

Key Features

  • Base Models: Utilizes the Gemma 7B/2B models as the foundation for fine-tuning.
  • Expanded Language Portfolio: This Indian AI model includes additional languages such as Marathi, Urdu, Konkani, Assamese, Nepali, and Sindhi, bringing the total to 16 languages, including English.
  • Data Enrichment: This Indian AI model employs a translated version of the alpaca-cleaned-filtered dataset, now extended to cover six more Indian languages, enhancing the training’s breadth and depth.
  • Enhanced Generative Capabilities: The model has been specifically enhanced to bolster its generative abilities, promoting more effective and context-aware text generation across multiple languages.

Click here to explore Navarasa 2.0.

Languages Supported by Navarasa 2.0

This Indian LLM Model supports Hindi, Telugu, Tamil, Kannada, Malayalam, Marathi, Gujarati, Bengali, Punjabi, Odia, Urdu, Konkani, Assamese, Nepali, Sindhi, and English.

With Navarasa 2.0, Telugu LLM Labs underscores its commitment to reducing linguistic barriers and fostering a more inclusive digital environment in India. This model exemplifies the potential of AI to cater to and enrich the multilingual fabric of the Indian subcontinent.

Dhenu 1.0

Expanding its portfolio of transformative AI solutions for agriculture, KissanAI proudly introduces Dhenu, a series of Language Learning Models (LLM) directly inspired by the mythological Kaamdhenu—the wish-fulfilling cow from Hindu mythology. Dhenu represents the epitome of marrying tradition with cutting-edge technology, designed specifically to serve the agricultural sector with precision and innovation.

Dhenu-vision-lora-v0.1, a part of this series, is an open-source agricultural disease detection model that has been fine-tuned using the Qwen-VL-chat model. This model is crafted to assist farmers in identifying diseases in three major crops—rice, maize, and wheat—through a conversational interface, integrating advanced Low-Rank Adaptation techniques for cost-effective fine-tuning on specialized agricultural datasets.

Also Read: Plant Disease Classification using AlexNet

Indian LLM Model | Indian AI model

Key Features

  • Model Base: This LLM utilizes the “Qwen/Qwen-VL-Chat” as its base, with enhancements through the LoRA methodology.
  • Specific Focus: Tailored to enhance disease detection capabilities in agriculture, significantly outperforming the base model with a 2X improvement.
  • Crop Diseases Addressed: The model can identify various diseases in rice, maize, and wheat, such as Leaf Blight, Leaf Spot, and Wheat Loose Smut.
  • Training and Dataset: The Indian LLM Model support was trained in March 2024 using a synthetic dataset of approximately 9,000 images highlighting common crop diseases.
  • Evaluation: Tested on 500 images, Dhenu-vision-lora-v0.1 achieved a 36.13% accuracy rate, demonstrating substantial advancements over the base model.

Click here to access Dhenu 1.0

Odia Llama

The OdiaGenAI team has released a fine-tuned Llama2 model dedicated to the Odia language, addressing Odisha’s linguistic nuances and cultural specifics. This Indian LLM Model enhances the digital presence of the Odia language, which has historically been underrepresented in AI applications.

Odia Llama | Indian LLMs

Key Features

  • Rich Training Dataset: This Indian AI model incorporates diverse domain-specific knowledge, covering topics from local cuisine to historical sites.
  • Advanced Fine-Tuning: This method utilizes low-rank adaptation (LoRA) methods tailored for Odia, improving the model’s performance on native content.
  • Cultural Relevance: Trained to respect and reflect the cultural heritage of Odisha, ensuring that the generated text resonates with local users.
  • Accessible and Open-Source: Available for research and non-commercial use, promoting further academic and practical exploration.

Explore the full discussion at OdiaGenAI.

Kannada Llama

Tailored for the Kannada-speaking community, Kannada Llama enhances AI’s linguistic capabilities in handling the Kannada language. This Indian LLM Model is meticulously engineered to support diverse applications, from conversational AI to text analysis.

Kannada Llama | Indian LLMs

Key Features

  • Extensive Training: Pre-trained on over 600 million Kannada tokens to capture nuances of the language.
  • Advanced Techniques: Utilizes Low-Rank Adaptation (LoRA) for efficient training and fine-tuning.
  • Optimized Datasets: Fine-tuned specialized datasets to improve conversational capabilities and text comprehension.
  • Open-Source Contribution: Facilitates wider access and collaboration within the tech community, promoting further research and development in Indic language AI.

Explore more details on Kannada Llama at Tensoic Blog.

OpenHathi

OpenHathi, which means “elephant” in Hindi, is not just a large language model but a symbol of the growing power of Indian languages in the AI landscape. This 7B parameter model, developed by Sarvam AI, marks the first release in the OpenHathi series, designed to empower diverse applications in the Indian market. As the first publicly available Hindi Large Language Model (LLM), OpenHathi represents a pivotal moment in India’s AI evolution. 

OpenHathi | India's LLM | Indian LLM Model | Indian AI model

Key Features

  • Bilingual Training: OpenHathi, an Indian AI model, leverages not just Hindi but also English and Hinglish data during training, enhancing its comprehension and generation capabilities across both languages.
  • Custom Tokenization: A unique sentence-piece tokenizer with a 16K Hindi vocabulary merges with the Llama2 tokenizer to significantly reduce tokenization overhead for Hindi text.
  • Phased Training: The Indian LLM Model undergoes a three-phase training process:
    • Phase 1: Bilingual text translation using low-rank adapters, fostering cross-lingual understanding.
    • Phase 2: Bilingual next-token prediction with low-rank adapters, enabling context-aware language generation.
    • Phase 3: Supervised fine-tuning of internal datasets for specific tasks, tailoring the model’s ability to handle diverse applications.
  • Open-source Accessibility: The OpenHathi base model after phase 2 is publicly available via HuggingFace, allowing developers and researchers to fine-tune it for their specific needs and tasks.
  • Cross-lingual Potential: OpenHathi’s bilingual training opens doors for applications in cross-lingual translation, information retrieval, and other tasks requiring seamless interaction between Hindi and English.

Click here to explore OpenHathi.

Tamil-LLAMA

Tamil-LLAMA is a large language model specifically designed for the Tamil language. Developed by Abhinand Balachandran, this Indian AI model builds upon the foundation of the LLaMA model but significantly enhances its capabilities in handling Tamil text.

Tamil-LLAMA | LLMs that are Built in India

Key Features

  • Enhanced vocabulary: The model’s vocabulary expands upon the original 32,000 tokens by incorporating an additional 16,000 Tamil-specific tokens, enabling more nuanced and accurate processing of the Tamil language.
  • Efficient training: Leveraging the LoRA methodology, Tamil-LLAMA achieves optimal training efficiency while maintaining model robustness.
  • Multiple variations: This Indian AI model has four variations: Tamil LLaMA 7B, 13B, 7B Instruct, and 14B Instruct. Each variation offers different parameter sizes and fine-tuning approaches, catering to diverse needs and computational resources.
  • Fine-tuning with focused datasets: To refine its Tamil comprehension and generation abilities further, the model undergoes additional training with a Tamil-translated version of the Alpaca dataset and a subset of the OpenOrca dataset, specifically chosen for their relevance to Tamil language tasks.
  • Open-source availability: The code, models, and datasets are all publicly available, fostering further research and development in Tamil language processing.

Overall, Tamil-LLAMA represents a significant leap forward in Tamil language AI. Its combination of enhanced vocabulary, efficient training methods, focused fine-tuning, and open-source accessibility makes it a valuable tool for researchers, developers, and anyone interested in leveraging the power of AI for Tamil language applications.

Click here to explore this LLM built in India.

Krutrim

 Krutrim AI is a generative AI assistant that converses in 10+ languages, including Hindi, English, Tamil, Telugu, Malayalam, Bengali, Marathi, Kannada, Gujarati, etc., making it India’s own AI by an artificial intelligence startup. Bhavish Aggarwal founded Ola Cabs and founded this Indian LLM Model. Krutrim AI has been natively created to ensure a creative AI tool designed for over 1.4 billion Indians to provide 100% contextually relevant responses. The company aims to revolutionize how Indians interact with technology, breaking down the linguistic and cultural barriers that often hinder AI adoption. Krutrim AI is currently in public beta and is poised to transform the Indian customer service landscape with AI-powered chatbots.

LLMs that are Built in India | Krutrim

Click here to explore this LLM built in India.

Project Indus

Tech Mahindra has unveiled a cool project, Project Indus, to make computers understand Hindi and its many dialects. This Indian AI model is at the forefront of a groundbreaking initiative in language technology, developing a pure Hindi Large Language Model (LLM) powered by AI. This model is notable for its substantial scale, encompassing 539 million parameters and a vast collection of 10 billion tokens from Hindi and its dialects. The project’s ambitious goal is to build an Open Source LLM to revolutionize language technology and meet the needs of a quarter of the world’s population. This endeavor will create extensive language repositories, promising significant benefits for rural finance, retail, and logistics sectors, thereby contributing to growth across India.

Project Indus

The initial phase of Project Indus focuses on Hindi and its 37 dialects, laying a solid foundation for future expansion. Over time, the project will incorporate additional languages and dialects, broadening its scope and impact. This initiative by Tech Mahindra is more than just a technological advancement. It’s a step towards bridging language barriers and fostering inclusivity on a global scale.

Click here to explore this LLM built in India.

Bhashini

The Government of India launched Bhashini to bridge the digital divide by democratizing access to digital services across various Indian languages. This national public digital platform aims to develop services and products by leveraging artificial intelligence and other emerging technologies. Bhashini’s efforts focus on developing Large Language Models (LLMs) and creating a comprehensive ecosystem that supports language technology through various projects.

Bhashini | Indian LLM Model | Indian AI model

Bhashini encompasses a diverse landscape of language technology projects, with LLM development as a crucial element. This holistic approach extends beyond individual languages, seeking to create bridge points between technology and India’s rich linguistic heritage. By dismantling language barriers, Bhashini envisions digital inclusivity as a lived reality for all citizens.

One of the key components of Bhashini is the Universal Language Contribution API, an open-source platform used to collect, curate, and discover datasets in Indian languages. This Indian LLM Model enhances language tech, supporting speech recognition, text-to-speech, and machine translation, advancing Indian language processing.

While still in its beta phase, the Bhashini app marks a significant milestone in the program’s journey. Available for download on both Apple Store and Google Play Store, the app offers a glimpse into the transformative potential of Bhashini. As the program grows, it will impact education, healthcare, governance, and economic development.

Click here to explore this made in India LLM.

BharatGPT

BharatGPT, by CoRover.ai, is a transformative Generative AI platform tailored for the Indian market. It supports over 14 languages across various modalities. Fully aligned with the Indian government’s initiative, BharatGPT ensures data sovereignty and security by keeping all data within the country. This Indian AI Model is versatile and integrated with ERP/CRM systems. Furthermore, it supports multiple languages and formats, featuring an inbuilt payment gateway for real-time transactions.

BharatGPT’s multi-layered query processing reduces computational load, enhancing efficiency and scalability for diverse organizational requirements. It is essential across sectors and utilized by major organizations like IRCTC and LIC for varied functions.

BharatGPT offers customizable experiences, including adding custom knowledge bases, appealing to enterprises seeking tailored AI solutions.

BharatGPT | Indian LLM Model | Indian AI model

Click here to access this LLM

Conclusion

India is making big strides in artificial intelligence, particularly its Large Language Models and AI tools. We’ve looked at various exciting projects—from Navarasa 2.0, which supports many Indian languages, to Dhenu, which helps farmers detect crop diseases, and Odia Llama, which focuses on the Odia language. These projects show India’s dedication to using AI to help different regions and people.

We’ve also seen innovative projects like OpenHathi and Tamil-LLAMA pushing the boundaries of what AI can do in India. On top of these, ambitious initiatives like Project Indus and the Bhashini program are making technology accessible to people across India. We’d love to hear about more projects as India grows in AI.

If you’ve created any homegrown LLM or know any that deserves to be on the above list, Let me know in the comments section. Let’s talk about the exciting world of AI in India!

Frequently Asked Questions

Q1. Which AI is made in India?

A. There are numerous AIs that are made in India. One is Vyasa, an AI model developed by India focused on advanced natural language processing and AI-based analytics. There are also Indian startups like Gupshup and Haptik specializing in AI-driven conversational platforms.

Q2. What is LLM in generative AI?

A. AI researchers designed Large Language Models (LLMs) to understand and generate human-like text, using vast datasets to perform various language tasks, such as text generation, translation, and summarization.

Q3. Who is the father of AI in India?

A. The father of AI in India is Dr. R. Narasimhan. His pioneering computer science and artificial intelligence work in India has earned him renown, significantly contributing to developing the country’s field.

Q4. What is the biggest AI company in India?

A. The biggest AI company in India is Tata Consultancy Services (TCS). They are known for their extensive AI solutions and innovations. TCS is one of India’s largest and most influential AI companies, offering various AI services across various industries.

Himanshi Singh 07 Jul, 2024

I am a data lover and I love to extract and understand the hidden patterns in the data. I want to learn and grow in the field of Machine Learning and Data Science.

Frequently Asked Questions

Lorem ipsum dolor sit amet, consectetur adipiscing elit,

Responses From Readers

Clear