Master Generative AI with 10+ Real-world Projects in 2025!

  • d
  • :
  • h
  • :
  • m
  • :
  • s
Analytics Vidhya
  • Free Courses
  • Learning Paths
  • GenAI Pinnacle Plus New
  • Agentic AI Pioneer
  • DHS 2025
    • Switch Mode
    • Logout
Interview Prep
Career
GenAI
Prompt Engg
ChatGPT
LLM
Langchain
RAG
AI Agents
Machine Learning
Deep Learning
GenAI Tools
LLMOps
Python
NLP
SQL
AIML Projects
  1. Dashboard
  2. Blog
  3. tanishq
t

tanishq

Author: since 05 Oct, 2020 Article: 12 Claps: 14

Writes on Computer Vision Deep Learning Python Python Image Image Analysis NLP Technique Unstructured Data Analytics Vidhya Libraries Supervised AVbytes Classification Artificial Intelligence Audio Processing Object Detection Object Tracking PyTorch Reinforcement Learning Advanced More
  • Advanced Computer Vision Image Image Analysis

    Understanding Taming Transformers for High-Resolution Image Synthesis

    In this article, we will be Understanding Taming Transformers for High-Resolution Image Synthesis using VQGAN and VQ-VAE approaches

    tanishq 22 Apr, 2024
    Understanding Taming Transformers for High-Resolution Image Synthesis
  • Advanced Analytics Vidhya Artificial Intelligence AVbytes Computer Vision

    Inverse Reinforcement Learning from Visual Demonstration to Train AI Systems

    In this article, learn about Inverse Reinforcement Learning from Visual Demonstration to Train Artificial Intelligence Systems

    tanishq 05 Jul, 2021
    Inverse Reinforcement Learning from Visual Demonstration to Train AI Systems
  • Advanced Deep Learning Libraries Python Python

    PyTorch 1.9 – Towards Distributed Training and Scientific Computing

    So what does the newest release of PyTorch, i.e, 1.9 have to offer? Let's explore the new offerings by Facebook's PyTorch 1.9

    tanishq 02 Jul, 2021
    PyTorch 1.9 – Towards Distributed Training and Scientific Computing
  • Advanced Computer Vision Deep Learning Libraries

    Introduction to Tensorflow 3D for 3D Scene Understanding by Google AI

    Google AI has introduced Tensorflow 3D library which can be used for state-of-the-art 3D semantic segmentation, 3D object detection, etc

    tanishq 15 Feb, 2021
    Introduction to Tensorflow 3D for 3D Scene Understanding by Google AI
  • Advanced Analytics Vidhya Audio Processing Computer Vision Deep Learning

    Introduction to Hugging Face’s Transformers v4.3.0 and its First Automatic Speech Recognition Model – Wav2Vec2

    Hugging Face has released Transformers v4.3.0 and it introduces the first Automatic Speech Recognition model to the library: Wav2Vec2

    tanishq 15 Feb, 2021
    Introduction to Hugging Face’s Transformers v4.3.0 and its First Automatic Speech Recognition Model – Wav2Vec2
  • Advanced Analytics Vidhya AVbytes Classification Computer Vision

    Self Supervised Learning Models to Predict Early COVID-19 Deterioration by Facebook AI

    Facebook AI has developed self supervised learning model that can help doctors predict how a patient’s condition may develop.

    tanishq 30 Jan, 2021
    Self Supervised Learning Models to Predict Early COVID-19 Deterioration by Facebook AI
  • Advanced Computer Vision Deep Learning Image Image Analysis

    Implementation of Attention Mechanism for Caption Generation on Transformers using TensorFlow

    In this artile let's see the Implementation of Attention Mechanism for Caption Generation with Transformers using TensorFlow

    tanishq 20 Jan, 2021
    Implementation of Attention Mechanism for Caption Generation on Transformers using TensorFlow
  • Advanced Computer Vision Deep Learning

    OpenAI’s Future of Vision: Contrastive Language Image Pre-training (CLIP)

    CLIP allows us to design our own classifiers and remove the need for any specific training data but still achieve SOTA results regardless the CV task

    tanishq 13 Jan, 2021
    OpenAI’s Future of Vision: Contrastive Language Image Pre-training (CLIP)
  • Advanced Computer Vision Deep Learning

    OpenAI’s Future of Vision with DALL-E: Creating Images from Text

    DALL-E is a neural network that successfully turns text into an appropriate image for a wide range of concepts expressible in natural language

    tanishq 13 Jan, 2021
    OpenAI’s Future of Vision with DALL-E: Creating Images from Text
  • Advanced Computer Vision Deep Learning Image Image Analysis

    A Hands-on Tutorial to Learn Attention Mechanism For Image Caption Generation in Python

    Attention mechanism is a complex cognitive ability that human beings possess. Let us see how to implement it for image captioning.

    tanishq 20 Nov, 2020
    A Hands-on Tutorial to Learn Attention Mechanism For Image Caption Generation in Python

1 2 Next

Flagship Programs

GenAI Pinnacle Program| GenAI Pinnacle Plus Program| AI/ML BlackBelt Program| Agentic AI Pioneer Program

Free Courses

Generative AI| DeepSeek| OpenAI Agent SDK| LLM Applications using Prompt Engineering| DeepSeek from Scratch| Stability.AI| SSM & MAMBA| RAG Systems using LlamaIndex| Building LLMs for Code| Python| Microsoft Excel| Machine Learning| Deep Learning| Mastering Multimodal RAG| Introduction to Transformer Model| Bagging & Boosting| Loan Prediction| Time Series Forecasting| Tableau| Business Analytics| Vibe Coding in Windsurf| Model Deployment using FastAPI| Building Data Analyst AI Agent| Getting started with OpenAI o3-mini| Introduction to Transformers and Attention Mechanisms

Popular Categories

AI Agents| Generative AI| Prompt Engineering| Generative AI Application| News| Technical Guides| AI Tools| Interview Preparation| Research Papers| Success Stories| Quiz| Use Cases| Listicles

Generative AI Tools and Techniques

GANs| VAEs| Transformers| StyleGAN| Pix2Pix| Autoencoders| GPT| BERT| Word2Vec| LSTM| Attention Mechanisms| Diffusion Models| LLMs| SLMs| Encoder Decoder Models| Prompt Engineering| LangChain| LlamaIndex| RAG| Fine-tuning| LangChain AI Agent| Multimodal Models| RNNs| DCGAN| ProGAN| Text-to-Image Models| DDPM| Document Question Answering| Imagen| T5 (Text-to-Text Transfer Transformer)| Seq2seq Models| WaveNet| Attention Is All You Need (Transformer Architecture) | WindSurf| Cursor

Popular GenAI Models

Llama 4| Llama 3.1| GPT 4.5| GPT 4.1| GPT 4o| o3-mini| Sora| DeepSeek R1| DeepSeek V3| Janus Pro| Veo 2| Gemini 2.5 Pro| Gemini 2.0| Gemma 3| Claude Sonnet 3.7| Claude 3.5 Sonnet| Phi 4| Phi 3.5| Mistral Small 3.1| Mistral NeMo| Mistral-7b| Bedrock| Vertex AI| Qwen QwQ 32B| Qwen 2| Qwen 2.5 VL| Qwen Chat| Grok 3

AI Development Frameworks

n8n| LangChain| Agent SDK| A2A by Google| SmolAgents| LangGraph| CrewAI| Agno| LangFlow| AutoGen| LlamaIndex| Swarm| AutoGPT

Data Science Tools and Techniques

Python| R| SQL| Jupyter Notebooks| TensorFlow| Scikit-learn| PyTorch| Tableau| Apache Spark| Matplotlib| Seaborn| Pandas| Hadoop| Docker| Git| Keras| Apache Kafka| AWS| NLP| Random Forest| Computer Vision| Data Visualization| Data Exploration| Big Data| Common Machine Learning Algorithms| Machine Learning| Google Data Science Agent

Company

  • About Us
  • Contact Us
  • Careers

Discover

  • Blogs
  • Expert Sessions
  • Learning Paths
  • Comprehensive Guides

Learn

  • Free Courses
  • AI&ML Program
  • Pinnacle Plus Program
  • Agentic AI Program

Engage

  • Community
  • Hackathons
  • Events
  • Podcasts

Contribute

  • Become an Author
  • Become a Speaker
  • Become a Mentor
  • Become an Instructor

Enterprise

  • Our Offerings
  • Trainings
  • Data Culture
  • AI Newsletter

Terms & conditions Refund Policy Privacy Policy Cookies Policy © Analytics Vidhya 2025.All rights reserved.

Av Logo White

Continue your learning for FREE

Forgot your password?
Av Logo White

Enter email address to continue

Av Logo White

Enter OTP sent to

Edit

Enter the OTP

Resend OTP

Resend OTP in 45s