Building a Sentiment Classification Pipeline with DistilBERT and Airflow

05 Nov 202413:11pm - 05 Nov 202414:11pm

Building a Sentiment Classification Pipeline with DistilBERT and Airflow

About the Event

Join us for a hands-on session where we’ll build an end-to-end sentiment classification pipeline using the Goodreads Reviews dataset. We'll use DistilBERT for training high-performance models, and orchestrate the workflow seamlessly with Apache Airflow. To make the predictions accessible, we’ll create an intuitive interface using Streamlit. Best of all, the entire setup will be run locally, simplifying the process and eliminating cloud complexities. This session offers a practical, approachable way to implement sentiment analysis for both beginners and experienced data practitioners.


Key Takeaways:

  • Build a complete sentiment classification pipeline using Goodreads Reviews, from data cleaning to predictions.
  • Leverage DistilBERT for efficient, high-performance sentiment analysis training.
  • Seamlessly manage complex workflows using Apache Airflow to orchestrate the entire process.
  • Create an intuitive interface with Streamlit to display sentiment predictions, all running locally for simplicity.
  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Priyanka Asnani

Priyanka Asnani

Senior Machine Learning Engineer at Fidelity Investments

Priyanka is a Senior Machine Learning Engineer at Fidelity Investments with over 7 years of experience. She specializes in building end-to-end machine learning pipelines, focusing on recommender and ranking systems. Her expertise spans large language models, deep learning, and time-series forecasting. Priyanka excels at applying machine learning techniques to solve complex problems across industries. An active community contributor, she shares her knowledge through public speaking, webinars, and technical content, helping aspiring data scientists stay updated with industry trends. You can reach her on LinkedIn.

Participate in discussion

Registration Details

216

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution