DataHour: Topic Modeling using Transformer Based Embeddings

DataHour: Topic Modeling using Transformer Based Embeddings

26 Dec 202213:12pm - 26 Dec 202214:12pm

DataHour: Topic Modeling using Transformer Based Embeddings

About the Event

Customers' reviews and comments are important for businesses to understand users' sentiment about the products and services. However, this data needs to be analyzed to assess the sentiment associated with topics/aspects to provide efficient customer assistance. LDA and LSA fail to capture the semantic relationship and are not specific to any domain. BERTopic, is a novel method that generates topics using sentence embeddings and is applied to Consumer Financial Protection Bureau (CFPB) data. 

In this DataHour, Bharath will show how BERTopic is flexible and yet provides meaningful and diverse topics compared to LDA and LSA. Furthermore, he will explain how domain-specific pre-trained embeddings (FinBERT) yield even better topics.


Prerequisites: 
Basic understanding of NLP and interest in learning Data Science.

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Bharath Kumar Bolla

Bharath Kumar Bolla

Senior Data Scientist at Salesforce

Bharath Kumar is a seasoned data scientist with over ten years of professional experience in various fields like telecom, marketing, edtech and healthcare. His expertise includes semi-supervised learning and deep learning architectures in NLP and computer vision. At Salesforce, he focuses on product analytics recommendation systems. He received the 40 under 40 Data Scientist award for 2022 and published over ten articles in various journals and conferences.

Participate in discussion

Registration Details

6833

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution