DataHour: Topic Modeling using Transformer Based Embeddings
DataHour: Topic Modeling using Transformer Based Embeddings
26 Dec 202213:12pm - 26 Dec 202214:12pm
DataHour: Topic Modeling using Transformer Based Embeddings
About the Event
Customers' reviews and comments are important for businesses to understand users' sentiment about the products and services. However, this data needs to be analyzed to assess the sentiment associated with topics/aspects to provide efficient customer assistance. LDA and LSA fail to capture the semantic relationship and are not specific to any domain. BERTopic, is a novel method that generates topics using sentence embeddings and is applied to Consumer Financial Protection Bureau (CFPB) data.
In this DataHour, Bharath will show how BERTopic is flexible and yet provides meaningful and diverse topics compared to LDA and LSA. Furthermore, he will explain how domain-specific pre-trained embeddings (FinBERT) yield even better topics.
Prerequisites: Basic understanding of NLP and interest in learning Data Science.
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
Who is this DataHour for?
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
- Best articles get published on Analytics Vidhya’s Blog Space
About the Speaker
Participate in discussion
Registration Details
Registered
Become a Speaker
Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event
- Professional Exposure
- Networking Opportunities
- Thought Leadership
- Knowledge Exchange
- Leading-Edge Insights
- Community Contribution
