DataHour: Exploring the Fundamentals of DeepMatch

DataHour: Exploring the Fundamentals of DeepMatch

02 Feb 202313:02pm - 02 Feb 202314:02pm

DataHour: Exploring the Fundamentals of DeepMatch

About the Event

The quintessential pre-task of most data-driven analysis is that of “stitching” multiple data sources together. Traditionally, in an analyst’s language, this is achieved through “joins”. They “stitch” datasets together based on a commonality in terms of shared entries within common columns across datasets. In many modern settings, however, this does not work because two datasets may lack a shared column(s)or have mismatched entries or many-to-one relationships. 

Typically, this arises because of the following reasons:

  • Lack of centralized design across first-party and third-party datasets
  • The datasets not adhering to a standardized format
  • Errors and missing values in the data
  • Many-to-one and many-to-many fuzzy relationships; or all of the above and more. 

This challenge is addressed currently through a mix of manual work and point solutions across industries and verticals including SKU mapping in retail and supply chain for demand planning; reconciliation in account receivable and payable, trade reconciliation in Banking and Financial services; auditing in insurances; and entity resolution across industries. 

In this DataHour, Devavrat will introduce DeepMatch, an AI-powered matching or joining of data with easy-to-interact humans in the loop component. He will also provide a few demonstrations of how it has been used for SKU mapping in Retail and Supply Chain for demand planning, transaction reconciliation in Banking and Financial Services and Auditing in Insurances.


Prerequisites: 
Zeal of learning Data Science and Artificial Intelligence

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Devavrat Shah

Devavrat Shah

Andrew (1956) and Erna Viterbi Professor at MIT

Devavrat Shah is an Andrew (1956) and Erna Viterbi professor of Computer Science and AI at MIT since 2005 where he founded MIT’s Statistics and Data Science Center and currently directs Deshpande Center for Tech Innovation. Previously, he co-founded Celect, focused on inventory optimization using AI (acquired by Nike in 2019). Currently, he serves as the CTO of Ikigai Labs which he co-founded in 2019, with the mission of building self-driving organization by empowering data business operators to make data-driven decisions with ease of spreadsheets. He received his B.Tech. degree from IIT Bombay and his Ph.D. degree from Stanford University, both in Computer Science. He is a Kavli Fellow of National Academy of Science. He has received paper awards from INFORMS Applied Probability Society, INFORMS Management Science and Operations Management, NeurIPS, ACM Sigmetrics and IEEE Infocom. He has received the Erlang Prize from INFORMS Applied Probability Society and Rising Star Award from ACM Sigmetrics. He has received multiple Test of Time paper awards from ACM Sigmetrics. He is a distinguished alumni of his alma mater IIT Bombay.

Participate in discussion

Registration Details

4560

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution