{"id":2849,"date":"2023-07-18T18:25:43","date_gmt":"2023-07-18T12:55:43","guid":{"rendered":"https:\/\/www.analyticsvidhya.com\/datahack-summit-2023\/?page_id=2849"},"modified":"2023-07-27T19:47:15","modified_gmt":"2023-07-27T14:17:15","slug":"end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2","status":"publish","type":"page","link":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/","title":{"rendered":"End to end ML Pipeline to perform predictive maintenance using OCR &#038; NLP"},"content":{"rendered":"<p>&#8220;Ever growing unstructured data brings a lot of information and insight for the business. However, industry experts have come up with various ways to make sense out of unstructured data and one of the recent innovations being LLM models. Even before getting unstructured data to process, there is lot of data that is still in physical format &amp; many engineers still prefer to use classic old pen &amp; paper method to document and note their observation.<br \/>\nOptical Character Recognition (OCR) is the process that converts an image of text into a machine-readable text format. There are multiple ways that this unstructured data is converted into digital format that can be used to provide actionable insights for the business to increase profit margins and optimize the operational cost.<br \/>\nThis hack session will involve designing an end to end MLOps framework that will showcase OCR model, Natural Language Processing (NLP) &amp; how Machine Learning models can help solve real-world problems.<\/p>\n<p><strong>Key Takeaways :<\/strong><\/p>\n<ol>\n<li>Understand the methodologies to convert the physical data to digital data using OCR technique like<\/li>\n<li>DocumentAI and know other methodologies like Google Vision API<\/li>\n<li>Understand with real time example on how NLP can help extract information to derive business insights<\/li>\n<li>Understand now Machine Learning can help to do proactive maintenance of the spare parts<\/li>\n<li>Design an end-to-end MLOps pipeline<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;Ever growing unstructured data brings a lot of information and insight for the business. However, industry experts have come up with various ways to make sense out of unstructured data and one of the recent innovations being LLM models. Even before getting unstructured data to process, there is lot of data that is still in [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":2850,"parent":1126,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"session-details.php","meta":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>End to end ML Pipeline to perform predictive maintenance using OCR &amp; NLP - DataHack Summit 2023<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"End to end ML Pipeline to perform predictive maintenance using OCR &amp; NLP - DataHack Summit 2023\" \/>\n<meta property=\"og:description\" content=\"&#8220;Ever growing unstructured data brings a lot of information and insight for the business. However, industry experts have come up with various ways to make sense out of unstructured data and one of the recent innovations being LLM models. Even before getting unstructured data to process, there is lot of data that is still in [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/\" \/>\n<meta property=\"og:site_name\" content=\"DataHack Summit 2023\" \/>\n<meta property=\"article:modified_time\" content=\"2023-07-27T14:17:15+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-content\/uploads\/2023\/07\/End-to-end-ML-Pipeline-to-perform-predictive-maintenance-using-OCR-NLP-100.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"500\" \/>\n\t<meta property=\"og:image:height\" content=\"250\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/\",\"url\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/\",\"name\":\"End to end ML Pipeline to perform predictive maintenance using OCR & NLP - DataHack Summit 2023\",\"isPartOf\":{\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website\"},\"datePublished\":\"2023-07-18T12:55:43+00:00\",\"dateModified\":\"2023-07-27T14:17:15+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Session\",\"item\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"End to end ML Pipeline to perform predictive maintenance using OCR &#038; NLP\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website\",\"url\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/\",\"name\":\"DataHack Summit 2023\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"End to end ML Pipeline to perform predictive maintenance using OCR & NLP - DataHack Summit 2023","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/","og_locale":"en_US","og_type":"article","og_title":"End to end ML Pipeline to perform predictive maintenance using OCR & NLP - DataHack Summit 2023","og_description":"&#8220;Ever growing unstructured data brings a lot of information and insight for the business. However, industry experts have come up with various ways to make sense out of unstructured data and one of the recent innovations being LLM models. Even before getting unstructured data to process, there is lot of data that is still in [&hellip;]","og_url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/","og_site_name":"DataHack Summit 2023","article_modified_time":"2023-07-27T14:17:15+00:00","og_image":[{"width":500,"height":250,"url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-content\/uploads\/2023\/07\/End-to-end-ML-Pipeline-to-perform-predictive-maintenance-using-OCR-NLP-100.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/","url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/","name":"End to end ML Pipeline to perform predictive maintenance using OCR & NLP - DataHack Summit 2023","isPartOf":{"@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website"},"datePublished":"2023-07-18T12:55:43+00:00","dateModified":"2023-07-27T14:17:15+00:00","breadcrumb":{"@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/end-to-end-ml-pipeline-to-perform-predictive-maintenance-using-ocr-nlp-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/"},{"@type":"ListItem","position":2,"name":"Session","item":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/"},{"@type":"ListItem","position":3,"name":"End to end ML Pipeline to perform predictive maintenance using OCR &#038; NLP"}]},{"@type":"WebSite","@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website","url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/","name":"DataHack Summit 2023","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/2849"}],"collection":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/comments?post=2849"}],"version-history":[{"count":5,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/2849\/revisions"}],"predecessor-version":[{"id":3250,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/2849\/revisions\/3250"}],"up":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/1126"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/media\/2850"}],"wp:attachment":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/media?parent=2849"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}