DataHour: DIY Semantic Code Search

DataHour: DIY Semantic Code Search

15 Apr 202414:04pm - 15 Apr 202415:04pm

DataHour: DIY Semantic Code Search

About the Event

Imagine a new contributor is trying to solve a simple beginner task but is overwhelmed by the complexity of the repository. Knowing which file to change and where to make the change can be time-consuming.

Popular code search tools are based on keyword matches and are not based on natural language. Existing Code search tools do not semantically understand the codebase. This talk will explore building a DIY Semantic Code Search tool to simplify the process of navigating complex codebases, which is particularly beneficial for new contributors onboard to open-source GitHub projects.

We will demonstrate how to build a locally deployable Natural Language-based Code Search tool using open-source LLMs that can help beginners understand any repository and start contributing. Leveraging the RAG paradigm (Retrieval Augmented Generation), we show a two-step process of retrieving relevant files for the task using Vector Search and an open-source LLM to generate answers to the natural language questions to accomplish the task at hand.

The solution would be a DIY solution that anyone can build, test, and deploy locally for their local repositories using open-source technologies only.

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Raghavan Muthuregunathan

Raghavan Muthuregunathan

Senior Engineering Manager at LinkedIn

Raghavan Muthuregunathan is a distinguished Senior Engineering Manager renowned for his leadership in the domain of artificial intelligence, particularly within the realm of search technologies. Currently stationed in San Francisco, California, he serves as the driving force behind the LinkedIn Search AI team, steering its endeavors towards innovation and excellence.

Raghavan Muthuregunathan is a distinguished Senior Engineering Manager renowned for his leadership in the domain of artificial intelligence, particularly within the realm of search technologies. Currently stationed in San Francisco, California, he serves as the driving force behind the LinkedIn Search AI team, steering its endeavors towards innovation and excellence.

Prior to his tenure at LinkedIn, Raghavan honed his expertise at Microsoft Bing, where he played a pivotal role in shaping the search landscape. With a rich background in engineering and a passion for leveraging technology to solve complex challenges, he has left an indelible mark on the industry.

Participate in discussion

Registration Details

5403

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution