DataHour: Making Data Pipelines Easy with Dataproc and Composer

DataHour: Making Data Pipelines Easy with Dataproc and Composer

17 Feb 202315:02pm - 17 Feb 202316:02pm

DataHour: Making Data Pipelines Easy with Dataproc and Composer

About the Event

In this DataHour Julian will walk you through best practices for Data engineering teams and discuss why and how to use serverless Spark options. Dataproc on its own is a managed Apache Spark and Apache Hadoop service that lets you take advantage of open source data tools for batch processing, querying, streaming and machine learning. However we will also understand how to go about with automating processes and making life simpler. We do this with the use of Airflow and go quite in depth with the operators used for different Dataproc services. Composer is a managed Airflow service provided by Google Cloud and its ease of setup will be clearly shown in a demo during this webinar. Any aspiring Data or ML engineer stands to benefit from this webinar in understanding best practices while running a Data pipeline in Cloud.


Prerequisites: 
Interest in learning the application of Data Science.

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space
  4. Best articles get published on Analytics Vidhya’s Blog Space
  5. Best articles get published on Analytics Vidhya’s Blog Space

Who is this DataHour for?

  1. Best articles get published on Analytics Vidhya’s Blog Space
  2. Best articles get published on Analytics Vidhya’s Blog Space
  3. Best articles get published on Analytics Vidhya’s Blog Space

About the Speaker

Julian Sara Joseph

Julian Sara Joseph

Developer Advocate: Data Analytics, Data Science

Julian is a highly skilled AI and ML Product Leader with a proven track record in building and designing AI products. As a current Google employee, she is at the forefront of cutting-edge technology, contributing to open-source libraries and showcasing innovative workflows for data-driven AI user journeys, MLOps, and serverless transformations. As a former Product Manager, she has a strong background in creating proposals for internal tools to support Data to AI workloads and implementing plans for the design and development phases of innovative products.In addition to her technical expertise, Julian is also an engaging speaker and mentor, sharing her knowledge and experiences with developer communities and guiding others in the AI field.In addition to all this, she is also a dedicated advocate for diversity and inclusivity in the tech industry. As a Women in Data Science Ambassador for 5 years, she has successfully led chapters in Mumbai and Kerala, and is currently organizing WiDS events in Vancouver. Her passion for supporting women in tech extends beyond her professional life, as she also hosts her own podcast, "Women-Led Businesses" where she shares inspiring stories of startups created and led by women. Julian's commitment to empowering and amplifying underrepresented voices in the tech world makes her a valuable asset to any team.

Participate in discussion

Registration Details

4849

Registered

Become a Speaker

Share your vision, inspire change, and leave a mark on the industry. We're calling for innovators and thought leaders to speak at our event

  • Professional Exposure
  • Networking Opportunities
  • Thought Leadership
  • Knowledge Exchange
  • Leading-Edge Insights
  • Community Contribution