We are looking for an experienced Big Data Engineer with vast experience in collecting, storing, processing, and analyzing huge sets of data. The primary focus will be on creating ways to share this knowledge with our community and creating ways to assess people on these skills through Analytics Vidhya platform

 

Responsibilities

  • Creating Tutorials, hackathons, challenges and open trainings for our community members

 

Skills and Qualifications

  • Proficient understanding of distributed computing principles
  • Management of Hadoop cluster, with all included services
  • Proficiency with Hadoop v2, MapReduce, HDFS
  • Experience with building stream-processing systems, using solutions such as Storm or Spark-Streaming
  • Good knowledge of Big Data querying tools, such as Pig, Hive, and Impala
    Experience with Spark
  • Experience with integration of data from multiple data sources
  • Experience with NoSQL databases, such as HBase, Cassandra, MongoDB
  • Experience with Big Data ML toolkits, such as Mahout, SparkML, or H2O would be beneficial
  • Good understanding of Lambda Architecture, along with its advantages and drawbacks
  • Experience with Cloudera/MapR/Hortonworks