Designation – Data – Associates
Location – Bangalore
About employer– KMPG
Data & Analytics (D&A) has been identified as a key focus and strategic investment area for KPMG globally. KGS is poised to be at the center of the D&A initiative, and we are in the process of creating comprehensive capabilities aligned to the global D&A vision. This will include technical and functional capabilities across the D&A spectrum comprising Advanced Analytics, Modeling, Data Management and Big Data. We are looking to hire the following roles across the D&A spectrum
- Analyze and model structured data using advanced statistical methods
- Implement algorithms and software needed to perform analyses.
- Build recommendation engines, spam classifiers, sentiment analyzers, classifiers for unstructured and semi-structured data
- Analyze data using R, Python, Java, open source packages and commercial/enterprise applications.
- Cluster large amount of user generated content
- Process data in large-scale environments, in Amazon EC2, Storm, Hadoop, Spark
- Interface with databases (SQL, NO SQL, HDFS) to extract, transform and load data
- Perform machine learning, natural language, and statistical analysis methods, such as classification, collaborative filtering, association rules, sentiment analysis, topic modeling, time-series analysis, regression, statistical inference, and validation methods.
- Drive client engagements focused on Big Data and Advanced Business Analytics, in diverse domains such as product development, marketing research, public policy, optimization, and risk management.
- Communicate results and educate others through reports and presentations.
- Performance explanatory data analyses, generate and test working hypotheses, prepare and analyze historical data and identify patterns.
Qualification and Skills Required
- Ability to break down complex problems, and develop strategies
- Masters degree or PhD in Computer Science, Statistics, Mathematics, Engineering, Bioinformatics, Physics, Operations Research, or related fields, with 2+ years of relevant experience
- Expertise in at least one of the following fields: machine learning, data visualization, statistical modeling, data mining, or information retrieval
- Develop and apply machine learning, and statistical analysis methods, such as classification, collaborative filtering, association rules, time-series analysis, advanced regression methods and hypothesis testing Experience working with large datasets and problems.
- Strong data extraction and processing, using MapReduce, Pig, and/or Hive preferred
- Experience with command-line scripting, data structures and algorithms
- Knowledgeable with search engines, spam detection, recommendation systems, and/or social networks
- Ability to work in a Linux environment, and process large amounts of data in a cloud environment
- Modern programming language such as Ruby, Python, Java, C++, etc.
- Strong mathematical background with ability to understand algorithms and methods from a mathematical viewpoint and an intuitive viewpoint.
- Proficiency in analysis (e.g. R, SAS, Matlab) packages, and programming languages (e.g. Java, Python, Ruby).
- Ability to implement, maintain, and troubleshoot big data infrastructure, such as distributed processing paradigms, stream processing, and databases, such as Hadoop, Storm, SQL, Solr.
- Additionally, have broad understanding of the various commercial distributions of the Apache Hadoop framework, e.g., MapR, Cloudera, Horton works, etc
Interested people can apply for this job at this PAGE