Potential candidates should have excellent depth and breadth of knowledge in basic stats, data mining, machine learning, and statistical modeling. They should possess the ability to translate a business problem into an analytical problem, identify the relevant data sets needed for addressing the analytical problem, recommend, implement and validate the best suited analytical algorithm(s), and generate/deliver insights to stakeholders. Candidates will be working with variety of data sources from flat files, RDBMS, and Hadoop HDFS data sources and must be familiar with Hadoop eco system. The role is that of an individual contributor; however the candidate is expected to work in small project teams and interact with Business partners on regular basis.
Key Roles and Responsibilities:
- Use data mining, machine learning, and statistical techniques to solve business problems
- Formulate analytical problems with appropriate assumptions
- Interact with internal stakeholders to understand the business problems
- Collaborate with business users for manipulating massive data sets and mash-up of data from multiple sources & formats
- Hands on experience in Big Data tools and technologies on Hadoop (Hive, Spark etc.)
- Programming background and experience in one or more of the languages - Java, Scala, Python, R
- Exposure to descriptive analytics and creating interactive visualizations (using Tableau, Qlikview etc.) will be advantageous
- Working knowledge on GCP or any other Cloud platform is a plus
- Advanced knowledge or specializations on statistical techniques and machine learning would be a plus
Qualifications:
- Post graduate degree in CS/IT, Data Science/Analytics, Statistics, Applied mathematics etc.
- Degree or certifications in Analytics is a plus
- Hands-on experience in Predictive and Prescriptive Analytics
- Experience in Hadoop ecosystem (Hive, Spark, Spark SQL, Spark Streaming)
- Hands-on with a programming (e.g., Java/R/Python) languages & visualization tools
|