Location: Gurgaon
Roles and Responsibilities:
- Be a key component of the Big Data practice, working directly with clients to solve very important problems using internal and external data
- Lead a consulting team of data science engineers and consultants and coordinate work on specific projects
- Building solutions that will be applied in real time contexts, such as recommendation engines, classification and typing systems, and process management applications - using machine learning tools, near-AI processes, statistics, scripting and data integration
- Working closely with Account Management and Business Development to design and advocate solutions
Skills and Experience:
- PhD in relevant field, or Masters with substantial experienc
- 7+ years of relevant industry experience
- Experience managing and leading a team
- Extensive experience solving analytical problems using quantitative approaches
- Knowledge of the standard Hadoop/MongoDB/Aster/HDFS/MapR/Hive/Pig tool
- Excellent SQL skills; comfortable using various data access tools.
- Good coding, scripting and prototyping skills covering some procedural as well as statistical or data oriented languages (Such as: Java, C++, Scala, Python as well as R, SQL, etc.)
- Experience with streaming algorithms and practical analysis of real-time data streams
- Familiar with parallel and distributed approaches to Data Analytics and Text Mining
- Prior experience in data mining, real-time, adaptive, probabilistic machine learning, predictive analysis, SVM, and some knowledge in NLP and text mining using Big Data
- Expert in algorithm development & implementation
- Statistical and predictive modeling experience
Experience with most of the following Machine Learning algorithms, methods, and techniques:
- Bayesian Inference (MAP, MLE, EM, MCMC, etc.)
- Random forest and other decision-tree algorithms
- Social graphs, graph theory, graph theoretic inference
- Familiarity with MATLAB, Mathematica, R, Juli, or another scientific computing language
- Experience working with Mahout, Weka, scikit-learn, mlpy, MALLET, GSL, or other third-party machine learning tools and platforms
- Extensive analytical toolset including an advanced understanding of statistics (time series analysis, cluster analysis, multivariate analysis), discrete simulation, linear and nonlinear optimization.
- Experience with data analytics in the cloud
- Familiarity with BI platforms such as Tableau and Qlikview
Didn’t find the job appropriate? Report this Job