Industry - Logistics / SCM / Freight / Shipping
- As Lead Data Engineer, you will lead and assist business and technical stakeholders to transform data into a format that can be easily analysed by developing, maintaining, and testing infrastructures for data generation
- Connect offline and online data to continuously improve overall understanding of customer behaviour and journeys for personalisation
- Data pre-processing including collecting, parsing, managing, analysing and visualising large sets of data.
- Drive standards define and implement/improve data governance strategies and enforce best practices to scale data analysis across platforms
- Expert use of the most current and emerging technologies to evaluate trends and develop actionable insights and recommendations to management, via understanding of the business model and the information available for analysis.
- Mentors less senior staff. Lead cross functional projects and programs formally preparing and presenting to management. Routinely work on multiple highly complex assignments concurrently
- Collaborate with other data scientists, subject matter experts, and business team/s around the globe to assist in strategic advanced data analytics projects from design to execution
- Supports multiple systems simultaneously and provides leadership for large project efforts. Provides consultation to Sr. Leadership on a normal basis.
- Strong programming skills in Python/R/SAS
- Proven experience with large data sets and related technologies - SQL, NoSQL, Google / AWS
- Cloud, Hadoop, Hive, Spark
- Excellent understanding of computer science fundamentals, data structures, and algorithms
- Data pipeline software - Airflow, RJ Metrics, Segment, Amazon Data Pipeline, Apache Pig
- ETL software's - Amazon RedShift, CA Erwin Data Modeller, Oracle Warehouse Builder, SAS Data
- Integration Server, Pentaho Kettle, Apatar
- Hands-on experience and knowledge of the Data Lake technology
Didn’t find the job appropriate? Report this Job