Data Scientist
Bengaluru, India
Digital Data & Analytics
Site Name: Bengaluru Luxor North Tower
GSK is one of the worlds foremost pharmaceutical and healthcare companies, and we are proud to be part of an industry that improves the lives of others. We are embarking on a significant transformation journey that will support GSK in becoming a top-quartile data-enabled organization.
This is an exciting time to join GSK. The world of master data management is changing, and it is no longer just about managing data. You will be part of a team that is building a robust master data management framework and service, that will allow GSK to drive higher value by placing data at the core of their strategic and operational decisions. We will be embracing new data technology that will improve the development, manufacture, and distribution of GSKs vital products to patients and consumers around the world.
As a Data Scientist, you are responsible for
- Working with multidisciplinary teams to identify, validate, and source required data and tools to develop and mature ML capabilities
- Performing data mining to discover nonobvious relationships, building training data, implementing and retraining ML solutions
- Conducting discovery workshops with business partners to identify business problems
- Building visualization dashboards to present metrics, statistical findings, and progress tracking
Your Responsibilities
- Collaborate and work across business units to identify and source the data and technology to build and mature ML capabilities
- Conduct discovery workshops with business partners to identify business problems, obtain and validate training data
- Design, Build, implement, and retrain ML models to solve specific business problems
- Perform data mining and apply findings to increase modeling accuracy
- Cleary and concisely communicate business needs, computational findings and recommendations to all audiences regardless of their background or technical understanding
- Build, implement, and maintain visualization dashboards that clearly present both historical and real time operational metrics, research findings, and progress updates
Basic Qualifications
- Bachelors degree in Applied Mathematics, Mathematical Engineering, Computer Science, Information Technology, or Related Field
- 10+ years of overall technical experience
- 2+ years of Machine Learning and or Data Mining experience
- 2+ years of experience building visualization dashboards
- 2+ years working with and manipulating large data sets
- 2+ years of hands on development experience using multiple languages
Preferred Qualifications
- Masters degree in Applied Mathematics, Mathematical Engineering, Computer Science, Information Technology, or Related Field
- Experience with tools such as Tableau and Power BI
- Experience manipulating large data sets using technologies such as Hadoop, Azure, Hive, Spark, Element, Data Bricks, Mongo, and BigQuery
- Strong development experience using multiple languages such as Python, R Studio, SQL, C#, Java, Java Script, and Julia
- Successful completion of 2 or more projects (end to end) involving the migration and merging of large data sets from multiple sources, leveraging statistical analysis to identify meaningful and significant data relationships to build, train, and retrain ML models
- Experience and/or exposure to the cyber security industry and working with a SIEM such as Splunk or Google Chronicle
Our goal is to be one of the worlds most innovative, best performing and trusted healthcare companies. We believe that we all bring something unique to GSK and when we combine our knowledge, experiences and styles together, the impact is incredible. Come join our adventure at GSK where you will be inspired to do your best work for our patients and consumers. A place where you can be you, feel good and keep growing.
Note: For your candidature to be considered on this job, you need to apply necessarily on the redirected career page of the company as well.
Didn’t find the job appropriate? Report this Job