About the Role:
Contify collects data by crawling diverse online platforms, including news, corporate websites, customer reviews, forums, and government portals. This data undergoes a sophisticated processing pipeline, which leverages various NLP and Machine Learning technologies to refine it and derive actionable insights. At a high level, our process involves filtering non-business data, clustering duplicates, and employing NER (Name Entity Recognition) for tagging alongside various topic classifications.
We are looking for a senior leader to manage the data operations to ensure the highest quality and process efficiencies. This person will also work closely with the business teams to grow the data business (APIs) of Contify. This role is pivotal in delivering high-quality data to our users. You must be passionate about data: understand its business value, be comfortable with large-volume unstructured data, use statistical techniques, and implement standards and best practices to maintain data integrity.
Job Description:
You'll be required to:
DATA OPERATIONS:
- Ensure data accuracy, completeness, and timeliness align with existing and future customer requirements.
- Work closely with the sales team to understand customers' data requirements and help them close the deal.
- Optimize the data collection and processing pipelines across the entire data lifecycle to maximize system efficiency and throughput.
- Design and develop new datasets by working closely with the product and business teams.
- Implement automated tests (statistical process controls) for validating data integrity, ensuring statistical variance stays within acceptable limits, coupled with real-time status, warning, and failure alerts.
- Participate in the scoping of data processing enhancements and work with Product Managers to turn business requirements into technical specifications.
SUPPORT DATA SCIENCE TEAM
- Collaborate closely with the data science team to refine ML algorithms and models for specific applications.
- Provide curated datasets to the ML team and software engineers for ML model development.
PEOPLE MANAGEMENT
- Oversee a remote team of data analysts responsible for data collection, processing, and delivery to the customers.
- Set clear objectives and KPIs for the data operations team.
- Lead, mentor, and support data team members in achieving their objectives and provide frequent reviews and guidance.
- Identify training and development needs for department staff and organize/ conduct training with the support of HR.
- Sets individual goals and provides performance evaluations for the team members.
Requirements:
- Proven expertise in managing unstructured data pipelines, with hands-on experience in relevant technologies and tools.
- Minimum ten years in a data operations role, including 3 to 5 years in a leadership capacity.
- Proficiency in BI Tools such as Tableau, Power BI, or Excel, with a good understanding of relational databases.
- Competence in tools and technologies related to Statistical Analysis, Machine Learning, and unstructured data management.
- Bachelor's or Master's Degree in Computer Science, Business, Statistics, Finance, Mathematics, Information Technology, Data Science, Data Engineering.
- Experience in B2B information products, knowledge management, search technologies, and platforms will be an added benefit.
Didn’t find the job appropriate? Report this Job