We are looking for a Program Specialist to join our data labeling operations team and act as a key technical enabler across multiple project workflows. In this role, you’ll bridge our internal labeling operations, external clients, and engineering/product teams to ensure smooth and scalable delivery of data annotation projects.
You’ll be hands-on in building scripts, creating lightweight tools, and proposing data workflows that help move projects forward quickly and accurately. This is an execution-focused role ideal for someone who thrives in cross-functional environments, enjoys solving practical data problems, and is passionate about building efficient systems at scale.
This role sits within the Delivery team, and while it’s not a traditional engineering position, you’ll be expected to have strong technical skills to support delivery teams, work closely with ML teams, and enable successful project outcomes.
Key Responsibilities
- Act as the technical liaison between data labeling operations, clients, and engineering/product teams.
- Propose and implement data pipelines for preparing, validating, and delivering labeling data across various projects.
- Write custom scripts to convert data between different formats (e.g., COCO, YOLO, custom JSON schemas).
- Pull and transform data from cloud storage, APIs, or client systems as needed to support project requirements.
- Build quick tool prototypes or utilities to support internal teams or client-specific proof-of-concepts (POCs).
- Provide technical input during project planning, including data structuring, annotation schema design, and delivery formats.
- Analyze project performance data to surface operational bottlenecks and propose efficiency metrics and improvements.
- Collaborate with engineering and product teams to translate recurring needs into product requirements or reusable tools.
- Maintain internal documentation on workflows, scripts, and tool usage to support knowledge sharing and handover.
- Support quality assurance processes by building or running data sanity checks and validation scripts
Requirements
Must-Have:
- 2–4 years of experience in a technical role involving scripting, data manipulation, or ML operations support.
- Proficiency in Python and working with structured data formats (e.g., JSON, CSV).
- Experience writing ad hoc scripts to process, clean, and transform data.
- Strong problem-solving skills and ability to self-direct in ambiguous scenarios.
- Excellent communication skills and comfort working cross-functionally with technical and non-technical stakeholders.
- Comfortable working in a fast-paced, iterative environment where flexibility and adaptability are key.
Nice-to-Have:
- Experience with ML data formats like COCO, YOLO, or working knowledge of LLM dataset structures.
- Familiarity with cloud storage systems (e.g., AWS S3, GCS) and basic data retrieval via APIs or SDKs.
- Experience working on or supporting machine learning annotation or evaluation projects.
- Familiarity with product delivery workflows or agile tools (e.g., Jira, Notion, Linear).
- Experience analyzing project or operations data using simple dashboards or Python-based analysis (e.g., Pandas, matplotlib).
- Basic familiarity with databases, ideally with experience using AWS RDS (PostgreSQL or MySQL) and Amazon DynamoDB (NoSQL) for querying or structuring project-related data.
Who You Are
- You’re curious and fast-learning, especially when working with unfamiliar data or tools.
- You enjoy being the person who unblocks others and solves problems that span teams.
- You balance speed with quality and know when to hack versus when to scale.
- You communicate clearly and proactively, especially when coordinating across teams.
Benefits
AI is reshaping the world of work—and Chemin sits at the heart of that transformation. We build the systems that train and evaluate today’s most advanced AI models, partnering with global tech leaders to deliver world-class data labeling, automation, and quality infrastructure.
At Chemin, you’ll:
- Work on real problems that power the future of AI
- Build deep skills in data, operations, and automation
- Take ownership, grow fast, and lead with purpose
- Join a remote-first, Southeast Asia–rooted team punching well above its weight
- Be part of the wider TDCX Group network, a global digital customer experience leader with presence across Asia, Europe, and the Americas
We’re still small enough that your work truly matters—and growing fast enough that you’ll never stop learning.
You’ll experience a flexible work environment designed to help you thrive:
- A modern co-working space in Damansara Heights, with easy access to Semantan MRT
- A hybrid setup with two in-office days each week and adaptable core work hours
- Structured on-the-job training to sharpen your skills and accelerate your growth
- Regular performance reviews and personalized career development planning
If you’re serious about thriving in the age of AI, there’s no better place to start.