ML Data Specialist
ChatGPT JobsFull Description
Job Description
Job Position: Machine Learning (ML) Data Specialist
Machine Learning (ML) Data Specialist
Allen Control Systems
Location
Austin, TX
* (Hybrid Role)
Machine Learning & Artificial Intelligence
Position Overview
Allen Control Systems is seeking a Machine Learning (ML) Data Specialist to own the flow of data through computer vision and machine learning pipelines. The successful candidate will act as the steward of dataset quality and labeling throughput, overseeing incoming video and label data, reviewing outputs for correctness, and identifying distribution gaps or procedural issues to report to the ML Platform team.
Key Responsibilities
* Own end-to-end data and label flow for ML training and testing, from raw inputs through label review to final dataset ingest.
* Review labeled data for quality, correctness, and conformance to specifications.
* Manage data routed to third-party labelers, including selecting/prioritizing batches and tracking throughput and quality.
* Audit incoming data for coverage and balance; flag procedural issues with data collection (e.g., over-collection of specific conditions or gaps in edge cases).
* Partner with the ML Platform team to analyze dataset composition, identify distribution gaps, and define key metrics.
* Utilize dashboards, spreadsheets, and Linux terminals to inspect data and maintain operational visibility.
* Document procedures and contribute to the continuous improvement of data and labeling workflows.
Required Qualifications
* Education & Experience: Bachelor's Degree in a relevant field and 1+ years of experience in data operations, dataset curation, annotation operations, QA, test engineering, video/imagery analysis, or robotics/autonomy data ops. (New graduates with relevant project/internship experience are also welcome).
* Data Quality Mindset: Detail-oriented ability to review imagery and sensor data, spot label inconsistencies, and reason about dataset coverage.
* Operational Ownership: Proven experience running end-to-end operational processes, including managing throughput, vendor coordination, and QA pipelines.
* Linux Proficiency: Basic command-line skills (navigating filesystems, running scripts, reading logs).
* Communication: Clear written communication to surface trends and align with engineering teams and external vendors.
Preferred Skills
* Domain Exposure: Familiarity with computer vision concepts (object detection, tracking, segmentation) or experience in AV, robotics, or drone industries.
* Scripting: Basic knowledge of Python, Bash, or SQL (or strong motivation to learn).
Benefits
* Competitive Salary
* ACS Equity Package
* Health, Dental, and Vision Insurance
* Paid Time Off