Data Platform Engineer
SWARM BiotacticsAt SWARM Biotactics, we’re creating biorobots — living, intelligent systems based on insects. Each insect carries a custom-designed backpack that enables wireless control, sensing, and secure communication.
Biorobot swarms combine the agility of biology with AI, sensors, and swarm coordination — unlocking capabilities where conventional machines fail.
We are the only company in the world doing this — and we’re pushing the edge of what’s possible in every domain: from biology to embedded hardware to swarm intelligence.
🚀 Join us at the frontier — and help create the first generation of living machines.
💡 Your Role
You will design and operate the data backbone of our biorobotic systems.
Our biorobots generate large volumes of data across the entire system: from insect-mounted sensors and wireless telemetry, to motion tracking systems in experimental arenas, to operational interfaces used in the field.
You will turn this raw system data into reliable data products used by:
* AI/ML models training swarm intelligence
* Data scientists and researchers
* Engineering teams improving system performance
* Operational analytics and monitoring
You will build scalable pipelines that transform complex real-world system data into clean, structured, and trustworthy datasets that drive the next generation of biorobotic capabilities.
What you’ll do
* Design and maintain data pipelines from edge → cloud → data platform.
* Process telemetry data from biorobot backpacks and wireless systems.
* Build ETL pipelines using AWS Glue and cloud-native data services.
* Integrate data streams from event-based edge systems.
* Process and structure motion tracking data from camera-based arena systems.
* Develop and maintain datasets for ML model training and evaluation.
* Build reliable data products and curated datasets for researchers and engineers.
* Support ad-hoc analysis, experimentation, and exploration by data scientists.
* Ensure data quality, observability, lineage, and reproducibility across pipelines.
* Collaborate with embedded, edge, ML, and software teams to ensure high-quality data capture.
* Improve the data infrastructure used to analyze and optimize swarm behavior and system performance.
🧠 Your Profile
You enjoy turning complex system data into clean, reliable datasets that power real-world AI systems.
Must-haves:
* 3+ years experience as a Data Platform Engineer or in a similar role.
* Strong experience with Python and SQL.
* Experience building cloud-based data pipelines.
* Familiarity with AWS data services (S3, Glue, Athena, etc.).
* Experience working with streaming or event-driven data systems.
* Understanding of data modeling and data product design.
* Experience supporting ML workflows and data scientists.
* Strong attention to data quality, validation, and reproducibility.
* Comfortable working with complex multi-system datasets (hardware, sensors, logs, telemetry).
Nice-to-haves:
* Experience processing IoT, robotics, or telemetry data.
* Experience with time-series or sensor data pipelines.
* Familiarity with SageMaker or ML data platforms.
* Experience with computer vision datasets or motion tracking systems.
* Experience designing data lakes or lakehouse architectures.
🌍 Why SWARM Biotactics
* We’re well-funded and expanding fast in a space no one else is in.
* You’ll join a world-class team working on the frontier of science and engineering.
* Your work will directly shape the foundation of biorobotic systems that redefine the interface between life and technology.
* [OPTIONAL: department / role specific reasons]
⚡ Join the Swarm
If you’re ready to create the impossible — hands-on, every day — we want to meet you.