Back to jobs

Data Engineer

Proxy Foods AI
Washington, DC
Full-time
AI tools:
ChatGPT
Applications go directly to the hiring team

Full Description

Full-Time · Hybrid ·  based in US or Europe 

ABOUT PROXY FOODS 

Proxy Foods is an advanced AI-powered product development and recipe formulation platform trusted by leading R&D teams to create, reformulate, and optimize food & beverage products at scale and in record time. 

We help food and beverage companies move from idea to viable formulation faster by unifying fragmented R&D data: ingredients, specs, cost, nutrition, processing constraints and compliance. As a result, teams launch new products with fewer failed trials, reformulate faster based on consumer feedback and margin pressure, and stay ahead on trends and regulatory requirements. 

THE ROLE 

We are looking for a motivated, hands-on Data Engineer with 3+ years of experience to join our team. 

You will build and maintain the data foundation behind our platform: pipelines, integrations, transformation logic, and data models that power internal analytics and customer-facing product experiences. You will work closely with engineering, product, and food science teams to make complex R&D data reliable, usable, and scalable. 

This is a strong fit for someone who enjoys working in a startup environment, takes ownership end-to-end, and is excited to work near AI-enabled products without losing sight of solid data engineering fundamentals. 

In this role, you will: 

* Build and maintain reliable ETL/ELT pipelines across APIs, databases, files, and external data sources 

* Clean, normalize, and structure complex datasets related to ingredients, specifications, nutrition, costs, and compliance 

* Design and improve data models and storage layers for application features, analytics, and reporting 

* Collaborate with product, engineering, and domain experts to translate messy real-world workflows into scalable data systems 

* Implement data quality checks, validation rules, monitoring, and alerting across critical pipelines 

* Leverage AI/LLM-assisted workflows where appropriate to improve data cleansing, normalization, enrichment, and structuring of complex datasets, with strong validation and quality controls 

* Improve performance, maintainability, and observability of our data stack. 

WHAT WE’RE LOOKING FOR 

* 3+ years of experience in data engineering, analytics engineering, or backend/data-focused software roles 

* Strong SQL skills and solid Python experience in production environments 

* Hands-on experience building and maintaining ETL/ELT pipelines 

* Experience working with relational databases, data warehouses, and API-based integrations 

* Strong understanding of data modeling, transformation, and data quality practices 

* Comfort working with semi-structured data such as JSON, CSV, and external supplier/customer datasets 

* Ability to move quickly, prioritize well, and work independently in a startup environment 

* Clear communicator who enjoys collaborating across technical and non-technical teams 

* Exposure to AI/LLM applications for data extraction, normalization, enrichment, or retrieval-style workflows is a plus 

WHY JOIN PROXY 

* Work on meaningful product and data problems at the intersection of AI and the food industry 

* Help build the data backbone of a platform used to accelerate real-world R&D decisions 

* Join a small, ambitious team where you can have direct ownership and visible impact 

* Collaborate closely with engineers, product builders, and food scientists 

* Grow with a company shaping a new category in AI-powered product development

Applications go to the hiring team directly