1. About Our Client:

The organization operates in the technology sector, focusing on supporting the US federal government. It addresses challenges in defense, national security, public safety, civilian, and military health by delivering reliable, private, and safe technology solutions. The program emphasizes operationalizing advanced AI technologies for confidential federal missions and serves millions through production-grade applications. The team prioritizes rapid delivery and measurable performance in latency, reliability, safety, and cost across high-impact federal programs.

2. About the Opportunity:

The Generative AI Platform Engineer role is responsible for building and maintaining production AI services that ground responses in critical mission data with low error rates. This position ensures AI systems meet strict federal guidelines through governance, observability, and performance optimization. The engineer will develop reusable platform components and apply practical metrics to improve system quality. This role is vital for operationalizing best-in-class AI models in secure, compliance-heavy environments, enabling the organization to deliver dependable AI capabilities that support federal missions.

3. Responsibilities:

• Operationalize large language models and retrieval-augmented generation services in cloud or on-prem environments

• Implement governance including prompt/version management, policy filtering, access controls, audit trails, and safety checks aligned with federal standards

• Define and monitor SLIs/SLOs for quality, latency, safety, and cost; lead observability efforts, incident response, and postmortems

• Optimize system performance and costs using caching, batching, autoscaling, metering, and FinOps principles

• Develop reusable SDKs, CI/CD templates, and infrastructure-as-code modules to support multiple mission teams

• Apply information retrieval metrics and queueing theory to improve retrieval quality and meet latency targets

• Conduct evaluations of AI helpfulness, safety, and factuality both offline and online

4. Requirements:

• Experience owning end-to-end production AI systems including integration, deployment, observability, and incident response

• Proficiency in Python programming

• Experience with retrieval or vector search technologies such as pgvector, Milvus, or OpenSearch and grounding AI in enterprise data

• Strong understanding of service level objectives and dashboarding to improve reliability, latency, safety, and cost

• Ability to communicate effectively with engineers, product managers, and security teams in regulated environments

• US Citizenship required

5. Pay Range and Compensation Package:

• The pay range for this role in specified locations including California, Colorado, Hawaii, Illinois, Maryland, Massachusetts, Minnesota, New Jersey, New York, Washington, Vermont, the District of Columbia, and the city of Cleveland is $100,200 to $203,400 USD

• Compensation varies based on location, role, skills, and experience

6. Benefits & Perks:

• The program offers a variety of benefits, including opportunities for professional growth through certifications and industry training

Equal Opportunity Statement: Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.

Note:

RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose in this opportunity is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company’s career page or ATS.

Generative AI Platform Engineer

Skills & Expertise

Key Responsibilities

Full Description