Title: Senior Engineer – Model Evaluation & Generative AI

Industry: Generative AI and Applied ML

Location: Morrisville, NC

Responsibilities

* Design, implement, and maintain evaluation pipelines for large generative AI models.

* Benchmark and compare public and proprietary models, analyzing trade-offs and performance characteristics.

* Perform deep error analysis to identify model failure patterns and generate actionable insights.

* Develop methods to detect and mitigate bias, improving fairness and equity in model outputs.

* Conduct robustness and adversarial testing to assess resilience to noise, edge cases, and real-world variation.

Requirements

* Bachelor’s or Master’s degree in Computer Science, Machine Learning, or related field.

* 12+ years of software or ML engineering experience.

* Strong proficiency in Python and deep learning frameworks (e.g., PyTorch).

* Deep understanding of ML evaluation methodologies and metrics (BLEU, ROUGE, F1, perplexity, etc.).

* Proven ability to design rigorous experiments, analyze results, and draw statistically sound conclusions.

Skills

* Model Evaluation

* Generative AI

* Machine Learning

* Robustness Testing

* Bias Mitigation

* Python

* Deep Learning

* Evaluation Methodologies

* Experiment Design

* AI Safety

Hourly rate is commensurate with experience and is an estimated range provided by WorkGenius.

ref_id: f2b02ee7-6adf-41c0-ab77-c7fbde6abca6