Senior Engineer – Model Evaluation & Generative AI
WorkGenius GroupTitle: Senior Engineer – Model Evaluation & Generative AI
Industry: Generative AI and Applied ML
Location: Morrisville, NC
Responsibilities
* Design, implement, and maintain evaluation pipelines for large generative AI models.
* Benchmark and compare public and proprietary models, analyzing trade-offs and performance characteristics.
* Perform deep error analysis to identify model failure patterns and generate actionable insights.
* Develop methods to detect and mitigate bias, improving fairness and equity in model outputs.
* Conduct robustness and adversarial testing to assess resilience to noise, edge cases, and real-world variation.
Requirements
* Bachelor’s or Master’s degree in Computer Science, Machine Learning, or related field.
* 12+ years of software or ML engineering experience.
* Strong proficiency in Python and deep learning frameworks (e.g., PyTorch).
* Deep understanding of ML evaluation methodologies and metrics (BLEU, ROUGE, F1, perplexity, etc.).
* Proven ability to design rigorous experiments, analyze results, and draw statistically sound conclusions.
Skills
* Model Evaluation
* Generative AI
* Machine Learning
* Robustness Testing
* Bias Mitigation
* Python
* Deep Learning
* Evaluation Methodologies
* Experiment Design
* AI Safety
Hourly rate is commensurate with experience and is an estimated range provided by WorkGenius.
ref_id: f2b02ee7-6adf-41c0-ab77-c7fbde6abca6