Back to jobs

Remote | Engineering Assessment Specialist — $35–$75/hour

24-MAG
San Francisco, CA
Contract
Applications go directly to the hiring team

Full Description

We are sharing a specialised part-time consulting opportunity for expert engineers with strong academic judgment, deep subject matter expertise, and the ability to author and review rigorous assessment content across advanced engineering domains.

This role supports an exciting collaboration with leading AI companies focused on improving advanced AI systems through high-quality academic question design, solution verification, and benchmark development across core areas of engineering.

Selected professionals will create or review challenging multiple-choice questions, assess clarity and rigor, rate difficulty, write step-by-step solutions, provide academic references, and help improve overall model quality. This opportunity is especially well-suited to detail-oriented engineering experts who are comfortable translating advanced technical knowledge into precise, self-contained assessment content that supports frontier model evaluation.

Key Responsibilities

Professionals in this role may contribute to:

Question Authoring for AI Evaluation

Create original, challenging multiple-choice questions in assigned areas of engineering expertise

Ensure that questions test deep conceptual understanding rather than surface-level recall

Help ensure that prompts are unambiguous, self-contained, and precisely defined

Question Verification & Quality Review

Review pre-written questions for accuracy, clarity, rigor, completeness, and solvability

Edit question content where needed and document any changes made

Help maintain high standards for precision, quality, and consistency across benchmark tasks

Solution Writing & Benchmark Support

Rate question difficulty across medium, hard, and expert levels

Provide one correct answer and nine plausible but subtly incorrect alternatives

Write step-by-step solutions in markdown format with clear, concise intermediate steps

Supply academic references from reputable sources to support question quality and correctness

Ideal Profile

Strong candidates may have:

A PhD or doctoral candidacy in Engineering or a closely related field

Strong command of graduate-level engineering principles, applied mathematics, and domain-specific standards

Excellent written English and the ability to express complex ideas clearly and concisely

Deep expertise in one or more areas such as mechanical and thermal engineering, electrical engineering, aerospace engineering, materials science and engineering, industrial and systems engineering, civil engineering, or related engineering domains

Preferred Qualifications

A Master's degree with exceptional depth in a specific engineering subdomain

Professional engineering licensure or industry experience

Strong consistency in writing rigorous academic content and verifying technical precision

Ability to evaluate both conceptual depth and assessment design quality across repeated tasks

Why This Opportunity

Contribute specialised engineering expertise to a cutting-edge AI collaboration

Help establish gold-standard academic benchmarks used to advance AI capabilities

Work on high-impact assessment design and verification tasks with strong technical relevance

Flexible remote work with competitive hourly compensation

Contract Details

Independent contractor role

Fully remote with flexible scheduling

Hourly compensation of $35–$75 per hour

Expected commitment of 10+ hours per week

Asynchronous work format

Assignments may involve either question authoring or question verification depending on project needs

Projects may be extended, shortened, or concluded early depending on project needs and performance

Weekly payments via Stripe or Wise

Work will not involve access to confidential or proprietary information from any employer, client, or institution

Please note: We are unable to support H1-B or STEM OPT candidates at this time

Start date: Immediate

About The Platform

This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects.

Experts contribute to improving advanced AI systems by providing specialised expertise across real-world workflows, structured evaluation, model training support, and domain-specific content validation.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy

Applications go to the hiring team directly