Operations Research Model Prompt Evaluator Job at SaidGig, Remote

ZUVFMFVhdGpRbGdmYlZuZ1NmTHFiYUsyRFE9PQ==
  • SaidGig
  • Remote

Job Description

Role Overview

As an expert in operations research, you will play a crucial role in crafting and verifying high-quality open-ended prompts for AI model evaluation. Your work will involve creating and reviewing complex optimization and decision-science problems, assessing AI reasoning quality, and helping to establish rigorous evaluation standards for advanced language models.

You will be assigned one of two task types:

  • Authoring Task — Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI''s response, such as optimization modeling, algorithmic analysis, or stochastic reasoning.
  • Verification Task — Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed.
Operations Research Subdomains Covered

Linear & Integer Programming, Network Optimization & Graph Theory, Stochastic Models & Queuing Theory, Game Theory & Decision Analysis, Supply Chain & Logistics Optimization, Simulation & Metaheuristics.

Key Responsibilities
  • Author clear, unambiguous, open-ended operations research prompts that elicit evaluable AI responses.
  • Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty.
  • Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels.
  • Apply expert judgment to assess the depth and quality of quantitative reasoning required.
  • Edit prompts and difficulty assignments where standards are not met.
Ideal Qualifications
  • Master''s degree or higher in Operations Research, Industrial Engineering, Applied Mathematics, or a closely related field.
  • 2–6 years of professional or research experience in optimization, logistics, or decision science.
  • Strong command of mathematical programming, probabilistic modeling, and algorithmic methods.
  • Experience with solvers (Gurobi, CPLEX) or simulation tools is a strong plus.
  • Excellent written English and ability to craft precise, well-scoped technical questions.
More About the Opportunity

Expected commitment: 10+ hours/week. This position offers asynchronous, fully remote work.

Job Tags

Remote job

Similar Jobs

TALENIQUE INC

Machine Operator - Night Shift Job at TALENIQUE INC

 ...Job Type: Full-time Salary : From $22.00 per hour Schedule : Hours: Sunday-Thursday, 11:00pm-7:30am 3rd shift (Must be available to train on 1st shift) We are looking for an experience skilled Machine Operator with at least 2 years of experience, to set up, maintain... 

Riot Games

Principal Concept Artist (Character) - VALORANT Job at Riot Games

 ...As a Character Concept Artist at Riot , you will bring characters and creatures to life. Drawing inspiration from as little as a napkin...  ...within our genre and gameplay needs. Youll partner with the Art Director and character art leaders as you work across disciplines... 

Mercy

Physician - Consult Liaison Psychiatrist, $100k Bonus with Mercy Hospital St Louis, Missouri Job at Mercy

 ...Mercy Psychiatry is currently seeking a board certified or board eligible Consult Liaison Psychiatrist to join our team at Mercy Hospital St. Louis, MO. $100K Sign on Bonus! This Position Offers: ~7 on 7 off schedule ~10-hour shifts ~ Competitive compensation... 

University of Cincinnati

Adjunct Instructor, Anthropology, College of Arts and Sciences Job at University of Cincinnati

Current UC employees must apply internally via SuccessFactors You are invited to apply to be included in the general pool of candidates from which qualified faculty roles will be filled. The number of positions varies depending on the needs of the department...

Walmart

(USA) Backroom Associate Sam's Club Job at Walmart

Position Summary... What youll do... Maintains and processes shipments for the Club in accordance with Company policies and procedures by developing and posting delivery schedules compiling and organizing receiving reports verifying merchandise counts routing...