Operations Research Model Prompt Evaluator Job at SaidGig, Remote

ZUVFMFVhdGpRbGdmYlZuZ1NmTHFiYUsyRFE9PQ==
  • SaidGig
  • Remote

Job Description

Role Overview

As an expert in operations research, you will play a crucial role in crafting and verifying high-quality open-ended prompts for AI model evaluation. Your work will involve creating and reviewing complex optimization and decision-science problems, assessing AI reasoning quality, and helping to establish rigorous evaluation standards for advanced language models.

You will be assigned one of two task types:

  • Authoring Task — Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI''s response, such as optimization modeling, algorithmic analysis, or stochastic reasoning.
  • Verification Task — Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed.
Operations Research Subdomains Covered

Linear & Integer Programming, Network Optimization & Graph Theory, Stochastic Models & Queuing Theory, Game Theory & Decision Analysis, Supply Chain & Logistics Optimization, Simulation & Metaheuristics.

Key Responsibilities
  • Author clear, unambiguous, open-ended operations research prompts that elicit evaluable AI responses.
  • Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty.
  • Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels.
  • Apply expert judgment to assess the depth and quality of quantitative reasoning required.
  • Edit prompts and difficulty assignments where standards are not met.
Ideal Qualifications
  • Master''s degree or higher in Operations Research, Industrial Engineering, Applied Mathematics, or a closely related field.
  • 2–6 years of professional or research experience in optimization, logistics, or decision science.
  • Strong command of mathematical programming, probabilistic modeling, and algorithmic methods.
  • Experience with solvers (Gurobi, CPLEX) or simulation tools is a strong plus.
  • Excellent written English and ability to craft precise, well-scoped technical questions.
More About the Opportunity

Expected commitment: 10+ hours/week. This position offers asynchronous, fully remote work.

Job Tags

Remote job

Similar Jobs

CBH Homes

Experienced HVAC Installer Job at CBH Homes

 ...furnaces, and read Manual J someone who takes pride in doing clean, professional installs and can keep jobs running smoothly. Requirements What Were Looking For: 1+ years of HVAC install experience (residential preferred) Able to gas pipe, set furnaces,... 

Avero

Safety and Recruitment Specialist Job at Avero

 ...Strong communication and organizational skills ~ Experience with recruiting and sourcing candidates ~ Familiarity with Microsoft Office and applicant tracking systems ~ First Aid/CPR certification (or ability to obtain)~ Bilingual English/Spanish is a plus... 

Yeehe

Gaming translators from Chinese to Burmese (part-time) Job at Yeehe

Requirements 1) Good command of Chinese and Burmese, responsible, an eye for detail, and good written skills. 2) Burmese is your mother-language3) True passion for gaming. 4) Great language and organizational skills. 5) Good computer skills and capable of using common... 

VareCo

Assistant Property Manager Job at VareCo

 ...Assistant Property Manager This is not a passive management role. The Assistant Property Manager is expected to take ownership of outcomes in a fast-moving, high-accountability environment. The work is demanding, conditions are imperfect, and pressure is constant... 

Rivian

Lead, Protection Security Job at Rivian

 ...our team shares a love of the outdoors and a desire to protect it for future generations. Role Summary The Protection Security lead is entrusted with safeguarding Rivian Executives through the delivery of comprehensive, discreet, and proactive protective services...