1. 𝗥𝗼𝗹𝗲 𝗢𝘃𝗲𝗿𝘃𝗶𝗲𝘄
Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation.2. 𝗞𝗲𝘆 𝗥𝗲𝘀𝗽𝗼𝗻𝘀𝗶𝗯𝗶𝗹𝗶𝘁𝗶𝗲𝘀
• Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution).
• Ensure prompts are objective, self-contained, and yield clear, unambiguous answers.
• Test prompts against advanced AI models and document failures/successes.
• Provide reasoning steps and solutions for each prompt.
• Classify prompts into subject domains for dataset organization.
• Collaborate with reviewers for expert validation and prompt refinement.
3. 𝗜𝗱𝗲𝗮𝗹 𝗤𝘂𝗮𝗹𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀
• Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.).
• Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references.
• Experience in academic research, benchmarking, or test question design preferred.
• Attention to detail and ability to provide concise reasoning explanations.
• Familiarity with AI models and their limitations is a plus.
4. 𝗠𝗼𝗿𝗲 𝗔𝗯𝗼𝘂𝘁 𝘁𝗵𝗲 𝗢𝗽𝗽𝗼𝗿𝘁𝘂𝗻𝗶𝘁𝘆
• Remote and asynchronous — set your own hours.
• Expected commitment: ~10–20 hours/week.
• Project duration: ~2 months, with possible extensions based on dataset needs.
5. 𝗖𝗼𝗺𝗽𝗲𝗻𝘀𝗮𝘁𝗶𝗼𝗻 & 𝗖𝗼𝗻𝘁𝗿𝗮𝗰𝘁 𝗧𝗲𝗿𝗺𝘀
• Competitive hourly compensation based on expertise.
• Independent contractor engagement.
• Payments for services rendered are processed weekly via Stripe Connect.
6. 𝗔𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗣𝗿𝗼𝗰𝗲𝘀𝘀
• Submit your resume or CV highlighting your subject matter expertise.
• Complete a brief questionnaire about your background and areas of specialization.
• Selected applicants may be asked to draft a short test prompt.
• You’ll receive a follow-up within a few days regarding next steps.
7. 𝗔𝗯𝗼𝘂𝘁 𝗠𝗲𝗿𝗰𝗼𝗿
• Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations.
• Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey.
• Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Here is how to apply:
Mercor
Mercor
Tidak ada komentar:
Posting Komentar