𝗥𝗼𝗹𝗲 𝗢𝘃𝗲𝗿𝘃𝗶𝗲𝘄
Mercor is partnering with an AI research organization to engage independent evaluation contractors who can assess agentic tool-use quality—specifically, whether a model calls search appropriately and rewrites user prompts into effective queries. This short-term engagement focuses on high-accuracy judgments, clear rationales, and consistency across a large volume of model–rater traces. The work is well-suited for experts in information retrieval, prompt engineering, and product QA who prefer remote, asynchronous projects.𝗞𝗲𝘆 𝗥𝗲𝘀𝗽𝗼𝗻𝘀𝗶𝗯𝗶𝗹𝗶𝘁𝗶𝗲𝘀
• Review model interaction logs and decide if invoking the search tool was appropriate given the initial prompt and context.
• Evaluate the rewritten search query for clarity, specificity, and fidelity to the user’s intent.
• Provide concise, evidence-based rationales tied to rubric criteria; label edge cases and ambiguities.
• Score query quality (e.g., intent capture, keyword selection, operator use) and overall tool-use timing.
• Calibrate against gold examples; surface rubric gaps and propose improvements.
• Track decisions in a task portal; maintain high inter-rater agreement and throughput targets.
• Flag potentially sensitive content according to the provided safety guidelines.
𝗜𝗱𝗲𝗮𝗹 𝗤𝘂𝗮𝗹𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻s
• Excellent written communication; able to justify decisions succinctly with references to instructions/rubrics.
• Meticulous attention to detail; comfortable working independently with minimal oversight.
• Nice to have: familiarity with annotation tools, basic scripting (Python/SQL), and multilingual proficiency.
𝗠𝗼𝗿𝗲 𝗔𝗯𝗼𝘂𝘁 𝘁𝗵𝗲 𝗢𝗽𝗽𝗼𝗿𝘁𝘂𝗻𝗶𝘁𝘆
• Remote and asynchronous—contractors set their own hours.
• Expected commitment: ~10–20 hours/week; flexible, project-based workload.
• Duration: initial 6–10 weeks with potential for additional task batches.
• Resource sharing and best-practice guides provided; support team available for inquiries.
𝗖𝗼𝗺𝗽𝗲𝗻𝘀𝗮𝘁𝗶𝗼𝗻 & 𝗖𝗼𝗻𝘁𝗿𝗮𝗰𝘁 𝗧𝗲𝗿𝗺𝘀
• Compensation for completed work: estimated $𝟰𝟱/𝗵𝗼𝘂𝗿 equivalent or calibrated per-task rates based on complexity and geography (final rates confirmed before work begins).
• Payments for services rendered via platform (e.g., weekly through Stripe Connect, where available).
• Independent contractor engagement; project-based statement of work; no employment relationship or benefits implied.
𝗜𝗱𝗲𝗮𝗹 𝗖𝗮𝗻𝗱𝗶𝗱𝗮𝘁𝗲:
• A self-starter who loves to create.
• Light on their feet, with the ability to produce high-quality work in short timeframes.
• Excited to experiment, innovate, and deliver fresh visual ideas.
𝗔𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗣𝗿𝗼𝗰𝗲𝘀𝘀:
• Submit a brief profile (CV or LinkedIn) and note relevant evaluation/search experience.
• Complete a short skills check and sample grading exercise to demonstrate rubric alignment.
• If matched, you’ll sign a simple contract/NDA and receive task access details.
• Typical follow-up is within a few days after the sample review.
𝗖𝗼𝗺𝗽𝗹𝗶𝗮𝗻𝗰𝗲 𝗻𝗼𝘁𝗲: This listing avoids employment-implying terms (e.g., “employee,” “hire,” “join our team”) and emphasizes independent, project-based services. If any materials you provide contain such language, we will propose compliant alternatives.
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Here is how to apply:
Mercor
Mercor
Tidak ada komentar:
Posting Komentar