
Job Overview
Location
Remote (USA)
Job Type
Contract
Category
Software Engineering
Date Posted
May 16, 2026
Full Job Description
đź“‹ Description
- • Design and evaluate high-difficulty Computer Engineering prompts that challenge the reasoning limits of large language models (LLMs) in areas including computer architecture, embedded systems, digital logic design, computer networks, and hardware-software integration.
- • Identify reasoning errors, logical inconsistencies, conceptual misunderstandings, and failure modes in AI-generated responses to Computer Engineering prompts.
- • Apply adversarial prompting techniques to surface gaps in model knowledge, particularly around edge cases, numerical precision, timing constraints, and system-level interactions.
- • Provide expert-level, detailed feedback on LLM outputs, correcting inaccuracies in technical content, terminology, schematics, and design methodologies specific to Computer Engineering.
- • Collaborate with project leads to uphold consistent quality standards across AI training datasets, ensuring technical fidelity and pedagogical accuracy in model responses.
- • Analyze model responses for adherence to industry-standard practices in hardware design, software-hardware co-design, and embedded system validation.
- • Detect and document subtle errors in chain-of-thought reasoning, such as incorrect assumptions about register allocation, pipeline stalls, memory hierarchy behavior, or bus protocol violations.
- • Contribute to the refinement of evaluation rubrics and scoring criteria for AI model performance in Computer Engineering contexts.
- • Maintain rigorous attention to numerical accuracy, units of measurement, and technical specificity in all feedback, ensuring no misleading or ambiguous information is propagated.
- • Participate in periodic alignment sessions to review model performance trends and suggest improvements to prompt design and feedback protocols.
- • Work independently to deliver high-quality evaluations with minimal supervision, adhering to project timelines and quality benchmarks.
- • Ensure all feedback is clear, actionable, and technically precise, enabling downstream model fine-tuning and performance enhancement.
- • Engage with evolving LLM capabilities and limitations in real-time, adapting prompt strategies to maintain challenge and diagnostic rigor.
- • Uphold data integrity by flagging inconsistent, outdated, or incorrect information in model outputs that could mislead learners or developers using the AI system.
- • Contribute to the creation of benchmark datasets that test advanced understanding of pipelining, cache coherence, interrupt handling, RTL design, and Verilog/VHDL implementation.
- • Communicate complex technical concepts with clarity, ensuring feedback is interpretable by both AI systems and human review teams.
- • Follow documented protocols for categorizing and tagging errors by type (e.g., conceptual, computational, syntactic, architectural) to support systematic model improvement.
- • Maintain confidentiality and data security for all proprietary prompt datasets and model outputs provided by Handshake AI.
- • Adapt evaluation approaches based on model updates or new releases, ensuring feedback remains relevant and effective across varying AI capabilities.
- • Work within a fully remote, asynchronous environment with flexible weekly hours (capped at 40 hours/week), contributing as much or as little as desired.
🎯 Requirements
- • PhD or MS in Computer Engineering or closely related fields (Computer Science, System Engineering, etc.)
- • Solid understanding of Computer Engineering principles, system design practices, and hardware/software development tools
- • Prior experience with prompt engineering and stumping expert-level models (max reasoning/tokens, Chain-of-thought)
- • Prior experience inducing/identifying reasoning-related errors
- • Excellent attention to detail, especially in technical communication and numerical accuracy
- • Based out of the US, Canada, Mexico, UK, or Spain
🏖️ Benefits
- • Fully remote work with no geographic restrictions beyond approved countries
- • Flexible hours — no minimum weekly requirement, capped at 40 hours/week
- • Opportunity to contribute to cutting-edge AI development in Computer Engineering
- • Potential for ongoing contract work based on performance and project needs
Skills & Technologies
About Handshake Technologies, Inc.
Handshake Technologies provides a cloud-based career-services platform that connects university students, recent graduates, and employers. The software enables institutions to manage job postings, career fairs, on-campus interviews, and employer relations while giving students tools to discover internships and entry-level roles and giving employers access to early-career talent across a network of partner colleges and universities.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Nivoda Limited
3 months ago

FluidStack Inc.
3 months ago

Lavendo Inc.
3 months ago

LiveKit, Inc.
3 months ago