
Job Overview
Location
United States
Job Type
Part-time
Category
Software Engineering
Date Posted
March 13, 2026
Full Job Description
📋 Description
- • As an AI Red-Teamer for adversarial AI testing, you will play a critical role in enhancing the safety, reliability, and robustness of cutting-edge conversational AI systems.
- • Your primary responsibility will be to proactively challenge AI models through meticulously designed adversarial evaluations, uncovering potential risks and vulnerabilities before they manifest in real-world applications.
- • This involves systematically probing AI systems to ensure they respond safely, accurately, and responsibly across a diverse spectrum of scenarios and potential misuse cases.
- • You will be instrumental in identifying weaknesses by testing various attack vectors, including jailbreak attempts, prompt injections, and other exploit strategies designed to push AI models beyond their intended operational boundaries.
- • A key aspect of your role will be generating high-quality human evaluation data. This includes annotating instances where AI models fail, classifying the nature of identified vulnerabilities, and pinpointing systemic risks that could impact AI performance or safety.
- • You will apply structured testing methodologies, leveraging established taxonomies, benchmarks, and playbooks to ensure a consistent and comprehensive evaluation process.
- • The ability to document your findings clearly and reproducibly is paramount. You will be expected to produce detailed reports, curated datasets, and well-defined adversarial test cases that development teams can readily act upon to implement improvements.
- • This position requires you to work across multiple projects concurrently, supporting a variety of AI systems and adapting to different evaluation objectives and client requirements.
- • You will be expected to think adversarially, naturally exploring unconventional ways to test the limits of AI systems and expose their hidden weaknesses.
- • A preference for structured methodologies, utilizing frameworks and benchmarks over ad-hoc testing, will be essential for effective and scalable evaluation.
- • You must possess the ability to communicate identified risks and vulnerabilities with clarity and precision, effectively conveying complex technical issues to both technical and non-technical stakeholders.
- • Adaptability is key, as you will be expected to work across diverse projects and readily adapt to new and evolving evaluation challenges within the rapidly advancing field of AI.
- • The work is entirely text-based, focusing on analyzing and generating textual inputs and outputs.
- • While the role involves reviewing AI outputs that may reference sensitive topics such as bias, misinformation, or harmful behaviors, participation in higher-sensitivity projects is entirely optional.
- • For optional higher-sensitivity projects, clear guidelines and comprehensive wellness resources will be provided to ensure your well-being and support.
- • Success in this role means uncovering critical vulnerabilities and failure modes that automated testing methods might overlook.
- • Your contributions will lead to the creation of reproducible artifacts and datasets that demonstrably improve the resilience of AI systems.
- • You will contribute to expanding evaluation coverage by testing more realistic adversarial scenarios prior to AI system deployment.
- • Ultimately, your rigorous testing and insightful analysis will directly contribute to making AI systems safer, more reliable, and more trustworthy for end-users.
- • This is a unique opportunity to contribute directly to frontier work in AI safety and adversarial testing, shaping the future of responsible AI development.
- • You will gain invaluable hands-on experience working with human data-driven AI evaluation methodologies, a critical component of modern AI development and safety assurance.
- • The compensation for this role is variable, ranging from $50-$111 per hour, and may depend on the specific project, customer requirements, your level of expertise, and the content sensitivity involved in each engagement.
- • This engagement is structured as an independent contractor role, offering flexibility and autonomy.
- • The position is fully remote, allowing you to complete the work on your own schedule from any location.
- • Project durations may vary, with possibilities for extensions, shortenings, or early conclusion based on project needs and your performance.
- • Importantly, the work performed will not involve access to any confidential or proprietary information from any employer, client, or institution, ensuring data privacy and security.
- • Payments are processed weekly via Stripe or Wise, based on the services rendered, providing a reliable and timely payment structure.
Skills & Technologies
About Weekday Technologies Inc.
Weekday Technologies operates a hiring platform that connects tech companies with pre-vetted software engineers through community referrals. The product crowdsources candidate recommendations from existing engineering teams, verifies skills, and offers employers a searchable talent pool for contract and full-time roles. Founded in 2021 and headquartered in San Francisco, the company focuses on reducing time-to-hire for startups and scale-ups by leveraging trusted peer networks rather than traditional recruiting pipelines.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

NextGen Healthcare, Inc.
1 month ago


