
Job Overview
Location
USA - Remote
Job Type
Full-time
Category
Data Science
Date Posted
June 14, 2026
Full Job Description
đź“‹ Description
- • Work at the intersection of Artificial Intelligence and Threat Research to build next-generation agentic systems for cybersecurity.
- • Collaborate with cybersecurity subject-matter experts to understand analyst workflows and security operations procedures.
- • Post-train large language models (LLMs) and AI agents using supervised fine-tuning and reinforcement learning techniques including RLHF, RLAIF, PPO, GRPO, and DPO.
- • Design and implement AI agent architectures featuring planning loops, reasoning, tool and function calling, retrieval systems, and memory management.
- • Research and prototype state-of-the-art agentic planning methods from academic literature to enhance system performance.
- • Establish objective, statistically rigorous benchmarking frameworks for agentic systems using evals, LLM-as-judge pipelines, and trajectory-level metrics.
- • Optimize prompts and inference strategies to maximize model efficiency and output reliability on real-world security tasks.
- • Partner with Engineering, Data Science, and Managed Services teams to transition prototypes into production-grade systems.
- • Track advancements in AI research and help identify, define, and prioritize new areas for innovation in agentic systems.
- • Maintain reproducible research engineering practices using clean Python and disciplined experiment tracking for team-wide collaboration.
- • Work independently on ambiguous, complex objectives while communicating clearly within cross-functional teams.
🎯 Requirements
- • Excellent foundations in machine learning, probability, and statistics with strong instincts for uncertainty, variance, and experimental design
- • PhD-level depth of understanding in modern machine learning research, including ability to read, critique, implement, and improve upon current papers
- • Experience training generative models with command of LLM fundamentals: architecture, optimization, tokenization, data, and scaling behavior
- • Core expertise in reinforcement learning and post-training: RLHF/RLAIF, policy optimization (PPO/GRPO/DPO), reward modeling, and building RL environments for agents
- • Experience building agentic systems: ReAct, planning, reflection, tool/function calling, and retrieval/memory/context management
- • Fluency with GPUs, PyTorch, and the LLM training/serving stack (Hugging Face Transformers/TRL/PEFT, DeepSpeed/FSDP, vLLM/TGI/SGLang)
- • Strong, reproducible research engineering skills with disciplined Python and experiment tracking
🏖️ Benefits
- • Market leader in compensation and equity awards
- • Comprehensive physical and mental wellness programs
- • Competitive vacation and holidays for recharge
- • Paid parental and adoption leaves
- • Professional development opportunities for all employees regardless of level or role
- • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
Skills & Technologies
See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.
About CrowdStrike Holdings, Inc.
CrowdStrike Holdings, Inc. provides cloud-delivered cybersecurity through the Falcon platform, combining next-generation antivirus, endpoint detection and response, threat hunting, and IT hygiene. Its AI-driven analytics correlate trillions of events weekly to identify malware-free intrusions, nation-state actors, and insider threats across endpoints, workloads, and identities. The company sells subscriptions, professional services, and threat intelligence to enterprises worldwide.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Hangar Aviation Technologies, Inc.
28 days ago

Hangar Aviation Technologies, Inc.
28 days ago

Hangar Aviation Technologies, Inc.
28 days ago

Hangar Aviation Technologies, Inc.
28 days ago