AI Security & Governance
Secure AI systems, prevent prompt injection, manage model governance and compliance.
0
Open Positions
Core Skills
Prompt Injection DefenseModel SecurityAdversarial MLAI Governance FrameworksDifferential PrivacyFederated Learning
Active Positions (41)
Senior Threat and Attack Research Engineersenior
Anduril·Washington, District of Columbia, United States
threat actor tracking systemsintelligence data integration toolingcyber threat campaign analysissupply chain threat analysisinfrastructure threat analysisoffensive security red team engagements
Senior BSA/AML Investigatorsenior
xAI·New York, NY
BSA/AML (Bank Secrecy Act/Anti-Money Laundering) ComplianceSuspicious Activity Report (SAR) FilingTransaction Monitoring Systems
Staff Data Scientist - Trust and Safetystaff
Databricks·San Francisco, California
fraud and abuse detection using MLstatistical techniques for securitymachine learning for trust and safetysecurity and compliance data analysisdata-driven security program analysisstate-of-the-art fraud detection methods
Safeguards Analyst, Account Abusemid
Anthropic·San Francisco, CA | New York City, NY
Graph-based data infrastructureAccount-linking signalsScaled Abuse DetectionThird-party vendor signal integrationBehavioral indicatorsEnforcement tooling
Safeguards Enforcement Analyst, Safety EvaluationsmidRemote
Anthropic·Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San Francisco, CA | New York City, NY
Safety EvaluationsModel Launch ReadinessPolicy and domain expertise integrationThreat vector analysisCross-Functional Collaboration
Security Architect, Applied AI mid
Anthropic·New York City, NY; New York City, NY | Seattle, WA; San Francisco, CA | New York City, NY
MCP (Model Context Protocol)Autonomous agent securityEnterprise security architectureRegulatory compliance (financial services)Data protection architecturePre-sales technical security
Technical Cyber Threat Investigator midRemote
Anthropic·Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC
Cyber threat intelligenceInfluence operations detectionMalware development misuse detectionSocial engineering misuse detectionThreat actor TTPs analysisLLM system vulnerabilities
Technical Policy Manager, Cyber Harms managerRemote
Anthropic·Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC
Cyber threat modelingAI capability evaluations (evals)Exploit chain analysisAI safety system designAI misuse prevention policiesDual-use cybersecurity knowledge
Head of National Security Policy - APACdirector
OpenAI·Singapore
National security policy engagementASEAN security frameworksARF (ASEAN Regional Forum)ADMM-PlusAUKUS partnership engagementQuad partnership engagement
Researcher, Frontier Cybersecurity Risksmid
OpenAI·San Francisco
Preparedness Frameworkfrontier cybersecurity riskscatastrophic riskscapability assessmentinternal red teamingAGI preparedness
TLM, Codex Securitymid
OpenAI·San Francisco
Codex Securitysecurity agentagentic security systemsAI-driven security researchsource code analysisvulnerability validation
Director of Trust & Safety Engineeringdirector
Vercel·Hybrid - San Francisco
Synthetic account detectionAutomated Misuse PreventionMalicious Content Detection for AI ToolsSpam content detection in AI deploymentsBilling abuse protection systemsCredit abuse prevention
Head of Security Engineeringdirector
Suno·Boston
Secure Cloud Architecture DesignIdentity Access Management (IAM) GovernanceSecurity Operations (SecOps)Product SecurityAgentic AI
Engineering Manager, Trust & Safety manager
Suno·San Francisco
Bot Detection SystemsContent Moderation SystemsAnomaly DetectionFraud Prevention SystemsTrust & Safety Data PipelinesTrust & Safety Operations Dashboards
Senior Engineering Manager - Trust and Safetysenior
Databricks·Bellevue, Washington
Security Foundation EngineeringRegulatory Compliance FrameworksSecure System Design
Explosives System Safety Engineermid
Anduril·Costa Mesa, California, United States
munition certificationwarhead certificationfuzing certificationexplosive ordnance designmunitions systems safetysystem safety engineering
Anthropic AI Security FellowmidRemote
Anthropic·London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA
AI Security ResearchCybersecurity AI ApplicationsVulnerability DiscoveryClaude Code
Information Systems Security Managermanager
Anduril·Costa Mesa, California, United States
Risk Management Frameworkclassified deploymentsair-gapped environmentsgovernment accreditation processessecurity controls documentation
Enforcement Operations Leadsenior
Anthropic·San Francisco, CA | New York City, NY | Washington, DC
AI Safety EvaluationsAI GovernanceContent moderation vendor management for AIModel Behavior Mitigation StrategiesCross-Functional Collaboration
Systems Security Engineer, Anti-Tampermid
Anduril·Costa Mesa, California, United States
Lattice OSAgile Systems EngineeringSystems Engineering Management Plan (SEMP)Technical Performance Measures (TPMs)Omen CapabilityGroup 3 platform
Affiliate Services Licensing Specialist, Regulatory Affairs (NORAM)midRemote
Stripe·New York, San Francisco, US Remote
Next.jsTailwind CSSAI-powered frontend appsLLM copilotself-serve landing page prototyper
Senior AI Governance Solutions Consultant - Financial Services & Insurancesenior
Dataiku·Singapore
Mistral AIopen-source AI ecosystementerprise AI use casesenterprise DevRel strategiesenterprise AI transformationtechnical evangelism
Manager I, Engineering - Platform Trust & Safety manager
Datadog·New York, New York, USA
Trust and Safety engineeringThreat detectionPlatform Abuse MitigationSecurity Research
Senior Security Researcher - GenAIsenior
Datadog·Madrid, Spain; Paris, France
Bits AI for SecurityAI agentsretrieval strategiesplanning strategiesguardrailsprompt management
Staff Software Safety Engineerstaff
Anduril·Costa Mesa, California, United States
software safetyhazard analysessafety processesverification strategiessafety architecture design
Systems Security Engineer, Programsmid
Anduril·Costa Mesa, California, United States
system security engineeringembedded systems securitysecurity architecture assessmentthreat intelligencesecurity test strategyproduct lifecycle security
Senior Software Engineer, Trust & SafetyseniorRemote
Vercel·Remote - United States
Trust & Safety Engineeringplatform abuse detectionfraud detection pipelinesAI-generated activity detectionsynthetic accountsbilling abuse protection
Manager I, Engineering - Bits AI Security Analystmanager
Datadog·Paris, France
Bits AI for SecurityAI Agents (LLM-powered)retrieval-augmented generation (RAG) pipelinesGenAI for securityLLM-driven security analysisAI-powered security assistant
Senior AI Engineer - Bits AI Security Analystsenior
Datadog·Lisbon, Portugal
Bits AI for SecurityAI agentsretrieval strategiesplanning strategiesguardrailshuman‑in‑the‑loop paths
Senior Threat and Attack Research Engineersenior
Anduril·Seattle, Washington, United States
Threat actor tracking systemsSupply chain attack analysisInfrastructure attack analysisRed team engagements
Information Systems Security Officermid
Anduril·Costa Mesa, California, United States
NIST 800-53Continuous Monitoringair-gapped environmentsclassified deploymentssecurity controls documentationgovernment accreditation processes
Technical CBRN-E Threat Investigator midRemote
Anthropic·Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC
CBRN-E threat detectionCBRN-E weapons misuse investigationBiodefense threat modelingChemical defense threat modelingAI misuse detection techniquesThreat actor behavior analysis
Software Engineer, Robotics Securitymid
Google DeepMind·Mountain View, California, US
robotics securitysecure API ecosystem for roboticson-device security for AI agentsdeveloper API security for roboticsGemini Robotics agentic modelsrobotics SDK security
Senior Manager, Technical Security Systems Architecturesenior
Anduril·Costa Mesa, California, United States
Lattice OSAI-powered operating systemsautonomous systems securitysensor fusion3D command and controlphysical-digital security integration
Fraud Specialist, Trust & SafetymidRemote
Vercel·Remote - United States
GTM domainsFinance domainscross-functional strategy planning
Technical Influence Operations Threat InvestigatormidRemote
Anthropic·Remote-Friendly, United States
Influence operations detectionDisinformation campaign investigationCoordinated inauthentic behavior detectionAI-generated synthetic content detectionNarrative manipulation detectionAstroturfing detection
Threat Collections EngineermidRemote
Anthropic·Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC
YARA rule infrastructureMCP (Model Context Protocol) ServersVirusTotalCensysUrlscanDBT-Based Frameworks
Engineering Manager, Safeguards Data Infrastructuremanager
Anthropic·London, UK; New York City, NY
Safeguards Data InfrastructureHIPAA CompliancePrivacy-preserving Data APIsPII StorageOffline Data StackML and Training Workflows
Privacy Research Engineer, Safeguardsmid
Anthropic·San Francisco, CA
privacy-preserving machine learningdifferential privacy for MLprivacy-first training algorithmsprivacy evaluation and auditing techniquesk-anonymityl-diversity
Security Program Manager, AI Assurancemanager
Ramp·New York, NY (HQ)
SOC 2ISO 27001PCI-DSSSOXISO 42001AIUC-1
Research Engineer, Cybersecurity Reinforcement Learningmid
Anthropic·San Francisco, CA | New York City, NY
Reinforcement Learning (RL)Cybersecurity RLSecure CodingVulnerability RemediationRL Environments