Senior Detection & Resilience Engineer
Not an alerting role — a systems-building role. Design and implement the telemetry, detection, and recovery systems that monitor and protect a high-performance AI compute environment at the intersection of security engineering, infrastructure observability, and incident response automation.
Detection, Containment, Recovery — At AI Scale
We are working with a confidential AI research organization building advanced machine learning systems on highly secure private infrastructure. They are hiring a Senior Detection & Resilience Engineer to design and implement systems that monitor, detect, and respond to threats within a high-performance AI compute environment.
What You’ll Do
Design and implement logging, monitoring, and signal collection across distributed infrastructure — compute clusters, orchestration layers, and networking systems.
Create detection mechanisms that identify suspicious activity using behavioral analysis and anomaly detection techniques.
Partner with teams to simulate attack scenarios and validate detection systems against real-world attacker behavior.
Build tools and workflows that enable fast investigation and response with minimal disruption to production systems.
Design systems for rapid containment, isolation, and recovery during compromise or failure scenarios.
Work closely with platform, security, and infrastructure teams to ensure full visibility across the environment.
What They’re Looking For
Strong candidates typically bring experience in several of the following:
- Infrastructure monitoring and observability platforms
- Security detection engineering
- Incident response automation
- Linux systems and distributed infrastructure
- Kubernetes and container telemetry
- Log pipelines and large-scale telemetry processing
- Programming in Go, Python, Rust, or similar languages
- Adversarial thinking or threat modeling (a plus)
Technical Domain
About the Opportunity
This role is with a confidential AI research organization building secure, high-performance infrastructure for next-generation machine learning systems. You will work alongside engineers in distributed systems, infrastructure security, and applied machine learning.
Only shortlisted candidates will be contacted. Selected applicants may be asked to sign an NDA prior to the interview process. This search is conducted exclusively by AIONIA on behalf of the client.
