Member of Technical Staff
Most AI roles focus on building AI systems. This one focuses on something more fundamental — understanding how AI systems actually perform in the real world, and defining how that performance is measured. You’ll operate at the intersection of systems, analysis, AI, and strategy.
Not Just Building AI — Understanding It
This is not a typical AI role. You won’t just build AI systems — you’ll work on the problems that define how modern AI is evaluated, understood, and deployed. You’ll shape the frameworks, datasets, and metrics that the industry uses to make sense of model and agent behavior at scale.
This Role Tends to Resonate With
This role attracts a specific type of person. You might be a fit if you:
What You’ll Do
- Design and build AI evaluation and benchmarking systems
- Analyze how models and agents perform across real-world use cases
- Develop frameworks, datasets, and metrics to measure AI capabilities
- Translate complex system behavior into clear, actionable insights
- Work closely with engineers, product teams, and external partners
- Contribute directly to product direction and overall strategy
What Matters Most
- Strong Python proficiency with recent, hands-on production work
- Experience with data analysis and building analytical frameworks
- Ability to operate as a technical generalist across systems and domains
- Clear, structured communication — you translate complexity into insight
- High ownership and comfort operating in ambiguous environments
- Product-minded software engineers with meaningful AI exposure
- Engineers who’ve built systems involving LLMs, pipelines, or automation
- Technical professionals from top-tier consulting with real coding ability
- Founding or early engineers with broad, cross-functional ownership
The following profiles are unlikely to thrive in this role.
- Focused exclusively on training models or academic research
- Removed from hands-on coding and implementation work
- Purely management-focused without technical execution
Why This Role Is Different
The work here shapes the frameworks and metrics the industry relies on to understand model and agent behavior — a rare, foundational problem space.
You’ll work with frontier models and real-world deployment contexts that most engineers don’t have visibility into.
The team is expected to scale rapidly. Joining now means meaningful equity upside and the ability to shape how the function is built.
This isn’t a pure engineering role or a pure strategy role. It’s both — for someone who can operate at that intersection.
