Website Anthropic
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the Role
As the Engineering Manager for Agent Platform at Anthropic, you’ll lead the strategic development of Claude’s autonomous capabilities, managing teams that are fundamental to expanding how Claude handles complex, multi-step workflows. You’ll oversee the infrastructure that enables Claude to acquire specialized capabilities and domain expertise, as well as the evaluation frameworks that measure and improve Claude’s performance on extended, agentic tasks. This role sits at the intersection of cutting-edge AI research and practical product development. You’ll partner with enterprise customers and research teams to translate breakthrough capabilities into production-ready features, while building the systems that ensure quality and safety at scale. Your teams will enable both Anthropic and our customers to extend Claude’s capabilities for sophisticated real-world tasks. The Agent Platform group is part of Frontier Apps, focused on productizing research breakthroughs and enabling Claude to tackle increasingly sophisticated activities. You’ll work closely with research teams developing foundational infrastructure, ensuring smooth productization pathways from research to customer-facing features.
Responsibilities
- Engineering Leadership & Team Management
- Lead and grow multiple high-impact engineering teams focused on agent capabilities
- Partner with technical leads to set technical direction, prioritize roadmaps, and ensure delivery of high-quality, scalable systems
- Foster a culture of rapid iteration, technical excellence, and cross-functional collaboration
- Manage team processes, from planning to implementation to incident response
 
- Product & Strategic Execution
- Drive the productization of agent capabilities, enabling both internal teams and external customers to use Claude in more powerful ways
- Work with cross functional teams, customers, and partners to help bring your developments to market successfully and ensure adoption of new functionality
- Evolve evaluation infrastructure to support continuous quality measurement throughout model development and deployment
- Partner with enterprise customers across financial services, life sciences, and other verticals to identify high-value use cases and translate them into reusable capabilities
 
- Cross-Functional Collaboration
- Collaborate with API and platform teams to define strategies for broad access to new capabilities, performance, and enterprise governance
- Partner closely with research teams on alignment, capability evaluation, and productization of experimental features
- Work with Product teams to ensure agent infrastructure supports broader product roadmap
 
To apply for this job please visit job-boards.greenhouse.io.
