company_logo

Full Time Job

Engineering Director, Cloud Operations

Epic Games

Amsterdam, Netherlands 09-12-2022
 
  • Paid
  • Full Time
Job Description
Take a look at the Epic Games on AWS case study here.

ENGINEERING - INFRASTRUCTURE

What We Do

We enable Epic's online service teams to build, deploy, and manage services that are used by more than half a billion players around the world. Our mission is to provide world class tools and platforms to improve the experience of our developers and make it easier, faster, and safer to build, operate, and scale their applications. We operate at a massive scale as one of the largest cloud computing users in the world.

What You'll Do

Your primary responsibility will be leadership of our Cloud Operations team with a focus on building a strong team with a growth mindset, setting and holding a high bar for quality, providing context and vision, and delivering impactful results. You will participate in strategic forums directing the future of Epic's cloud services. You will also provide operational engineering expertise to the Cloud Operations and Data organization as we navigate the problems typical of a fast growing, successful online business with an inevitable long tail to pull in as growth continues.

In this role, you will
• Hire, grow and retain a world class Cloud Operations organization staffed with top notch engineers with expertise spanning infrastructure, systems engineering, software engineering, service resilience and observability.
• Work with Epic application development leadership in an operations engineering capacity focused on scalability, performance, failure modes, graceful degradation and other operational characteristics of services and applications in production environments.
• Lead discussions for infrastructure service roadmaps, solution designs as well as service incidents and problem resolutions.
• Identify the correct measurements as KPIs used to guide teams towards successful outcomes across all of their efforts.
• Provide technical leadership across Epic for Cloud Operations patterns, best practices, solutions and processes with the goal of reducing cycle time friction and improving overall application developer productivity.

What we're looking for
• Prior experience building and managing global teams in a cloud environment.
• Strong understanding of AWS system architecture patterns and best practices for building high performance services that adapt and adjust to operating conditions so as to maintain runtime operations efficiently. Resilience, Fail Fast, Repairability, High Availability, Disaster Recovery, Data Protection.
• In-depth understanding of the HTTP request response cycle encompassing related technologies and protocols such as DNS, CDN caching, TCP/IP fundamentals, API gateways and load balancing and the relationship to network designs.
• Past experience with any agile methodology operating at organizational scale requiring a focus on dependency management and unblocking delivery.
• Development experience (golang, python, javascript, java) progressively moving towards ownership of the subsequent SDLC and toolchains including, but not limited to CI and CD pipelines.
• Systems engineering experience in SOA, microservice or event-driven environments
• Experience participating in compliance programs from translating controls to real world production environments to participating in audits and helping resolve findings that block progress or compromise the outcome of the audit.

Jobcode: Reference SBJ-rnjvw5-18-216-94-152-42 in your application.

Company Profile
Epic Games

Founded in 1991, Epic Games is a leading interactive entertainment company and provider of 3D engine technology. Epic operates Fortnite, one of the world’s largest games with over 350 million accounts and 2.5 billion friend connections. Epic also develops Unreal Engine, which powers the world’s leading games and is also adopted across industries such as film and television, architecture, automotive, manufacturing, and simulation.