Senior Software Engineer - Telemetry Engineering (metrics)
Who We Are
We deploy hundreds of microservices, across multiple regions, tens of thousands of instances, and have millions of client devices with our software installed. That's a lot of infrastructure and software that collectively work together to give customers an end product they love to use. Ensuring availability and reliability across that scale is a task that's taken on by our amazing Netflix engineers through full operational ownership of their software.
Our team, Telemetry Engineering, builds software to provide real-time operational observability to our 1000's of engineers and teams across Netflix. This requires collecting, streaming, and persisting operational data, HUGE amounts of it, and making that data accessible through flexible APIs and visualizations.
Our team is looking for a Senior Distributed Systems/Database Engineer to help build and manage our metrics infrastructure, built using our very own open-source time-series database, Atlas. We currently collect up to 20 billion time-series per day and expect that number to grow significantly as Netflix scales from 200 million members today, to our next milestone of 500 million members.
This is an opportunity to work on one of Netflix's largest and most critical distributed systems, and solve deep technical challenges around how to design and scale our database and metrics infrastructure cost-effectively. If designing and implementing high-performance data systems that are highly critical to the entire company excites you, this is the role for you!
You will also have a chance to take a lot of ownership and freedom in this role with Netflix's unique culture of Freedom & Responsibility, solving hard database and distributed system problems, and seeing the effect of your changes in production at massive scale.
What You Bring to the Table
• Proven expertise in building and operating performant, large scale databases
• Proven expertise in building and operating scalable distributed services for real-world use cases
• Deep experience using JVM technologies such as Java and Scala
• Experience handling time-sensitive and/or large data sets
• Experience with observability is a plus
• Proven ability to work well closely with and collaborate with other engineers on the team
• A customer-focused attitude
Sharing Is Caring
In this group, you'll have a chance to create software that is state of the art and foundational. Because of Netflix's desire to share technology and concepts, you'll be in the rare position of both working on this and sharing this knowledge with your peers outside Netflix. We believe this is unique to Netflix, and if it sounds amazing to you, we should talk.
More Background on our Work
A recent talk on what we do as a team:
AtlasDB, Netflix's open-source Time-Series Database
A recent presentation on how we use Atlas/Metrics at Netflix: https://docs.google.com/presentation/d/1nfHw4GGI5dZ31cPD8IR59V8g34rikXjc1TtkpNiWLLs/edit
Jobcode: Reference SBJ-gpm1zx-35-172-136-29-42 in your application.