Full Time Job

Senior Software Engineer - Site Reliability Engineering

Warner Bros. Discovery

Bangalore, India 03-21-2024

Apply @ Employer

Details

Paid
Full Time
Senior (5-10 years) Experience

Job Description

Senior Software Engineer - Site Reliability Engineering (SRE)

Senior Software Engineer, Site Reliability Engineering, Bangalore, India.

About Warner Bros. Discovery:

Warner Bros. Discovery (WBD), a premier global media and entertainment company, offers audiences the world's most differentiated and complete portfolio of content, brands, and franchises across television, film, streaming, and gaming. The new company combines WarnerMedia's premium entertainment, sports, and news assets with Warner Bros Discovery's leading non-fiction and international entertainment and sports businesses.

For more information, please visit www.wbd.com

Meet Our Team:

DTC's many software engineering teams build applications for the web, mobile, tablets, connected TVs, consoles, and other streaming devices. DTC's Site Reliability Engineering (SRE) teams are responsible for ensuring these systems stay up and operational. Warner Bros. Discovery's approach to SRE is proactive vs. reactive. The team focuses on building and supporting prevention systems and processes over incident response. SRE builds systems that assess Warner Bros. Discovery's resilience risk posture, enable auto-remediation and failover, and partner directly with service development teams to ensure the robustness of their architectures.

We are a fast-growing, global engineering group crucial to Warner Bros. Discovery's future. We need a technical lead who is comfortable turning the needs of our engineering community into thoughtful design and architecture, serving as a senior resource for other developers as they build software components. The SDE II needs to be both a strong individual contributor in coding and a technical mentor within their team for all things SRE.

The SDE II needs to be a strong individual contributor to code, automate and implement solutions to improve software reliability and scalability.

Roles & Responsibilities:
• You are a senior software engineer focusing on development and implementation of SRE capabilities to improve reliability and scalability of application services through load testing and fault injection testing/chaos engineering.
• You will work with service engineering teams to understand customer journeys, their impact to business and implement practices and develop self-heal capabilities to reduce MTTD and MTTR.
• You will work with the Platform engineering teams to develop and integrate appropriate reliability and scalability governance into the workflows.
• You will work with the Load Testing team to build improvements to the Load Testing platform and ensure stability of the platform.
• You will own the development and implementation of automation to reduce operational toil.
• You have made significant contributions to high-performance, stable, scalable systems that have been successfully shipped to customers in production.
• Ensure uptime and performance at peak workflow hours by assisting with system capacity planning and ensuring appropriate instance type allocation.
• You can describe your design, code, and architectural decisions within your team and local leadership. You embrace constructive feedback from these groups and adapt to align with standards and objectives.
• You offer enhancements to best practices, document, design, and architecture improvements, and as feedback through code reviews to other engineers without continuous prompting or recurring encouragement. Other engineers appreciate your feedback.
• You can troubleshoot a production issue by reviewing source code, logs, operational metrics, stack trace etc. to pinpoint a specific problem and then resolve it.

What to Bring:
• 5+ years of software engineering experience. Proficiency in technologies on our stack including containerized Kubernetes-based microservice architectures, messaging and DB services, API gateways, service mesh, repeatable self-serve gitops-based deployment pipelines and infrastructure-as-code technologies/processes.
• You have extensive experience in working on Kubernetes platform.
• You have strong experience in development of tools and automation using Python/JAVA.
• You have strong experience working with service engineering teams and have developed and implemented solutions to improve deployment quality, software reliability and performance engineering for large scale applications.
• You have strong experience in troubleshooting applications and infrastructure issues in production environment and have solid exposure to industry standard observability practices and tools.
• You have worked with the platform/infrastructure teams to develop and integrate solutions into their workflows.
• You have contributed to developer productivity through automation and building self-serve tools.
• You have supported systems operated at large scale with a global footprint.
• You have contributed to systems and frameworks that validate the scalability of large-scale systems (load/stress and fault-injection testing).
• You have experience establishing and raising the bar for technical best practices for your team.
• You have acquired subject-matter expertise over one or more production software systems and demonstrated a pattern of shipping high-quality feature changes in that or those system(s).
• You have strong experience supporting large scale applications on public cloud infrastructure.
• You solve scalability and reliability problems with simple and straightforward solutions, applying appropriate technologies and software engineering best practices.
• You deliver high-quality results the first time and improve code, documentation, and results with each iteration. Your team trusts your work.

What We Offer:
• A Great Place to work
• Equal opportunity employer
• Fast track growth opportunities

If you're a qualified candidate and you require adjustments or accommodations to search for a job opening or apply for a position, please contact us at recruitadmin@wbd.com.

Jobcode: Reference SBJ-d9yomz-18-216-34-146-42 in your application.

Full Time Job

Senior Software Engineer - Site Reliability Engineering

Warner Bros. Discovery

Job Description

Find More Jobs Like This

Location

Similar Listings

Senior Software Engineer - Kafka

Manager, Software Engineering - Kafka

Manager, Software Engineering- Backend Development

Principal Engineer- Backend