company_logo

Full Time Job

Machine Learning Operations Engineer

Penguin Random House

New York, NY 04-04-2022
 
  • Paid
  • Full Time
  • Mid (2-5 years) Experience
Job Description
Penguin Random House wouldn't perhaps be the first company that comes to mind when you think of a career in Technology but here's a number of reasons why we think you should change your mind: we're the number 1 Publishing firm in the USA, likely globally, fresh off the back of an incredibly strong performance last year; we're going through a lot of change, particularly in technology, and have embraced and thrived in an environment new to us; we consider ourselves to be the pioneers of Publishing and are always looking for ways to operate more efficiently with creativity in mind; we feel like we're akin to a startup environment, but within an established business that's excited to embrace new processes and technologies – you have all the fun and excitement within a company that's tried and tested!

Data is at the core of Penguin Random House's success and has fast become an integral part of how we drive our business model. As the number 1 largest company in the Publishing industry, and off the back of an incredible performance in 2021, we're adding several positions to the Data Science team, starting with a Machine Learning Operations Engineer.

As a Machine Learning Operations Engineer on the team, and reporting into a director, you will be responsible for designing, building and maintaining our machine learning deployment infrastructure supporting the Data Science team. We're proud to be able to say that we're at the start of an exciting new stage within our journey, where you'll gain a LOT of exposure to more parts of the business than most in what is a highly visible group. Our directors report into the Vice President, who reports into the COO, so you'll regularly be exposed to C-level positions!

You will ideally come with a collaborative, R&D-oriented, and analytical mindset. The role emphasizes written communication and a continuous documentation of learnings, as well as the ability to convey complex technical results to a nontechnical audience.

Responsibilities:

At Penguin Random House, you will:
• Collaborate with data scientist(s) to create and implement testing strategy to test model readiness for deployment
• Optimize and package code written by data scientists for model deployment
• Deploy ML models, creating and orchestrating automated deployment and retraining pipelines where appropriate
• Manage and monitor ML models leveraging MLOPs best practices
• Continuously shorten the analytic product lifecycle without compromising accuracy

Requirements:
• 3-5+years of experience with containers (Docker, Kubernetes), CI/CD tools (Jenkins, Travis, Circle, etc.) and Git
• Experience designing and deploying end to end ML workflows and 4+ years of professional experience programming in R or Python
• Aptitude with shell scripting, debugging tools, containerized environments, and any flavor of Linux
• Familiarity with automated feature engineering, data imputation, and working with large datasets
• Ability to communicate complex technical concepts to a business audience
• Data munging skills; experience working with relational databases and fluency in SQL

Full-time employees are eligible for our comprehensive benefits program. Our range of benefits include, but are not limited to, Medical/Prescription drug insurance, Dental, Vision, Health Care/Dependent Care Flexible Spending Account, Health Savings Account, Pre-Tax and Roth 401(k), Short and Long-Term Disability Insurance, Life/AD&D Insurance, Commuter Benefits, Student Loan Repayment Program, Educational Assistance & generous paid time off.

Jobcode: Reference SBJ-gm1bwx-3-144-251-72-42 in your application.

Company Profile
Penguin Random House

Penguin Random House is the leading adult and children’s publishing house in North America, the United Kingdom and many other regions around the world. In publishing the best books in every genre and subject for all ages, we are committed to quality, excellence in execution, and innovation throughout the entire publishing process: editorial, design, marketing, publicity, sales, production, and distribution.