Site Reliability Engineer 5 - Live Encoding SRE
USD 388kβ558k
Job Description
At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of whatβs next.
About the role
In this role, you will support our live streaming pipeline team and day-to-day live-streaming operations for Netflix. As a Live Streaming Pipeline SRE, you will be responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting end to end observability and visualizing the data to achieve the desired availability at scale.
Working with cross functional teams in the preparation, validation, and execution of live streaming focused initiatives.You will impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days. You will be leading innovation initiatives, driving new features that will enhance our live streaming services, encoding & content delivery.
Responsibilities
Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal to maintain highly scalable and reliable services worldwide
Implement, automate, execute, and analyze the results from a broad range of live streaming delivery focused functional, performance, resilience, and fault injection testing
Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events
Aggregate, analyze, and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset for service delivery optimization and system reliability improvements
Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule
Qualifications
5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery
Experience with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS, MPEG-DASH
Knowledge of and proven experience with HTTP cache/proxy technologies. Experience supporting live-streaming delivery at scale
Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale.
Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S
Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Proficient in a programming language such as Python or Go
Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners
Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience)
Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits here.