跳至主要內容

Be advised, we are currently experiencing a maintenance outage that may inhibit your ability to apply to some of our jobs. If the role you are interested in is impacted, please check back later today.

Staff Site Reliability Engineer (Staff SRE)

工作 ID 10135906 地點 溫哥華, 加拿大 有意工作的公司 Walt Disney Animation Studios 日期已公佈 Oct. 31, 2025
申請

工作概要:

Walt Disney Animation Studios’ world-class filmmakers, artists, and technical collaborators create the magic of animation. Bring your unique talents, passion and ideas to our team and prepare to play in a creative, artist-friendly environment.

We are seeking a Staff SRE with expertise in systems administration skills in Linux platforms, and also has experience with software development (e.g. Python, Go, Java, Node), CI Pipeline tools (e.g. Jenkins), Git source management, cloud hosting (AWS, GCP & Azure), container computing (e.g. Docker, OCI), and web technologies. The ideal candidate will enjoy the diversity and challenges of working at various levels in the foundational deployment stack, from defining configuration management, to developing CI/CD infrastructure and processes.

This role resides within the Platform and Infrastructure team at Walt Disney Animation Studios (WDAS), and we build the tools and manage the infrastructure that artists use daily to create our celebrated animated content. The SRE team within Platform Engineering is focused on optimizing service deployments  and improving the availability, latency, performance, efficiency, and observability of systems at WDAS. All projects have in common pursuit of simple and performant solutions to complex problems using Agile and DevOps methodologies as part of high-energy, proficient teams. 

Critical to success in this role is an aptitude for working collaboratively with a technical team. You will help to develop and drive requirements and strategies while also supporting services and core services infrastructure.

Our studio thrives from a wide variety of technical backgrounds and experiences, so we encourage applicants to apply even if they have experiences not specified below. Bring your unique talents, passion and ideas to our team, and be a part of Disney’s creative legacy! 

 

Responsibilities

As Staff SRE, you will translate ideas into tangible products that shape experiences by focusing on a systematic approach to automation, resiliency, efficiency, stability, security, performance, and capacity management, as well as documentation. You will serve as a subject matter expert in multiple areas and be looked at by your fellow team members as a 'go to' individual; you are someone who has a clear understanding of, and can thoroughly elaborate on SRE principles and best practices to a given audience. To be successful in this role you will continuously uphold and improve all the relevant reliability aspects for our services, with an increased focus on SLIs and SLOs, while raising the reliability of a variety of large scale user facing and internal services. As Staff SRE, you will maintain a strong understanding of stakeholder workflows and requirements, and then be able to translate the targeted solutions into an end-to-end architectural design.

 

You will work with engineering, creative and production teams in an extremely collaborative and high-energy environment to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support.  You are passionate about constantly learning, applying technology to solve complex problems, and is a highly motivated, optimistic, proactive, creative thought leader and project manager.

Additional Responsibilities Include:

  • Support a wide range of on-premises and cloud deployments  using infrastructure-as-code, self-healing, and security automation patterns and can facilitate others to use the Infrastructure as Code paradigm

  • Deploy and manage a wide array of on-premises and cloud deployments 

  • Develop useful telemetry, alerts, and response to reduce Mean Time To Repair (MTTR).

  • Collaborate and provide technical excellence within and across teams.

  • Consult on best practices and develop tools to enable smooth adoptions of good service reliability practices and methods.

  • Identify areas of improvement in reliability, efficiency, and operations.

  • Build tools to help your SRE team quickly pinpoint, isolate and resolve issues related to infrastructure, platform services and applications.

  • Continuously refine monitoring processes, configurations, and thresholds.

  • Practice and promote sustainable incident response and blameless postmortems

  • Develop runbooks and tools to streamline processes and shorten problem resolution time.

  • Write code that improves scalability, performance, maintainability, and security.

  • Add, tune and maintain alert configurations and documentation as needed.

  • Develop and improve CI/CD processes to improve release cadence and success.

  • Use Chaos Engineering principles and methodologies to test what you build under real-world conditions.

  • Mentor SREs, Sysadmins, and Systems Engineers  in technical and non-technical SRE responsibilities.

Required Education

  • BS in Computer Science, Computer Engineering, Electrical Engineering or related field

Key Qualifications:

  • 7+ years of experience in SRE, devops, technical operations, systems engineering, software engineering or related discipline

  • Proficient, collaborative, & experienced in building reliable, scalable, enterprise systems

  • Excellent communication skills, both verbal and written

  • Passionate and curious about ways to leverage technology while continually learning

  • Efficiently skilled with the use of containers and container orchestration systems  in enterprise production environments (e.g. Docker, Kubernetes, Rancher, AWS ECS and EKS)

  • Experience with configuration management and infrastructure as code  (e.g. Terraform, Helm, Cloud Formation, Ansible, Puppet, and Ansible)

  • Comfortable in one or more of the following languages (Python, Java, Scala, Go, Rust, Ruby, or similar)

  • Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)

  • Hands-on experience using source control (Git, GitHub) and feature branching strategies

  • Experience with continuous integration tools (e.g. Jenkins, Gitlab CI/CD, AWS CodeBuild, CodeDeploy, Spinnaker)

  • Knowledge of best practices and IT operations in an always-up, always-available service

  • Possess expertise in scalable testing, automation, continuous integration frameworks and best practices

  • Experience in SDLC, distributed systems, networking, hardware, logistics and operations or capacity planning

  • UNIX/Linux administration, troubleshooting, performance tuning, and security

  • Experience with DevOps methodologies and/or SRE

  • Experience with monitoring and observability tooling such as Datadog, Prometheus, and Grafana

  • Experience with automating infrastructure, deployment and testing using tools like Cloudformation, Ansible or Terraform.

  • Experience with Service Level Objectives and Error Budgets

  • Understanding of the principles and methodologies behind Chaos Engineering

Bonus Qualifications:

  • Expertise in web server administration

The Walt Disney Company is an Equal Opportunity Employer.


The hiring range for this position in British Columbia, Canada is C$124,200 to C$166,700 CAD per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A full range of medical, financial, and/or other variable pay or benefits, may be offered dependent on the level and position offered.

申請

關於Walt Disney Animation Studios:

將精湛的藝術和故事與突破性技術融合在一起,Walt Disney Animation Studios 是由一位電影製作人驅動的動畫工作室,負責創作一些最受歡迎的電影。從 1937 年第一部全動畫故事片 Snow White and the Seven Dwarfs,即將推出的 2024 年秋季功能, Moana 2,Disney Animation 將源源不絕的創新與創意延續傳承。這個動畫工作室的不朽創作包括 Pinocchio、Sleeping Beauty、The Jungle Book、The Little Mermaid、The Lion King、Frozen、Big Hero 6、Zootopia 以及 Encanto。

關於 The Walt Disney Company:

Walt Disney Company 連同其子公司和聯營公司,是領先的多元化國際家庭娛樂和媒體企業,其業務主要涉及三個範疇:Disney Entertainment、ESPN 及 Disney Experiences。Disney 在 1920 年代的起步之初,只是一間卡通工作室,至今已成為娛樂界的翹楚,並昂然堅守傳承,繼續為家庭中每位成員創造世界一流的故事與體驗。Disney 的故事、人物與體驗傳遍世界每個角落,深入人心。我們在 40 多個國家/地區營運業務,僱員及演藝人員攜手協力,創造全球和當地人們都珍愛的娛樂體驗。

這個職位隸屬於 Walt Disney Pictures,其所屬的業務部門是 Walt Disney Animation Studios。

招聘流程

  • 您的故事從哪裡開始?

    探索 Disney 職位空缺和 The Life at Disney 網誌,了解華特迪士尼公司有待發掘的所有精彩機會。

  • 迪士尼的故事裏,有你更精彩成就迪士尼故事

    有許多不同品牌和業務可供探索。當您找到適合您的機會後,請填寫您的申請,進行下一步。

  • 下一章

    申請後,您將收到一封電子郵件,讓您可存取應徵者控制面板。建立您的登入資料,並確保經常檢視您的控制面板,以查看申請進度。

探索此地點 US & Canada

The Walt Disney Company 運用精采故事的非凡力量,為世界各地獻上頂級娛樂、豐富資訊及靈感啟發,締造出使我們成為全球頂尖娛樂公司的知名品牌、創意理念及創新科技。

我們的文化

相關內容

  • 行政領導

    我們的高級主管為公司的日常營運帶來了豐富經驗、遠見思維和對卓越、創意和創新的共同承諾。

    了解更多 
  • 多元、公平與包容

    在 Disney,我們致力於創造一個更美好的世界。整個世界充滿歸屬感,讓每人都覺得備受重視、傾聽和理解。整個世界滿載希望和承諾。

    了解更多 

登記收取職缺通知

即時收到最新的工作機會的資訊。

關注我們的職位

星號表示必填欄位。

興趣要求從選項列表中選擇工作類別。從選項列表中選擇工作地點。最後,點擊「添加 (Add)」以建立你的職缺通知。

一經建立帳戶,即代表本人同意使用條款(在新視窗中開啟),並確認已閱讀私隱政策(在新視窗中開啟)

一經點擊「提交」,即同意我們的使用條款(在新視窗中開啟),並確認已閱讀我們的私隱政策(在新視窗中開啟)。如果本人選擇接收營銷訊息或電子通訊,本人可以隨時撤回對這些營銷訊息的同意。

一經點擊「提交」,即同意我們的使用條款(在新視窗中開啟),並確認已閱讀私隱政策(在新視窗中開啟)Cookie 政策(在新視窗中開啟)歐盟私隱權內容(在新視窗中開啟)

我們如何使用您的個人資料以及您的權利:

  1. 你的個人資料由 The Walt Disney Company Limited 控制,公司地址為:3 Queen Caroline Street, London, W6 9PE, United Kingdom。
  2. 當你遊覽 Disney、在 Disney 購物或使用任何 Disney 產品、服務或流動應用程式,The Walt Disney Company Family of Companies 亦可能使用你的資料,以向你提供此等服務、度身定制你的體驗,並向你發送有關服務的最新消息及通訊資料。
  3. 你擁有多項權利,包括有權要求存取、更改或移除你的個人資料,或更改你的營銷偏好設定(包括隨時撤回同意)。請參閱我們的私隱政策(在新視窗中開啟),以進一步了解如何管理你的營銷偏好設定或刪除你的帳戶。
  4. 如欲聯絡我們的資料保護專員,可發送電郵至:dataprotection@disney.co.uk
  5. 你有權向英國資訊專員的辦事處投訴:https://ico.org.uk/(在新視窗中開啟)
  6. 有關 Disney 資料收集和使用方式的更多資料,請見 Disney 的私隱政策(在新視窗中開啟)

點擊「提交」,即表示你同意我們的使用條款(在新視窗中開啟),並確認你已經閱讀我們的私隱政策(在新視窗中開啟)收集聲明(在新視窗中開啟)

如要進一步了解我們的一般資料收集、用途及做法,包括如何管理你的喜好設定,請參閱我們的私隱政策(在新視窗中開啟)。本人已閱讀和同意使用條款(在新視窗中開啟)

Privacy Policy Agreement

Privacy Policy Agreement

Privacy Policy Agreement

Privacy Policy Agreement

Privacy Policy Agreement