You are using the web browser we don't support. Please upgrade or use a different browser to improve your experience.
"icon arrow top"

Senior DevOps Engineer - Kubernetes


Bethesda Softworks



United States

Programming and Development




Mid-Senior Level

Permanent | Full Time


Senior DevOps Engineer - Kubernetes
Bethesda Softworks - Austin @ Austin, TX, US

Bethesda Softworks seeks a Senior DevOps Engineer with extensive Kubernetes experience to work in a team environment that will help build, automate, and monitor the infrastructure platform for our games. Your mission will be to help make possible an operating environment for our development teams that provides the very best gaming experience to our players no matter where in the world they choose to play.
The ideal candidate is a technology generalist (who may have demonstrated significant specialization in previous positions) with deep knowledge of Kubernetes both in the datacenter and the cloud who can comfortably and confidently work with and/or lead an experienced team comprised of veterans from both inside and outside of the game industry.

You must have:

  • At least 7 years of experience keeping online production environments operating at peak performance on bare metal, in the cloud, and in containers with at least 3 years of that experience focused on DevOps and service-based systems engineering.
  • At least 2 years of experience managing production Kubernetes infrastructure with exposure to the use of cloud vendor Kubernetes solutions such as EKS, AKS, and GKE.
  • Extensive experience automating system tasks and infrastructure using a scripting language (Python and Bash preferred), with configuration management tools (Salt, Chef), service registry and discovery tools (Consul and Vault, etcd), infrastructure orchestration tools (Terraform, CloudFormation), and automated imaging tools (Packer)
  • Experience automating public cloud deployments (AWS, Azure, Google Cloud).
  • Experience using and optimizing monitoring and trending systems (Prometheus, Grafana), log aggregation systems (ELK, Splunk), and their agents.
  • Deep understanding of the Software Development Life Cycle including Continuous Integration and Continuous Deployment (CICD) pipeline architecture (Jenkins, GitLab, Drone) including deployments into Kubernetes environments using tools such as Flux or Keel.
  • Desire to be part of an incredible team. That means being someone the team can count on, who communicates honestly and respects others and their views, who adapts quickly to changing priorities, drives positive change, and is committed to giving 110%.
  • True passion for video games because that’s why we do what we do.
  • Prior experience in a 24/7 on call rotation supporting business critical applications or systems.
You will do this as part of a team of system administrators, network engineers, and database analysts building an IaaS (Infrastructure as a Service) platform. You will also work closely with other engineering teams to better understand their software enabling you to adapt the platform to their needs.

As part of this self-guided team, you will have total ownership of the solutions we provide for our customers making availability, performance, security, and reliability critical to our success. We need to do this all at global scale, as well, so continual, iterative improvements in our process, technology, tools, and automation will be essential to allowing us to meet those goals as well as focus on the things that add value for our players.
Preferred Skills
  • Experience building and managing online game infrastructure
  • Experience with alternative Prometheus data stores like Cortex, Thanos, or M3DB
  • Experience architecting, deploying, and managing private cloud infrastructure (OpenStack, Eucalyptus)
  • Experience programming in additional languages like Go, JavaScript, etc.
  • Experience with developing microservice-based HTTP applications and servers
  • A passion for security tools and best practices: OS hardening, vulnerability scanning, system auditing
  • Experience performing full stack load testing in order to identify and address hardware, network, system, or application bottlenecks
  • Networking experience both within Linux and with Cisco or Juniper network routers and switches

Keep Calm and waka waka...