Staff Software Engineer Infrastructure
Docker · Remote — Worldwide
Job Description
Staff Software Engineer, Infrastructure (Remote - Worldwide)
Docker is a globally recognized brand in developer tooling, trusted by millions of developers worldwide. We are building the future of software development and delivery, with a strong focus on AI-assisted workflows and secure, reliable infrastructure. As we launch new products and expand our platform, we are investing heavily in the underlying systems that support hundreds of engineers and high-scale production traffic.
We are seeking a Staff Software Engineer to join our remote-first infrastructure team. This role is critical in evolving our platform from expert-driven support to robust, self-service systems that empower our development teams. You will play a key role in setting technical direction, driving adoption of new standards, and ensuring our platform is reliable, secure, and scalable.
About the Role
This is a Staff-level position where your impact will be measured by your leverage and ability to influence technical direction. You will be hands-on with the codebase while also leading strategic initiatives, establishing pragmatic standards, and driving platform investments to successful adoption.
Key Responsibilities
- Translate ambiguous infrastructure challenges into actionable proposals, driving them through RFCs and cross-team architecture reviews.
- Design and implement self-service capabilities and platform APIs, primarily in Go, for onboarding, provisioning, deployment, observability, and day-2 operations.
- Define and implement delivery standards using Terraform and GitOps with Argo CD, including building robust continuous deployment pipelines with testing and progressive rollout strategies.
- Enhance our multi-tenant EKS foundations for improved reliability, security, scale, and cost-efficiency, including managing ingress with Envoy Gateway and evolving multi-region, cross-account networking.
- Improve Service Level Objectives (SLOs), alerting, and incident follow-up processes, potentially leveraging tools like Grafana Cloud, to enhance production stability and reduce reliance on heroic efforts.
- Contribute to the development and integration of AI-assisted workflows for operational tasks, focusing on safety, auditability, and human review.
- Participate in the on-call rotation and actively work to improve the on-call experience through better tooling, runbooks, and blameless postmortems.
Qualifications
- 8+ years of professional software engineering experience in backend, infrastructure, or platform engineering.
- Strong software engineering skills in Go or a similar language, with a focus on design, testing, debugging, and maintainability.
- Proven track record of designing, shipping, and operating cloud services or infrastructure platforms in production.
- Deep expertise in at least one of the following areas: Kubernetes, cloud platforms, reliability engineering, or developer platforms, complemented by solid Linux, networking, and production operations fundamentals.
- Demonstrated experience in setting technical direction and leading work that requires cross-team alignment.
- Excellent written and verbal communication skills, essential for a remote environment.
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
Nice to have:
- Experience with EKS, ingress controllers, CNIs, or service meshes.
- Proficiency in observability tools (OpenTelemetry, Prometheus, Grafana).
- Experience with CI/CD and progressive delivery (e.g., GitHub Actions, Argo CD).
- Experience leading migrations or adoption programs across multiple teams.
What We Offer
- Freedom and flexibility to fit your work around your life.
- Designated quarterly "Whaleness Days" and an end-of-year break.
- Home office setup allowance.
- 16 weeks of paid Parental Leave (after 6 months of employment).
- Technology stipend equivalent to $100 USD net/month.
- Generous Paid Time Off (PTO) plan.
- Training stipend for conferences, courses, and classes.
- Equity in a growing startup.
- Docker Swag.
- Medical benefits, retirement plans, and holidays vary by country.
- A remote-first culture with offices in Seattle and Paris.
Docker is committed to diversity and equal opportunity. We strive to build a team that represents a variety of backgrounds, perspectives, and skills.
✨ This description was enhanced by AI based on the original listing.