We are looking for a Senior DevOps Engineer to lead the design, automation, and scaling of our hybrid cloud infrastructure spanning public cloud and private/on\-premises environments. You will partner closely with software engineering, security, and product teams to build reliable, secure, and high\-performance systems that support rapid product delivery. This is a hands\-on role with significant influence over our infrastructure strategy, deployment workflows, and engineering culture.
*Key Responsibilities**
Architect, deploy, and maintain scalable, highly available infrastructure across both public cloud (AWS, Azure, GCP) and private cloud platforms (OpenStack, VMware vSphere/Tanzu, Nutanix, or similar).
Operate and maintain on\-premises infrastructure: hypervisors, compute, storage (Ceph, NetApp, SAN/NAS), networking (SDN, VLANs, BGP, MPLS), and hardware capacity planning, alongside their public cloud equivalents.
Design and own CI/CD pipelines that deploy seamlessly across public and private environments.
Implement and manage Infrastructure as Code (Terraform, Ansible, Pulumi) with strong version control and review practices, using providers for both public and private cloud platforms.
Manage container orchestration (Kubernetes, ECS, OpenShift, Rancher) across managed cloud services and self\-managed/bare\-metal clusters, including upgrades, autoscaling, and workload reliability.
Build observability into all systems through logging, metrics, tracing, and alerting (Prometheus, Grafana, Datadog, ELK, or similar) with unified visibility across hybrid environments.
Champion security best practices: secrets management, IAM hardening, network segmentation, vulnerability scanning, and compliance (SOC 2, ISO 27001, HIPAA, or data\-sovereignty requirements).
Lead incident response, root\-cause analysis, and post\-mortems; drive long\-term reliability improvements and SLO/SLA adherence.
Optimize cost, capacity, and resource utilization across public cloud spend and on\-premises hardware without compromising performance or availability.
Partner with data center operations and network providers on hardware provisioning, firmware management, MPLS circuit management, and lifecycle planning.
Mentor junior DevOps and software engineers; promote DevOps culture, automation\-first thinking, and shared ownership of production.
Evaluate and introduce new tools, platforms, and processes that improve developer productivity and system reliability.
*Required Qualifications**
5\+ years of experience in DevOps, SRE, or Platform Engineering roles, with at least 2 years at a senior level.
Deep expertise with at least one major public cloud provider (AWS, Azure, or GCP) in production.
Hands\-on experience operating private cloud or virtualization platforms (OpenStack, VMware, Nutanix, or equivalent) in production.
Strong experience with virtualization, storage systems, and enterprise networking in on\-premises environments.
Strong hands\-on experience with Kubernetes in production, including both managed cloud and self\-managed/bare\-metal clusters.
Proficiency in Infrastructure as Code (Terraform and Ansible strongly preferred).
Solid scripting and programming skills in Python, Go, Bash, or similar.
Experience designing and operating CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, CircleCI, or ArgoCD.
Strong Linux systems administration and networking fundamentals (TCP/IP, DNS, load balancing, VPNs, firewalls, routing, MPLS).
Experience with monitoring and observability stacks (Prometheus, Grafana, Datadog, New Relic, ELK, or OpenTelemetry).
Proven track record of leading incident response and improving system reliability.
Excellent communication skills and the ability to collaborate across engineering, security, infrastructure, and product teams.
*Preferred Qualifications**
Experience designing hybrid and multi\-cloud architectures, including secure connectivity (Direct Connect, ExpressRoute, MPLS, VPN, SD\-WAN) between public and private environments.
Familiarity with service meshes (Istio, Linkerd), API gateways, and GitOps workflows (ArgoCD, Flux).
Background in security\-focused or regulated environments and exposure to compliance frameworks.
Experience with database administration (PostgreSQL, MySQL, Redis, MongoDB) in cloud\-managed and self\-hosted setups.
Contributions to open\-source DevOps or cloud infrastructure tooling.