A successful Sr Site Reliability Engineers (SREs) to lead and scale our SRE function in India. This role is pivotal in ensuring the reliability, scalability, and performance of our production systems while fostering a culture of operational excellence and continuous improvement.
You will lead a team of SREs responsible for designing and implementing robust infrastructure, automating operations, and driving incident management and service reliability across our platforms.
- **Leadership \& Strategy**
+ Lead and mentor a high\-performing team of SREs across multiple time zones.
+ Execute the SRE roadmap aligned with business and engineering goals.
+ Collaborate with engineering, product, and infrastructure teams to ensure system reliability and performance.
- **Reliability Engineering**
+ Drive the adoption of SRE best practices including SLIs, SLOs, and error budgets.
+ Lead efforts in capacity planning, performance tuning, and disaster recovery.
+ Oversee incident response, root cause analysis, and postmortems.
- **Automation \& Tooling**
+ Champion automation to reduce toil and improve operational efficiency.
+ Build and maintain CI/CD pipelines, observability tools, and infrastructure\-as\-code solutions.
- **Operational Excellence**
+ Establish and monitor key metrics for system health and team performance.
+ Ensure compliance with security, privacy, and regulatory standards.
+ Foster a culture of blameless postmortems and continuous learning
- *What You Will Need to be Successful:**
- Bachelor’s degree in computer science, Engineering, or related field, or equivalent practical experience
- 10\+ years of experience in software engineering, DevOps, or SRE roles, with at least 3 years in a leadership capacity
- *What You May Need to be Successful:**
- Proven experience managing large\-scale, distributed systems in a cloud\-native environment (AWS, Azure).
- Strong expertise in monitoring (AppD, Splunk), automation (Terraform, Ansible), and CI/CD (Jenkins, GitHub Actions).
- Deep understanding of Linux systems, networking, containers (Docker, Kubernetes), and modern infrastructure practices.
- Excellent communication, collaboration, and stakeholder management skills.
- *Preferred Qualifications**
- Experience working in a global, matrixed organization.
- Contributions to open\-source SRE or DevOps tools.
- *We have great people here and are looking for more. Come join us!**
Follow us
- Facebook
- Instagram
- LinkedIn
- X
- YouTube
- **Equal Employment Opportunities at First Advantage***
- First Advantage is an equal opportunity employer. We are committed to providing a workplace and recruitment process that is free from unlawful discrimination, harassment, and retaliation. Employment decisions at First Advantage are based solely on qualifications, merit, and business needs. We do not discriminate in any aspect of employment on the basis of race, color, national origin, ancestry, citizenship, religion, creed, sex, gender identity, gender expression, sexual orientation, marital or family status, pregnancy, age, physical or mental disability, medical condition, genetic information, veteran or military status, or any other characteristic protected by applicable law.*