We are looking for an experienced Senior Kafka Administrator to lead the design implementation and management of highly scalable and resilient Kafka ecosystems
This role requires deep expertise in distributed systems strong troubleshooting capabilities and the ability to architect enterprise grade streaming platforms
You will play a key role in ensuring system reliability optimizing performance and guiding teams on best practices for real time data streaming
*Key Responsibilities:**
------------------------
Design deploy and manage large scale Kafka clusters across on premise and cloud environments
Lead architecture decisions for high availability scalability and fault tolerance
Monitor troubleshoot and optimize Kafka clusters to achieve optimal performance and uptime
Handle capacity planning cluster sizing and performance tuning
Manage advanced configurations including multi cluster replication MirrorMaker tiered storage and KRaft mode
Configure and enforce security best practices SASL SSL TLS RBAC ACLs
Implement robust backup disaster recovery and failover strategies
Collaborate with engineering teams to design efficient event driven architectures and streaming pipelines
Resolve complex production issues and lead root cause analysis RCA
Automate infrastructure provisioning and operational tasks using tools like Terraform Ansible or scripts
Lead Kafka upgrades migrations and platform enhancements with minimal downtime
Establish and maintain monitoring alerting and logging frameworks
*Technical Requirements:**
--------------------------
Primary skills Technology Java Apache
5 9 years of IT experience with at least 3 years in Kafka administration engineering
Strong expertise in Apache Kafka internals brokers partitions replication ISR
Deep understanding of distributed systems messaging frameworks and event streaming