OCI is constructing the world's largest AI clusters and bringing them to market swiftly. Within the AI Infrastructure organization, you will serve as a distributed systems engineer tasked with scaling and optimizing essential components like the GPU control plane and GPU data plane, which allocate computing resources to AI workloads. This involves working with cutting-edge hardware to maximize performance, efficiency, reliability, and scalability, enabling customers to expand from minimal to massive GPU deployments seamlessly. You will engage with pioneering technologies and play a significant role in the organization's success by building infrastructure that powers the AI revolution. A solid background in distributed systems and a passion for solving complex scaling challenges are imperative.
Lead I - Python Kafka Development, docker, oauth2
UST · Trivandrum, Kerala, India