Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.
We are building an **A\-team** of highly skilled, autonomous, and AI\-first engineers, and we are looking for an ambitious Full Stack Data Engineer to join our focused squads in Pune. This role is designed for a hands\-on engineer who is passionate about leveraging data, proficient in building end\-to\-end data solutions, and deeply committed to using AI tools to maximize productivity. The ideal candidate will be instrumental in designing, developing, and optimizing robust data pipelines, from ingestion to consumption, using Python, PySpark, and other big data technologies. We seek an individual with strong domain understanding who can contribute to our AI\-first culture and help shape the future of our data platforms.
+ Expert proficiency in Python, with proven experience in developing scalable data processing applications.
+ Strong understanding and hands\-on experience with Apache Spark, particularly PySpark, for large\-scale data processing.
+ Solid experience with Hive for data warehousing and querying large datasets.
+ Familiarity with distributed computing fundamentals and components like HDFS.
+ Proficiency in SQL and experience with data warehousing concepts.
+ Experience with data storage formats (e.g., Parquet, ORC, Avro) and cloud\-based data lake solutions (e.g., S3\).
+ Experience with Apache Kafka for building real\-time data pipelines and event\-driven architectures.
+ Knowledge of data visualization tools like Tableau is beneficial but not mandatory.
+ **Proven effectiveness with AI coding tools (e.g., Claude Code, Codex, Antigravity) is expected; a strong willingness to adopt and maximize their usage is essential.**
+ An "AI\-first thinker" mindset, demonstrating how to leverage and integrate AI tools into the development workflow for continuous improvement.
+ Strong ability to articulate the functional domain being worked in, understanding the business context, and explaining the "why" behind the technical data solutions.
+ Strong understanding of data structures, algorithms, and performance optimization techniques for large\-scale data processing.
+ Experience with RESTful API design and development for data ingestion or exposure points.
+ Familiarity with containerization technologies (e.g., Docker, Kubernetes) for deploying data applications is a plus.
+ Expert proficiency with version control systems, especially Git.
+ Exceptional problem\-solving, analytical, and debugging skills in complex, distributed data environments.
+ Superior communication and interpersonal skills, with the ability to work effectively and autonomously within small, high\-performing teams, and to collaborate with various stakeholders.
+ Demonstrated high autonomy and agency in tackling complex challenges and delivering impactful data solutions.
This job description provides a high\-level review of the types of work performed. Other job\-related duties may be assigned as required.
\-
Technology
\-
Applications Development
\-
Full time
\-
Please see the requirements listed above.
\-
For complementary skills, please see above and/or contact the recruiter.
\-
Full Stack Developer Intern (Remote) | MERN Stack | Web Applications | API Development
Inficore Soft · Remote
API Architect
Virtusa · Gurgaon, Haryana, India
API Developer
Zenwork, Inc · Hyderabad, Telangana, India