Required Skills

apici/cddata visualizationdockergitkafkakubernetesnosqlpythonsqltableau

Job Description

*Discover your future at Citi**
-------------------------------

Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.

*Job Overview**
---------------

We are building an **A\-team** of highly skilled, autonomous, and AI\-first engineers, and we are looking for an ambitious Full Stack Data Engineer to join our focused squads in Pune. This role is designed for a hands\-on engineer who is passionate about leveraging data, proficient in building end\-to\-end data solutions, and deeply committed to using AI tools to maximize productivity. The ideal candidate will be instrumental in designing, developing, and optimizing robust data pipelines, from ingestion to consumption, using Python, PySpark, and other big data technologies. We seek an individual with strong domain understanding who can contribute to our AI\-first culture and help shape the future of our data platforms.

*Responsibilities:**

Operate end\-to\-end in the design, development, and implementation of full\-stack data solutions, ensuring optimal performance, scalability, data quality, and security across the data lifecycle.
Collaborate closely within small, co\-located squads (4\-7 person teams), fostering high communication and low coordination overhead, to translate complex business requirements into technical specifications for data engineering solutions.
Develop, maintain, and optimize data ingestion, processing, and transformation pipelines using Python and PySpark for large\-scale datasets.
Implement data storage solutions using big data technologies such as Hive, distributed file systems (e.g., HDFS, S3\), and potentially NoSQL databases.
Design and implement data models and schemas optimized for analytics and reporting, ensuring data integrity and accessibility.
Work with data consumers (e.g., analysts, data scientists) to understand their needs and provide efficient access to processed data, potentially involving reporting tools like Tableau.
Implement and manage real\-time data streaming and event\-driven architectures using technologies like Apache Kafka.
Champion best practices in data engineering and software development, including rigorous code reviews, implementing comprehensive testing, and supporting continuous integration and continuous deployment (CI/CD) pipelines.
Demonstrate high autonomy and agency in driving data projects forward, making informed technical decisions, and proactively identifying areas for data quality and efficiency improvements.

*Required Skills \& Experience:**

**Experience:** 4\-5 years of hands\-on experience as a Data Engineer, with a strong focus on building end\-to\-end data solutions and big data technologies.
**Programming Languages:**

+ Expert proficiency in Python, with proven experience in developing scalable data processing applications.

**Big Data Frameworks/Technologies:**

+ Strong understanding and hands\-on experience with Apache Spark, particularly PySpark, for large\-scale data processing.

+ Solid experience with Hive for data warehousing and querying large datasets.

+ Familiarity with distributed computing fundamentals and components like HDFS.

**Data Storage \& Management:**

+ Proficiency in SQL and experience with data warehousing concepts.

+ Experience with data storage formats (e.g., Parquet, ORC, Avro) and cloud\-based data lake solutions (e.g., S3\).

**Messaging \& Event Streaming:**

+ Experience with Apache Kafka for building real\-time data pipelines and event\-driven architectures.

**Reporting \& Visualization (Nice to Have):**

+ Knowledge of data visualization tools like Tableau is beneficial but not mandatory.

**AI\-Powered Development \& Productivity:**

+ **Proven effectiveness with AI coding tools (e.g., Claude Code, Codex, Antigravity) is expected; a strong willingness to adopt and maximize their usage is essential.**

+ An "AI\-first thinker" mindset, demonstrating how to leverage and integrate AI tools into the development workflow for continuous improvement.

**Domain Understanding:**

+ Strong ability to articulate the functional domain being worked in, understanding the business context, and explaining the "why" behind the technical data solutions.

**Other Essential Skills:**

+ Strong understanding of data structures, algorithms, and performance optimization techniques for large\-scale data processing.

+ Experience with RESTful API design and development for data ingestion or exposure points.

+ Familiarity with containerization technologies (e.g., Docker, Kubernetes) for deploying data applications is a plus.

+ Expert proficiency with version control systems, especially Git.

+ Exceptional problem\-solving, analytical, and debugging skills in complex, distributed data environments.

+ Superior communication and interpersonal skills, with the ability to work effectively and autonomously within small, high\-performing teams, and to collaborate with various stakeholders.

+ Demonstrated high autonomy and agency in tackling complex challenges and delivering impactful data solutions.

*Education:**

Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a related quantitative field is required. Equivalent practical experience with a demonstrable track record of excellence will also be considered.

This job description provides a high\-level review of the types of work performed. Other job\-related duties may be assigned as required.

*Job Family Group:**
--------------------

Technology

*Job Family:**
--------------

Applications Development

*Time Type:**
-------------

Full time

*Most Relevant Skills**
-----------------------

Please see the requirements listed above.

*Other Relevant Skills**
------------------------

For complementary skills, please see above and/or contact the recruiter.

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.*

Similar Jobs

Browse all jobs

Upload resume for AI match score

Job Overview

Job type: Full-time
Work mode: On-site
Location: Mumbai
Posted: 1d ago
Source: Indeed

LinkedIn 𝕏 / Twitter

Full Stack Data Engineer

Required Skills

Job Description

Similar Jobs

Job Overview

Share