Job Description: We’re looking for a strong Python engineer to help modernize legacy data workflows into production\-grade Python pipelines running in a cloud\-based data environment.
The role focuses on building reliable, scalable, and highly validated data pipelines with structured configs, logging, parquet\-based processing, and lightweight Streamlit interfaces. A major part of the work involves translating existing workflow logic into clean, vectorized Python while ensuring data accuracy and consistency across large datasets.
Clean engineering practices: reusable code, documentation, testing, and backward compatibility
Experience with PySpark, Polars, DuckDB, or large\-scale data processing is a plus.
You should be comfortable reading complex data flows, debugging silent data issues, optimizing slow transformations, and shipping maintainable pipeline code.
Qualifications: Graduate/ Post Graduate
Sen. Mobile App Tester
Testvox · Mumbai
GenAI / AI-ML Engineer
Premier IT Solutions · Ghaziabad
Network SME (Subject Matter Expert)
TalentNest Solutions · Mumbai