Description:
Key Responsibilities: Design, develop, and maintain ETL pipelinesusing AWS services, Python, and Spark. Optimize data ingestion, transformation, and storage processes for high-performance data processing. Work with structured and unstructured data, ensuring data integrity, quality, and governance. Develop SQL queriesto extract and manipulate data efficiently from relational databases. Implement data validation and testingframeworks using Pytest to ensure data accuracy and reliability. Colla
Apr 7, 2025;
from:
dice.com