Description:
Key Responsibilities: Design, develop, and maintain ETL pipelines using AWS services, Python, and Spark. Optimize data ingestion, transformation, and storage processes for high-performance data processing. Work with structured and unstructured data, ensuring data integrity, quality, and governance. Develop SQL queries to extract and manipulate data efficiently from relational databases. Implement data validation and testing frameworks using Pytest to ensure data accuracy and reliability. C
Mar 28, 2025;
from:
dice.com