Avenue CodeSenior Data Engineer
Feb. 2023Rio de Janeiro, BrasilDigital Transformation Project: migration of SSIS packages from on-premise infrastructure to Azure for an international client (Canada). Phase 1: Lift & Shift
- Successfully migrated SSIS packages to Azure, using Azure Data Factory for orchestrations, Azure SQL for data storage, and Databricks for scalable data processing.
Phase 2: Refactoring
- Refactored codebase from C# to Python, enhancing performance and maintainability.
- Developed an extensive Python library within the Azure DevOps repository to standardize and streamline processes.
- Implemented scheduled jobs in Databricks, automating workflows and ensuring timely data processing.
- Designed CI/CD pipelines in Azure DevOps, incorporating unit tests and integration tests to ensure that updates to the main branch do not disrupt the process. Data Handling & Integration:
- Managed diverse data sources and formats including CSV, TXT, JSON, Parquet, and XML, ensuring seamless integration across the data pipeline.
- Integrated data from a variety of systems, including APIs, SFTP, and direct database queries, maintaining a high standard of data accuracy and availability. Data Platform Creation:
- Architected and implemented a data platform following the Medallion Strategy, utilizing the Bronze, Silver, and Gold layers to manage data lifecycle and quality.
- Bronze Layer: Ingested raw data from multiple sources in its original format, ensuring data integrity.
- Silver Layer: Conducted data cleaning and transformation to produce a curated dataset, ready for analytical processing.
- Gold Layer: Created business logic-driven BI tables (DIM & FACT), ensuring data was ready for reporting and analytics.
- Established QA workflows to maintain data quality across all layers, ensuring accurate and reliable data.