ExcellaSenior Data Engineer
Mar. 2019Arlington, VABuild and manage 20+ data pipelines using Airflow, AWS (EC2, S3, RDS, Athena, Lambda, Glue); SQL (Postgres), Python (Pandas, IRSx) for a team of auditors doing data anomaly analysis. Curate data sets from multiple data sources for Looker reports; use Python tests to alert for data issues and improve data quality. Manage document analysis data pipeline using Kafka streaming to ingest .pdfs. Troubleshoot and resolve data issues for external and internal users. Team lead for a legacy government system that utilized Oracle PL/SQL and Goldengate migration tool.