SessionMData Engineer
Nov. 2019 - Mar. 2021Estados Unidos• In this role, I worked with complex Extract, Transform, and Load (ETL) processes using Apache Airflow and deployed them using Jenkins on an Elastic MapReduce (EMR) cluster. This involved designing and implementing efficient data pipelines to move and transform large sets of data, as well as ensuring proper monitoring and troubleshooting of the ETL processes.
• Focused on ML model development and validation, as well as feature creation and selection to provide recommendations based on user characteristics and behavior. AWS, EMR, Hadoop, Sqoop, Spark, Scala, Python, Jenkins, Airflow, Luigi