Morocco, Casablanca>> Improvement and optimization of data pipelines :
• Optimized and improved production data pipelines
• Reduced execution time by redesigning Spark cluster configuration
• Improved churn prediction model performance
• Reviewed and enhanced ETL, training, and inference logic
• Defined Spark infrastructure requirements for DEV and PROD
Tools: PySpark, python, Jira, Hue, Airflow