Governo do Estado de São PauloCientista de Dados | Secretaria Estadual da Educação
May. 2024- Extensive Exploratory Data Analysis (EDA) on student performance, attendance, and assessment indicators to uncover patterns and support policy decisions. - Developed and monitored educational KPIs to track learning outcomes, dropout risks, school infrastructure demands, and program effectiveness. - Applied statistical techniques (correlation, hypothesis testing, regression) to evaluate the impact of educational interventions and resource allocation. - Conducted time series analysis to model trends in longitudinal educational indicators. - Designed data pipelines in Databricks using PySpark to process large-scale educational data from raw to refined and gold layers. -Designed and implemented data quality strategies leveraging DQX and DLT to monitor and enforce schema integrity, value consistency, and reliability of large-scale educational data. -Built machine learning models (Random Forest, PCA, clustering) to evaluate students performance and school clusters. - Collaborated with multidisciplinary teams (pedagogical, infrastructure, and IT departments) to translate educational needs into actionable, data-driven insights. - Built dashboards and analytical reports for stakeholders using Power BI, automating recurring analyses and visual storytelling.