I am a data and AI engineering professional with over eight years of solid experience in Data Analysis, Engineering, and Visualization projects within complex corporate environments. I hold a degree in Industrial Engineering, an MBA in Project Management, and a specialization in Data Science and Artificial Intelligence, complemented by Lean Six Sigma Black Belt certification.
I possess proficiency in Python, SQL, and PySpark, with extensive experience developing scalable data pipelines in cloud environments (mainly AWS and Azure). My technical expertise spans modern data architectures and orchestration tools including Apache Spark, Apache Kafka, Kubernetes, Terraform, and Databricks (data platform). I've implemented end-to-end Data Lake and Data Warehouse solutions using open-source technologies such as MinIO, Apache Iceberg, Trino, DBT, Apache Airflow, and Metabase, with strong expertise in data governance, versioning, and automation of complex analytical processes.
Most recently, I've been deepening my expertise in AI Engineering and multi-agent systems architecture. I've designed and implemented sophisticated agent orchestration frameworks, including specialized subagent systems for data analytics and workflow automation. This work demonstrates my ability to architect complex AI systems that leverage foundation models to autonomously solve specialized domain problems while maintaining seamless inter-agent collaboration. I've also developed agent-based architectures where autonomous agents handle distinct responsibilities—from pipeline optimization to quality assurance—while working cohesively on enterprise-scale problems. This includes designing multi-agent collaboration patterns, managing agent specialization through domain expertise encapsulation, and orchestrating parallel execution for complex workflows.
Beyond technical foundations, I bring strategic experience in Supply Chain Management, Digital Transformation, Fraud Prevention, and Internal Audit, enabling a systemic perspective focused on generatingMore...
Reinaldo Duzanski Jr.
Lead Data Engineer@Thoughtworks
Verified
Lead Data Engineer with 9+ years of experience building and scaling cloud data platforms on GCP and AWS. I design and implement robust ETL pipelines using tools like PySpark, Airflow, Databricks, and Snowflake, supporting both batch and real-time data processing.
Recent highlights:
• Built and scaled a GCP data platform supporting a large analytics team using a medallion architecture (Bronze/Silver/Gold).
• Improved performance and reliability of data pipelines through optimized PySpark processing.
• Led the development of data infrastructure for audit and analytics use cases, enabling better visibility into business operations.
• Implemented real-time data streaming to support operational dashboards and time-sensitive insights.
Experienced in collaborating with global, cross-functional teams (USA, UK, India, China), with a strong ability to translate business needs into scalable data solutions.More...