Data Engineer | Databricks Developer | ETL Developer
Reliable data engineer with 10 years of proven industry experience in data lake development, data analytics, real-time streaming, and back-end application development. My work is used by millions of people in the legal and entertainment industries. I have built exceptionally stable solutions for high-traffic, high-visibility projects, and understand what it takes to ensure products are robust and dependable. I also have expertise in the Apache Spark ecosystem, Elastic Search, ETL, AWS Glue, DMS, Athena, EMR, Data Lake, AWS Big Data, Apache Kafka, Java, and NoSQL.
Specific Experience
1. Databricks : 5+ years of experience
2. Unity Catalog: 2+ years of experience
3. Apache Spark: 8+ years of experience
4. ETL: 8+ years of experience
5. SQL: 9+ years of experience
6. AWS: 8+ years of experience
7. Azure and GCP: 5+ years of experience
I am a data professional, worked with many companies, and delivered some of the enormous data engineering and data science projects in the past. My focus is always on scalable, sustainable, and robust software building.
As a data scientist, I will use data modeling, programming, analysis, visualization, and writing skills to help people have the insight to develop products, customers, and impact. As a data scientist, I care deeply about the data from beginning to end—I am actively involved in all aspects of data analysis, from data modeling tasks to writing reports and making visualizations.
Python/Scala Programming, Linux Admin, Data Wrangling, Data Cleansing & Data Extraction services utilizing Python 3 or Python 2 Programming or Scala/Spark on Linux or Windows.
I slice, dice, extract, transform, sort, calculate, cleanse, collect, organize, migrate, and otherwise handle data management for clients.
Services Provided:
- Big data processing using Spark Scala
- Building large Scale ETL
- Could Management
- Distributed platform development
- Machine learning
- Python Programming
- Algorithm Development
- AWS glue
- Pyspark
- DatMore...