New York, New York, United StatesI’m a Data Scientist working in client-facing environment where analytics, applied AI, and system design overlap.
At IBM, I design and implement Python-based analytics, automated workflows, and cloud-native data pipelines that support Medicaid fraud-risk detection for state agencies operating at scale. Most of my work involves taking large, messy healthcare data and turning it into usable sig...
New York, New York, United StatesI collaborated cross-functionally with engineering, analytics, and product teams to support the development and ongoing enhancement of an enterprise business automation platform. My work involved leveraging SQL for large-scale data querying, validation, and transformation across complex datasets, ensuring data integrity and consistency within automated analytics and decision-support workflows...
Jun. 2021 - Aug. 2021
TakedaData Engineering and Artificial Intelligence Intern
Boston, Massachusetts, United StatesI designed a data pipeline to ingest large amounts of data quickly and provide exploratory data analysis for each patient and to get the data ready to be put into a neural network to predict target symptoms. I designed this data pipeline using PySpark and partioned the data, which was 1.8 terabytes of data, using a Resilient Distributed Dataset to drastically decrease the run time of the pipe...