Keep in touch with meI'm using Intch to connect with new people. Use this link to open chat with me via Intch app
Work Background
Senior Consultant
GridBeyondSenior Consultant
Jan. 2023Dublin, County Dublin, Ireland• Efficiently acquire data from diverse open source and paid websites using APIs and real-time scraping methods for energy markets in Japan, Australia, USA, and Ireland. • Develop robust Python scraping scripts to extract data and ensure timely updates. • Schedule and automate data extraction processes using VMs and Azure Function Apps. • Manage data loading and maintenance into databases to ensure high data quality and availability. • Oversee the Azure environment, including Azure Data Factory (ADF), Azure Function App, container registry, Azure Data Lake Storage Gen2 (ADLS2), and Azure Blob Storage. • Coordinate with the team to ensure the reliability, scalability, and security of the infrastructure components. • Implement Airflow through Azure Data Factory to optimize workflow scheduling and management. • Utilize collected data to create and fine-tune classifier models to predict energy demand forecasts for target markets. • Collaborate with data scientists to integrate predictive models into the organization's built-in optimizer software. • Validate and enhance the accuracy of predictions through continuous evaluation and model refinement. • Lead the creation of a comprehensive unified data model that captures inputs and outputs of the organization's optimizer. • Collaborate closely with cross-functional teams, including data scientists and software developers, to ensure seamless integration and data consistency.
Senior Data Analyst
Codec IrelandSenior Data Analyst
Nov. 2020 - Jan. 2023Dublin, County Dublin, Ireland• Collaborated closely with a cross-functional team of ICT stakeholders, both internal and external, to establish the technical scope and prerequisites of the environment, which facilitated the construction of cost-effective and scalable solutions tailored to meet client requirements. Solution Documentation and Integration • Played a pivotal role in crafting comprehensive high-level and detailed solution documents for incoming projects. These projects primarily involved the seamless integration of new source systems into the existing Enterprise Data Warehouse (EDW). • Assumed responsibility for devising BI conceptual data flow and logical data models that facilitated the harmonious integration of novel source systems into the pre-existing EDW framework. • Provided essential inputs to the development team, assisting in the construction of efficient data pipelines. Broke down complex high-level tasks into manageable low-level builds, ensuring alignment with enterprise and business process architecture. • Managed all release activities utilizing Azure DevOps, which encompassed the creation of build pipelines (primarily one-time tasks), formulation of automated release pipelines, and seamless code deployment between various environments. • Demonstrated a proactive approach in reviewing and enhancing existing BI processes. Spearheaded comprehensive improvements across the entire BI lifecycle, ranging from build tasks to BI application refinement. • Developed periodic Proof of Concepts (PoCs) to present to stakeholders. Utilized these PoCs to offer valuable insights for informed decision-making regarding the acquisition of new tools (PaaS) within the Azure Platform. • Actively engaged with business units and stakeholders to explore opportunities for applying machine learning algorithms to collected data. Leveraged these algorithms to derive actionable insights from the Business Intelligence data stored within the EDW.
Data Science Intern
Health VectorsData Science Intern
Aug. 2020 - Nov. 2020Dublin, Ireland• Deriving actionable insights from large and complex datasets using statistical techniques (Python). • Prediction of units, high range value, low range value for various hematology and thyroid parameters from historical data using classification algorithms. • Data mining from static pathology lab reports using pytesseract.
Postgraduate Student
Dublin City UniversityPostgraduate Student
Sep. 2019 - Aug. 2020Dublin, Ireland• Developed a bar chart race using Matplotlib to visualize the evolution of medals tally by top 10 countries from 1920 to 2016. • Used plotly and bokeh to visualize the number of Olympic medals won against GDP per capita for all country every four year from 1920 with a slider option. • Created an intelligent product recommendation system using Apriori algorithm and deployed application on Shinyapps.io and Google AppEngine. • Developed a regression model to predict the house prices on Kansas City housing dataset from Kaggle. • Predicted long-term and short-term video memorability of short video clips using semantic features (XGBoost regressor). • Clustering and classification of Spotify DB data with respect to Genres (Classifiers: SVM, Naïve Bayes, Random Forest) • Increased efficiency of large matrix multiplication by 150% as compared to traditional method using OpenMP. • Thesis: Investigated the correlation between Lexical Diversity and Evaluation Metrics in training of Neural Machine Engines. • Completed coursework and projects in Statistics, Machine Learning, Data Visualization, Data Analysis, Data Mining, Cloud Technologies, Mathematical Methods & Computation and Concurrent Programming.
Data Scientist
PricewaterhouseCoopers - Service Delivery Center (PwC SDC)Data Scientist
Oct. 2018 - Aug. 2019Kolkata Area, India• Collaborated with data modelers on physical database design to model and create data pipeline as per business requirements. • Extract transform and load data from different data sources and automate the complete ETL process adhering to job dependencies as specified. • Automated the process for retrieving structured and semi-structured files from AWS S3 server and transfer them into Unix box using shell for use by Alteryx workflows. • Retrieval of apportionment rates from tax sheets based on keywords from PDF files using OpenCV and PyTesseract (Python) and load the data into an excel file. • Classification of employees into specified buckets as per client requirements using HR data for a US clientele. • Implementing a chatbot for the employees of a US clientele for assistance on company policies.
Data Engineer
CognizantData Engineer
Sep. 2016 - Oct. 2018Kolkata metropolitan area, West Bengal, India• Worked as a L2 developer for a US based energy organization and handled modification requests for Informatica ETL tasks along with monitoring ETL jobs running in production. • Implemented the ETL flow for adding a new source (Great Plains ERP) into the existing Data warehouse architecture for a US clientele. • Build a Datamart for an Australia based energy and utilities organization, to get used by Spotfire reporting tool to be viewed on a dashboard by business and end users. • Designed AWS Data Pipelines (source files uploaded to AWS S3) and then process data in order to load them into the client data warehouse using these pipelines. • Developed informatica mappings and workflows for a US based banking and financial services company for their new commercial landing zone (CL3) • Developed Tableau dashboard from dimension and fact tables in the client data warehouse for assisting business to take important decisions for the organization. • Carried out performance tuning activities to reduce time and resource usage of various ETL jobs and SQL scripts using optimization techniques, indexes, and partitions. • Used python pandas to read and pre-process TSV files and carried out extensive statistical study using sklearn to find clusters in the data. (Using PCA and T-SNE)
Big Data Analyst
CognizantBig Data Analyst
Feb. 2015 - Aug. 2016Kolkata metropolitan area, West Bengal, IndiaETL Developer/ Data Engineer
Programmer Analyst Trainee
CognizantProgrammer Analyst Trainee
Feb. 2014 - Feb. 2015Kolkata Area, IndiaDatawarehouse and ETL developer
Intch is a Professional Networking App for the Future of Work
300k+ people
130+ countries
AI matching
See more people like Aritra on Intch
Startup Founder
249688 people
29
Sponsorship and Commercial Partnerships @ Bamberg Health
18
Managing Partner @ Nexus Search
16
Event management @ CARVILLE CONSULTANCY
Startup FounderFounder
93742 people
16
Event management @ CARVILLE CONSULTANCY
30
Owner @ Barrow Signs Ltd.
21
Project Manager @ ERD Solutions