ProvectusSenior Machine Learning Engineer
Aug. 2019 - Aug. 2021Toronto, Ontario, Canada· Led the development of a text extraction and QA system employing OpenAI GPT-3, achieving a 75% accuracy (EM) on
domain-specific data sets
· Optimized data preprocessing pipelines resulting in a 15% increase in overall efficiency
· Led document multi-class classification efforts using transformer architectures, achieving an F1 score of 087 on extensive
datasets
· Developed a multi-label text classification pipeline using Transformers and Pytorch, handling over 100 labels and achieving
a 70% micro-average precision
· Pioneered the transformation of clinical trial descriptions into SQL-like logic using BERT-based models, reducing error rates
by 20% compared to conventional methods
· Designed and executed benchmarking tests for the QA task within the biomedical domain, specifically focusing on RAG
requests across various LLMs
· Increased model training process efficiency by 30% through the development of a scalable Amazon SageMaker ML pipeline
· Curated a comprehensive dataset from reputable biomedical sources and implemented evaluation metrics, including BLEU
score, ROUGE-L, and EM, ensuring a holistic assessment of model performance in real-world scenarios
· Compiled and optimized YOLOv4 models on Nvidia Jetson Xavier and Nano, achieving a real-time frame rate of 30 FPS on
HD video feeds
· Architected an end-to-end video processing pipeline from edge devices (using Gstreamer) to AWS Cloud for real-time
analytics and report generation using AWS Lambda and S3
· Trained the YOLO model, achieving real-time object detection and tracking, with velocity estimation using mapping to GPS
coordinates
· Developed a real-time industrial appliance monitoring system using machine learning for predictive maintenance
· Applied TF-IDF and Word2Vec embeddings along with encoders with advanced statistical techniques for code segmentation,
resulting in a 30% improvement in feature relevance for item matching