NANO Web Group | AI Venture StudioMachine Learning and Natural Language Processing Director
Mar. 2018 - May. 2019Greater New York City AreaAs a Lead ML and NLP engineer, I am responsible of applying the research and engineering skills to develop proprietary technology. I have several responsibilities such as: *Programming and Scripting Languages: Python *Database Systems: SQL (MySQL, SQLite), NoSQL(MongoDb), Postgresql, Flask *Frameworks and Toolkits : NLTK, SciPy, Numpy, Gensim, pytest, textblob, Stanford CoreNLP, Boost, igraph, Weka, Wordnet, Spacy, scikit-learn, pytorch *Problem Solving, Data Modeling and Analysis, Machine learning, Information Retrieval, NLP, Data Mining, Complex Networks Sampling techniques, Word Embeddings, *preprocessing data to transform and change raw feature vectors into a representation that is more suitable for our models *extract linguistic features like part-of-speech tags (POS), dependency labels and named entities (NER), *customising the stopwords, tokenizer *trained statistical models for spaCy named entity recognizer *creating rule-based matcher to find patterns *working with word vectors including tfidf, LSI,LSA, LDA *semantic similarity, sentiment analysis : Predicting similarity is useful for building recommendation systems and flagging duplicates. *clustering and classification :running a comparison of the clustering algorithms in scikit-learn to select the best model for the data