Senior Research Engineer · NLP & ML

Hi, I'm Emad. I build intelligent systems.

I've developed strong expertise in Natural Language Processing, Machine Learning, Deep Learning, and Software Engineering across academic and industrial settings. I'm highly motivated to tackle challenging AI problems — from LLM-powered products to search and knowledge graph systems.

Jul 2021 - Present

Senior Research Engineer

Thomson Reuters

Toronto, Canada

Details & Impact

Delivering software components and ML Pipelines for experimentation & deployment of AI Models (ML/DL/LLM-based models) to improve customers' flagship products, and add new AI features related to NLP, search, knowledge graph, and recommendation problems.

Jul 2018 - Jun 2021

NLP & Machine Learning Engineer

INAGO INC.

Toronto, Canada

Details & Impact

Deep neural network development, managing datasets, training, and deployment for Language Understanding Engines and Automated Text Generation Models. [Python, PyTorch, Word Embedding, AWS EC2, SpaCy, NLTK, BERT, T5 models]. Managing collaborative research projects with universities related to Automated Text Generation and Linguistics.

Education

2014 - 2018

M.Sc. in Computer Science

York University

Toronto, Canada

GPA: 8.17 / 9

Thesis: Interactive Question Answering Using Frame-based Knowledge Representation

2010 - 2014

B.Sc. in Computer Engineering

Amirkabir University of Technology

Tehran, Iran

GPA: 17.18 / 20

Technical Skills

Python PyTorch Scikit-Learn Numpy Pandas SpaCy NLTK Transformers Large Language Models NLP Machine Learning Deep Learning AWS Docker PostgreSQL Elasticsearch Opensearch Java Scala Bash Agile Development

Projects

GenAI Summary for Business Entity Reports

2024

Using LLMs prompting to generate summaries with customer-focused insights to facilitate the use of long business reports. Improving the current data extraction pipeline from reports.

PythonOpenAI GPT-4Claude Sonnet

Design and Building New Entity Matching System

2022 - 2024

Worked on new generation of flagship product for Due Diligence and Risk Management.

SparkPythonJavaScikit-LearnAWS RDSOpensearchEMRSageMaker

New Search Improvement Feature to Promote More Relevant Documents

2021

Delivered new search functionality to detect more general type of queries and questions and promoting the more suitable content for these queries for a Tax Research web application (in-production).

PythonJavaNLTKSentence EmbeddingElasticsearchAWS

Improving Automated Question Generation from Documents

2020

Fine-tuning T5 Transformer Language Model, experimenting for improving model input, using BLEURT evaluation. Auto-generated questions resulted in 40% reduction in manual data curation efforts.

PyTorchSpaCyHuggingFace TransformersPython

Improved Language Understanding Engine Models

2019

Training domain-specific Word2Vec word embeddings and adding detailed model testing for LSTMs to improve NLU model interpretability.

PyTorchPythonWord2VecLSTM

Conversational Question Answering

2018

Creating a question answering dialogue system powered by syntactic and semantic analysis of documents and ontology generation. Domain-specific question answering using a dialogue interface. Part of a collaborative project with an industrial partner.

SpaCyPython

Publications

Question-worthy sentence selection for question generation

Canadian AI 2020 2020 · co-authored

Interactive Question Answering Using Frame-based Knowledge Representation

York University M.Sc. Thesis 2018

Time aware topic based recommender system

Big Data & Information Analytics 2016 · co-authored

A study on prediction of user's tendency toward purchases in websites based on behavior models

Information and Knowledge Technology (IKT), 6th Conference, IEEE 2014 · co-authored