Mohammed Khalilia

Mohammed Khalilia

Lead Data Scientist

Aramco

Professional Summary

Mohammed Khalilia (محمد عبد الستار قاسم) is a researcher, computer scientist, and data scientist with a PhD in Computer Science from the University of Missouri. Following his doctorate, he joined Georgia Tech’s Computational Science and Engineering school and Emory University as a Postdoctoral Fellow, where his research spanned predictive modeling, relational cluster analysis, and health and nursing informatics.

He then spent nearly five years at Amazon, working across Amazon Web Services (AWS) and Amazon Studios in natural language processing (NLP), speech synthesis, and computer vision. In 2018, he was part of the team that launched Comprehend Medical, Amazon’s NLP service for clinical and biomedical text. At Qualtrics, he developed the company’s first fine-tuned large language model, trained synthetic sampling model, and worked on conversational machine learning, and active learning.

He is also an adjunct professor at Birzeit University, where he teaches NLP courses for doctoral students.

Education

PhD Computer Science

2007-01-01
2014-05-15

University of Missouri-Columbia • USA

BS Computer Science

2001-08-31
2006-12-31

University of Missouri-Columbia • USA

Interests

Machine learning Natural Language Processing Biomedical/health informatics Predictive modeling Large Language Models
Featured Publications
Multi-Channel Volume Level Equalization Based on User Preferences featured image

Multi-Channel Volume Level Equalization Based on User Preferences

Systems, devices, and methods are provided for multi-stem volume equalization, wherein the volume levels of each stem may be adjusted non-uniformly. Audio may be diarized into a …

mohammed-khalilia
Service architecture for entity and relationship detection in unstructured text featured image

Service architecture for entity and relationship detection in unstructured text

Techniques for entity and relationship detect from unstructured text as a service are described. A service may receive a request to identify entities within a provided unstructured …

thiruvarul-selvan-senthivel
Facial Feature Location-based Audio Frame Replacement featured image

Facial Feature Location-based Audio Frame Replacement

Played audio frames included in first audio content may be received over one or more networks. The first audio content may further include a replaced audio frame. The first audio …

mohammed-khalilia
Video Frame Replacement Based on Auxiliary Data featured image

Video Frame Replacement Based on Auxiliary Data

The popularity of videoconferencing has increased rap idly in recent years. Video conferencing tools may allow multiple people at multiple different locations to interact by …

gregory-johnson
Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT featured image

Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT

This paper presents Wojood, a corpus for Arabic nested Named Entity Recognition (NER). Nested entities occur when one entity mention is embedded inside another entity mention. …

mustafa-jarrar
Recent Publications
(2025). $mathrmWojood^Relations$: Arabic Relation Extraction Corpus and Modeling. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing.
(2025). Active Learning for Multidialectal Arabic POS Tagging. Findings of the Association for Computational Linguistics: EMNLP 2025.
(2025). Konooz: Multi-domain Multi-dialect Corpus for Named Entity Recognition. Findings of the Association for Computational Linguistics: ACL 2025.