Mohammed Khalilia

Mohammed Khalilia

Lead Data Scientist

Aramco

Professional Summary

Mohammed Khalilia (محمد عبد الستار قاسم) is a researcher, computer scientist, and data scientist with a PhD in Computer Science from the University of Missouri. Following his doctorate, he joined Georgia Tech’s Computational Science and Engineering school and Emory University as a Postdoctoral Fellow, where his research spanned predictive modeling, relational cluster analysis, and health and nursing informatics.

He then spent nearly five years at Amazon, working across Amazon Web Services (AWS) and Amazon Studios in natural language processing (NLP), speech synthesis, and computer vision. In 2018, he was part of the team that launched Comprehend Medical, Amazon’s NLP service for clinical and biomedical text. At Qualtrics, he developed the company’s first fine-tuned large language model, trained synthetic sampling model, and worked on conversational machine learning, and active learning.

He is also an adjunct professor at Birzeit University, where he teaches NLP courses for doctoral students.

Education

PhD Computer Science

2007-01-01
2014-05-15

University of Missouri-Columbia

BS Computer Science

2001-08-31
2006-12-31

University of Missouri-Columbia

Interests

Machine learning Natural Language Processing Biomedical/health informatics Predictive modeling Large Language Models
Featured Publications

Multi-Channel Volume Level Equalization Based on User Preferences

Systems, devices, and methods are provided for multi-stem volume equalization, wherein the volume levels of each stem may be adjusted non-uniformly. Audio may be diarized into a …

mohammed-khalilia

Service architecture for entity and relationship detection in unstructured text

Techniques for entity and relationship detect from unstructured text as a service are described. A service may receive a request to identify entities within a provided unstructured …

thiruvarul-selvan-senthivel

Facial Feature Location-based Audio Frame Replacement

Played audio frames included in first audio content may be received over one or more networks. The first audio content may further include a replaced audio frame. The first audio …

mohammed-khalilia

Video Frame Replacement Based on Auxiliary Data

The popularity of videoconferencing has increased rap idly in recent years. Video conferencing tools may allow multiple people at multiple different locations to interact by …

gregory-johnson
Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT featured image

Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT

This paper presents Wojood, a corpus for Arabic nested Named Entity Recognition (NER). Nested entities occur when one entity mention is embedded inside another entity mention. …

mustafa-jarrar
Recent Publications
(2024). Event-Arguments Extraction Corpus and Modeling using BERT for Arabic. Association for Computational Linguistics, Proceedings of the Second Arabic Natural Language Processing Conference.
PDF
(2024). NLU-STR at SemEval-2024 Task 1: Generative-based Augmentation and Encoder-based Scoring for Semantic Textual Relatedness. Association for Computational Linguistics, Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024).
PDF
(2024). SinaTools: Open Source Toolkit for Arabic Natural Language Processing. Procedia Computer Science, 6th International Conference on AI in Computational Linguistics.
PDF
(2023). Arabic Fine-Grained Entity Recognition. Proceedings of The First Arabic Natural Language Processing Conference (ArabicNLP 2023, co-hosted with EMNLP 2023).