Jason Lucas

Jason Lucas

Ph.D. Candidate in Informatics

I am a PhD candidate in Informatics in the College of IST at Penn State University, where I conduct research at the PIKE Research Lab under the guidance of Dr. Dongwon Lee. I specialize in AI/ML research focused on Information Integrity, Safe and Ethical AI, including combating harmful content across multiple languages and modalities. My research spans low-resource multilingual NLP, generative AI, and adversarial machine learning, with work extending across 79 languages. I have published 12 papers with 260+ citations in premier venues including ACL, EMNLP, IEEE, and NAACL.

My doctoral research focuses on bridging the digital language divide through transfer learning, classification (NLU), generation (NLG), adversarial attacks, and developing end-to-end AI pipelines using RAG and Agentic AI workflows for combating multilingual threats. Drawing from my Grenadian background and knowledge of local Creole languages, I bring a global perspective to AI challenges, working to democratize state-of-the-art AI capabilities for underserved linguistic communities worldwide. My mission is to develop robust multilingual multimodal systems and mitigate evolving security vulnerabilities while enhancing access to human language technology through cutting-edge solutions.

As an NSF LinDiv Fellow, I conduct transdisciplinary research advancing human-AI language interaction for social good. I actively mentor 5+ research interns and teach Applied Generative AI courses. Through industry experience at Lawrence Livermore National Lab, Interaction LLC, and Coalfire, I bridge academic research with practical applications in combating evolving security threats and enhancing global AI accessibility. I see multilingual advances and interdisciplinary collaboration as a competitive advantage, not a communication challenge. Beyond research, I stay active through dance, fitness, martial arts, and community service.

Dagstuhl Seminar 26252 — From Speech Translation to Multilingual Communication featured image

Dagstuhl Seminar 26252 — From Speech Translation to Multilingual Communication

Invitation-only seminar on "From Speech Translation to Multilingual Communication – New Research Challenges" bringing together researchers from speech translation, interpretation …

avatar
Jason Lucas
Ethical Use of AI in Research and Teaching featured image

Ethical Use of AI in Research and Teaching

Invited Lunch & Learn session exploring responsible, transparent, and ethical uses of AI in research and teaching, with emphasis on upholding academic integrity and maintaining …

avatar
Jason Lucas
DIA-HARM: Harmful Content Detection Robustness Across 50 English Dialects featured image

DIA-HARM: Harmful Content Detection Robustness Across 50 English Dialects

DIA-HARM evaluates 16 harmful content detection models across 50 English dialects using 195K+ samples, revealing 1.4–3.6% F1 drops for fine-tuned models and up to 27% for zero-shot …

avatar
Jason Lucas
BLUFF: Benchmarking in Low-resoUrce Languages for detecting Falsehoods and Fake news featured image

BLUFF: Benchmarking in Low-resoUrce Languages for detecting Falsehoods and Fake news

BLUFF is the largest multilingual fake news detection benchmark, spanning 79 languages with 202K+ samples. It introduces AXL-CoI for adversarial generation and mPURIFY for quality …

avatar
Jason Lucas
Program Committee Member — LoResLM 2026 Workshop (EACL) featured image

Program Committee Member — LoResLM 2026 Workshop (EACL)

Program Committee Member for LoResLM 2026, the Workshop on Low-Resource Languages and Multilingual NLP, co-located with EACL in Rabat, Morocco.

avatar
Jason Lucas
Beyond speculation: Measuring the Growing Presence of LLM-generated texts in Multilingual Disinformation featured image

Beyond speculation: Measuring the Growing Presence of LLM-generated texts in Multilingual Disinformation

This IEEE article provides empirical measurements of LLM-generated texts in multilingual disinformation, moving beyond speculation to analyze the growing presence and …

dominik-macko
Chain-of-Interactions: Iterative ICL Framework for Abstractive Task-Oriented Dialogue Summarization of Conversational AI Interactions featured image

Chain-of-Interactions: Iterative ICL Framework for Abstractive Task-Oriented Dialogue Summarization of Conversational AI Interactions

Chain-of-Interactions (CoI) introduces a novel multi-step framework that leverages LLMs' in-context learning capabilities for abstractive task-oriented dialogue summarization. …

avatar
Jason Lucas
Organizing Committee Member — 12th MASC-SLL featured image

Organizing Committee Member — 12th MASC-SLL

Organizing Committee Member for the 12th Mid-Atlantic Student Colloquium on Speech, Language & Learning (MASC-SLL) at Penn State.

avatar
Jason Lucas
Graph-based Molecular In-context Learning Grounded on Morgan Fingerprints featured image

Graph-based Molecular In-context Learning Grounded on Morgan Fingerprints

GAMIC introduces a novel self-supervised learning approach for molecular in-context learning that combines graph neural networks with Morgan fingerprints to better capture …

ali-al-lawati
Graph-based Molecular In-context Learning Grounded on Morgan Fingerprints featured image

Graph-based Molecular In-context Learning Grounded on Morgan Fingerprints

GAMIC introduces a novel self-supervised learning approach for molecular in-context learning that combines graph neural networks with Morgan fingerprints to better capture …

ali-al-lawati