Jason Lucas

Jason Lucas

Ph.D. Candidate in Informatics

I am a PhD candidate in Informatics in the College of IST at Penn State University, where I conduct research at the PIKE Research Lab under the guidance of Dr. Dongwon Lee. I specialize in AI/ML research focused on Information Integrity, Safe and Ethical AI, including combating harmful content across multiple languages and modalities. My research spans low-resource multilingual NLP, generative AI, and adversarial machine learning, with work extending across 79 languages. I have published 12 papers with 260+ citations in premier venues including ACL, EMNLP, IEEE, and NAACL.

My doctoral research focuses on bridging the digital language divide through transfer learning, classification (NLU), generation (NLG), adversarial attacks, and developing end-to-end AI pipelines using RAG and Agentic AI workflows for combating multilingual threats. Drawing from my Grenadian background and knowledge of local Creole languages, I bring a global perspective to AI challenges, working to democratize state-of-the-art AI capabilities for underserved linguistic communities worldwide. My mission is to develop robust multilingual multimodal systems and mitigate evolving security vulnerabilities while enhancing access to human language technology through cutting-edge solutions.

As an NSF LinDiv Fellow, I conduct transdisciplinary research advancing human-AI language interaction for social good. I actively mentor 5+ research interns and teach Applied Generative AI courses. Through industry experience at Lawrence Livermore National Lab, Interaction LLC, and Coalfire, I bridge academic research with practical applications in combating evolving security threats and enhancing global AI accessibility. I see multilingual advances and interdisciplinary collaboration as a competitive advantage, not a communication challenge. Beyond research, I stay active through dance, fitness, martial arts, and community service.

Beemo - Benchmark of Expert-edited Machine-generated Outputs featured image

Beemo - Benchmark of Expert-edited Machine-generated Outputs

In this talk, we present Beemo, one of the first multi-author benchmarks for machine-generated text (MGT) detection that includes expert-edited responses. Our benchmark comprises …

avatar
Jason Lucas
Penn State connections lead doctoral student to interdisciplinary College of IST featured image

Penn State connections lead doctoral student to interdisciplinary College of IST

Doctoral student Jason Lucas's academic journey from the West Indies to Penn State illustrates the power of mentorship and interdisciplinary thinking. With guidance from Professor …

avatar
Jason Lucas
Beemo: Benchmark of Expert-edited Machine-generated Outputs featured image

Beemo: Benchmark of Expert-edited Machine-generated Outputs

Beemo introduces a novel benchmark featuring 6.5k expert-edited machine-generated texts across diverse domains from creative writing to summarization. Through comprehensive …

ekaterina-artemova
Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text featured image

Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text

This work introduces semantic captioning for SQL queries, addressing the reverse operation of semantic parsing by translating SQL code into natural language explanations. Using …

ali-al-lawati
Leadership Examples in Tech Rarely Look Like Me featured image

Leadership Examples in Tech Rarely Look Like Me

Informatics PhD student Jason Lucas reflects on his journey as a Black man in tech, sharing how his experiences have shaped his commitment to mentoring underrepresented students …

avatar
Jason Lucas
Generative AI Disproportionately Harms Long Tail Users featured image

Generative AI Disproportionately Harms Long Tail Users

This Computer article examines how Generative AI disproportionately harms longtail users, focusing on the structural inequalities that emerge when AI systems are deployed without …

barani-maung-maung
How is artificial intelligence being used in journalism now? What's coming next? featured image

How is artificial intelligence being used in journalism now? What's coming next?

Jason Lucas joins WXXI's "Connections" to discuss the dangers of AI-driven disinformation campaigns during crises, alongside Oxford researcher Felix Simon's insights on AI …

avatar
Jason Lucas
Program Committee Member — DAGGenC Workshop (COLING 2025) featured image

Program Committee Member — DAGGenC Workshop (COLING 2025)

Program Committee Member for the Workshop on Detecting AI Generated Content, co-located with COLING 2025 in Abu Dhabi, UAE.

avatar
Jason Lucas
The Longtail Impact of Generative AI on Disinformation: Harmonizing Dichotomous Perspectives featured image

The Longtail Impact of Generative AI on Disinformation: Harmonizing Dichotomous Perspectives

This IEEE Intelligent Systems article examines the "longtail" impact of Generative AI on disinformation in high-impact events and resource-limited settings. We analyze four …

avatar
Jason Lucas
Graduate Student Bill of Rights Committee — College of IST featured image

Graduate Student Bill of Rights Committee — College of IST

Served on the College of IST Graduate Student Bill of Rights Committee at Penn State, developing student advocacy frameworks.

avatar
Jason Lucas