Dongwon Lee

BLUFF: Benchmarking in Low-resoUrce Languages for detecting Falsehoods and Fake news featured image

BLUFF: Benchmarking in Low-resoUrce Languages for detecting Falsehoods and Fake news

BLUFF is the largest multilingual fake news detection benchmark, spanning 79 languages with 202K+ samples. It introduces AXL-CoI for adversarial generation and mPURIFY for quality …

avatar
Jason Lucas
Beyond speculation: Measuring the Growing Presence of LLM-generated texts in Multilingual Disinformation featured image

Beyond speculation: Measuring the Growing Presence of LLM-generated texts in Multilingual Disinformation

This IEEE article provides empirical measurements of LLM-generated texts in multilingual disinformation, moving beyond speculation to analyze the growing presence and …

dominik-macko
Generative AI Disproportionately Harms Long Tail Users featured image

Generative AI Disproportionately Harms Long Tail Users

This Computer article examines how Generative AI disproportionately harms longtail users, focusing on the structural inequalities that emerge when AI systems are deployed without …

barani-maung-maung
The Longtail Impact of Generative AI on Disinformation: Harmonizing Dichotomous Perspectives featured image

The Longtail Impact of Generative AI on Disinformation: Harmonizing Dichotomous Perspectives

This IEEE Intelligent Systems article examines the "longtail" impact of Generative AI on disinformation in high-impact events and resource-limited settings. We analyze four …

avatar
Jason Lucas
Authorship Obfuscation in Multilingual Machine-Generated Text Detection featured image

Authorship Obfuscation in Multilingual Machine-Generated Text Detection

This research from Penn State and KiNiT, benchmarks the effectiveness of 10 authorship obfuscation (AO) techniques against 37 machine-generated text (MGT) detection methods across …

dominik-macko

Fighting Fire with Fire - EMNLP 2023

The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation

jason-lucas
MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark featured image

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

This research from Penn State and KiNiT introduces MULTITuDE, a novel multilingual dataset for detecting machine-generated text. Comprised of over 74,000 authentic and …

dominik-macko
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation featured image

Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation

This research project is a collaboration with Penn State and MIT Lincoln Lab. Our study demonstrates the dual capacity of LLMs for offensive misuse and defense detection against …

avatar
Jason Lucas
Detecting False Claims in Low-Resource Regions: A Case Study of Caribbean Islands featured image

Detecting False Claims in Low-Resource Regions: A Case Study of Caribbean Islands

This paper is the first attempt to detect COVID-19 misinformation (in English, Spanish, and Haitian French) populated in the Caribbean regions, using the fact-checked claims in the …

avatar
Jason Lucas