Dominik Macko

BLUFF: Benchmarking in Low-resoUrce Languages for detecting Falsehoods and Fake news featured image

BLUFF: Benchmarking in Low-resoUrce Languages for detecting Falsehoods and Fake news

BLUFF is the largest multilingual fake news detection benchmark, spanning 79 languages with 202K+ samples. It introduces AXL-CoI for adversarial generation and mPURIFY for quality …

avatar
Jason Lucas
Beyond speculation: Measuring the Growing Presence of LLM-generated texts in Multilingual Disinformation featured image

Beyond speculation: Measuring the Growing Presence of LLM-generated texts in Multilingual Disinformation

This IEEE article provides empirical measurements of LLM-generated texts in multilingual disinformation, moving beyond speculation to analyze the growing presence and …

dominik-macko
Authorship Obfuscation in Multilingual Machine-Generated Text Detection featured image

Authorship Obfuscation in Multilingual Machine-Generated Text Detection

This research from Penn State and KiNiT, benchmarks the effectiveness of 10 authorship obfuscation (AO) techniques against 37 machine-generated text (MGT) detection methods across …

dominik-macko
MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark featured image

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

This research from Penn State and KiNiT introduces MULTITuDE, a novel multilingual dataset for detecting machine-generated text. Comprised of over 74,000 authentic and …

dominik-macko