Jason S. Lucas
Jason S. Lucas
Home
Experience
Publications
Skills
Talks
News
Gallery
Contact
CV
Light
Dark
Automatic
Benchmarking
Beemo: Benchmark of Expert-edited Machine-generated Outputs
Beemo introduces a novel benchmark featuring 6.5k expert-edited machine-generated texts across diverse domains from creative writing to summarization. Through comprehensive evaluation of 33 MGT detector configurations, we demonstrate that expert editing effectively evades detection while LLM-edited texts remain distinguishable from human writing, highlighting critical gaps in current detection methods for multi-author scenarios.
Ekaterina Artemova
,
Jason Lucas
,
Saranya Venkatraman
,
Jooyoung Lee
,
Sergei Tilga
,
Adaku Uchendu
,
Vladislav Mikhailov
PDF
Cite
DOI
Cite
×