Cohere For AI Invited Talk

Image credit: Cohere

Abstract

In today’s world, the rapid advancement and widespread use of large language models (LLMs) have brought about both opportunities and challenges. While these models have tremendous potential, they also pose risks, particularly in generating harmful and misleading content. Our research introduces an innovative “Fighting Fire with Fire” (F3) strategy to address this issue by leveraging the capabilities of modern LLMs. Our approach utilizes GPT-3.5-turbo to generate both authentic and deceptive content through advanced paraphrase and perturbation techniques. Furthermore, we apply zero-shot in-context semantic reasoning to differentiate genuine from deceptive posts and news articles. Our extensive experiments demonstrate that GPT-3.5-turbo achieves a superior accuracy of 68-72% in detecting disinformation, outperforming previous models. This research underscores the potential of using advanced AI to combat the very problems it may create. In another study, we address the spread of COVID-19 misinformation in low-resource regions, focusing on the Caribbean. Given the lack of abundant fact-checking resources in these areas, we transferred knowledge from fact-checked claims in the US to detect misinformation in English, Spanish, and Haitian French. Our findings highlight the challenges and limitations of current fake news detection methods in low-resource settings and emphasize the need for further research in multilingual detection.

Date
Apr 11, 2024 9:00 AM — Apr 13, 2024 9:20 AM
Location
Online Presentation
University Park, Pensylvania 16802-2440
Click on the Slides button above to view the built-in slides feature.

Slides can be added in a few ways:

  • Create slides using Hugo Blox Builder’s Slides feature and link using slides parameter in the front matter of the talk file
  • Upload an existing slide deck to static/ and link using url_slides parameter in the front matter of the talk file
  • Embed your slides (e.g. Google Slides) or presentation video on this page using shortcodes.

Further event details, including page elements such as image galleries, can be added to the body of this page.

Jason Lucas
Jason Lucas
Ph.D. Candidate in Informatics