Misinformation Detection | Complex Data Lab

📄️ Overview

Misinformation can have serious harms to public health, the economy, and more. To help counter it, we studied detection methods. We found simple language model baselines are competitive with or can even out-perform state-of-the-art methods. This is particularly the case when train and test data have similar content, corresponding to the real-world task of detecting and containing misinformation on an already known topic or conspiracy. We also found some existing datasets have flaws due to their collection process, particularly when examples of different classes (e.g. real and fake) are collected at different times.