Overview
Misinformation can have serious harms to public health, the economy, and more. To help counter it, we studied detection methods. We found simple language model baselines are competitive with or can even out-perform state-of-the-art methods. This is particularly the case when train and test data have similar content, corresponding to the real-world task of detecting and containing misinformation on an already known topic or conspiracy. We also found some existing datasets have flaws due to their collection process, particularly when examples of different classes (e.g. real and fake) are collected at different times.
For more information, please contact Kellin.Pelrine@mila.quebec.
Publications
The Surprising Performance of Simple Baselines for Misinformation Detection. Kellin Pelrine, Jacob Danovitch (equal contribution), Reihaneh Rabbany. The Web Conference 2021. https://arxiv.org/pdf/2104.06952.pdf