Uploaded: 2021-09-08
Languages: English
Collected from: 2021
Access category: Controlled
Email: Not available
To: 2021

Summary is a popular UK-based online discussion forum for parents seeking advice, support and reviews of parenthood-related products and services.

Subject keywords: corpus, forum
Data types: Written
Funders: N/A
Associated AIFL centres: Centre for Forensic Text Analysis (FTA)
License: Unsure


The present corpus consists of a little over 51 million posts, authored by more than 59 000 registered users of the forum. Most of the posts included are relatively short and highly interactive in style. The data collected spans over 16 years of the forum's history. Corpus size is just over 2 billions tokens

Data Donors


Information: This dataset contains highly sensitive material or data that come from a third party and have heavy constraints on access and use. This dataset is therefore stored not on the FoLD web server but on an air-gapped, offline computer in our secure data lab at the Aston Institute for Forensic Linguistics. Users who wish to access this dataset must make a detailed application to FoLD and the researcher, as well as potentially gain additional agreement from an external organisation before they can be approved for access.

Request Item