US murder trial transcript
The dataset includes pdf scans and docx versions of 5 US murder cases from the early 2000s. The documents are quite substantial in size, ranging between 98 and 342 pages of text. An excel file with metadata for all documents is also present as part of the dataset.
Data types: Spoken - transcript
Associated AIFL centres: None
License: Non-Commercial Government Licence for public sector information
This dataset includes entire transcripts from 5 murder trials from the early 2000s. The documents are shared in their raw data form of scanned PDFs and MS Word transcriptions with the intention of making it available to researchers. The dataset however still requires transcription and anonymisation of names and dates. Any researcher that would be interested in accessing these data for their project must also agree to (1) anonymise any personal name and information in the documents and (2) share with FoLD the anonymised transcriptions they produce.