1,721,054 research outputs found
Fictionality
Contains LIWC feature tables for all ~27,000 documents used in this study, R and Python code used to generate statistical results, and all supporting tables
CR4Interact: Citizen Readers for Character Interactions Dataset
This data is drawn from the "Social Lives of Literary Characters Project" as part of the Citizen Readers for Citizen Science Initiative. Please cite: Andrew Piper, Michael Xu, Derek Ruths, "The Social Lives of Literary Characters: Combining citizen science and language models to understand narrative social networks" NLP4DH (2024)
Replication Data for "Biodiversity is not declining in fiction"
This repository provides data and code to support the replication of the paper "Biodiversity is not declining in fiction.
Towards a perspectival moral history of the novel using LLMs
Data for the article "Towards a perspectival moral history of the novel using LLMs" JCLS 2025
Fictionality
Contains LIWC feature tables for all ~27,000 documents used in this study, R and Python code used to generate statistical results, and all supporting tables
Mini Worldlit: A dataset of contemporary fiction from 13 countries, 9 languages, and 5 continents
Metadata for the Mini Worldlit dataset
Replication data for "Cultural Capitals: Modeling 'Minor' European Literature"
Data and code to support replication of the above article
CR4-NarrEmote: An Open Vocabulary Dataset of Narrative Emotions Derived Using Citizen Science
"Citizen Readers for Narrative Emotions" (CR4-NarrEmote) is a large-scale, open-vocabulary dataset of narrative emotions derived through a citizen science initiative called "Reading Emotions." Over a four-month period, 3,738 volunteers contributed more than 200,000 emotion annotations across 43,000 passages from long-form fiction and non-fiction, spanning 150 years, twelve genres, and multiple Anglophone cultural contexts
Evaluating Large Language Models for Narrative Topic Labeling (NLP4DH 2025)
Human annotated data for evaluating LLMs on the task of topic labeling
- …
