Skip to content

sTechLab/AIRules

Repository files navigation

AI Rules Datasets

Datasets accompanying the paper "AI Rules? Characterizing Reddit Community Policies Towards AI-Generated Content" (CHI25).

Datasets Overview

  • Longitudinal Subreddit Set: Subreddit metadata and rules for english-language subreddits that were seen in both July 2023 and Novemeber 2024.
  • Broad Subreddit Set: Subreddit metadata and rules for english-language subreddits gathered in Novemeber 2024.
  • Rules Subreddit Set: Subreddit metadata, rules, and labels for english-language subreddits with non-empty rules gathered in Novemeber 2024.
  • AI Rules Subreddit Set: Subreddit metadata, rules, and labels for english-language subreddits with non-empty rules gathered in Novemeber 2024 that were identified as having a rule about AI.
  • AI Rules Set: Subreddit rules and labels for rules gathered in Novemeber 2024 that were identified as being about AI.

Using this data

You are free to use this data for research purposes. If you do, please include a citation to our paper:

Travis Lloyd, Jennah Gosciak, Tung Nguyen, and Mor Naaman. 2025. AI Rules? Characterizing Reddit Community Policies Towards AI-Generated Content. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25). Association for Computing Machinery, New York, NY, USA, Article 9, 1–19. https://doi.org/10.1145/3706598.3713292

See the paper for additional details about how the paper was gathered and labeled. Feel free to reach out with any questions or issues with the data.

About

Dataset accompanying AI Rules? Characterizing Reddit Community Policies Towards AI-Generated Content

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published