GitPedia

Arctic shift

Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.

From ArthurHeitmann·Updated June 12, 2026·View on GitHub·

Making Reddit data accessible to researchers, moderators and everyone else. The project is written primarily in TypeScript, first published in 2023. It has gained significant community traction with 1,120 stars and 70 forks on GitHub. Key topics include: archived, pushshift, reddit.

Latest release: 2026_05May 2026
June 11, 2026View Changelog →

Project Arctic Shift

Making Reddit data accessible to researchers, moderators and everyone else.

Interact with the data through large dumps, an API or web interface.

Downloads

All download links are organized here. Once a new dump is available, it will
also be added on the releases page.

Alternatively for downloading data of users or smaller subreddits, you can use this tool.

For information on how the data was collected and modified, see here.

API

Depending on your use case, you can try my (limited) API. For manual queries, you can use this tool.

Usage

First download one more dumps from the above links.

Generally I'd recommend to work with the compressed files instead of unpacking them. Unless of
course you have seemingly infinite disk space.

With the helper scripts in this repository you can quickly get started.

(You need at least Python 3.10)

  1. Clone this repository and its submodules
bash
git clone --recursive https://github.com/ArthurHeitmann/arctic_shift.git cd arctic_shift
  1. Install the zstandard library
bash
pip install zstandard
  1. Open scripts/processFiles.py in your editor. That script can process .zst, .zst_blocks, .jsonl/ndjson and .json files.

  2. Enter the path to a file or folder in fileOrFolderPath (since it is a raw string, you don't have to escape
    backslashes). If you enter a folder, all files in that folder will be processed.

  3. Add your code to the processFile function.

  4. Run the file and be (very) patient.

Contact & Removal requests

Removal requests and generic support requests can be submitted here.
To check if your data is in the dataset, search for your username here.

Removal forms of other archives: Pushshift | PullPush | potentially archive.org.

If you have questions, you can DM me on discord (raiderbv if the link doesn't work) or email.
Alternatively open an issue or pull request.

Contributors

Showing top 3 contributors by commit count.

View all contributors on GitHub →

This article is auto-generated from ArthurHeitmann/arctic_shift via the GitHub API.Last fetched: 6/13/2026