Bulk Data Mirror — complete, byte-exact copies of large federal databases, preserved on independent storage. 2,308 files, 26.7 GB total, refreshed monthly (last run 2026-07-02).

Every file below is an unmodified original as served by the source agency. Each dataset ships with a manifest.json recording the source URL, size, SHA-256 checksum, and retrieval time of every file, plus a SHA256SUMS file you can check with sha256sum -c. All content is U.S. Government work in the public domain.

Federal Register

GPO / GovInfo2000 – present319 files · 3.55 GB

Every Federal Register issue as full-text XML, 2000–present, packaged as monthly ZIPs. The daily journal of the U.S. government: rules, proposed rules, notices, and presidential documents.

Code of Federal Regulations

GPO / GovInfo1996 – present1,442 files · 9.01 GB

Annual XML editions of the complete CFR — all 50 titles of codified federal regulation, 1996–present.

Congressional Bills

GPO / GovInfo2013 – present192 files · 1.06 GB

Every version of every bill and resolution introduced in Congress, as XML, 113th Congress (2013) to present.

Public & Private Laws

GPO / GovInfo2013 – present10 files · 47.5 MB

Enacted public and private laws as XML, 113th Congress (2013) to present.

U.S. Statutes at Large

GPO / GovInfo1789 – present241 files · 4.32 GB

The permanent bound record of every law ever enacted by Congress — complete volumes from 1789 to present.

EPA Toxics Release Inventory

EPA1987 – 202438 files · 2.49 GB

TRI Basic Data Files — facility-level toxic chemical releases, one CSV per year for every year since reporting began in 1987.

EPA EJScreen / EJAM

EPA (via Public Environmental Data Partners)All published releases66 files · 6.19 GB

The environmental justice screening datasets EPA removed from its website in February 2025 — block-group environmental and demographic indicators, preserved from the community-maintained mirror.