Bulk Data Mirror — complete, byte-exact copies of large federal databases, preserved on independent storage. 2,308 files, 26.7 GB total, refreshed monthly (last run 2026-07-02).
Every file below is an unmodified original as served by the source agency. Each dataset ships with a manifest.json recording the source URL, size, SHA-256 checksum, and retrieval time of every file, plus a SHA256SUMS file you can check with sha256sum -c. All content is U.S. Government work in the public domain.
Federal Register
Every Federal Register issue as full-text XML, 2000–present, packaged as monthly ZIPs. The daily journal of the U.S. government: rules, proposed rules, notices, and presidential documents.
Code of Federal Regulations
Annual XML editions of the complete CFR — all 50 titles of codified federal regulation, 1996–present.
Congressional Bills
Every version of every bill and resolution introduced in Congress, as XML, 113th Congress (2013) to present.
Public & Private Laws
Enacted public and private laws as XML, 113th Congress (2013) to present.
U.S. Statutes at Large
The permanent bound record of every law ever enacted by Congress — complete volumes from 1789 to present.
EPA Toxics Release Inventory
TRI Basic Data Files — facility-level toxic chemical releases, one CSV per year for every year since reporting began in 1987.
EPA EJScreen / EJAM
The environmental justice screening datasets EPA removed from its website in February 2025 — block-group environmental and demographic indicators, preserved from the community-maintained mirror.