Database Replication for Organizations and Researchers
Overview
The CourtListener Database Replication service is for researchers and organizations that need high-speed, granular access to the CourtListener database on an ongoing basis.
This service works by using PostgreSQL to create and maintain a table-based logical replica of our database allocated specifically for an individual or organization.
This creates dedicated hardware for your team that is updated in real time and that allows unlimited data access at any speed and scale you may need.
This provides an incredibly powerful way to access one of the largest open collections of American legal data.
The following types of data are currently available as part of this service:
- Case Law
- RECAP Archive of PACER data
- The FJC Integrated Database civil data
- Oral Arguments
- Financial Disclosures
- Judge and Appointer data
User data will never be shared as part of this service.
If you are replicating case law data, note that opinion clusters may be merged, deduplicated, or sealed over time. See the Opinion Cluster Redirections help page for details on how to handle these cases in your replica.
If you are interested in this service, please get in touch to learn more. We will be happy to discuss the technical details.
How Logical Replication Works
Logical replication is a system supported by recent versions of PostgreSQL. Unlike older methods of database replication, which worked by shipping the database binary files across the network (so-called physical replication), logical replication works by streaming SQL commands from a "publisher" server to a "subscriber" server.
This form of replication provides a number of benefits over physical replication including the ability to have slightly different versions of the database installed, and the ability to only replicate certain tables from the publisher to the subscriber.
Our Process
When we engage with you to set up logical replication, we will set up a new server for you in our cloud, and we will provide access to you from a specific IP address in your network.
Once it's set up, your server will always be in sync and we can work together to scale the server however you need.
In general, once a contract is signed, you'll be connected within a few days.
More Questions?
If you have more questions, please reach out. This service is a complicated one and we look forward to working with organizations and researchers to make it a success.
Get In Touch [FAQs][faqs]{button}