Data on Patent Law: Sources and Uses Explained

By Adam J. Feldman February 1, 2024

Two professionals look at a tablet in a hallway; man points at screen, woman smiles, another person walks behind them.

Sometimes the most useful litigation tools are ones you assemble on your own – that way you can tailor them to your needs, and occasionally they are even free.  Here is an example of resources for the federal circuit, PTAB, and other trial level patent litigation. These resources can give you a sense of judicial behavior which can help generate expectations for case outcomes and timelines.  The three resources that I will quickly run through in this post are the the Compendium of Federal Circuit Decisions compiled by the University of Iowa Law School (what I will call the “Iowa Database”), the USPTO’s datasets and case resources , and CourtListener’s RECAP Archive. Each of these resources is free, and each can significantly assist you in developing a patent rights strategy.

Federal Circuit Database

This Iowa Database is comprehensive of Federal Circuit decisions since 2004 and has multiple pieces of information for each case.  The Database contains 19,761 cases and is consistently updated.  The types of information that one can derive from this dataset are invaluable. Anything from the likelihood of a granted en-banc (136 granted and 13,237 denied for a rate of approximately 1%) to the number of appeals adjudicated from PTAB (1,777) is readily available.  

Since the Iowa Database contains information on all decisions from the Federal Circuit, some sorting is required to isolate particular types of appeals like those relating to patents.  If you have a software application that can easily create crosstabs like Tableau (my favorite) you can organize and synthesize the information to derive useful outputs. The 19,761 records, for instance, can be sorted by dispute type. Although many of the records records relate to orders that aren’t connected to several of the case outcome variables, among cases that are labeled by type, 3,270 deal with patent infringements, 1,059 deal with inter partes review, 517 deal with contract claims, etc.

The patent cases are coded for whether they relate to code sections 102 or 103 along with other issues like claim construction and definiteness.  Once a specific area is nailed down, let’s say patent infringement, then more specific analyses can be performed.  If we isolate the cases from 2015 forward for example, we can see which judges have been the most frequent majority authors (Stoll with 99, then Prost with 93, and Lourie with 87). We can also look to see who authored dissents most frequently (Newman with 31 and Reyna with12). Or perhaps we want to know about the most frequent lower courts (District Court for the District of Delaware with 246, District Court for the Eastern District of Texas with 163, and the District Court for the Northern District of California with 158). Maybe we even want to know who is or was most likely to dissent from an opinion authored by Judge Stoll. In such instances, Judges Dyk, Hughes, Lurie, and Newman each dissented twice.

USPTO Website

The USPTO also has a treasure trove of free resources for the legal data enthusiast. Some of the information is quite helpful for legal practitioners moving forward while other data are mostly historic. Even the backwards looking data though can aid with current decisions to the extent that they are based on litigation before active judges.

The historic element is quite fascinating. While unfortunately only updated through 2016, the Patent Litigation Docket Reports have case level information from 81,350 district court cases filed between 1963 and 2016.  A few nice feature of the Docket Reports is that they track litigation timing and this can be parsed by on other variables like the judge or court of interest.  There are also multiple datasets that correlate to one another so you can look at observations based on the attorneys on the cases, patents, case names, or documents. 

We might, for instance, be interested in the magistrate judges who these cases were referred to in order to gauge how long proceedings end up taking in their courts. Here is an output of magistrate judges with over 200 proceedings in this dataset.

Judge Roy S. Payne for the Eastern District of Texas has the lion’s share of these cases with all other judges only deciding a fraction of Judge Payne’s count. Let’s say we are interested in the time it takes these judges to move from an opened to a closed case, we can use the time parameters in the dataset to run this calculation for each individual case, and then generate averages by judge.  Here are what the averages look like for these judges.

Judge Payne cleared his cases the quickest of the group at just under 250 days while, at the other end of the spectrum, Judge Trumbull of the District Court for the Northern District of California averaged over 535 days per case.

There are also other datasets available on the USPTO site as well including the Patent Examination Research Dataset (PatEx) which covers “13 million publicly-viewable provisional and non-provisional patent applications to the USPTO and over 1 million Patent Cooperation Treaty (PCT) applications.” 

CourtListener’s RECAP Archive

The RECAP Archive is a freely accessible tool that compiles PACER records.  It is an extremely useful resource and was used to derive some of the datapoints for the USPTO measures.

RECAP is generally more of a qualitative data source that can be used to put together quantitative statistics. One of the nice parts of RECAP though is that you can dive into case dockets and in some instances you can view documents filed in cases. 

One of the nice features of the RECAP archive is that you can filter by PACER codes, so, if for instance you were interested in patent cases, you could plug in nature of suit code 830 and find that since the beginning of 2015 there are 28,257 cases that fit under this code and 1,867,132 docket entries. If you were interested in the cases referred to Judge Roy S. Payne in the Eastern District of Texas you could refine your search by judge and find there are 2,611 relevant cases since the beginning of 2015.

A nice feature of RECAP that was presumably used in the creation of the USPTO dataset is the RECAP metadata that correlates with the variables in the USPTO site. These variables include the judge assigned to and referred to the case, the citation, date filed and terminated, date of last known filing, cause of action and nature of suit, jury demand, and jurisdiction type. There are also data on the parties and attorneys where available through PACER.

The upside to these data is that they allow for updating beyond the numbers currently available from the USPTO dataset which only run through 2016 and provide additional information not provided in the dataset. The downside though is that it takes either scraping and parsing skills to put it into a useable format or taking the time to input the data manually. If you have specific information you are trying to assemble rather than raw general data though, this is a good place to begin.

Concluding Thoughts

Legal data help with generating predictions, following trends, and understanding changes in the legal landscape.  The data described in this article are all readily available and relatively easy to use and navigate. These are great starting points for research and comparisons and provide context to those interested in specific cases. Another big upside is that these resources are free.

While the resources I described generally relate to patent law, this is just an example of the legal data that are freely available on the web. There are many other resources for other areas. If you already understand the value of data, then the raw data available to put together novel datasets abound. Furthermore, there are experts in legal data analysis that can help you develop the skills to make use of these resources and to ascertain answers and solutions to complex legal questions that are not answerable through doctrine alone. For claimholders, litigators, litigation funders, and insurers, such data provide the additional benefit of oftentimes lending themselves to probabilistic determinations that can help individuals forecast potential outcomes and generate likelihood intervals that relate to the probability that certain outcomes will come to fruition.

Adam Feldman  is the editor of  Empirical SCOTUS , a blog that conducts data analysis of the United States Supreme Court, and the Principal of Optimized Legal, a legal data/statistical consultancy. He is also an adjunct professor of political science and public law at California State University, Northridge. You can reach  Adam  for specific data and analyses related to your own litigation questions in this and other areas.

Certum Group Can Help

Get in touch to start discussing options.

Subscribe to Our Newsletter

Newsletter

Recent Content

By Certum Team May 19, 2026
MLex, a LexisNexis publication covering global regulatory intelligence, recently interviewed and quoted Certum Group’s William Marra in an article examining the U.S. International Trade Commission’s proposed rule that would require disclosure of third-party litigation funding in Section 337 patent investigations. The proposed rule, published in the Federal Register on April 30, 2026, would require parties and intervenors in Section 337 investigations to disclose certain entities that provide funding or hold approval rights over litigation or settlement decisions. The ITC stated that the proposal is intended to identify conflicts of interest, clarify whose rights are at issue, and promote settlement and transparency. Comments are due June 29, 2026. Marra expressed concerns about the asymmetrical nature of the proposed disclosure requirements. While the rule would reach third-party litigation funding, it would not require disclosure of personal loans, bank loans, insurance funding, or contingent fee arrangements. “If you want to have a rule requiring the disclosure of third-party finance… it is more appropriate to have a rule requiring the disclosure of any and all forms of third-party finance,” Marra told MLex, including contingency-fee arrangements. Marra argued that selectively targeting only certain forms of funding creates an uneven playing field. “To the extent that you have disclosure rules that are targeted only at specific forms of third-party funding and not others, you are going to give certain parties a strategic advantage or disadvantage,” he said. “We have nothing to hide. We don’t want to give the other side of litigation a strategic advantage.”  Marra also highlighted the outsized burden that overly broad disclosure requirements can impose on smaller parties. “TPLF disclosure tends to impose a burden disproportionately on small- and medium-sized enterprises,” he said, drawing on arguments he made in a recent co-authored article in the Southern California Law Review . The full MLex article is available here .
By W. Tyler Perry May 14, 2026
We tend to view regulation and litigation as wholly separate enterprises. But federal regulatory agencies have always operated alongside private civil litigation, with each supplying functions the other cannot. Agencies set prospective standards and monitor compliance at scale. Litigation responds to concrete harm, remedying often unanticipated—or minimized—risks. Prior posts in this series traced the procedural mechanics of mass aggregation —from the equitable origins of representative litigation through Rule 23 to the modern MDL—and explained why those mechanisms exist as a structural response to the access failures of bilateral litigation . This post addresses a related but distinct question: Why private enforcement matters not just as a substitute for bilateral litigation, but as a necessary complement to public regulation. This symbiotic dynamic has held for decades, and an examination of that history underscores the importance of mass tort litigation as a regulatory backstop. The Structural Limits of Administrative Oversight The relationship between regulatory agencies and private litigation is complementary rather than redundant. Even at full capacity, administrative agencies face structural constraints that limit their effectiveness as enforcement mechanisms. The resource gap is the most straightforward. Regulated industries consistently outspend the agencies that oversee them. The pharmaceutical industry employs scientists, lawyers, and regulatory specialists whose collective depth of knowledge exceeds what any federal agency can match across its full portfolio of regulated products. An agency charged with monitoring thousands of products and reviewing hundreds of new applications annually necessarily operates with inherent informational disadvantages relative to the firms it oversees. The capture problem is more subtle but no less significant. Regulatory agencies are staffed, in significant part, by individuals who move between government service and the industries they regulate . This is not an indictment of those individuals—it reflects the reality that domain expertise concentrates in the private sector. But it nonetheless creates structural pressures that shape enforcement priorities in ways that do not always align cleanly with public interests. The latency problem is perhaps the most consequential. Pre-market approval is a snapshot, not an ongoing guarantee. An agency that approves a pharmaceutical compound based on clinical trial data cannot know what population-scale, long-term use will reveal. Post-market surveillance is resource-intensive and chronically underfunded . Harms that emerge years or decades after initial regulatory clearance may never trigger administrative enforcement action. These are not new problems. They have characterized the administrative state for decades, and they are precisely why private litigation has long served as a necessary counterpart to administrative enforcement. The Opioid Crisis: What Happens When Regulation Falls Short The opioid epidemic illustrates—at enormous human cost—what happens when regulatory oversight fails to keep pace with private-sector harm, and what private enforcement can accomplish when it fills the gap. The FDA approved OxyContin in 1995 based on clinical data that did not capture the addiction potential of mass-market, long-duration prescribing. Regulators, empowered to act against manufacturers and distributors flooding suspicious channels, were slow to exercise that authority at scale. State medical boards, operating in an environment shaped by industry-funded campaigns redefining pain management standards, did not flag prescribing patterns that, in hindsight, were plainly problematic. By the time the regulatory apparatus mobilized a meaningful response, hundreds of thousands of Americans had died. The tens of billions of dollars in settlements and judgments that followed came not through administrative action but through litigation— state attorneys general, municipalities, and private plaintiffs coordinated in MDL proceedings—that forced production of internal documents demonstrating what manufacturers and distributors knew and when they knew it. That information entered the public record through discovery. It informed subsequent regulatory responses, shaped public health policy, and produced one of the largest coordinated public health settlements in American history. PFAS and the Limits of Pre-Market Review Per- and polyfluoroalkyl substances—PFAS, or “forever chemicals”—illustrate a different dimension of the same structural problem. Manufacturers possessed internal research suggesting health risks associated with certain PFAS compounds for decades before that information became public. The EPA, constrained by the evidentiary standards of the Toxic Substances Control Act and facing significant industry opposition, did not set enforceable drinking water limits for the most common PFAS compounds until 2024 —roughly seventy years after their widespread industrial introduction. Private litigation, brought by communities near manufacturing facilities, military bases, and industrial sites, has produced more actionable information about PFAS health effects than decades of administrative process. Discovery in PFAS proceedings has surfaced internal documents , epidemiological data, and risk assessments that were never voluntarily disclosed. Those materials have informed subsequent regulatory action and generated the factual record on which ongoing public health policy depends. This is the information function of private litigation operating precisely as it should: Reaching into corporate decision-making in ways that administrative oversight either cannot compel or has not yet prioritized. Social Media and the Enforcement Frontier The current mass tort litigation against social media platforms for harms to adolescent mental health illustrates how private enforcement operates at the frontier of regulatory capacity. Congress has repeatedly attempted and failed to pass legislation governing platform design, algorithmic amplification, and the targeting of minors. The FTC’s authority is potentially applicable but has not been deployed at scale. The regulatory frameworks needed to establish clear standards remain, years into public awareness of the problem, largely unbuilt. Into that gap have stepped coordinated proceedings in federal MDL and state courts, alleging that platform features were designed with internal knowledge of their addictive potential and their disproportionate effects on adolescent development. Whatever the ultimate resolution of those cases, the litigation has already begun forcing into the public record information about internal product decisions and user research that no regulatory proceeding has yet reached. In March 2026, a California jury found Meta and YouTube liable for negligent platform design, rejecting both Section 230 and First Amendment defenses—the first bellwether verdict to hold platforms accountable for design-based harms to adolescents. Private enforcement is not a substitute for thoughtful legislation. But it is filling the gap that legislation has not occupied. The social media cases are, it should be noted, the most legally contested example in this series. Unlike pharmaceutical or chemical exposure litigation, platform liability claims must navigate Section 230’s broad immunity provisions and First Amendment questions that the opioid and PFAS cases did not present. The ultimate merits of these cases may differ from the prior examples. But even litigation that does not ultimately succeed forces into the public record information that regulatory silence cannot reach—and that distinction matters regardless of outcome. The Practical Consequence of a Smaller Administrative Footprint The structural argument for private enforcement as a complement to regulation is well-established. What fluctuations in agency capacity add is urgency.  Regulation and private litigation each supply what the other cannot. Regulation operates ex ante , setting prospective standards based on information available at approval. Litigation operates ex post , responding to harm that has materialized with discovery tools that can reach information never voluntarily shared. Regulation generalizes across industries; litigation develops facts specific to individual defendants and affected populations. Where these functions operate in tandem, the enforcement system is more complete. Where one contracts, the other must bear more weight. When agency enforcement capacity declines—whether through budget reductions, staff attrition, or shifts in enforcement priorities—the civil justice system is not simply one option among several. For many categories of diffuse harm, it becomes the only remaining mechanism capable of generating accountability. Companies that externalize costs onto the public face reduced administrative scrutiny. The deterrence effect of potential enforcement weakens. The information that litigation forces into the public record, and that regulators themselves have often relied upon, is no longer generated. One need not have a settled view on the optimal scope of the administrative state to recognize this dynamic. The practical question is not whether federal agencies should be larger or smaller. It is whether, given the enforcement landscape that actually exists, the civil justice system is equipped to do the work that system requires. Conclusion The debate over federal regulatory scope will continue, as it should. Reasonable people hold genuine disagreements about the appropriate role of administrative agencies, and those disagreements deserve serious engagement. But the institutions available to enforce safety norms and produce corporate accountability do not wait for that debate to resolve. When the administrative footprint contracts, courts and private litigation occupy the space. Mass tort aggregation, as this series has argued from the beginning, is not a procedural anomaly or an artifact of plaintiff-side opportunism. It is a structural feature of how diffuse harm gets addressed in a system where regulation has never been sufficient on its own. That function does not become less important when regulatory capacity declines. It becomes more so. Oliver Wendell Holmes once observed that “[t]he life of the law has not been logic: it has been experience.” The Common Law 1 (1881). The experience of the opioid epidemic, the decades of PFAS contamination, and the accumulating evidence of adolescent harm from platform design all point to the same structural lesson: Regulation and private enforcement are not competitors in an institutional zero-sum game. They are partners in an enforcement system that neither can sustain alone. The debate about their proper balance will continue. But dismissing private enforcement as mere opportunism ignores what experience has consistently shown: When private enforcement is absent, no one else fills the gap.
By Ross Weiner May 5, 2026
Class action litigators who practice in the BIPA space received clarity in April 2026 following the Seventh Circuit Court of Appeals’ decision in Clay v. Union Pacific Railroad Co. (“Clay”).[1] In a concise 17-page opinion, the court held that the Illinois General Assembly’s 2024 BIPA amendments, which established that BIPA damages should be evaluated on a per-person basis, should be applied retroactively to cases pending at the time of enactment. This decision is a setback for plaintiffs’ counsel who had invested heavily—in time and resources—in BIPA litigation as the next major vehicle for class action recovery. An overview of how we got here is below followed by a summary of the decision. History of BIPA In 2008, Illinois enacted the Biometric Information Privacy Act to respond to the “increasing use of biometric data in commerce.”[2] BIPA was intended to give individuals the right to control their biometric identifiers and information while providing a right of action and meaningful damages against entities that mishandled them. But one question quickly came to the fore: was a new claim accruing each and every time an employer collected the same information from the same employee? As one defendant argued, such a per-scan theory of claim accrual would create “potentially crippling financial liability” for employers who violate BIPA by “repeatedly collecting the same information in the same way.”[3] Recognizing the question’s importance, the Seventh Circuit, in Cothron v. White Castle System, Inc., certified the question of claim accrual to the Supreme Court of Illinois. During briefing, the defendant invoked Section 20—which sets the damages a plaintiff can recover “for each violation”—to dissuade the court from adopting its per-scan reading of Section 15, citing potentially astronomical awards. In a 2023 decision, the Illinois Supreme Court sided with the plaintiffs and held that pursuant to Section 15, claims accrue “with every scan or transmission” of biometric information.[4] The Illinois Supreme Court acknowledged the prospect of “potentially excessive damage awards,” but noted that concern is “best addressed by the legislature.”[5] Accordingly, the court concluded its opinion by “respectfully suggest[ing] that the legislature review these policy concerns and make clear its intent regarding the assessment of damages under the Act.”[6] The Illinois General Assembly Acts Less than a year and a half after Cothron, the Illinois General Assembly heeded the court’s call and passed an amendment that added two clauses to Section 20. The first provided that any entity that collects biometric information “in more than one instance… from the same person using the same method of collection in violation of subsection (b) of Section 15 has committed a single violation…for which the aggrieved person is entitled to, at most, one recovery under this Section.[7] The second added the same operative language for violations of Section 15(d).[8] Going forward, it was now clear that only “one recovery” was available per person (regardless of how many scans there were), transforming potentially excessive damages into more modest ones. But the legislature left one question open: should the amendments apply retroactively to cases already in progress? The Clay Decision According to the Seventh Circuit, Illinois courts have a simple decision tree when it comes to assessing retroactivity. First, did the legislation expressly indicate the temporal reach of the amendment? If yes, case closed. If not, then the court must assess whether the amendment in question constituted a substantive or procedural change to the law. Under Illinois law, a substantive amendment “prescribes the rights, duties, and obligations of persons to one another as to their conduct or property and … determines when a cause of action for damages or other relief has arisen.”[9] Conversely, a procedural amendment involves the “rules that prescribe the steps for having a right or duty judicially enforced, as opposed to the law that defines the specific rights or duties themselves.”[10] While the Clay court acknowledged that the distinction between the two can, in many different contexts, “be unclear,”[11] the court had no trouble deciding the case at bar for one simple reason: the “amendment to BIPA Section 20 is a remedial change,”[12] and “the Supreme Court of Illinois treats remedial changes as procedural, not substantive.”[13] Two features of the amendments were critical: First, the legislature located the amendments in Section 20, which governs liquidated damages, rather than Section 15, which sets the substantive standards for liability under the Act. Second, the amendments’ plain language “focuses on remedies,”[14] indicating that an “aggrieved person is entitled to, at most, one recovery under this Section.”[15] The court’s analysis was straightforward. For those BIPA litigants involved in currently pending cases, the litigation terrain just got bumpier for plaintiffs and more favorable for defendants. Plaintiffs’ settlement leverage in these cases has been significantly reduced. Nevertheless, with enough putative class members, BIPA cases could still be worth bringing, even if they are no longer as valuable. We will continue to monitor the ramifications of this decision. Notes: [1] No. 25-2185 (7th Cir. Apr. 1, 2026). [2] Id. at 3. [3] Id. [4] Cothron v. White Castle System, Inc., 216 N.E.3d at 921 (Ill. 2023). [5] Id. at 929. [6] Id. [7] 740 ILCS 14/20(b). [8] Id. at 14/20(c). [9] Perry v. Dept. of Fin. & Prof. Regulation, 106 N.E.3d 1016, 1034 (Ill. 2018). [10] Id. [11] Clay at 8. [12] Id. at 9. [13] Id. at 8. [14] Id. at 10. [15] 740 ILCS 14/20(b), (c) (emphasis added).