The number of trade agreements has increased significantly since the early 1990s. Trade agreements cover more and more subjects and an average text of the treaty is now about ten times longer than it was twenty-five years ago. This makes it increasingly difficult to analyse the content of trade agreements and their impact on international trade and welfare. Big data and text-based methods can help researchers, policymakers and other stakeholders better cope with the increasing complexity of trade agreements.

Modern calculation methods, however, require the existence of machine-readable texts. While several databases provide PTA texts, they are usually optimized for reading, but not for computer analysis. As part of a year-long effort, this project used the WTO`s RTA database to find texts and metadata from nearly 450 preferential trade agreements and transform them into a machine-readable format for article, chapter or treaty-level analysis of ATP texts.