Skip to main content
Shaping Europe’s digital future
News article | Publication

Commission publishes open-source software package to facilitate analysis of data in the Digital Services Act Transparency Database

The new software package will provide insight into the world’s largest near real-time dataset on content moderation decisions.

The Commission has published a new open-source software package to help streamline the analysis of the data in the Digital Services Act (DSA) Transparency Database.

The so called dsa-tdb python package was developed by the Commission and is hosted on code.europa.eu, the code development platform for open-source software projects of the European Union. It can now also be accessed via a new dedicated page on the DSA Transparency Database website.

The DSA Transparency Database tracks anonymous content moderation decisions taken by online platforms in almost real-time by recording a standardised set of information, called statement of reasons, for each moderation action submitted by an online platform. Operational since September 2023, the database now contains more than 22 billion entries.

To support the data analysis of this large database at scale, the package enables users to carry out a number of data pre-processing and data aggregation tasks in a computationally efficient manner. It also allows users to create their own visualisations based on the data they are interested in.

This release is part of the Commission’s continuous effort to develop the database infrastructure and analytical tooling in exchange with and based on feedback from its emerging research community. By expanding and enhancing the analytical capabilities of the database, the Commission aims to empower the DSA’s academic and civil society stakeholders with the tools to contribute to the DSA’s enforcement as efficiently as possible.

Read more about the Digital Services Act (DSA) package.