DraCor (Drama Corpora) is an open digital infrastructure developed for the computational study of European drama from Greco-Roman antiquity to the 20th century. The platform hosts plays encoded in the TEI format across various languages, supporting comparative and computational methods in drama studies. As of 2025, the collection comprised over 4,000 texts in more than 20 languages. Data provided by DraCor has seen widespread use in digital humanities research. The project received the Rahtz Prize for TEI Ingenuity by the TEI Consortium in 2022.
DraCor aims to create reliable, expandable, and interoperable corpora of dramatic literature. The project emphasises the concept of Programmable Corpora, where the data is not only accessible but also designed for computational analysis through APIs and integration with other tools. The platform strives to adhere to FAIR data principles (Findability, Accessibility, Interoperability, Reusability).
DraCor's collection of corpora is continuously growing and covers plays in Dutch, English, French, German, Ancient Greek, Hungarian, Italian, Latin, Polish, Russian, Spanish, Swedish, Ukrainian, and other languages. Each corpus is curated by individual scholars or teams and provides rich metadata alongside TEI-encoded texts, supporting analyses of dramatic structures, character interactions, and related topics.
The DraCor platform includes basic visualisation tools, particularly for network analysis. It also supports programmatic access to the corpora, enabling integration into computational research workflows. This facilitates various types of analyses, including:
DraCor is developed through collaboration among researchers from multiple institutions. The DraCor platform is jointly run at the Freie Universität Berlin and the University of Potsdam. As an open-source project, it actively encourages community contributions and feedback. The DraCor community presented its corpora and associated research projects at the DraCor Summit, a five-day event in Berlin in September 2025.