Here you can find the datasets that have been officially published online so far. If you have any additional questions, please contact us.
EXCEPTIUS main dataset v.1.0
You can find the data on Dataverse.nl.
The current codebook for the 1st wave EXCEPTIUS project dataset (v.1.0) tracks all exceptional measures in the field of democratic governance/ human rights and international cooperation issued by national authorities in European Union (EU) countries (including UK and Switzerland, excluding Iceland) in order to tackle the COVID-19 pandemic from January to June 2020. The dataset also provides data on the implementation modalities of such measures. The data collection started in September 2020; since then, the codebook was revised twice due to the increasing complexity and granularity of the measures that countries had been implementing. The current version of the codebook is v.3.2. Note that new categories were added to several existing variables (subtypevents) over time; all of them are included in the current version. Numerous new variables and categories require back-coding, the process of which is ongoing. (2022-01-27)
Corpus dataset v.1.0
EXCEPTIUS Corpus v1.0 contains the following data:
- raw documents for 21 countries at national level
- pre-processed data with spacy-udpipe v1.0
- automatically annotated documents for the identification of exceptional measures at sentence level
Country list (ISO 3166-1 alpha-2): AT, BE, HR, CY, CZ, DK, FR, DE, HU, IE, IT, LV, LT, NL, NO, PL, SI, SE, CH, UK
Folder structure: each country has a dedicated folder. Inside each folder you will find the following subfolders:
- raw_text: the raw text data (.txt format)
- processed: the output of the spacy-udpipe v1.0 – each line is a sentence, containing the following info: tokens, lemma, POS, UD dependency relations
- model: the predictions of the trained model (XML pre@36 as reported in Table 4 of the paper). Each line is a sentence, separate by 9 tab – each for a exceptional measure class. 1: signals presence of a class.
You can find the data on Dataverse.nl.