Repository that holds continuously developing R and Python scripts based on the PAPEA paper

HTML 64%
Jupyter Notebook 33.3%
R 2%
Python 0.7%

Find a file

Sebastian Haunss 6988d17eb0 update to current version		2026-03-10 16:08:43 +01:00
data	update to current version	2026-03-10 16:08:43 +01:00
models	update to current version	2026-03-10 16:08:43 +01:00
python_scripts	update to current version	2026-03-10 16:08:43 +01:00
r scripts	update to current version	2026-03-10 16:08:43 +01:00
.gitignore	Initial commit	2026-03-10 15:58:47 +01:00
LICENSE	Initial commit	2026-03-10 15:58:47 +01:00
README.md	update to current version	2026-03-10 16:08:43 +01:00

README.md

PAPEA-Pipeline

This is the development repository for scripts based on the PAPEA paper. It contains current versions of the scripts used in the original PAPEA paper and additional scripts.

The Python and R scripts are located in the folders 'python_scripts' and 'r_scripts'

'python_scripts/1_papea_pipeline_python.ipynb' is a Jupyter notebook for the first part of the PAPEA pipeline
'python_scripts/Actor Identification.ipynb' is a Jupyter notebook actor identification in protest form sentences using LLMs
'python_scripts/counterprotest_prediction.ipynb' is a Jupyter notebook for prdicting whether a protest event is a counter protest
'r_scripts/2_papea_pipeline_R.Rmd' is a R markdown script for the second part of the PAPEA pipeline
'r_scripts/3_papea_evaluate_predictions.Rmd' is a R markdown script replicating the evaulation of the performance of the language models in Table 1b and Table 2
'r_scripts/4_papea_appendix.Rmd' is a R markdown script to replicate all computations in the appendix.