Repository that holds continuously developing R and Python scripts based on the PAPEA paper
  • HTML 64%
  • Jupyter Notebook 33.3%
  • R 2%
  • Python 0.7%
Find a file
2026-03-10 16:08:43 +01:00
data update to current version 2026-03-10 16:08:43 +01:00
models update to current version 2026-03-10 16:08:43 +01:00
python_scripts update to current version 2026-03-10 16:08:43 +01:00
r scripts update to current version 2026-03-10 16:08:43 +01:00
.gitignore Initial commit 2026-03-10 15:58:47 +01:00
LICENSE Initial commit 2026-03-10 15:58:47 +01:00
README.md update to current version 2026-03-10 16:08:43 +01:00

PAPEA-Pipeline

This is the development repository for scripts based on the PAPEA paper. It contains current versions of the scripts used in the original PAPEA paper and additional scripts.

The Python and R scripts are located in the folders 'python_scripts' and 'r_scripts'

  • 'python_scripts/1_papea_pipeline_python.ipynb' is a Jupyter notebook for the first part of the PAPEA pipeline
  • 'python_scripts/Actor Identification.ipynb' is a Jupyter notebook actor identification in protest form sentences using LLMs
  • 'python_scripts/counterprotest_prediction.ipynb' is a Jupyter notebook for prdicting whether a protest event is a counter protest
  • 'r_scripts/2_papea_pipeline_R.Rmd' is a R markdown script for the second part of the PAPEA pipeline
  • 'r_scripts/3_papea_evaluate_predictions.Rmd' is a R markdown script replicating the evaulation of the performance of the language models in Table 1b and Table 2
  • 'r_scripts/4_papea_appendix.Rmd' is a R markdown script to replicate all computations in the appendix.