In the folder data, there are the dataset, train.csv
and test.csv
, that include data from both original and synthetic (original_test.csv
and synthetic_test.csv
)
An exploratory data analysis in greek text appear in eda.ipynb
and the rule based models in RuleBased.ipynb
that represent as pseudo code in the htrec_pseudocode.pdf
.
If you find our work useful to your research, please cite this work as:
@inproceedings{pavlopoulos-2023,
title = "Error Correcting HTR’ed Byzantine Text",
author = "John Pavlopoulos, Vasiliki Kougia, Paraskevi Platanou, Stepan
Shabalin, Konstantina Liagkou, Emmanouil Papadatos, Holger Essler,
Jean-Baptiste Camps, and Franz Fischer"
year = 2023
}