Following the above process, we generated the Englishversion of dataset based on a large corpus of 0.12m news articles (Horne et al. 2018). In this paper, we do not report any results from the English version of the dataset for brevity, yet we release the two datasets together on this github page 1 for the research community.
But I don't find it in this repo. May you open source this version corpus? Thx.๐น