kuroginqin / people_crawler Goto Github PK
View Code? Open in Web Editor NEWA simple crawler to collect news text from People.cn (www.people.com.cn) with an example Chinese news corpus, two datasets of Chinese word vectors with different scale and a labeled dataset of Named Entity Recognition (NER).