sorayutmild / unsupervised-thai-document-clustering-with-sanook-news Goto Github PK
View Code? Open in Web Editor NEWAn unsupervised model to clustering Thai news. Using TD-IDF, SimCSE-WangchanBERTa with weighted by number of named entities as a vector representation, and using k-means as an clustering model.