Rate prediction training system for learning NLP
- transformData - cleans news to contain only English news, combines with rates for dates
- cleanData - cleans every news, tokenize and lemmatize them
- processData - counts tfidf features from tokens, reduce dimensionality, use Linear Regression to predict rates
- prDataD2V - trains Doc2Vec model to get features from tokens, use Linear Regression to predict rates