by Parth Mistry
- This repository is a project based on the task given by Talentbook Technological Services Private Limited.
- The task contained 3 directories as train_docs, train_tags, test_docs.
- The aim was to figure out the tags for test_docs.
Python Version: 3.7.6
Packages: pandas, re, nltk, sklearn
- Imported all the documents and tags from the directories and their respective notepad files.
- Distributed the tags according to the appropriate columns.
- Lemmatized the documents for model preparation.