The file “search_engine.cpp” trains the Hash Table and B tree in C++ The processes of my code can be broken down into following steps:
- Defined the loadWordLibrary function to save segmentation lexicon
- Defined the loadStopWords function to save unnecessary chinese characters thesaurus.
- Defined the loadData to save the music lyrics part
- Defined the splitWords to split the lyrics into several words.
- Defined the deleteWords to delete the words in unnecessary chinese characters thesaurus.
- Defined the buildInvertIndex to build an invert index.
- Defined the computeScore to compute the frequency of the words.
- Load the data and save the input from the command line.
- Get the recommandation.