Turned out that the naive bayes model is still better than the rnn. This is unacceptable, and therefore we must tune hyperparameters and experiment so that it can outperform the baseline
Estimate classifiers performance on development and train set using various metrics.
This can be done e.g. using a jupyter notebook. Results can be presented in readme.
Suggestions about how it should be done or presented are very welcome
When trying to run the API, Tensorflow raises a NotImplementedError: Cannot convert a symbolic Tensor (bidirectional/forward_lstm/strided_slice:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported
Offer an english version of the README along with the norwegian version in order to make the code more accessible to people.
This should apply to all documents in the repo. This means that at the time of writing this, the contribution guide and the backend README needs to be translated as well.
NLTK is an old fashioned, awkward (imo). I think we should opt for a more modern classifier API. scikit learn is more modern and elegant. I think we should use this for the naive bayes (and possibly other) classifiers
We should use config files to manage hyperparameter tuning. Yaml is a pretty decent and readable format.
I'll be working on it as part of my mission to beat the bayes model.
Change names like "clf", "M", and "F" to more semantic and easy-to-understand names. It's not obvious to neither people reading the code nor consumers of the API what the values mean.
If we are going through with this, I want keep this change on hold until we've fully rewritten the backend. This makes it so that the finished rewrite will be a 3.0 release and this change the 4.0 release. This creates a consistent correlation between the version numbers of the frontend and the backend. Though, not a requirement it is good to keep it this way for now.
The current soultion requires changing a line of code each time you want to switch it. We should have a .env config or something similar to set react environment variables when switching between dev and build. It might require installing additional dependencies (E.g. webpack or dotenv)