Problem Statement
Determine whether a compound is active (1) or not (0).
Solution
Feature selection method is implemented to reduce dimension and consider only important feature for prediction. Using selected features different classifiers are validated for maximum F1 score as the data is highly imbalanced.
Challenge
Dataset is completely unbalanced
Complete procedure is given in the doc file