The results of the information gain are found to be consistent with the Hypothesis Test. The values of diabetesMed and change (of medication) are found to have significant information gain and are included in the final model as well.
Based on the above chart, the following features were removed in order to aim for better model accuracy:
acarbose, miglitol, glyburide.metformin, glimepiride, chlorpropamide, nateglinide, acetohexamide, tolbutamide, metformin.pioglitazone, glipizide.metformin, tolazamide, troglitazone, citoglipton, examide, metformin.rosiglitazone, & glimepiride.pioglitazone
All the models were trained in Weka using a 10-fold cross-validation in order to avoid overfitting.
Given the importance of understanding the factors that help in predicting the radmissions, we focus on achieving a better True Positive Rate for the classifier.