A. What is the entropy of this collection of training examples with respect to the class attribute?
B. What are the information gains of a1 and a2 relative to these training examples?
C. For a3, which is a continuous attribute, compute the information gain for every possible split.
D. What is the best split (among a1, a2, and a3) according to the information gain?
E. What is the best split (between a1 and a2) according to the misclassification error rate?
F. What is the best split (between a1 and a2) according to the Gini index?