*To see this with example code, check out my Kaggle Notebook.
Decision Trees are a very flexible algorithm and can be used in supervised or unsupervised contexts, for both classification and regression.
data:image/s3,"s3://crabby-images/e3b58/e3b585f2f74c36001ee511bbb1443d17693254bd" alt=""
This post is about the “supervised” “regression” version.
Supervised meaning we used labeled data to train the model.
data:image/s3,"s3://crabby-images/51e65/51e654f4eff1362d047616093038052ad68db629" alt=""
Regression meaning we predict a numerical value, instead of a “class”.
data:image/s3,"s3://crabby-images/148a7/148a78ff03ce469e1b31e68016eb188b2a938a14" alt=""
Decision trees split our data into subcategories which we use to predict some output variable.
data:image/s3,"s3://crabby-images/b2138/b2138c0f9472c15cb964c1ebac4f7c6c8110b8a3" alt=""
They shine when working with datasets that have multiple “predictor” variables, as these are notoriously hard to visualize.
data:image/s3,"s3://crabby-images/e425c/e425c02956663773efe68cc920ed4b74852f910f" alt=""
To create these separations, the algorithm tests various splits, attempting to maximize “information gain” by reducing “entropy”.
data:image/s3,"s3://crabby-images/e38c6/e38c6c00f09c1766935861043b45da14be1af3e7" alt=""
data:image/s3,"s3://crabby-images/181b7/181b7a079873bff5a12e2f498e93df45a2e37ebe" alt=""
Once the algorithm determines the first split, this becomes the “root node”.
data:image/s3,"s3://crabby-images/3b74a/3b74a869e0891aba58247590b179d22f5e642194" alt=""
Next, it evaluates the resulting subsets of data, and creates another split based on information gain.
data:image/s3,"s3://crabby-images/5456f/5456fb43e52973facfc630a11d07e3327869c89e" alt=""
It continues with this process until hitting some predefined stopping point, such as a maximum depth or minimum sample size per subset.
data:image/s3,"s3://crabby-images/c9bb4/c9bb4c6966e80a3599bc7553b93699eb3dd6c2d7" alt=""
With the splitting complete, an average value is calculated for each “leaf node”.
data:image/s3,"s3://crabby-images/75e92/75e92766c3d9cd7a82b6d4d6256449cc4bf9fb42" alt=""
Now we can pass in new examples and predict their output.
data:image/s3,"s3://crabby-images/5ea4e/5ea4e1674fdd8793d5be473a6a09b5119470e8cf" alt=""
Overall, Decision Trees are highly flexible and intuitive, but not the most accurate on their own. However, they are the foundation of other incredibly powerful algorithms.