Supervised learning is the most widely used branch of machine learning.
data:image/s3,"s3://crabby-images/05e2b/05e2b8f212bbbb653af80c455155642372409fd0" alt=""
It requires that our training data has “labels”.
data:image/s3,"s3://crabby-images/d8073/d80738054a448b4e0955f87b406829bfd0929449" alt=""
A label is the output value we are trying to predict, often called the “y-variable”.
data:image/s3,"s3://crabby-images/4126e/4126eeb25ce9331a142a4d7006a9cc30daf893ec" alt=""
When the label is a numeric value, this is called regression.
data:image/s3,"s3://crabby-images/b7f85/b7f8585cbf6693cfc0a4c22201b35c3dd6d46ed7" alt=""
Regression algorithms work by building a formula to calculate the output based on the input.
data:image/s3,"s3://crabby-images/3a4d0/3a4d0c49fd287e5261c0e324755049a4dac13f24" alt=""
When the label is a “category” that the example belongs to, this is called classification.
data:image/s3,"s3://crabby-images/8a734/8a734414ff32e491f2a75e28911d222c787587ab" alt=""
Classification algorithms build a model that predicts the probabilities of each category, given a training example.
data:image/s3,"s3://crabby-images/8f02c/8f02c51fb71e822dd39cfd1e6eb32347f8176a6c" alt=""
Some algorithms, including those used in ensemble methods and deep learning, can perform classification and regression.
data:image/s3,"s3://crabby-images/3b093/3b09365b624c464fa16a6619e031f37dfccaf839" alt=""
Ensemble methods work by building many models and allowing them to vote on the answer.
data:image/s3,"s3://crabby-images/6e18c/6e18cc3123d3a84dee71aee0976dcb2212c8ed90" alt=""
Deep learning utilizes neural networks, which simulate how biological neurons function.
data:image/s3,"s3://crabby-images/e7da8/e7da81f573c10f9e70d9376f5d678519b2aeba86" alt=""
There are also some types of specialized neural networks which excel at specific tasks.
data:image/s3,"s3://crabby-images/2717d/2717d335b2cb68bb07e593961e44e854c8db5708" alt=""
Deep learning is very widely used today, but it has some downsides. Specifically, large models require massive amounts of training data and significant computational resources.
data:image/s3,"s3://crabby-images/d4da5/d4da51b0d161c5ac546f8306d56be59b58a82ea3" alt=""
“Big-data” and “cloud computing” have mitigated part of this problem. However, obtaining large quantities of high-quality, labeled data, remains one of the biggest barriers in machine learning today.