Unsupervised learning is major subfield of machine learning.
data:image/s3,"s3://crabby-images/f7944/f794458c1f3a29562bf76774d36a72dbb279629c" alt=""
Its algorithms train on “unlabeled” data, meaning it does not include a value we are learning to predict.
data:image/s3,"s3://crabby-images/f8d22/f8d222c5f947836f69dff5df192ab3ad42ea59e2" alt=""
This makes it applicable to nearly any dataset, but the resulting models give less “direct” answers that often require additional interpretation or processing.
data:image/s3,"s3://crabby-images/fbcc8/fbcc85b2c39f501562f698c4d8017913b209d9b3" alt=""
There are two main applications of these algorithms: clustering and dimensionality reduction.
data:image/s3,"s3://crabby-images/d97e9/d97e9bc8c198eac6397fdcf8a80afa41a5cd164b" alt=""
In clustering, we look for groups of data points that are similar to each other.
data:image/s3,"s3://crabby-images/e612b/e612bda029557bf96d523cfa69abf7929a018483" alt=""
This can be applied to a wide variety of problems such as document classification, fraud detection, and even modeling UFO sightings.
data:image/s3,"s3://crabby-images/ffbb3/ffbb39ddf5d78d3e96b86cdad939aa4ac5f8fc74" alt=""
In dimensionality reduction, we distill multiple input variables down through clever mathematical techniques.
data:image/s3,"s3://crabby-images/f99b4/f99b435f1802780acd2d76bb526dbecdb610938f" alt=""
This is important for efficient training, but also can be used to visualize high-dimensional datasets which would otherwise be impossible to display.
data:image/s3,"s3://crabby-images/25d96/25d96e3750b77f9cb4bbdb45443f330ce04c7c78" alt=""
These unsupervised techniques can be combined with supervised methods to achieve “semi-supervised” learning.
data:image/s3,"s3://crabby-images/92303/92303fe413802274adb46a7859871f080f8e44a0" alt=""
This is particularly useful in cases where unlabeled data is abundant, but labeled data is scarce, such as in pre-training language models.
data:image/s3,"s3://crabby-images/cf4bf/cf4bfa3c3d79703570b422ec5bd47bda942244e6" alt=""
It should also be noted that there are unsupervised, deep learning algorithms, such as auto-encoders and Boltzmann machines, but they are beyond the scope of this post.