“Transfer Learning” is the process of using a model that was trained to complete one task (ex: Identify Cats), and training it further to complete a similar, but different task (ex: Identify Cancer).
data:image/s3,"s3://crabby-images/6fd48/6fd4871f29c745305e2ca8d2bdf073e6692f17c2" alt=""
This allows us to harness the foundational knowledge learned during “pre-training”, such as identifying lines and shapes in image recognition.
data:image/s3,"s3://crabby-images/7eb09/7eb099f475c87fa7707b9034054ee3cc81dbc519" alt=""
This is especially powerful when we have significantly more data available for pre-training than we have for the task we are interested in (Cat vs Cancer Images).
data:image/s3,"s3://crabby-images/860aa/860aa21a9b4e7b2b20c069e1d6ea9714289f5461" alt=""
Additionally, complex models take a very long time to train. Transfer Learning allows us to skip ahead and only use our time to “fine-tune” the model, making huge and complex models tractable for the everyday user.
data:image/s3,"s3://crabby-images/79c37/79c37a8ec5ba9cd2c7d92de89e9473b765836078" alt=""
In practice, we load the model and all of its parameters, but replace the “output layer” with a new one for our task.
data:image/s3,"s3://crabby-images/9136e/9136eaf5e0a97af258cdef7873b41d21e2999b00" alt=""
We can also choose to “freeze” specific layers in the pre-trained model, meaning they will not be updated during training.
data:image/s3,"s3://crabby-images/d1eaa/d1eaaa27921f4f987735e8d0c1a21df6e51a339a" alt=""
Choosing which layers to unfreeze varies based on the model, the problem, and the amount of data available for fine-tuning.
data:image/s3,"s3://crabby-images/22ce7/22ce7fea0413fc7e45280e6ab9e368cd78a3b8f5" alt=""
Overall, Transfer Learning is an extremely powerful tool, particularly when faced with limited data for a specific task.