In supervised learning, the goal is to learn from labeled data. The model is trained using a set of inputs where each input is labeled with the correct output. The performance of the model is measured using a loss function, which quantifies the difference between the model's prediction and the actual label. This process involves adjusting the model's parameters to minimize this loss, effectively teaching the model to replicate the labeling in the training data.
We're going to recap some key concepts in unsupervised learning. This is an essential area of machine learning where we don't use labeled data to train our models. Instead, the model identifies patterns and structures within the data independently.
What is Unsupervised Learning?
Unsupervised learning is a type of machine learning where the algorithm is trained on data without labeled responses. The primary goal is to discover the underlying structure of the data. Two main objectives of unsupervised learning are:
Clustering is one of the most common unsupervised learning techniques. It involves grouping a set of objects in such a way that objects in the same group (or cluster) are more similar to each other than to those in other groups.
Examples of Clustering:
Common Algorithms:
Dimensionality reduction techniques reduce the number of random variables under consideration by obtaining a set of principal variables. This is particularly useful when dealing with high-dimensional data.