DLCV - Intro to Neural Nets

一次給你 n 筆資料，把這 n 筆資料分成 k 堆
High within-cluster (intra-cluster) similari
- 同一堆內的影像越像越好
Low between-cluster (inter-cluster) similarit

But Similarity is NOT Always Objective
Similarity

Input: N examples {x1, . . . , xN } (xn ∈ RD ); number of partitions K
Initialize: K cluster centers μ1, . . . , μK . Several initialization options:
- Randomly initialize μ1, . . . , μK anywhere in RD
- Or, simply choose any K examples as the cluster centers
Iterate:
- Assign each of example xn to its closest cluster center
- Recompute the new cluster centers μk (mean/centroid of the set Ck )
- Repeat while not converge
Possible convergence criteria:
- Cluster centers do not change anymore
- Max. number of iterations reached
Output:
K clusters (with centers/means of each clust

L2 可能會出現的問題..

因此他 Sensitive to initialization

soft assignment

Consider that we have 10 object categories of interest
E.g., CIFAR10 with 50K training & 10K test images of 10 categories. And, each image is of size 32 x 32 x 3 pixel