當(dāng)前位置：首頁(yè) > 编程资源 > 编程问答 >内容正文

编程问答

clustering

發(fā)布時(shí)間：2025/3/21 编程问答 37 豆豆

生活随笔收集整理的這篇文章主要介紹了 clustering 小編覺得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

層次化聚類可以使用樹圖表示。

自頂向下：所有節(jié)點(diǎn)當(dāng)做同一類，然后逐層劃分

自底向上：每個(gè)節(jié)點(diǎn)都是獨(dú)立的類，然后逐層合并

其中需要用到兩個(gè)距離函數(shù)，用來(lái)識(shí)別“相似”：

1 metric： N范式、高維向量夾角衡量點(diǎn)與點(diǎn)之間的相似度

2 linkage：衡量類與類之間的相似度：

?2.1） max{d(x,h): x in A, y in B}

?2.2） min{d(x,h): x in A, y in B}

?2.3） sigma(d(x,y))/(|A|*|B|), 均值, 類間所有點(diǎn)的距離之和的均值

?以下幾個(gè)不甚明白

The sum of all intra-cluster variance.
The increase in variance for the cluster being merged (Ward's criterion).
The probability that candidate clusters spawn from the same distribution function (V-linkage).
?

在樹的每一層都是一種聚類結(jié)果及對(duì)應(yīng)的類個(gè)數(shù)

?

?

http://en.wikipedia.org/wiki/Hierarchical_clustering

?

?

基于劃分

k-means, 優(yōu)勢(shì)是算法簡(jiǎn)單且快，可以處理大數(shù)據(jù)量；

缺點(diǎn)是每次算法過(guò)程得到的結(jié)果并不一定相同，取決于初始的隨機(jī)k個(gè)質(zhì)點(diǎn)；最小化了類內(nèi)的方差，但不保證全局的最小方差；并且要求均值是可定義的有意義的（質(zhì)點(diǎn)是用均值計(jì)算得到的）【當(dāng)均值無(wú)意義時(shí)，可以使用k-medoids代替，該算法選取中位點(diǎn)作為質(zhì)點(diǎn)】

?

模糊c-means: 點(diǎn)可以概率性的屬于多個(gè)類

?

QT clustering(quality threshold), 算法流程：
- The user chooses a maximum diameter for clusters.
- Build a candidate cluster for each point by iteratively including the point that is closest to the group, until the diameter of the cluster surpasses the threshold.
- Save the candidate cluster with the most points as the first true cluster, and remove all points in the cluster from further consideration. Must clarify what happens if more than 1 cluster has the maximum number of points??
- Recurse with the reduced set of points.
The distance between a point and a group of points is computed using complete linkage, i.e. as the maximum distance from the point to any member of the group (see the "Agglomerative hierarchical clustering" section about distance between clusters).

?

spectral clustering：

?

?

總結(jié)

以上是生活随笔為你收集整理的clustering的全部?jī)?nèi)容，希望文章能夠幫你解決所遇到的問(wèn)題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯(cuò)，歡迎將生活随笔推薦給好友。

Clustering

日韩av黄I国产麻豆传媒I国产91av视频在线观看I日韩一区二区三区在线看I美女国产在线I麻豆视频国产在线观看I成人黄色短片

编程问答

clustering

總結(jié)