[python] kmeans, agglomerative clustering

Notice

Recent Posts

Recent Comments

Link

« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

지방이의 Data Science Lab

[python] kmeans, agglomerative clustering 본문

Data Analysis/Python

[python] kmeans, agglomerative clustering

[지현] 2019. 9. 3. 01:46

from sklearn.cluster import AgglomerativeClustering
%time
cluster = AgglomerativeClustering(n_clusters=7, affinity='euclidean', linkage='average')
cluster.fit_predict(temp_data)

pd.value_counts(pd.Series(cluster.labels_))

불러오는 중입니다...

from sklearn.cluster import KMeans

km = KMeans(n_clusters=7)
x_names = [x for x in total_activity.columns if x not in ['acc_id']]
km.fit(total_activity[x_names])
pd.value_counts(pd.Series(km.labels_))

불러오는 중입니다...

from scipy.cluster.hierarchy import dendrogram, linkage
import scipy.cluster.hierarchy as spc

from scipy.cluster.hierarchy import cophenet
from scipy.spatial.distance import pdist
import pylab

corr = temp_data.corr()#.values()
Z = linkage(corr, 'average')
c, coph_dists = cophenet(Z, pdist(corr))
c

# pdist = spc.distance.pdist(corr)
# linkage = spc.linkage(pdist, method='average')
# idx = spc.fcluster(linkage, 0.5 * pdist.max(), 'distance')

# assignments = fcluster(linkage(temp_data, method='complete'),4,'distance')
# cluster_output = pandas.DataFrame({'team':df.teamID.tolist() , 'cluster':assignments})

import matplotlib.pyplot as plt
%matplotlib inline
plt.title('Dendrogram')
plt.xlabel('Index Numbers')
plt.ylabel('Distance')
dendrogram(
    Z,
    leaf_rotation=90.,
    leaf_font_size=8.,
)
plt.show()

불러오는 중입니다...

저작자표시 비영리 동일조건

'Data Analysis > Python' 카테고리의 다른 글

[python] 원하는 string포함한 pd.dataframe 필터링 (0)	2020.02.05
[python] key id가 multiple 관측치일때 갯수 일정하게 (1)	2020.02.01
[python] 데이하루씩 미루기 (0)	2019.08.30
[python] minmaxscaler (0)	2019.08.28
[python] 주별, 요일별로 변경 (0)	2019.08.28

'Data Analysis/Python' Related Articles

Comments

지방이의 Data Science Lab

[python] kmeans, agglomerative clustering 본문

[python] kmeans, agglomerative clustering

'Data Analysis > Python' 카테고리의 다른 글

티스토리툴바