교보문고

학술논문

데이터 클러스터링을 위한 혼합 시뮬레이티드 어닐링

이용수 51

영문명: Hybrid Simulated Annealing for Data Clustering
발행기관: 한국산업경영시스템학회
저자명: 김성수(Sung-Soo Kim) 백준영(Jun-Young Baek) 강범수(Beom-Soo Kang)
간행물 정보: 『산업경영시스템학회지』제40권 제2호, 92~98쪽, 전체 7쪽
주제분류: 경제경영 > 경영학
파일형태: PDF
발행일자: 2017.06.30

4,000원

구매일시로부터 72시간 이내에 다운로드 가능합니다.
이 학술논문 정보는 (주)교보문고와 각 발행기관 사이에 저작물 이용 계약이 체결된 것으로, 교보문고를 통해 제공되고 있습니다.

1:1 문의

국문 초록

영문 초록

Data clustering determines a group of patterns using similarity measure in a dataset and is one of the most important and difficult technique in data mining. Clustering can be formally considered as a particular kind of NP-hard grouping problem. K-means algorithm which is popular and efficient, is sensitive for initialization and has the possibility to be stuck in local optimum because of hill climbing clustering method. This method is also not computationally feasible in practice, especially for large datasets and large number of clusters. Therefore, we need a robust and efficient clustering algorithm to find the global optimum (not local optimum) especially when much data is collected from many IoT (Internet of Things) devices in these days. The objective of this paper is to propose new Hybrid Simulated Annealing (HSA) which is combined simulated annealing with K-means for non-hierarchical clustering of big data. Simulated annealing (SA) is useful for diversified search in large search space and K-means is useful for converged search in predetermined search space. Our proposed method can balance the intensification and diversification to find the global optimal solution in big data clustering. The performance of HSA is validated using Iris, Wine, Glass, and Vowel UCI machine learning repository datasets comparing to previous studies by experiment and analysis. Our proposed KSAK (K-means+SA+K-means) and SAK (SA+K-means) are better than KSA(K-means+SA), SA, and K-means in our simulations. Our method has significantly improved accuracy and efficiency to find the global optimal data clustering solution for complex, real time, and costly data mining process.

국문 초록

영문 초록

목차

키워드

해당간행물 수록 논문

참고문헌

관련논문

경제경영 > 경영학분야 BEST

경제경영 > 경영학분야 NEW

최근 이용한 논문

APA

MLA