• English
    • Türkçe
  • English 
    • English
    • Türkçe
  • Login
View Item 
  •   DSpace Home
  • 3. Mühendislik Fakültesi / Faculty of Engineering
  • Computer Engineering / Bilgisayar Mühendisliği
  • View Item
  •   DSpace Home
  • 3. Mühendislik Fakültesi / Faculty of Engineering
  • Computer Engineering / Bilgisayar Mühendisliği
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
Realtime Access Map

Geodesic distances for web document clustering

Thumbnail
View/Open
Conference Paper (143.3Kb)
Date
2011
Author
Tekir, Selma
Mansmann, Florian
Keim, Daniel
Metadata
Show full item record
Abstract
While traditional distance measures are often capable of properly describing similarity between objects, in some application areas there is still potential to fine-tune these measures with additional information provided in the data sets. In this work we combine such traditional distance measures for document analysis with link information between documents to improve clustering results. In particular, we test the effectiveness of geodesic distances as similarity measures under the space assumption of spherical geometry in a 0-sphere. Our proposed distance measure is thus a combination of the cosine distance of the term-document matrix and some curvature values in the geodesic distance formula. To estimate these curvature values, we calculate clustering coefficient values for every document from the link graph of the data set and increase their distinctiveness by means of a heuristic as these clustering coefficient values are rough estimates of the curvatures. To evaluate our work, we perform clustering tests with the k-means algorithm on the English Wikipedia hyperlinked data set with both traditional cosine distance and our proposed geodesic distance. The effectiveness of our approach is measured by computing micro-precision values of the clusters based on the provided categorical information of each article. © 2011 IEEE.
URI
http://doi.org/10.1109/CIDM.2011.5949449
http://hdl.handle.net/11147/5014
Collections
  • Computer Engineering / Bilgisayar Mühendisliği [191]
  • Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection [3276]


DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 



| IZTECH OS Policy |
DSpace@IYTE Guide |

DSpace@IZTECH

by OpenAIRE
Advanced Search

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeDepartmentPublisherCategoryLanguageAccess TypeIZTECH AuthorThis CollectionBy Issue DateAuthorsTitlesSubjectsTypeDepartmentPublisherCategoryLanguageAccess TypeIZTECH Author

My Account

LoginRegister

Statistics

View Usage Statistics

DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 


|| IZTECH OS Policy || DSpace@IYTE Guide || Library || IYTE || OAI-PMH ||

IZTECH Library, Gülbahçe Kampüsü - 35430 - Urla, İzmir / TURKEY
If you find any errors in content, please contact: openaccess@iyte.edu.tr.

Creative Commons Lisansı
DSpace@IZTECH by IYTE Institutional repository is licensed under a Creative Commons Attribution-Gayriticari-NoDerivs 3.0 Unported License.

DSpace@IZTECH is member of: