CiteULike is a free online bibliography manager. Register
and you can start organising your references online.
| registrieren | anmelden | FAQ | [?] |
Concept decompositions for large sparse text data using clusteringMachine Learning, Vol. 42, No. 1. (2001), pp. 143-175.
|
Reviews
[Write a review of this article]
There are no reviews of this article
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
AbstractUnstructured text documents are becoming increasingly common and available; mining such data sets represents a major contemporary challenge. Using words as features, text documents are often represented as high-dimensional and sparse vectors--a few thousand dimensions and a sparsity of 95 to 99% is typical. In this paper, we study a certain spherical k-means algorithm for clustering such document vectors. The algorithm outputs k disjoint clusters each with a concept vector that is the centroid...
BibTeX record
RIS record