TY - GEN
T1 - Hypergraph-based multilevel matrix approximation for text information retrieval
AU - Fang, Haw Ren
AU - Saad, Yousef
PY - 2010
Y1 - 2010
N2 - In Latent Semantic Indexing (LSI), a collection of documents is often pre-processed to form a sparse term-document matrix, followed by a computation of a low-rank approximation to the data matrix. A multilevel framework based on hypergraph coarsening is presented which exploits the hypergraph that is canonically associated with the sparse term-document matrix representing the data. The main goal is to reduce the cost of the matrix approximation without sacrificing accuracy. Because coarsening by multilevel hy-pergraph techniques is a form of clustering, the proposed approach can be regarded as a hybrid of factorization-based LSI and clustering-based LSI. Experimental results indicate that our method achieves good improvement of the retrieval performance at a reduced cost.
AB - In Latent Semantic Indexing (LSI), a collection of documents is often pre-processed to form a sparse term-document matrix, followed by a computation of a low-rank approximation to the data matrix. A multilevel framework based on hypergraph coarsening is presented which exploits the hypergraph that is canonically associated with the sparse term-document matrix representing the data. The main goal is to reduce the cost of the matrix approximation without sacrificing accuracy. Because coarsening by multilevel hy-pergraph techniques is a form of clustering, the proposed approach can be regarded as a hybrid of factorization-based LSI and clustering-based LSI. Experimental results indicate that our method achieves good improvement of the retrieval performance at a reduced cost.
KW - Latent Semantic Indexing
KW - Low-rank matrix approximation
KW - Multilevel hypergraph partitioning
KW - Text information retrieval
UR - http://www.scopus.com/inward/record.url?scp=78651295428&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78651295428&partnerID=8YFLogxK
U2 - 10.1145/1871437.1871681
DO - 10.1145/1871437.1871681
M3 - Conference contribution
AN - SCOPUS:78651295428
SN - 9781450300995
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 1597
EP - 1600
BT - CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops
T2 - 19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10
Y2 - 26 October 2010 through 30 October 2010
ER -