TY - GEN
T1 - Information content measures of semantic similarity perform better without sense-tagged text
AU - Pedersen, Ted
PY - 2010
Y1 - 2010
N2 - This paper presents an empirical comparison of similarity measures for pairs of concepts based on Information Content. It shows that using modest amounts of untagged text to derive Information Content results in higher correlation with human similarity judgments than using the largest available corpus of manually annotated sense-tagged text.
AB - This paper presents an empirical comparison of similarity measures for pairs of concepts based on Information Content. It shows that using modest amounts of untagged text to derive Information Content results in higher correlation with human similarity judgments than using the largest available corpus of manually annotated sense-tagged text.
UR - http://www.scopus.com/inward/record.url?scp=84858422324&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84858422324&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84858422324
SN - 1932432655
SN - 9781932432657
T3 - NAACL HLT 2010 - Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Main Conference
SP - 329
EP - 332
BT - NAACL HLT 2010 - Human Language Technologies
T2 - 2010 Human Language Technologies Conference ofthe North American Chapter of the Association for Computational Linguistics, NAACL HLT 2010
Y2 - 2 June 2010 through 4 June 2010
ER -