TY - JOUR
T1 - It's about this and that
T2 - a description of anaphoric expressions in clinical text.
AU - Wang, Yan
AU - Melton, Genevieve B.
AU - Pakhomov, Serguei
PY - 2011
Y1 - 2011
N2 - Although anaphoric expressions are very common in biomedical and clinical documents, little work has been done to systematically characterize their use in clinical text. Samples of 'it', 'this', and 'that' expressions occurring in inpatient clinical notes from four metropolitan hospitals were analyzed using a combination of semi-automated and manual annotation techniques. We developed a rule-based approach to filter potential non-referential expressions. A physician then manually annotated 1000 potential referential instances to determine referent status and the antecedent of each referent expression. A distributional analysis of the three referring expressions in the entire corpus of notes demonstrates a high prevalence of anaphora and large variance in distributions of referential expressions with different notes. Our results confirm that anaphoric expressions are common in clinical texts. Effective co-reference resolution with anaphoric expressions remains an important challenge in medical natural language processing research.
AB - Although anaphoric expressions are very common in biomedical and clinical documents, little work has been done to systematically characterize their use in clinical text. Samples of 'it', 'this', and 'that' expressions occurring in inpatient clinical notes from four metropolitan hospitals were analyzed using a combination of semi-automated and manual annotation techniques. We developed a rule-based approach to filter potential non-referential expressions. A physician then manually annotated 1000 potential referential instances to determine referent status and the antecedent of each referent expression. A distributional analysis of the three referring expressions in the entire corpus of notes demonstrates a high prevalence of anaphora and large variance in distributions of referential expressions with different notes. Our results confirm that anaphoric expressions are common in clinical texts. Effective co-reference resolution with anaphoric expressions remains an important challenge in medical natural language processing research.
UR - http://www.scopus.com/inward/record.url?scp=84872214261&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84872214261&partnerID=8YFLogxK
M3 - Article
C2 - 22195211
AN - SCOPUS:84872214261
SN - 1559-4076
VL - 2011
SP - 1471
EP - 1480
JO - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
JF - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
ER -