New supervised learning algorithm for word sense disambiguation

Ted Pedersen; Rebecca Bruce

New supervised learning algorithm for word sense disambiguation

Ted Pedersen, Rebecca Bruce

Computer Science (Duluth)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

18 Scopus citations

Abstract

The Naive Mix is a new supervised learning algorithm that is based on a sequential method for selecting probabilistic models. The usual objective of model selection is to find a single model that adequately characterizes the data in a training sample. However, during model selection a sequence of models is generated that consists of the best-fitting model at each level of model complexity. The Naive Mix utilizes this sequence of models to define a probabilistic model which is then used as a probabilistic classifier to perform word-sense disambiguation. The models in this sequence are restricted to the class of decomposable log-linear models. This class of models offers a number of computational advantages. Experiments disambiguating twelve different words show that a Naive Mix formulated with a forward sequential search and Akaike's Information Criteria rivals established supervised learning algorithms such as decision trees (C4.5), rule induction (CN2) and nearest-neighbor classification (PEBLS).

Original language	English (US)
Title of host publication	Proceedings of the National Conference on Artificial Intelligence
Editors	Anon
Publisher	AAAI
Pages	604-609
Number of pages	6
State	Published - Dec 1 1997
Event	Proceedings of the 1997 14th National Conference on Artificial Intelligence, AAAI 97 - Providence, RI, USA Duration: Jul 27 1997 → Jul 31 1997

Other

Other	Proceedings of the 1997 14th National Conference on Artificial Intelligence, AAAI 97
City	Providence, RI, USA
Period	7/27/97 → 7/31/97

OpenUrl availability

Full text

Cite this

@inproceedings{6e91e0e434d4478393112f29900e6b5a,

title = "New supervised learning algorithm for word sense disambiguation",

abstract = "The Naive Mix is a new supervised learning algorithm that is based on a sequential method for selecting probabilistic models. The usual objective of model selection is to find a single model that adequately characterizes the data in a training sample. However, during model selection a sequence of models is generated that consists of the best-fitting model at each level of model complexity. The Naive Mix utilizes this sequence of models to define a probabilistic model which is then used as a probabilistic classifier to perform word-sense disambiguation. The models in this sequence are restricted to the class of decomposable log-linear models. This class of models offers a number of computational advantages. Experiments disambiguating twelve different words show that a Naive Mix formulated with a forward sequential search and Akaike's Information Criteria rivals established supervised learning algorithms such as decision trees (C4.5), rule induction (CN2) and nearest-neighbor classification (PEBLS).",

author = "Ted Pedersen and Rebecca Bruce",

year = "1997",

month = dec,

day = "1",

language = "English (US)",

pages = "604--609",

editor = "Anon",

booktitle = "Proceedings of the National Conference on Artificial Intelligence",

publisher = "AAAI",

note = "Proceedings of the 1997 14th National Conference on Artificial Intelligence, AAAI 97 ; Conference date: 27-07-1997 Through 31-07-1997",

}

TY - GEN

T1 - New supervised learning algorithm for word sense disambiguation

AU - Pedersen, Ted

AU - Bruce, Rebecca

PY - 1997/12/1

Y1 - 1997/12/1

N2 - The Naive Mix is a new supervised learning algorithm that is based on a sequential method for selecting probabilistic models. The usual objective of model selection is to find a single model that adequately characterizes the data in a training sample. However, during model selection a sequence of models is generated that consists of the best-fitting model at each level of model complexity. The Naive Mix utilizes this sequence of models to define a probabilistic model which is then used as a probabilistic classifier to perform word-sense disambiguation. The models in this sequence are restricted to the class of decomposable log-linear models. This class of models offers a number of computational advantages. Experiments disambiguating twelve different words show that a Naive Mix formulated with a forward sequential search and Akaike's Information Criteria rivals established supervised learning algorithms such as decision trees (C4.5), rule induction (CN2) and nearest-neighbor classification (PEBLS).

AB - The Naive Mix is a new supervised learning algorithm that is based on a sequential method for selecting probabilistic models. The usual objective of model selection is to find a single model that adequately characterizes the data in a training sample. However, during model selection a sequence of models is generated that consists of the best-fitting model at each level of model complexity. The Naive Mix utilizes this sequence of models to define a probabilistic model which is then used as a probabilistic classifier to perform word-sense disambiguation. The models in this sequence are restricted to the class of decomposable log-linear models. This class of models offers a number of computational advantages. Experiments disambiguating twelve different words show that a Naive Mix formulated with a forward sequential search and Akaike's Information Criteria rivals established supervised learning algorithms such as decision trees (C4.5), rule induction (CN2) and nearest-neighbor classification (PEBLS).

UR - http://www.scopus.com/inward/record.url?scp=0031370146&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031370146&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0031370146

SP - 604

EP - 609

BT - Proceedings of the National Conference on Artificial Intelligence

A2 - Anon, null

PB - AAAI

T2 - Proceedings of the 1997 14th National Conference on Artificial Intelligence, AAAI 97

Y2 - 27 July 1997 through 31 July 1997

ER -

New supervised learning algorithm for word sense disambiguation

Abstract

Other

OpenUrl availability

Other files and links

Fingerprint

Cite this