Abstract
The Ngram Statistics Package (NSP) is a flexible and easy– to–use software tool that supports the identification and analysis of Ngrams, sequences of N tokens in online text. We have designed and implemented NSP to be easy to customize to particular problems and yet remain general enough to serve a broad range of needs. This paper provides an introduction to NSP while raising some general issues in Ngram analysis, and summarizes several applications where NSP has been successfully employed. NSP is written in Perl and is freely available under the GNU Public License.
Original language | English (US) |
---|---|
Title of host publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Editors | Alexander Gelbukh |
Publisher | Springer Verlag |
Pages | 370-381 |
Number of pages | 12 |
ISBN (Print) | 3540005323 |
DOIs | |
State | Published - 2003 |
Event | 4th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2003 - Mexico City, Mexico Duration: Feb 16 2003 → Feb 22 2003 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 2588 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Other
Other | 4th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2003 |
---|---|
Country/Territory | Mexico |
City | Mexico City |
Period | 2/16/03 → 2/22/03 |
Bibliographical note
Publisher Copyright:© Springer-Verlag Berlin Heidelberg 2003.