
Showing posts from May, 2009

Report on POS-Tagger for Nepali Text - 06/23/07

INTRODUCTION Part-of-speech tagging (POS tagging or POST), also called grammatical tagging, morphosyntactic categorization or syntactic wordclass tagging. It is the process of marking up the words in a text as corresponding to a particular part of speech, based on both its definition, as well as its context—i.e., relationship with adjacent and related words in a phrase, sentence, or paragraph. A POS analysis is the very basic grammatical task of assigning every word in a sentence or text to the correct morphosyntactic category – noun, verb, adjective, adverb, and so on. In POS tagging, labels or tags are added to every word in a text to indicate their category. While it is possible to assign these tags manually, it is highly desirable to automate the process, as otherwise the process of applying a POS analysis to a large corpus becomes prohibitively work intensive. Some of the POS tagger available are : ● Stanford POS tagger ● TreeTagger ● TnT - A Statistical Part-of-Speech Tag...