Part-of-speech tagging - Factbites
 Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: Part-of-speech tagging


    Note: these results are not from the primary (high quality) database.


Related Topics

In the News (Wed 3 Dec 08)

  
 Part-of-speech tagging - Wikipedia, the free encyclopedia
In part-of-speech tagging by computer, it is typical to distinguish from 50 to 150 separate parts of speech for English (see the POS tags used in the Brown Corpus).
For some time, part-of-speech tagging was considered an inseparable part of natural language processing, because there are certain cases where the correct part of speech cannot be decided without understanding the semantics or even the pragmatics of the context.
Part-of-speech tagging is the process of marking up the words in a text with their corresponding parts of speech.
en.wikipedia.org /wiki/Part-of-speech_tagging   (1363 words)

  
 Part-of-Speech Tagging (OxUni)
Tagging words with their correct part-of-speech (singular proper noun, predeterminer, etc) is an important precursor to further automatic natural language processing.
The task is non-trivial since many (English) words have a number of possible part-of-speech (POS) tags.
Positive and negative examples of when to eliminate a POS tag, including the entire unambiguous left and right context (in terms of tags) for the focus word were used by Progol.
www-ai.ijs.si /~ilpnet2/apps/post.html   (214 words)

  
 Speech and Language Technology Part of Speech Tagging
Parts of Speech Tagging (PoS tagging) is assigning Parts of Speech to the words in a text.
Joakim Nivre Sparse data and smoothing in statistical part-of-speech tagging; Goteborg University
PoS tagging is a kind of word sense disambiguation: the PoS tag gives some information about the sense of the word in the context of use.
wwwhome.cs.utwente.nl /~infrieks/stt/stt.html   (1332 words)

  
 POS Tagging Overview
Full automation of the tagging process addresses the need to accurately tag previously untagged genres and languages in light of the fact that hand tagging of training data is a costly and time-consuming process.
HMM's cannot, however, be used in an automated tagging schema, since they rely critically upon the calculation of statistics on output sequences (tagstates).
Another possibility is to calculate the probability that each tag in the tag set occurs at the end of the n-gram, and to select the path with the highest probability.
www.georgetown.edu /faculty/ballc/ling361/tagging_overview.html   (2639 words)

  
 Part-of-Speech Tagging
This model consists of two parts: an n-gram model for part of speech sequences and a likelihood distribution model of part of speech tags for words.
These parts are combined using Bayes Theorem and the Viterbi algorithm is used to find the most probable part of speech sequence given a set of words.
Apart from size, we do not think that the two corpora are significantly different with respect to POS behaviour.
www.cs.cmu.edu /People/awb/papers/CSL-posbrk/node10.html   (329 words)

  
 Part of speech tagging
Part of speech tagging has become a quite mature field and we simply follow the known technology.
We use a standard HMM-based tagger (as in [4]) which estimates the probability of a part of speech tag sequence given a sequence of words.
Even better results were achieved by using the full tagset to tag the data and then reducing to the smaller set (97.04%).
www.cs.cmu.edu /~awb/papers/ES97pos/node3.html   (208 words)

  
 Universidade da Coruña. UDCDspace: Item 2183/148
One of the most important prior tasks for robust part-of-speech tagging is the correct tokenization or segmentation of the texts.
Nevertheless, this preprocessing step is an indispensable task in practice, and it is particularly dificult to tackle it with scientific precision with-out falling repeatedly in the analysis of the specific casuistry of every phenomenon detected.
In this work, we have developed a scheme of preprocessing oriented towards the disambiguation and robust tagging of Galician.
hdl.handle.net /2183/148   (192 words)

  
 Lexicalized Hidden Markov Models for Part-of-Speech Tagging - Lee, Tsujii, Rim (ResearchIndex)
Part-of-Speech Tagging with Lexicalized HMM - Pla, Molina (2001)
Abstract: Since most previous works for HMM-based tagging consider only part-of-speech information in contexts, their models cannot utilize lexical information which is crucial for resolving some morphological ambiguity.
In this paper we introduce uniformly lexicalized HMMs for partof -speech tagging in both English and Korean.
citeseer.ist.psu.edu /338381.html   (455 words)

  
 Bibliography on Part-of-Speech Tagging
Part of speech tagging using a network of linear separators.
Part-of-speech tagging of Dutch with MBT, a memory-based tagger generator.
This is a selection of articles on part-of-speech tagging.
odur.let.rug.nl /robbert/tagging/tagging.html   (511 words)

  
 UCREL Corpus Annotation
Part-of-speech tagging is often seen as the first stage of a more comprehensive syntactic annotation, which assigns a phrase marker, or labelled bracketing, to each sentence of the corpus, in the manner of a phrase structure grammar.
Part-of-speech (POS) tagging, also called grammatical tagging, is the commonest form of corpus annotation, and was the first form of annotation to be developed at Lancaster.
The ACASD semantic tagging system (Wilson and Rayson, 1993) accepts as input text which has been tagged for part of speech using the CLAWS POS tagging system.
www.comp.lancs.ac.uk /ucrel/annotation.html   (1827 words)

  
 Citebase - Distributional Part-of-Speech Tagging
This paper presents an algorithm for tagging words whose part-of-speech properties are unknown.
Robust part-of-speech tagging using a hidden markov model.
In Proceedings of the DARPA Speech and Natural Language Workshop, pages 275-282.
citebase.eprints.org /cgi-bin/citations?id=oai:arXiv.org:cmp-lg/9503009   (424 words)

  
 Part-of-Speech Tagging Using Progol
A system for `tagging' words with their part-of-speech (POS) tags is constructed.
The system has two components: a lexicon containing the set of possible POS tags for a given word, and rules which use a word's context to \em eliminate possible tags for a word.
Progol was altered to allow the caching of information about clauses generated during the induction process which greatly increased efficiency.
www.cs.bris.ac.uk /~ILPnet2/Tools/Reports/Abstracts/cus97-ilp97.html   (183 words)

  
 TnT -- Statistical Part-of-Speech Tagging
TnT, the short form of Trigrams'n'Tags, is a very efficient statistical part-of-speech tagger that is trainable on different languages and virtually any tagset.
Tagging speed depends on the average ambiguity rate of words and the percentage of unknown words in the text.
We measured tagging accuracy for the different tagsets by dividing the corpora in 90% training and 10% test set.
www.coli.uni-saarland.de /~thorsten/tnt   (622 words)

  
 LTG software: LT POS
The task of POS-tagging is to assign part of speech tags to words reflecting their syntactic category.
Regardless of how the lexicon is implemented, the morphological classifier retrieves for a word its possible parts of speech and other related morpho-syntactic features such as number, case, gender, etc. In the POS-tagging framework, each combination of morpho-syntactic features is unambiguously mapped into a POS-tag.
A short introduction to part-of-speech tagging, what it is, and what it isn't.
www.ltg.ed.ac.uk /software/pos   (1101 words)

  
 CA : Part-of-Speech Tagging - Xerox XRCE
The general purpose of a part-of-speech tagger is to associate each word in a text with its morphosyntactic category (represented by a tag).
Tagging French - Comparing a Statistical and a Constraint-Based Method.
Creating a tagset, lexicon and guesser for a French tagger.
www.xrce.xerox.com /competencies/content-analysis/fsnlp/tagger.en.html   (179 words)

  
 Find in a Library: Automatic lexicon generation for unsupervised part-of-speech tagging using only unannotated text
Automatic lexicon generation for unsupervised part-of-speech tagging using only unannotated text
Find in a Library: Automatic lexicon generation for unsupervised part-of-speech tagging using only unannotated text
WorldCat is provided by OCLC Online Computer Library Center, Inc. on behalf of its member libraries.
worldcatlibraries.org /wcpa/ow/5d535bbbaef81aaba19afeb4da09e526.html   (85 words)

  
 Part of Speech Tagging
This is called a part of speech tagger.
Suppose we want to write a program that can guess the part of speech of each ambiguous word (prior to parsing).
We could just use the method from the previous slide..
www.cee.hw.ac.uk /~alison/nl/l13h/node10.html   (93 words)

  
 Citebase - Part-of-Speech Tagging with Minimal Lexicalization
We use a Dynamic Bayesian Network to represent compactly a variety of sublexical and contextual features relevant to Part-of-Speech (PoS) tagging.
Robust Part-of-Speech Tagging Using a Hidden Markov Model.
Mathematical Foundations of Speech and Language Processing, ed.
citebase.eprints.org /cgi-bin/citations?id=oai:arXiv.org:cs/0312060   (398 words)

  
 Eric Brill's Home Page
We have a few NLP programs that you can download: a supervised part of speech tagger, an unsupervised part of speech tagger, and a prepositional phrase attachment program.
I am on the editorial board of Computational Linguistics, and the Journal for Artificial Intelligence Research.
I have also served as the program committee chair for: the 1996 International Conference on Computational Linguistics (COLING-96) Session on Corpus-Based Language Processing, the 1996 Conference on Empirical Methods in Natural Language Processing (Sponsored by the Association for Computational Linguistics) and the 1995 Speech Recognition Symposium, Corpus-Based NLP Session.
www.cs.jhu.edu /~brill/home.html   (300 words)

  
 Special Circumstances: Part-of-Speech Tagging 'A Verbless Post'
Ooh, one other thought, for computational linguists: What bets on the performance of part-of-speech tagging algorithms on prose such as this?
I reached for Adwait Ratnaparkhi's aging but conveniently handy Maximum Entropy part-of-speech tagger and ran it on Pullum's post.
Pullum's post, of course, contains no verbs, but more to the point for this posting, has the following concluding statement:
www.cs.sfu.ca /~anoop/weblog/archives/000031.html   (682 words)

  
 CLAWS part-of-speech tagger
(This is part of the Manual to accompany The British National Corpus (Version 2) and gives guidelines for the C5 tagset).
There is a similar document for the C7 tagset: BNC sampler corpus - guidelines to wordclass tagging.
These are incorporated in a separate document, the Wordclass Tagging Guidelines.
www.comp.lancs.ac.uk /computing/research/ucrel/claws   (682 words)

  
 Slippery soap by James Cardno and Rebecca Keillor New Zealand Listener
Hurricane Brash (TV1, Monday, 8.35pm) is a lone cameraman's recordings behind the scenes at the National Party, beginning at the time of Brash's appointment, and covering the Orewa speech and the Waitangi mudslinging.
In January, a man delivered a speech to the Orewa Rotary Club that set the nation on fire and gave the DNZ cameraman tagging along something to film.
The public and media outcry that followed was like a gift from God, Presbyterian or other, for John Kier and his Screentime Communicado team, who were filming a documentary on the first 100 days of the Don Brash leadership.
www.listener.co.nz /default,1756.sm   (682 words)

  
 Part-of-speech tagging - Wikipedia, the free encyclopedia
For some time, part-of-speech tagging was considered an inseparable part of natural language processing, because there are certain cases where the correct part of speech cannot be decided without understanding the semantics or even the pragmatics of the context.
Part-of-speech tagging is harder than just having a list of words and their parts of speech, because some words can represent more than one part of speech at different times.
Schools commonly teach that there are 8 parts of speech in English: noun, verb, adjective, preposition, pronoun, adverb, conjunction, and interjection.
en.wikipedia.org /wiki/Part-of-speech_tagging   (682 words)

  
 Speech Synthesis Demo
The preprocessing is based on part-of-speech tagging and a thorough analysis of the syntactic structure of the input text.
Linguistic preprocessing to generate a more natural intonation.
www.ims.uni-stuttgart.de /phonetik/synthesis/synthesis_demo.html   (682 words)

  
 POS Tagging Overview
There are many approaches to automated part of speech tagging.
Unsupervised learning of disambiguation rules for part of speech tagging.
This handout is intended to serve as a brief introduction to the types of tagging schemes commonly used today, although no specific system will be discussed.
www.georgetown.edu /faculty/ballc/ling361/tagging_overview.html   (682 words)

  
 466656.txt
This improvement is mainly due to the lower overall ambiguity rate: part-of-speech pre-tagging solved the "semantic" ambiguity for 40% of the ambiguous words in Test 1.
Realizing of course that semantic tagging is a much more difficult problem than partof-speech tagging, we decided nonetheless to perform an experiment to see how well words can be semantically disambiguated using techniques that have proven to be effective in part-of-speech tagging.
3 HMM Tagging Statistical part of speech disambiguation relies on the fact that certain sequences of parts of speech are more probable than others.
nlp.fi.muni.cz /projekty/wnportal/ps/txt/466656.txt   (1897 words)

  
 ESPER: Home
Festival also provides much of the infrastructure that detailed text analysis requires: such as controllable, punctuation and tokenization, part of speech tagging, utterance representation, well-defined extraction of data for machine learning techniques.
ESPER is a component of the StoryTeller project, which focuses on speech synthesis for children's stories.
This involves finding (and defining) appropriate markup for children's story text that is sufficient for modelling of intonation, tagging and parsing of the text, as well as discovering what aspects of language make detectable effects on intonation prosody.
fife.speech.cs.cmu.edu /esper   (308 words)

  
 Word sense disambiguation - encyclopedia article about Word sense disambiguation.
It is instructive to compare the WSD problem with the problem of part-of-speech tagging Part-of-speech tagging is the process of marking up the words in a text with their corresponding parts of speech.
Both involve disambiguating or tagging with words, be it with senses or parts of speech.
However, algorithms used for one do not tend to work well for the other, mainly because the part of speech of a word is primarily determined by the immediately adjacent 1-3 words, whereas the sense of a word may be determined by words a fair way further away.
digbig.com /4cedn   (308 words)

  
 Jorge Graña - Articles & Communications
Graña Gil, J.; Barcala Rodríguez, F.M.; Vilares Ferro, J. Formal Methods of Tokenization for Part-of-Speech Tagging.
Graña Gil, J.; Alonso Pardo, M.A.; Vilares Ferro, M. A Common Solution for Tokenization and Part-of-Speech Tagging: One-Pass Viterbi Algorithm vs. Iterative Approaches.
Graña Gil, J.; Andrade Sánchez, G.; Vilares Ferro, J. Compilation of Constraint-based Contextual Rules for Part-of-Speech Tagging into Finite State Transducers.
www.dc.fi.udc.es /~grana/grana-menu-publications-articles.html   (1334 words)

  
 Home Page: Evelyne Tzoukermann
Since joining Bell Laboratories in 1989, my research interests have spread on a number of areas, including computational morphology, natural language understanding, part-of-speech tagging, and text-to-speech for French.
After that, I spent 2 years at the IBM Watson Research Center in Yorktown Heights, N.Y., where I worked in computational morphology and part-of-speech disambiguation in the Natural Language group as well as in the Speech Recoginition group.
Tzoukermann, Evelyne, Dragomir R. Radev, and William A. Gale, ``Tagging French Without Lexical Probabilities'', (1997, to appear), in Natural Language Processing using Very Large Corpora, Eds Amstrong, Susan and Kenneth Church and Pierre Isabelle and, Evelyne Tzoukermann and David Yarowsky, Kluwer.
www.bell-labs.com /project/tts/evelyne.html   (1334 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.