Buch, Englisch, 334 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 1490 g
Buch, Englisch, 334 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 1490 g
Reihe: Text, Speech and Language Technology
ISBN: 978-0-7923-5896-1
Verlag: Springer Netherlands
In both the linguistic and the language engineering community, the creation and use of annotated text collections (or annotated corpora) is currently a hot topic. Annotated texts are of interest for research as well as for the development of natural language pro cessing (NLP) applications. Unfortunately, the annotation of text material, especially more interesting linguistic annotation, is as yet a difficult task and can entail a substan tial amount of human involvement. Allover the world, work is being done to replace as much as possible of this human effort by computer processing. At the frontier of what can already be done (mostly) automatically we find syntactic wordclass tagging, the annotation of the individual words in a text with an indication of their morpho syntactic classification. This book describes the state of the art in syntactic wordclass tagging. As an attempt to give an overall view of the field, this book is of interest to (at least) two, possibly very different, types of reader. The first type consists of those people who are using, or are planning to use, tagged material and taggers. They will want to know what the possibilities and impossibilities of tagging are, but are not necessarily interested in the internal working of automatic taggers. This, on the other hand, is the main interest of our second type of reader, the builders of automatic taggers and other natural language processing software.
Zielgruppe
Research
Autoren/Hrsg.
Fachgebiete
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Wissensbasierte Systeme, Expertensysteme
- Mathematik | Informatik Mathematik Stochastik Mathematische Statistik
- Mathematik | Informatik Mathematik Stochastik Wahrscheinlichkeitsrechnung
- Technische Wissenschaften Elektronik | Nachrichtentechnik Elektronik Robotik
- Geisteswissenschaften Sprachwissenschaft Computerlinguistik, Korpuslinguistik
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Datenkompression, Dokumentaustauschformate
Weitere Infos & Material
I The User’s View.- 1 Orientation.- 2 A Short History of Tagging.- 3 The Use of Tagging.- 4 Tagsets.- 5 Standards for Tagsets.- 6 Performance of Taggers.- 7 Selection and Operation of Taggers.- II The Implementer’s View.- 8 Automatic Taggers: An Introduction.- 9 Tokenization.- 10 Lexicons for Tagging.- 11 Standardization in the Lexicon.- 12 Morphological Analysis.- 13 Tagging Unknown Words.- 14 Hand-Crafted Rules.- 15 Corpus-Based Rules.- 16 Hidden Markov Models.- 17 Machine Learning Approaches.- Appendix A: Example tagsets.- A.1 The Brown Corpus tagset.- A.2 The Penn Treebanktagset.- A.3 The EngCG tagset.- References.