A resource-light approach to morpho-syntactic tagging | Buch | 978-90-420-2768-8 | sack.de

Buch, Englisch, Band 70, 185 Seiten, Format (B × H): 155 mm x 230 mm, Gewicht: 481 g

Reihe: Language and Computers

A resource-light approach to morpho-syntactic tagging


Erscheinungsjahr 2010
ISBN: 978-90-420-2768-8
Verlag: Brill | Rodopi

Buch, Englisch, Band 70, 185 Seiten, Format (B × H): 155 mm x 230 mm, Gewicht: 481 g

Reihe: Language and Computers

ISBN: 978-90-420-2768-8
Verlag: Brill | Rodopi


While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly limiting the necessary data and instead extrapolating the relevant information from another, related language. The approach has been tested on Catalan, Portuguese, and Russian. Although these languages are only relatively resource-poor, the same method can be in principle applied to any inflected language, as long as there is an annotated corpus of a related language available. Time needed for adjusting the system to a new language constitutes a fraction of the time needed for systems with extensive, manually created resources: days instead of years.
This book touches upon a number of topics: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and Natural Language Processing (NLP). Researchers and students who are interested in these scientific areas as well as in cross-lingual studies and applications will greatly benefit from this work. Scholars and practitioners in computer science and linguistics are the prospective readers of this book.
A resource-light approach to morpho-syntactic tagging jetzt bestellen!

Weitere Infos & Material


List of tables
List of figures
Preface
Introduction
Common tagging techniques
Previous resource-light approaches to NLP
Languages, corpora and tagsets
Quantifying language properties
Resource-light morphological analysis
Cross-language morphological tagging
Summary and further work
Bibliography
Appendices: Tagsets we use; Corpora; Language properties
Citation Index


Anna Feldman is an assistant professor of linguistics and computer science at Montclair State University. She received her Ph.D. from The Ohio State University.

Jirka Hana is a researcher at Charles University in Prague. He holds a Ph.D. degree in linguistics from The Ohio State University and a doctoral degree in computer science from Charles University. He has published numerous articles in computational linguistics.


Ihre Fragen, Wünsche oder Anmerkungen
Vorname*
Nachname*
Ihre E-Mail-Adresse*
Kundennr.
Ihre Nachricht*
Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.
Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.