Petrov S. Coarse-to-Fine Natural Language Processing

Файл формата pdf
размером 2,21 МБ

Добавлен пользователем Shushimora 12.12.2012 02:31
Описание отредактировано 15.02.2021 15:58

Petrov S. Coarse-to-Fine Natural Language Processing

Springer, 2012. — 127 p.

The impact of computer systems that can understand natural language will be tremendous. To develop this capability we need to be able to automatically and efficiently analyze large amounts of text. Manually devised rules are not sufficient to provide coverage to handle the complex structure of natural language, necessitating systems that can automatically learn from examples. To handle the flexibility of natural language, it has become standard practice to use statistical approaches, where probabilities are assigned to the different readings of a word and the plausibility of grammatical constructions.
Unfortunately, building and working with rich probabilistic models for real-world problems has proven to be a very challenging task. Automatically learning highly articulated probabilistic models poses many challenges in terms of parameter estimation at the very least. And even if we succeed in learning a good model, inference can be prohibitively slow. Coarse-to-fine reasoning is an idea which has enabled great advances in scale, across a wide range of problems in artificial intelligence. The general idea is simple: when a model is too complex to work with, we construct simpler approximations thereof and use those to guide the learning or inference procedures. In computer vision various coarse-to-fine approaches have been proposed, for example for face detection (Fleuret et al. 2001) or general object recognition (Fleuret et al. 2001). Similarly, when building a system that can detect humans in images, one might first search for faces and then for the rest of the torso (Lu et al. 2006). Activity recognition in video sequences can also be broken up into smaller parts at different scales (Cuntoor and Chellappa 2007), and similar ideas have also been applied speech recognition (Tang et al. 2006). Despite the intuitive appeal of such methods, it was not obvious how they might be applied to natural language processing (NLP) tasks. In NLP, the search spaces are often highly structured and dynamic programming is used to compute probability distributions over the output space.
In this work, we propose a principled framework in which learning and inference can be seen as two sides of the same coarse-to-fine coin. On both sides we have a hierarchy of models, ranging from an extremely simple initial model to a fully refined final model. During learning, we start with a minimal model and use latent variables to induce increasingly more refined models, introducing complexity gradually. Because each learning step introduces only a limited amount of new complexity, estimation is more manageable and requires less supervision. Our coarse-to-fine strategy leads to better parameter estimates, improving the state-of-the- art for different domains and metrics.
However, because natural language is complex, our final models will necessarily be complex as well. To make inference efficient, we also follow a coarse-to-fine regime. We start with simple, coarse, models that are used to resolve easy ambiguities first, while preserving the uncertainty over more difficult constructions.
The more complex, fine-grained, models are then used only in those places where their rich expressive power is required. The intermediate models of the coarse-to-fine hierarchy are obtained by means of clustering and projection, and allow us to apply models with the appropriate level of granularity where needed. Our empirical results show that coarse-to-fine inference outperforms other approximate inference techniques on a range of tasks, because it prunes only low probability regions of the search space and therefore makes very few search errors.

Latent Variable Grammars for Natural Language Parsing
Discriminative Latent Variable Grammars
Structured Acoustic Models for Speech Recognition
Coarse-to-FineMachine Translation Decoding
Conclusions and FutureWork.

Чтобы скачать этот файл зарегистрируйтесь и/или войдите на сайт используя форму сверху.
Регистрация

Смотри также

Подробнее

Bird S., Klein E., Loper E. Natural Language Processing with Python

Раздел: Наука о данных → Обработка естественного языка / Aвтоматическое распознавание речи / Анализ текста

O’Reilly, 2009. — 504 p. This is a book about Natural Language Processing. By natural language we mean a language that is used for everyday communication by humans; languages such as English, Hindi, or Portuguese. In contrast to artificial languages such as programming languages and mathematical notations, natural languages have evolved as they pass from generation to...

4,10 МБ
добавлен 28.11.2011 12:08
описание отредактировано 05.08.2022 17:25

Подробнее

Clark A., Fox C., Lappin S. (Eds.) The Handbook of Computational Linguistics and Natural Language Processing

Раздел: Искусственный интеллект → Компьютерная лингвистика

Wiley-Blackwell, 2010 — 800 p. ISBN-10: 1118347188 This comprehensive reference work provides an overview of the concepts, methodologies, and applications in computational linguistics and natural language processing (NLP). Features contributions by the top researchers in the field, reflecting the work that is driving the discipline forward Includes an introduction to the major...

2,99 МБ
дата добавления неизвестна
описание отредактировано 05.10.2010 15:17

Подробнее

Fomichov Vladimir. Semantics-Oriented Natural Language Processing: Mathematical Models and Algorithms

Раздел: Искусственный интеллект → Компьютерная лингвистика

Springer. New York, USA. 2010. 328 pages. Includes glossary, references, and index. This book examines key issues in designing semantics-oriented natural language processing systems. A broad conceptual framework for describing structured meanings of NL-texts is obtained by defining a new class of formal languages called standard knowledge languages (SK-languages) using a system...

2,44 МБ
добавлен 16.06.2014 13:18
описание отредактировано 19.06.2014 16:19

Подробнее

Indurkhya N., Damerau F.J. Handbook of Natural Language Processing

Раздел: Искусственный интеллект → Компьютерная лингвистика

Издательство Chapman & Hall/CRC, 2010, -692 pp. As the title of this book suggests, it is an update of the first edition of the Handbook of Natural Language Processing which was edited by Robert Dale, Hermann Moisl, and Harold Somers and published in the year 2000. The vigorous growth of new methods in Natural Language Processing (henceforth, NLP) since then, strongly suggested...

6,74 МБ
добавлен 06.12.2011 20:38
описание отредактировано 06.12.2011 23:11

Подробнее

Kao A., Roteet S. Natural Language Processing and Text Mining

Раздел: Искусственный интеллект → Компьютерная лингвистика

With the increasing importance of the Web and other text-heavy application areas, the demands for and interest in both text mining and natural language processing (NLP) have been rising. Researchers in text mining have hoped that NLP—the attempt to extract a fuller meaning representation from free text—can provide useful improvements to text mining applications of all kinds....

3,15 МБ
дата добавления неизвестна
описание отредактировано 05.10.2010 15:24

Подробнее

Mihalcea R., Radev D. Graph-based Natural Language Processing and Information Retrieval

Раздел: Искусственный интеллект → Компьютерная лингвистика

Издательство Cambridge University Press, 2011, -202 pp. Graph theory is a well-studied discipline as are the fields of natural language processing and information retrieval. Traditionally, these areas of study have been perceived as distinct, with different algorithms, different applications, and different potential end-users. However, as recent research work has shown, these...

1,12 МБ
добавлен 26.06.2012 14:25
описание отредактировано 26.06.2012 14:53

Главная

Наверх