- Topics
- Aerospace
- Animals
- Anthro and Archaeology
- Bio and Medicine
- Brain and Behavior
- Business and Economy
- Computers and Electronics
- Education and Outreach
- Energy and Environment
- Geoscience
- Internet and Communication
- Media and Entertainment
- Nanotech, Chem and Materials
- Physics and Numbers
- Security and Defense
- Software
- Space
- Transportation
- Reader Blogs
- Commerce
- Register/Login
- RSS
Strict pattern-based methods
Submitted by Anonymous on Sun, 2008-05-25 19:53.
Strict pattern-based methods of grammar induction are often frustrated by the apparently inexhaustible variety of novel word combinations in large corpora. Statistical methods offer a possible solution by allowing frequent well-formed expressions to overwhelm the infrequent ungrammatical ones. They also have the desirable property of being able to construct robust grammars from positive instances alone. Unfortunately, the zero-frequency problem entails assigning a small probability to all possible word patterns, thus ungrammatical n-grams become as probable as unseen grammatical ones. Further, such grammars are unable to take advantage of inherent lexical properties that should allow infrequent words to inherit the syntactic properties of the class to which they belong.

