The Hybridisation of a Data-driven Parser for Natural Languages

Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Authors:
  • Sardar Jaf
  • Allan Ramsay

Abstract

Identifying and establishing structural relations between words in natural language sentences is called Parsing . Ambiguities in natural languages make parsing a difficult task. Parsing is even more difficult when dealing with a structurally complex natural language such as Arabic, which has a number of properties that make it particularly difficult to handle. In this paper, we briefly highlight some of the complex structure of Arabic, and we identify different parsing approaches (grammar-driven and data-driven approaches) and briefly discuss their limitations. Our main goal is to produce a hybrid parser, by combining different parsing approaches, which retains the advantages of data-driven approaches but is guided by a set of grammatical rules to produce more accurate results. We describe a novel technique for directly combining different parsing approaches. Results for our initial experiments that we have conducted in this work, and our plans for future work are also presented.

Bibliographical metadata

Original languageEnglish
Title of host publication 6th International Conference on Language and Technology
Pages531-535
Number of pages5
Publication statusPublished - 2013
Event6th International Conference on Language and Technology - Polan
Event duration: 1 Jan 1824 → …

Conference

Conference6th International Conference on Language and Technology
CityPolan
Period1/01/24 → …