Xlike requires the linguistic processing of large numbers of documents in a variety of languages.<\/p>\n
<\/a>Figure 1: Xlike Language Processing Architecture.<\/em><\/p>\n
Therefore all the pipelines (one for each language) have been implemented as web services and may be requested to produce different levels of analysis (e.g. Tokenization, lemmatization, NERC, parsing, relation extraction, etc.). This approach is very appealing due to the fact that it allows to treat every language independently and to execute the whole language analysis process at different threads or computers allowing an easier parallelization (e.g., using external high performance platforms such as Amazon Elastic Compute Cloud EC2<\/a> as needed. Furthermore, it also provides independent development life-cycles for each language which is crucial in this type of research projects. Recall that these web services can be deployed locally or\u00a0remotely, maintaining the option of using them in a stand-alone configuration.<\/p>\n
Semantic Representation<\/strong><\/p>\n
Dependency Parsing<\/strong><\/p>\n
In XLike, we use the so-called graph-based methods for dependency parsing. In particular we use MSTParser<\/a> for Chinese and Croatian, and Treeler<\/a> –a library developed by the UPC team that implements several methods for dependency parsing, among other statistical methods for tagging and parsing– for the other languages.<\/p>\n
Semantic Role Labeling<\/strong><\/p>\n
Word Sense Disambiguation<\/strong><\/p>\n
Frame Extraction<\/strong><\/p>\n
For example, in the sentence in Figure 2, we can find three frames:<\/p>\n
\n- Base<\/strong>: A person or organization being established or grounded somewhere. This frame has two participants: Acme, a participant of type entity playing the theme role (the thing being based), and New York, a participant of type entity playing the role of location.<\/li>\n
- Plan<\/strong>: A person or organization planning some activity. This frame has three participants: Acme, a participant of type entity playing the agent role, now, a participant of type word playing the role of time, and make, a participant of type frame playing the theme role (i.e., the activity being planned).<\/li>\n
- Make<\/strong>: A person or organization creating or producing something. Participants in this frame are: Acme, entity playing the agent role, and products, a participant of type word playing the theme role (i.e., the thing being created).<\/li>\n<\/ul>\n
A graphical representation of the example sentence is presented in Figure 3.<\/p>\n
<\/a>
\nFigure 3: Graphical representation of frames in the example sentence.<\/p>\n
The code that was used to process text and generate the output shown above is available under an open-source licence and can be downloaded here<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"