News item

Business and Economy
20.01.2012 19:03
SCIENCE, MEDIA, LANGUAGE, MEETING
  • print
  • smaller font
  • larger font

Cross-Lingual Knowledge Extraction Project Launched at Bled

Foto Foto servis

A meeting of participants in international project XLike.
File photo.

Bled, 20 January (STA) - XLike, an ambitious project aimed at developing technology to monitor and aggregate knowledge that is currently spread across global mainstream and social media and at enabling cross-lingual services, was launched with a three-day meeting at Bled that concluded on Friday.

Featuring Ljubljana-based Jožef Stefan Institute (IJS) as the coordinator, research institutes from Germany, Spain, Croatia and China, as well the Slovenian Press Agency (STA) and contributions from Bloomberg and The New York Times, the three-year project is estimated at almost EUR 5m, EUR 3.5m of which is to come from EU funds.

The idea is to combine scientific insights for a breakthrough in the area of machine-based cross-lingual text understanding, including for the needs of media, the head of the project Marko Grobelnik, who works in the IJS artificial intelligence lab, told the STA.

The project, detailed information on which is available at www.xlike.org, will draw on the latest findings in computational linguistics, machine learning, text mining and semantic technologies.

The software solutions envisaged would for instance enable the connecting of reports by individual media with similar reports on related subject in any language. The detection of plagiarism could be one of the applications.

Also a goal is fast detection of new events via announcements in different languages on the internet. This would give editors a tool for identifying new developments, such as spontaneously organised protests, at an early stage.

The project will focus on six languages - English, German, Spanish and Chinese as major languages, and Catalan and Slovenian as smaller languages.

The basic messages of texts will however be attainable in more than 100 languages, which is to be achieved with the help of a Wikipedia-based corpus.

The meeting at Bled featured guests and representatives of partners, including the IJS, the Karlsruhe Institute of Technology, Universitat Politecnica de Catalunya, University of Zagreb, the Chinese Tsinghua University, and iSOCO from Spain.

The next meeting is expected in April in Barcelona, where the goal is a detailed plan for the execution of the project.

gz/zm
20.01.2012 19:03

© STA 2014. All rights reserved. ISSN 1854-214X