(apologies for X-posting)
We are pleased to announce the release of UBY 1.0 -
a large-scale lexical-semantic resource for natural language processing (NLP)
based on the ISO standard Lexical Markup Framework (LMF):
UBY combines a wide range of information from expert-constructed and collaboratively
constructed resources for English and German.
Currently, UBY holds structurally and semantically interoperable versions of nine
resources in two languages:
* English WordNet, Wiktionary, Wikipedia, FrameNet and VerbNet,
* German Wikipedia, Wiktionary and GermaNet, and multilingual OmegaWiki.
A subset of these resources is linked at the word sense level.
There are monolingual sense alignments between VerbNet–FrameNet and VerbNet–WordNet as
well as between WordNet–Wikipedia and WordNet–Wiktionary.
In addition, UBY provides cross-lingual sense alignments between WordNet and German
OmegaWiki, also including the inter-language links given in Wikipedia and OmegaWiki.
All resources in UBY are represented according to our LMF lexicon model, UBY-LMF.
UBY-LMF captures lexical information at a ﬁne-grained level by employing a large number of
Data Categories from ISOCat.
Highlights of UBY:
* The union of a wide range of heterogeneous resources in a single, standardized
* The linking at the word sense level between a subset of the resources.
UBY is complemented by a Java API, the UBY-API, and conversion tools (e.g., for converting
the resources to UBY-LMF).
The UBY API and conversion tools are available at Google Code:
Highlights of the UBY-API:
* Unified access to the various information types in the nine resources.
* Easy cross-resource access to the various information types in the resources.
A tutorial showing the use of the UBY-API can be found at
A Web Interface for exploring and visualizing UBY is currently being developed and will
soon be available
This project was initiated under the auspices of Prof. Dr. Iryna Gurevych, Ubiquitous
Knowledge Processing Lab (UKP), Technische Universität Darmstadt.
We are grateful for the generous financial support from the Volkswagen Foundation and the
German Research Foundation.
Please direct any questions or suggestions to
Christian M. Meyer, M.Sc.
Ubiquitous Knowledge Processing (UKP Lab)
FB 20 Computer Science Department
Technische Universität Darmstadt
Hochschulstr. 10, D-64289 Darmstadt, Germany
Phone [+49] (0)6151 16-7477, fax -5455, room S2/02/B117
Web Research at TU Darmstadt (WeRC)