2022
pdf
bib
abs
Towards Dynamic Wordnet: Time Flow Hydra
Borislav Rizov
|
Tinko Tinchev
Proceedings of the 5th International Conference on Computational Linguistics in Bulgaria (CLIB 2022)
Hydra is a Wordnet management system where the Synsets from different languages live in a common relational structure (Kripke frame) with a user-frendly GUI for searching, editing and alignment of the objects from the different languages. The data is retrieved by means of a modal logic query language. Despite its many merits the system stores only the current state of the wordnet data. Wordnet editing and development opens questions for wordnet data, structure and its consistency over time. The new Time Flow Hydra uses a Dynamic wordnet model with a discrete time embeded where all the states of all the objects are stored and accessed simultaneously. This provides the ability to track the changes, to detect the desired and undesired results of the data evolution. For example, we can ask which objects 10 days ago had 2 hyponyms, and 5 days later have 3.
2016
pdf
bib
abs
Hydra for Web: A Browser for Easy Access to Wordnets
Borislav Rizov
|
Tsvetana Dimitrova
Proceedings of the 8th Global WordNet Conference (GWC)
This paper presents a web interface for wordnets named Hydra for Web which is built on top of Hydra – an open source tool for wordnet development – by means of modern web technologies. It is a Single Page Application with simple but powerful and convenient GUI. It has two modes for visualisation of the language correspondences of searched (and found) wordnet synsets – single and parallel modes. Hydra for web is available at:
http://dcl.bas.bg/bulnet/.
2015
pdf
bib
Automatic Classification of WordNet Morphosemantic Relations
Svetlozara Leseva
|
Ivelina Stoyanova
|
Maria Todorova
|
Tsvetana Dimitrova
|
Borislav Rizov
|
Svetla Koeva
The 5th Workshop on Balto-Slavic Natural Language Processing
2014
pdf
bib
Coping with Derivation in the Bulgarian Wordnet
Tsvetana Dimitrova
|
Ekaterina Tarpomanova
|
Borislav Rizov
Proceedings of the Seventh Global Wordnet Conference
pdf
bib
Hydra: A Software System for Wordnet
Borislav Rizov
Proceedings of the Seventh Global Wordnet Conference
pdf
bib
Anaphora – Clause Annotation and Alignment Tool.
Borislav Rizov
|
Rositsa Dekova
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics
2012
pdf
bib
Application of Clause Alignment for Statistical Machine Translation
Svetla Koeva
|
Svetlozara Leseva
|
Ivelina Stoyanova
|
Rositsa Dekova
|
Angel Genov
|
Borislav Rizov
|
Tsvetana Dimitrova
|
Ekaterina Tarpomanova
|
Hristina Kukova
Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
pdf
bib
abs
Bulgarian X-language Parallel Corpus
Svetla Koeva
|
Ivelina Stoyanova
|
Rositsa Dekova
|
Borislav Rizov
|
Angel Genov
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
The paper presents the methodology and the outcome of the compilation and the processing of the Bulgarian X-language Parallel Corpus (Bul-X-Cor) which was integrated as part of the Bulgarian National Corpus (BulNC). We focus on building representative parallel corpora which include a diversity of domains and genres, reflect the relations between Bulgarian and other languages and are consistent in terms of compilation methodology, text representation, metadata description and annotation conventions. The approaches implemented in the construction of Bul-X-Cor include using readily available text collections on the web, manual compilation (by means of Internet browsing) and preferably automatic compilation (by means of web crawling ― general and focused). Certain levels of annotation applied to Bul-X-Cor are taken as obligatory (sentence segmentation and sentence alignment), while others depend on the availability of tools for a particular language (morpho-syntactic tagging, lemmatisation, syntactic parsing, named entity recognition, word sense disambiguation, etc.) or for a particular task (word and clause alignment). To achieve uniformity of the annotation we have either annotated raw data from scratch or transformed the already existing annotation to follow the conventions accepted for BulNC. Finally, actual uses of the corpora are presented and conclusions are drawn with respect to future work.
2008
pdf
bib
abs
Chooser: a Multi-Task Annotation Tool
Svetla Koeva
|
Borislav Rizov
|
Svetlozara Leseva
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
The paper presents a tool assisting manual annotation of linguistic data developed at the Department of Computational linguistics, IBL-BAS. Chooser is a general-purpose modular application for corpus annotation based on the principles of commonality and reusability of the created resources, language and theory independence, extendibility and user-friendliness. These features have been achieved through a powerful abstract architecture within the Model-View-Controller paradigm that is easily tailored to task-specific requirements and readily extendable to new applications. The tool is to a considerable extent independent of data format and representation and produces outputs that are largely consistent with existing standards. The annotated data are therefore reusable in tasks requiring different levels of annotation and are accessible to external applications. The tool incorporates edit functions, pass and arrangement strategies that facilitate annotators work. The relevant module produces tree-structured and graph-based representations in respective annotation modes. Another valuable feature of the application is concurrent access by multiple users and centralised storage of lexical resources underlying annotation schemata, as well as of annotations, including frequency of selection, updates in the lexical database, etc. Chooser has been successfully applied to a number of tasks: POS tagging, WS and syntactic annotation.
pdf
bib
abs
Hydra: a Modal Logic Tool for Wordnet Development, Validation and Exploration
Borislav Rizov
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
This paper presents a multipurpose system for wordnet (WN) development, named Hydra. Hydra is an application for data editing and validation, as well as for data retrieval and synchronization between wordnets for different languages. The use of modal language for wordnet, the representation of wordnet as a relational database and the concurrent access are among its main advantages.