![]() The server version provides support for saving user corpora on server and then accessing the saved corpora via unique URL. There is also a module of the project which makes it possible to deploy the project on server, written on Python3, Flask. The source code for conversion support is located in. It also allows to either upload or paste corpora in plain text and then convert them into CoNLL-U. The interface allows to work with CoNLL-U and CG3 formats, and to convert the data between the formats. The source code for editing support is mostly located in. standalone/lib/cy-style.js.Įditing POS labels, editing deprels, drawing arcs and deleting arcs are undoable and redoable. The source code for visualisation support is located in. This was made to add functionality which brat's visualisation library could not provide, namely, easier implementation of editing functionality and alignment (right-to-left, top to bottom) settings. Cytoscape is a JS graph library primarily developed for biologists, but avaliable to use for different purposes. As a part of this project, I have rewritten the visualisation part using the Cytoscape library. Primarily, the tool was using brat's JavaScript library for visualisation. The tool should work both online and offline and allow a user to edit the annotation in both graphical and text modes. The aim of this project was to create an easy-to-use, quick and interactive interface tool for Universal Dependencies annotation based on the existing Apertium project. The interface allowed the user to either enter their trees in the text area or upload a treebank from a file and switch between sentences. Finally, it requires a web-server in order to be used by a team of annotators.īefore GSoC 2017 Apertium had a web-interface for visualising syntactic trees. Basically, this interface lacks a lot of features that could be very useful for annotation. Secondly, it does not allow to edit tokenisation. Firstly, it does not allow a user to edit the source. However, the tool has a number of issues. ![]() There is a tool for doing syntactic annotation called brat. For creating a good treebank, manual annotation and/or disambiguation is required. It can be used both for the purposes of linguistic research and for training statistical parser, which in turn can serve different purposes of natural language processing. The contributions were made to the master branch of the UD annotatrix repository on GitHub ( last commit).ĭependency treebank is a corpus of sentences with annotated dependency structure. 5 The project's architecture and components.
0 Comments
Leave a Reply. |