Accessing Natural Language Processing Engines and Tasks
Saveski, Grigorij Ljubin (2014)
Saveski, Grigorij Ljubin
2014
Master's Degree Programme in Information Technology
Tieto- ja sähkötekniikan tiedekunta - Faculty of Computing and Electrical Engineering
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Hyväksymispäivämäärä
2014-12-03
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tty-201412051580
https://urn.fi/URN:NBN:fi:tty-201412051580
Tiivistelmä
This thesis presents how a natural language task can be accessed through the use of natural language processing engine in an easy way. So far the access to the task of part-of-speech tagging and other tasks has been going through the engine command line inter-face, which demands both knowledge and experience in scripting and programming. Moreover, manual work had also been required to prepare the input data in order to be fed into the engine. At the same time all the output files from the task and engine have been handled manually.
To solve these issues, both the OpenNLP engine and its part-of-speech tagging task are integrated into a web interface that can be used by individuals that possess little or no technical knowledge. Furthermore, the system also guides the users through a process where they can input their data and it will automatically be processed and prepared for further use. After that they can follow the rest of the task and use the engine. At various points of the usage, the data is saved so that it can be used later to continue the process from wherever it was stopped. The data files are stored and organized on a server, which helps reusability. At the same time, the structure of the system is easy to extend with other language processing tasks and engines according to future needs. Last but not least, the current implementation makes the whole interface accessible from different locations and is quite portable. No graphical user interface details for the system will be presented in this thesis.
The resulting interface provides for ease of use, access, and expandability. Some challenges in the future include increased complexity of the system because of different tasks and engines. Moreover, certain parts of the process and the structure of the implementation could be improved.
To solve these issues, both the OpenNLP engine and its part-of-speech tagging task are integrated into a web interface that can be used by individuals that possess little or no technical knowledge. Furthermore, the system also guides the users through a process where they can input their data and it will automatically be processed and prepared for further use. After that they can follow the rest of the task and use the engine. At various points of the usage, the data is saved so that it can be used later to continue the process from wherever it was stopped. The data files are stored and organized on a server, which helps reusability. At the same time, the structure of the system is easy to extend with other language processing tasks and engines according to future needs. Last but not least, the current implementation makes the whole interface accessible from different locations and is quite portable. No graphical user interface details for the system will be presented in this thesis.
The resulting interface provides for ease of use, access, and expandability. Some challenges in the future include increased complexity of the system because of different tasks and engines. Moreover, certain parts of the process and the structure of the implementation could be improved.