Книга: Besmir Hasanaj «A Part of Speech Tagging Model for Albanian»

A Part of Speech Tagging Model for Albanian

Производитель: "LAP Lambert Academic Publishing"

With the enormous growth of the digital information, it is necessary to find advanced ways to process it. The goal is to enhance information retrieval, information extraction and natural language processing. One of the most complicated processes is text mining which deals with finding high quality information from text. This book presents a statistical part-of-speech tagging model for Albanian. The training, testing and evaluation processes are done with Apache OpenNLP tool. The tagging process is performed based on a basic and a large tagset. The experiments are performed on a tagger model trained with corpus composed of a standard Albanian text written by Albanian authors. The tagger model is tested using a cross-validation and a sample text. Results showed that the accuracy of the trained tagger model in real testing environments was about 70%, and up to 98% when the environment settings were optimized for the best accuracy. It was also noticed that the overall accuracy for this... ISBN:9783659223273

Издательство: "LAP Lambert Academic Publishing" (2012)

ISBN: 9783659223273

См. также в других словарях:

  • Law, Crime, and Law Enforcement — ▪ 2006 Introduction Trials of former heads of state, U.S. Supreme Court rulings on eminent domain and the death penalty, and high profile cases against former executives of large corporations were leading legal and criminal issues in 2005.… …   Universalium

  • History of Wikipedia — Wikipedia in the news redirects here. For an overview of Wikipedia mentioned in other media, see Wikipedia:Wikipedia in the media Growth of Wikipedia redirects here. For mathematical models of Wikipedia s expansion, see Wikipedia:Modelling… …   Wikipedia

Поделиться ссылкой на выделенное

Прямая ссылка:
Нажмите правой клавишей мыши и выберите «Копировать ссылку»