Automatic Biological Term Annotation. Using n-gram and Classification Models

Automatic Biological Term Annotation. Using n-gram and Classification Models

Sittichai Jiampojamarn

     

бумажная книга



Издательство: Книга по требованию
Дата выхода: июль 2011
ISBN: 978-3-6391-0733-3
Объём: 96 страниц
Масса: 166 г
Размеры(В x Ш x Т), см: 23 x 16 x 1

Exciting research in biology has resulted in a large amount of biological publications. Knowledge discovery in biology becomes an interesting task which can be established by recognizing terms in text to extract useful information such as interaction relationships. We propose the Automatic Biological Term Annotation (ABTA) system which uses classification methods to annotate terms in text. A novel method is presented to express lexical features in pattern notations. Prefix and suffix characters are used instead of lists of potential terms or external resources. We demonstrate that part-of-speech tag information is the most effective attribute. Creating classification exemplars is conducted from text by using word n-gram model. We illustrate improvements on our system's performance which depends on the feature attributes we define. Biological concept markers are also assigned to each located term indicating its meaning. Our results are comparable to the performance of other existing systems while our system retains simplicity and generalizability.

Данное издание не является оригинальным. Книга печатается по технологии принт-он-деманд после получения заказа.

Каталог