Automated Semantic Analysis of Schematic Data. Learning-based Techniques for Scalable and Automated Semantic Understanding of Template Generated Schematic Web Content

Automated Semantic Analysis of Schematic Data. Learning-based Techniques for Scalable and Automated Semantic Understanding of Template Generated Schematic Web Content

     

бумажная книга



Издательство: Книга по требованию
Дата выхода: июль 2011
ISBN: 978-3-6390-2674-0
Объём: 108 страниц
Масса: 184 г
Размеры(В x Ш x Т), см: 23 x 16 x 1

Content in numerous data sources are not directly amenable to machine processing. This book describes techniques for automated semantic analysis of schematic content which are characterized by being populated from backend databases. Starting with a seed set of hand-labeled instances of semantic concepts in a set of HTML documents, a technique is devised that bootstraps an annotation process for automatic identification of concept instances present in other documents. The technique exploits the observation that semantically related items in schematic HTML documents exhibit consistency in presentation style and spatial locality to learn statistical concept models, using light-weight semantic features. This model directs the annotation of diverse Web documents possessing similar content semantics. The power of these techniques is demonstrated through applications developed for real-life problems that include audio-based assistive browsing for non-visual Web access, focused browsing on handhelds with semantic bookmarks, text data cleaning, and accurate identification of remote homologs of biological protein sequences.

Данное издание не является оригинальным. Книга печатается по технологии принт-он-деманд после получения заказа.

Каталог