The search functionality is under construction.

IEICE TRANSACTIONS on Information

Design Framework of a Database for Structured Documents with Object Links

Masatoshi YOSHIKAWA, Hiroyuki KATO, Hiroko KINUTANI

  • Full Text Views

    0

  • Cite this

Summary :

Structured documents often contain character strings of which semantics can be naturally stored as database values or has direct correspondence with database values. By building bilateral logical links between character strings in documents and corresponding database values, semantically rich queries are made expressible. We have introduced a new ADT, named "paratext," to model text which has links with database values. Paratexts are logically viewed as consisting of two parallel layers; on the "appearance" layer, ordinary text (i. e. a linear sequence of character strings) is placed, while the "reference" layer holds an array of OIDs and literals. Each OID or literal on the reference layer is associated with a contiguous substring of the appearance layer text, and represents the semantics of the associated substring. We have also designed domain-specific functions for this document model. Using the functions, we can express queries which go back and forth between the two layers. In structured documents, such character strings can appear in the whole content of logical elements, or as phrases inside logical elements. We also present frameworks for the implementation of the paratext ADT, and discuss how traditional full-text indexing techniques can be extended to support paratext.

Publication
IEICE TRANSACTIONS on Information Vol.E82-D No.1 pp.147-155
Publication Date
1999/01/25
Publicized
Online ISSN
DOI
Type of Manuscript
Special Section PAPER (Special Issue on New Generation Database Technologies)
Category
Web and Document Databases

Authors

Keyword