Read more
This book constitutes the thoroughly refereed post-workshop proceedings of the 4th International Workshop on Principles of Digital Document Processing, PODDP'98, held in Saint Malo, France, in March 1998.
The 12 revised full papers presented were carefully reviewed during two rounds of selection for inclusion in the book. The book is divided into sections on document models and structures, characterization of documents and corpora, and accessing collections of documents.
List of contents
Document Models and Structures.- Context and Caterpillars and Structured Documents.- A Conceptual Model for Tables.- Analysis of Document Structures for Element Type Classification.- Using Document Relationships for Better Answers.- Characterization of Documents and Corpora.- Generating, Visualizing, and Evaluating High-Quality Clusters for Information Organization.- On the Specification of the Display of Documents in Multi-lingual Computing.- Spotting Topics with the Singular Value Decomposition.- A Linear Algebra Approach to Language Identification.- Accessing Collections of Documents.- Indexed Tree Matching with Complete Answer Representations.- Combining the Power of Query Languages and Search Engines for On-line Document and Information Retrieval : The QIRi@D Environment.- Intensional HTML.- Data Model for Document Transformation and Assembly.
About the author
Charles Nicholas is currently a Professor of Computer Science and Chair of the Computer Science and Electrical Engineering Department at UMBC, where he has been since 1988. He received his Ph.D. from The Ohio State University in 1988. Dr. Nicholas' research interests include electronic document processing, information retrieval, and software engineering.