Text Databases

Text Databases
Author: Crist-Jan Doedens
Publisher: Rodopi
Total Pages: 324
Release: 1994
Genre: Computers
ISBN: 9789051837292

Manipulation of text by means of the computer is well-established. Everybody has a word processor on his or her desk, and electronic mail, desk top publishing, text interchange languages, hypertext and multimedia are technologies many will be aware of. However, the full potential of the computer for the management and use of textual information has not been tapped yet. Far from it. For this a more principled approach is necessary, which will create a framework on which existing technologies, and technologies-yet-to-come can build and in which they can be integrated. This book can be seen as one step on this road. It employs the experience gained in working with a rich electronic linguistic corpus, the ECA database. A basic text database model is put forward and several text database retrieval languages are defined and analysed. A clear direction for further research is given. Therefore, the book is of relevance to researchers and developers in the field of corpus linguistics and in the more general field of electronic text.

Information Retrieval

Information Retrieval
Author: David A. Grossman
Publisher: Springer
Total Pages: 0
Release: 2012-10-23
Genre: Computers
ISBN: 9781461375326

Information Retrieval: Algorithms and Heuristics is a comprehensive introduction to the study of information retrieval covering both effectiveness and run-time performance. The focus of the presentation is on algorithms and heuristics used to find documents relevant to the user request and to find them fast. Through multiple examples, the most commonly used algorithms and heuristics needed are tackled. To facilitate understanding and applications, introductions to and discussions of computational linguistics, natural language processing, probability theory and library and computer science are provided. While this text focuses on algorithms and not on commercial product per se, the basic strategies used by many commercial products are described. Techniques that can be used to find information on the Web, as well as in other large information collections, are included. This volume is an invaluable resource for researchers, practitioners, and students working in information retrieval and databases. For instructors, a set of Powerpoint slides, including speaker notes, are available online from the authors.

Intelligent Document Retrieval

Intelligent Document Retrieval
Author: Udo Kruschwitz
Publisher: Springer Science & Business Media
Total Pages: 205
Release: 2006-01-09
Genre: Computers
ISBN: 1402037686

Collections of digital documents can nowadays be found everywhere in institutions, universities or companies. Examples are Web sites or intranets. But searching them for information can still be painful. Searches often return either large numbers of matches or no suitable matches at all. Such document collections can vary a lot in size and how much structure they carry. What they have in common is that they typically do have some structure and that they cover a limited range of topics. The second point is significantly different from the Web in general. The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process. In order to suggest sensible query modifications we would need to know what the documents are about. Explicit knowledge about the document collection encoded in some electronic form is what we need. However, typically such knowledge is not available. So we construct it automatically.

Effective Databases for Text & Document Management

Effective Databases for Text & Document Management
Author: Shirley A. Becker
Publisher: IGI Global
Total Pages: 387
Release: 2003-01-01
Genre: Computers
ISBN: 1931777632

"Focused on the latest research on text and document management, this guide addresses the information management needs of organizations by providing the most recent findings. How the need for effective databases to house information is impacting organizations worldwide and how some organizations that possess a vast amount of data are not able to use the data in an economic and efficient manner is demonstrated. A taxonomy for object-oriented databases, metrics for controlling database complexity, and a guide to accommodating hierarchies in relational databases are provided. Also covered is how to apply Java-triggers for X-Link management and how to build signatures."

Text Retrieval Systems In Information Management

Text Retrieval Systems In Information Management
Author: G G Chowdhury
Publisher: New Age International
Total Pages: 238
Release: 1996
Genre: Database management
ISBN: 9788122407600

This Book Aims At Helping The Reader Develop A Clear Under- Standing Of Text Retrieval Systems, Including Its Nature And Characteristics; Steps To Be Followed In Developing A Text Retrieval System; Software Packages Available For The Purpose; Guidelines For Choosing An Appropriate Software, And So On. To Make The Text Suitable For All Kinds Of Readers, Chapters And The Basics Of Database Technology, Database Management, And File Structures Appropriate For Text Retrieval Systems Have Been Provided. This Book Also Discusses The Major Features Of Library Management Systems (Lmss), The Software Packages Used For Automating Library House-Keeping Operations.The Trend Is To Developing Systems Which Can Provide The Actual Information Sought By The Use Rather Than Reference To The Information Sources Or Part Of The Text Where The Search Term Appears. Such Systems Apply Expert Systems And Natural Language Processing Techniques, And Are Called Knowledge-Based Systems (Kbss). This Book Describes Features Of These Systems And Mentions Some Of The Applications Of Kbss In Library And Information Activities.

Multimedia Information Retrieval

Multimedia Information Retrieval
Author: Peter Schäuble
Publisher: Springer Science & Business Media
Total Pages: 214
Release: 1997-04-30
Genre: Computers
ISBN: 9780792398998

Multimedia Information Retrieval: Content-Based Information Retrieval from Large Text and Audio Databases addresses the future need for sophisticated search techniques that will be required to find relevant information in large digital data repositories, such as digital libraries and other multimedia databases. Because of the dramatically increasing amount of multimedia data available, there is a growing need for new search techniques that provide not only fewer bits, but also the most relevant bits, to those searching for multimedia digital data. This book serves to bridge the gap between classic ranking of text documents and modern information retrieval where composite multimedia documents are searched for relevant information. Multimedia Information Retrieval: Content-Based Information Retrieval from Large Text and Audio Databases begins to pave the way for speech retrieval; only recently has the search for information in speech recordings become feasible. This book provides the necessary introduction to speech recognition while discussing probabilistic retrieval and text retrieval, key topics in classic information retrieval. The book then discusses speech retrieval, which is even more challenging than retrieving text documents because word boundaries are difficult to detect, and recognition errors affect the retrieval effectiveness. This book also addresses the problem of integrating information retrieval and database functions, since there is an increasing need for retrieving information from frequently changing data collections which are organized and managed by a database system. Multimedia Information Retrieval: Content-Based Information Retrieval from Large Text and Audio Databases serves as an excellent reference source and may be used as a text for advanced courses on the topic.

Advances in Information Retrieval

Advances in Information Retrieval
Author: Fabrizio Sebastiani
Publisher: Springer Science & Business Media
Total Pages: 640
Release: 2003-04-08
Genre: Computers
ISBN: 3540012745

This book constitutes the refereed proceedings of the 25th European Conference on Information Retrieval Research, ECIR 2003, held in Pisa, Italy, in April 2003. The 31 revised full papers and 16 short papers presented together with two invited papers were carefully reviewed and selected from 101 submissions. The papers are organized in topical sections on IR and the Web; retrieval of structured documents; collaborative filtering and text mining; text representation and natural language processing; formal models and language models for IR; machine learning and IR; text categorization; usability, interactivity, and visualization; and architectural issues and efficiency.

Text Databases and Document Management: Theory and Practice

Text Databases and Document Management: Theory and Practice
Author: Chin, Amita Goyal
Publisher: IGI Global
Total Pages: 0
Release: 2000-07-31
Genre: Computers
ISBN: 193070898X

The ability of relational databases to store and manage large amounts of limited data types is well proven, but the basics of representing textual information and its subsequent retrieval in a meaningful fashion still provides many challenges. This is because documents do not easily map to traditional database solutions. The primary objective of Text Databases and Document Management: Theory and Practice is to provide a focal point for concrete theories and practices in the handling of textual data as well as documents as a whole.