Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Author: Gregory Grefenstette
Publisher: Springer Science & Business Media
Total Pages: 313
Release: 2012-12-06
Genre: Computers
ISBN: 1461527104

Explorations in Automatic Thesaurus Discovery presents an automated method for creating a first-draft thesaurus from raw text. It describes natural processing steps of tokenization, surface syntactic analysis, and syntactic attribute extraction. From these attributes, word and term similarity is calculated and a thesaurus is created showing important common terms and their relation to each other, common verb--noun pairings, common expressions, and word family members. The techniques are tested on twenty different corpora ranging from baseball newsgroups, assassination archives, medical X-ray reports, abstracts on AIDS, to encyclopedia articles on animals, even on the text of the book itself. The corpora range from 40,000 to 6 million characters of text, and results are presented for each in the Appendix. The methods described in the book have undergone extensive evaluation. Their time and space complexity are shown to be modest. The results are shown to converge to a stable state as the corpus grows. The similarities calculated are compared to those produced by psychological testing. A method of evaluation using Artificial Synonyms is tested. Gold Standards evaluation show that techniques significantly outperform non-linguistic-based techniques for the most important words in corpora. Explorations in Automatic Thesaurus Discovery includes applications to the fields of information retrieval using established testbeds, existing thesaural enrichment, semantic analysis. Also included are applications showing how to create, implement, and test a first-draft thesaurus.

Introduction to Controlled Vocabularies

Introduction to Controlled Vocabularies
Author: Patricia Harpring
Publisher: Getty Publications
Total Pages: 259
Release: 2010-04-13
Genre: Art
ISBN: 160606018X

This detailed book is a “how-to” guide to building controlled vocabulary tools, cataloging and indexing cultural materials with terms and names from controlled vocabularies, and using vocabularies in search engines and databases to enhance discovery and retrieval online. Also covered are the following: What are controlled vocabularies and why are they useful? Which vocabularies exist for cataloging art and cultural objects? How should they be integrated in a cataloging system? How should they be used for indexing and for retrieval? How should an institution construct a local authority file? The links in a controlled vocabulary ensure that relationships are defined and maintained for both cataloging and retrieval, clarifying whether a rose window and a Catherine wheel are the same thing, or how pot-metal glass is related to the more general term stained glass. The book provides organizations and individuals with a practical tool for creating and implementing vocabularies as reference tools, sources of documentation, and powerful enhancements for online searching.

The Devil’s Dictionary

The Devil’s Dictionary
Author: Ambrose Bierce
Publisher: Standard Ebooks
Total Pages: 341
Release: 2021-03-16T22:46:04Z
Genre: Fiction
ISBN:

“Dictionary, n: A malevolent literary device for cramping the growth of a language and making it hard and inelastic. This dictionary, however, is a most useful work.” Bierce’s groundbreaking Devil’s Dictionary had a complex publication history. Started in the mid-1800s as an irregular column in Californian newspapers under various titles, he gradually refined the new-at-the-time idea of an irreverent set of glossary-like definitions. The final name, as we see it titled in this work, did not appear until an 1881 column published in the periodical The San Francisco Illustrated Wasp. There were no publications of the complete glossary in the 1800s. Not until 1906 did a portion of Bierce’s collection get published by Doubleday, under the name The Cynic’s Word Book—the publisher not wanting to use the word “Devil” in the title, to the great disappointment of the author. The 1906 word book only went from A to L, however, and the remainder was never released under the compromised title. In 1911 the Devil’s Dictionary as we know it was published in complete form as part of Bierce’s collected works (volume 7 of 12), including the remainder of the definitions from M to Z. It has been republished a number of times, including more recent efforts where older definitions from his columns that never made it into the original book were included. Due to the complex nature of copyright, some of those found definitions have unclear public domain status and were not included. This edition of the book includes, however, a set of definitions attributed to his one-and-only “Demon’s Dictionary” column, including Bierce’s classic definition of A: “the first letter in every properly constructed alphabet.” Bierce enjoyed “quoting” his pseudonyms in his work. Most of the poetry, dramatic scenes and stories in this book attributed to others were self-authored and do not exist outside of this work. This includes the prolific Father Gassalasca Jape, whom he thanks in the preface—“jape” of course having the definition: “a practical joke.” This book is a product of its time and must be approached as such. Many of the definitions hold up well today, but some might be considered less palatable by modern readers. Regardless, the book’s humorous style is a valuable snapshot of American culture from past centuries. This book is part of the Standard Ebooks project, which produces free public domain ebooks.

The Merriam-Webster Thesaurus

The Merriam-Webster Thesaurus
Author: Merriam-Webster
Publisher: Merriam-Webster
Total Pages: 0
Release: 2023-06
Genre: Reference
ISBN: 9780877790983

Find the right word fast! This indispensable guide from America's Language Experts is the perfect tool for readers and writers! This all new edition of The Merriam-Webster Thesaurus features more than 150,000 word choices, including related words, antonyms, and near antonyms. Each main entry provides the meaning shared by the synonyms listed and abundant usage examples show words used in context. Words alphabetically organized for ease of use. A great complement to The Merriam-Webster Dictionary and perfect for school, home, or office.

The Highly Selective Dictionary for the Extraordinarily Literate

The Highly Selective Dictionary for the Extraordinarily Literate
Author: Eugene Ehrlich
Publisher: Harper Collins
Total Pages: 288
Release: 2009-03-17
Genre: Language Arts & Disciplines
ISBN: 0061746797

Between TV talk shows, radio call-in programs, email and the Internet, spontaneous-talk media has skyrocketed in the '90s. People are interacting more frequently and more fervently than ever before, turning the English language into an indecipherable mess. Now, this unique and concise compendium presents the most confused and misused words in the language today -- words misused by careless speakers and writers everywhere. It defines, discerns and distinguishes the finer points of sense and meaning. Was it fortuitous or only fortunate? Are you trying to remember, or more fully recollect? Is he uninterested or disinterested? Is it healthful or healthy, regretful or regrettable, notorious or infamous? The answers to these and many more fascinating etymological questions can be found within the pages of this invaluable (or is it valuable?) reference.

Webster's New World College Dictionary

Webster's New World College Dictionary
Author: Michael E. Agnes
Publisher: Webster's New World
Total Pages: 0
Release: 2003-07
Genre: English dictionary
ISBN: 9780764556029

Webster's Fourth has been adopted by many magazines and newspapers as the definitive guide to the English language as spoken in America. Acclaimed for its 7000+ new words reflecting lifestyle changes, technology, and popular culture, the fourth edition contains 163,000 entries, with synonyms, so that it also functions as a thesaurus. Many entries put words into context as a further guide to understanding, and the dictionary includes 850 illustrations and maps and a world atlas. It's an excellent gift for students, and certainly for anyone who wants an up-to-date and easy-to-use reference for good writing and speaking.

Roget's Thesaurus of the Bible

Roget's Thesaurus of the Bible
Author: A. Colin Day
Publisher: Castle Books
Total Pages: 0
Release: 2008-11-07
Genre: Reference
ISBN: 9780785817086

Roget's Thesaurus of the Bible uses Roget's fundamental and brilliant category concept, which groups together all related subjects - similar and opposite - for quick and easy comparisons. If you are researching a particular subject, the category list will lead you to the relevant Bible passages. If you are researching a particular passage, the index of Bible verses will lead you to the appropriate category. As a study tool, this resource is unique. Even the browser will be rewarded with the discovery of related subjects and Bible passages for further exploration. Unlike concordances, this book is not tied to the language of any one Bible version. Nor is it based on a particular theological system. The Bible passages in this book are not from any one version of the Bible, but are paraphrases by Colin Day. In compiling the work, the most popular English translations were used - the Revised Standard Version, New International Version, and New American Standard - with references to the original Hebrew and Greek texts.

Cambridge Advanced Learner's Dictionary

Cambridge Advanced Learner's Dictionary
Author: Kate Woodford
Publisher:
Total Pages: 1550
Release: 2003
Genre: Foreign Language Study
ISBN: 9780521824231

The Cambridge Advanced Learner's Dictionary is the ideal dictionary for advanced EFL/ESL learners. Easy to use and with a great CD-ROM - the perfect learner's dictionary for exam success. First published as the Cambridge International Dictionary of English, this new edition has been completely updated and redesigned. - References to over 170,000 words, phrases and examples explained in clear and natural English - All the important new words that have come into the language (e.g. dirty bomb, lairy, 9/11, clickable) - Over 200 'Common Learner Error' notes, based on the Cambridge Learner Corpus from Cambridge ESOL exams Plus, on the CD-ROM: - SMART thesaurus - lets you find all the words with the same meaning - QUICKfind - automatically looks up words while you are working on-screen - SUPERwrite - tools for advanced writing, giving help with grammar and collocation - Hear and practise all the words.