Withdraw
Loading…
Using Speech Input for Image Interpretation, Annotation, and Retrieval
Srihari, Rohini K.
Loading…
Permalink
https://hdl.handle.net/2142/25949
Description
- Title
- Using Speech Input for Image Interpretation, Annotation, and Retrieval
- Author(s)
- Srihari, Rohini K.
- Issue Date
- 1997
- Keyword(s)
- Digital Libraries
- electronic information resources
- digital images
- image access
- image retrieval
- linguistic context
- caption based
- audio annotation
- Abstract
- """This research explores the interaction of textual and photographic information in an integrated text/image database environment. Specifically, three different applications involving the exploitation of linguistic con-text in vision are presented. Linguistic context is qualitative in nature and is obtained dynamically. By understanding text accompanying images or video, we are able to extract information useful in retrieving the picture and directing an image interpretation system to identify relevant objects (e.g., faces) in the picture. The latter constitutes a powerful technique for automatically indexing images. A multistage system, PICTION, which uses captions to identify human faces in an accompanying photograph, has been developed. We discuss the use of PICTION's output in content-based retrieval of images to satisfy focus of attention in queries. The design and implementation of a system called Show&Tell—a multimedia system for semi-automated image annotation—is discussed. This system, which combines advances in speech recognition, natural language processing (NLP), and image understanding (IU), is designed to assist in image annotation and to enhance image retrieval capabilities. An extension of this work to video annotation and retrieval is also presented."""
- Publisher
- Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign
- Series/Report Name or Number
- Digital Image Access & Retrieval [papers presented at the 1996 Clinic on Library Applications of Data Processing, March 24-26, 1996 Urbana-Champaign]
- ISSN
- 0069-4789
- Type of Resource
- text
- Language
- en
- Permalink
- http://hdl.handle.net/2142/25949
Owning Collections
1996: Digital Image Access & Retrieval PRIMARY
33th Clinic on Library Applications of Data Processing (1996). Edited by P. Bryan Heidorn and Beth Sandore.Manage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…