Computer Recognition of Three-dimensional Objects in a Visual Scene

Computer Recognition of Three-dimensional Objects in a Visual Scene
Author: C. Adolfo Guzman-Arenas
Publisher:
Total Pages: 296
Release: 1968
Genre: Optical pattern recognition
ISBN:

Methods are presented: (1) to partition or decompose a visual scene into the bodies forming it; (2) to position these bodies in three-dimensional space, by combining two scenes that make a stereoscopic pair; (3) to find the regions or zones of a visual scene that belong to its background; (4) to carry out the isolation of objects in (1) when the input has inaccuracies. Running computer programs implement the methods, and many examples illustrate their behavior. The input is a two-dimensional line-drawing of the scene, assumed to contain three-dimensional bodies possessing flat faces (polyhedra); some of them may be partially occluded. Suggestions are made for extending the work to curved objects. Some comparisons are made with human visual perception. The main conclusion is that it is possible to sseparate a picture or scene into the constituent objects exclusively in basis of monocular geometric properties (in basis of pure form); in fact, successful methods are shown. (Author).

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation
Author: Derek Hoiem
Publisher: Morgan & Claypool Publishers
Total Pages: 172
Release: 2011
Genre: Computers
ISBN: 1608457281

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Computer Interpretation of Imperfect Line Data as a Three-dimensional Scene

Computer Interpretation of Imperfect Line Data as a Three-dimensional Scene
Author: Gilbert Falk
Publisher:
Total Pages: 406
Release: 1970
Genre: Computer graphics
ISBN:

The paper describes a heuristic scence description program. This program accepts as input a scene represented as a line drawing. Based on a set of known object models the program attempts to determine the identify and location of each object viewed. The most significant feature of the program is its ability to deal with imperfect input data. Also presented are some preliminary results concerning constraints in projections of planar-faced solids. It is shown that for a restricted class of projections, 4 points located in 3-space in addition to complete monocular information are sufficient to specify all the visible point locations precisely. (Author).

High-level Vision

High-level Vision
Author: Shimon Ullman
Publisher: MIT Press
Total Pages: 438
Release: 2000
Genre: Computers
ISBN: 9780262710077

Shimon Ullman focuses on the processes of high-level vision that deal with the interpretation and use of what is seen in the image. In this book, Shimon Ullman focuses on the processes of high-level vision that deal with the interpretation and use of what is seen in the image. In particular, he examines two major problems. The first, object recognition and classification, involves recognizing objects despite large variations in appearance caused by changes in viewing position, illumination, occlusion, and object shape. The second, visual cognition, involves the extraction of shape properties and spatial relations in the course of performing visual tasks such as object manipulation, planning movements in the environment, or interpreting graphical material such as diagrams, graphs and maps. The book first takes up object recognition and develops a novel approach to the recognition of three-dimensional objects. It then studies a number of related issues in high-level vision, including object classification, scene segmentation, and visual cognition. Using computational considerations discussed throughout the book, along with psychophysical and biological data, the final chapter proposes a model for the general flow of information in the visual cortex. Understanding vision is a key problem in the brain sciences, human cognition, and artificial intelligence. Because of the interdisciplinary nature of the theories developed in this work, High-Level Vision will be of interest to readers in all three of these fields.

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation
Author: Derek Santhanam
Publisher: Springer Nature
Total Pages: 147
Release: 2022-05-31
Genre: Computers
ISBN: 3031015576

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Pictorial Data Analysis

Pictorial Data Analysis
Author: Robert M. Haralick
Publisher: Springer Science & Business Media
Total Pages: 468
Release: 2012-12-06
Genre: Computers
ISBN: 3642820174

This volume is the collection of lectures and presentations of the NATO AS! On Pictorial Data Analysis, held August 1-12, 1982 in the beautiful chateau de Bonas, Bonas France. The director of the AS! was Robert M. Haralick and the Co-director was Stefano Levialdi. The papers in the book are arranged in two sections first theory and general prinicples and then applications. Local computations play a central role in image processing both when a traditional computer is used and when parallel machines are used for improving image throughput. Levialdi reviews such neighborhood operators. Hung and Kasvand discuss a line thinning application which involves detection of critical points on chain encoded data. Most low level image processing has been done using the digital raster as the basic data structure. Within the last few years many of these basic algorithms have been developed for the quadtree data structure. The quadtree permits easier access to certain kinds of spatial adjacency relationships in a variable resolution context. Rosenfeld reviews the properties of these representations and their uses in image segmentation and property measurement. Besslich discusses an expanded form of an invertible quadtree representation which permits a multiprocessor execution. Gisolfi and Vitulano discuss the C-matrix and C-filtering technique for image and texture feature extraction. O'mara et.al. discuss the application of Codel numbers to image feature extraction. Kropatsch discusses an image segmentation technique which permits the effective use of a variety of different kinds of segmentation techniques.

Three-Dimensional Computer Vision

Three-Dimensional Computer Vision
Author: Yoshiaki Shirai
Publisher: Springer Science & Business Media
Total Pages: 308
Release: 2012-12-06
Genre: Computers
ISBN: 3642824293

The purpose of computer vision is to make computers capable of understanding environments from visual information. Computer vision has been an interesting theme in the field of artificial intelligence. It involves a variety of intelligent information processing: both pattern processing for extraction of meaningful symbols from visual information and symbol processing for determining what the symbols represent. The term "3D computer vision" is used if visual information has to be interpreted as three-dimensional scenes. 3D computer vision is more challenging because objects are seen from limited directions and some objects are occluded by others. In 1980, the author wrote a book "Computer Vision" in Japanese to introduce an interesting new approach to visual information processing developed so far. Since then computer vision has made remarkable progress: various rangefinders have become available, new methods have been developed to obtain 3D informa tion, knowledge representation frameworks have been proposed, geometric models which were developed in CAD/CAM have been used for computer vision, and so on. The progress in computer vision technology has made it possible to understand more complex 3 D scenes. There is an increasing demand for 3D computer vision. In factories, for example, automatic assembly and inspection can be realized with fewer con straints than conventional ones which employ two-dimensional computer vision.

Pattern Analysis and Understanding

Pattern Analysis and Understanding
Author: Heinrich Niemann
Publisher: Springer Science & Business Media
Total Pages: 384
Release: 2013-04-17
Genre: Science
ISBN: 3642748996

In this second edition every chapter of the first edition of Pattern Analysis has been updated and expanded. The general view of a system for pattern analysis and understanding has remained unchanged, but many details have been revised. A short account of light and sound has been added to the introduction, some normalization techniques and a basic introduction to morphological operations have been added to the second chapter. Chapter 3 has been expanded significantly by topics like motion, depth, and shape from shading; additional material has also been added to the already existing sections of this chapter. The old sections of Chap. 4 have been reorganized, a general view of the classification problem has been added and material provided to incorporate techniques of word and object recognition and to give a short account of some types of neural nets. Almost no changes have been made in Chap. 5. The part on representation of control structures in Chap. 6 has been shortened, a section on the judgement of results has been added. Chapter 7 has been rewritten almost completely; the section on formal grammars has been reduced, the sections on production systems, semantic networks, and knowledge acquisition have been expanded, and sections on logic and explanation added. The old Chaps. 8 and 9 have been omitted. In summary, the new edition is a thorough revision and extensive update of the first one taking into account the progress in the field during recent years.