Computer Vision in the Infrared Spectrum

Computer Vision in the Infrared Spectrum
Author: Michael Teutsch
Publisher: Springer Nature
Total Pages: 128
Release: 2022-06-01
Genre: Computers
ISBN: 3031018265

Human visual perception is limited to the visual-optical spectrum. Machine vision is not. Cameras sensitive to the different infrared spectra can enhance the abilities of autonomous systems and visually perceive the environment in a holistic way. Relevant scene content can be made visible especially in situations, where sensors of other modalities face issues like a visual-optical camera that needs a source of illumination. As a consequence, not only human mistakes can be avoided by increasing the level of automation, but also machine-induced errors can be reduced that, for example, could make a self-driving car crash into a pedestrian under difficult illumination conditions. Furthermore, multi-spectral sensor systems with infrared imagery as one modality are a rich source of information and can provably increase the robustness of many autonomous systems. Applications that can benefit from utilizing infrared imagery range from robotics to automotive and from biometrics to surveillance. In this book, we provide a brief yet concise introduction to the current state-of-the-art of computer vision and machine learning in the infrared spectrum. Based on various popular computer vision tasks such as image enhancement, object detection, or object tracking, we first motivate each task starting from established literature in the visual-optical spectrum. Then, we discuss the differences between processing images and videos in the visual-optical spectrum and the various infrared spectra. An overview of the current literature is provided together with an outlook for each task. Furthermore, available and annotated public datasets and common evaluation methods and metrics are presented. In a separate chapter, popular applications that can greatly benefit from the use of infrared imagery as a data source are presented and discussed. Among them are automatic target recognition, video surveillance, or biometrics including face recognition. Finally, we conclude with recommendations for well-fitting sensor setups and data processing algorithms for certain computer vision tasks. We address this book to prospective researchers and engineers new to the field but also to anyone who wants to get introduced to the challenges and the approaches of computer vision using infrared images or videos. Readers will be able to start their work directly after reading the book supported by a highly comprehensive backlog of recent and relevant literature as well as related infrared datasets including existing evaluation frameworks. Together with consistently decreasing costs for infrared cameras, new fields of application appear and make computer vision in the infrared spectrum a great opportunity to face nowadays scientific and engineering challenges.

Visual Domain Adaptation in the Deep Learning Era

Visual Domain Adaptation in the Deep Learning Era
Author: Gabriela Csurka
Publisher: Springer Nature
Total Pages: 182
Release: 2022-06-06
Genre: Computers
ISBN: 3031791754

Solving problems with deep neural networks typically relies on massive amounts of labeled training data to achieve high performance. While in many situations huge volumes of unlabeled data can be and often are generated and available, the cost of acquiring data labels remains high. Transfer learning (TL), and in particular domain adaptation (DA), has emerged as an effective solution to overcome the burden of annotation, exploiting the unlabeled data available from the target domain together with labeled data or pre-trained models from similar, yet different source domains. The aim of this book is to provide an overview of such DA/TL methods applied to computer vision, a field whose popularity has increased significantly in the last few years. We set the stage by revisiting the theoretical background and some of the historical shallow methods before discussing and comparing different domain adaptation strategies that exploit deep architectures for visual recognition. We introduce the space of self-training-based methods that draw inspiration from the related fields of deep semi-supervised and self-supervised learning in solving the deep domain adaptation. Going beyond the classic domain adaptation problem, we then explore the rich space of problem settings that arise when applying domain adaptation in practice such as partial or open-set DA, where source and target data categories do not fully overlap, continuous DA where the target data comes as a stream, and so on. We next consider the least restrictive setting of domain generalization (DG), as an extreme case where neither labeled nor unlabeled target data are available during training. Finally, we close by considering the emerging area of learning-to-learn and how it can be applied to further improve existing approaches to cross domain learning problems such as DA and DG.

Proceedings of the 2022 2nd International Conference on Computer Technology and Media Convergence Design (CTMCD 2022)

Proceedings of the 2022 2nd International Conference on Computer Technology and Media Convergence Design (CTMCD 2022)
Author: Kannimuthu Subramanian
Publisher: Springer Nature
Total Pages: 935
Release: 2023-02-10
Genre: Computers
ISBN: 9464630469

This is an open access book. With the rapid development of society and the continuous progress of computer science and technology, when entering the information age, design has also been integrated into the new media age in time. The application of computer technology in design has broken the limitations of traditional design, achieved a huge breakthrough in the field of design, provided more innovative forms of expression for design, and also achieved subversive changes in design methods. We need to note that design comes from life, and then it is a matter of applying tools and crafting techniques to realize it. For designers, maintaining innovation is still the first and foremost in creation. How to use technology to enable design without relying on technology is still a dilemma. Therefore, it is necessary to create a space for the researchers, practitioners, and enthusiasts in the field of computing and design to gather and discuss this current issue. The International Conference on Computer Technology and Media Convergence Design aims to accommodate this need, as well as to: 1. Advance the academic field by exploring cutting-edge research and applications. 2. Open up new horizons, broaden the horizons of computer technology research and design, 3. Create academic forums to provide opportunities for academic resource sharing and research exchanges. 2022 2nd International Conference on Computer Technology and Media Convergence Design (CTMCD 2022) will be held in Dali, China during May 13-15, 2022. CTMCD2022 invites the researchers, practitioners, and enthusiasts in the field of computing and design to participate and share knowledge. We also accept papers on computer technology and media convergence design.

Learning to Analyze what is Beyond the Visible Spectrum

Learning to Analyze what is Beyond the Visible Spectrum
Author: Amanda Berg
Publisher: Linköping University Electronic Press
Total Pages: 111
Release: 2019-11-13
Genre:
ISBN: 9179299814

Thermal cameras have historically been of interest mainly for military applications. Increasing image quality and resolution combined with decreasing camera price and size during recent years have, however, opened up new application areas. They are now widely used for civilian applications, e.g., within industry, to search for missing persons, in automotive safety, as well as for medical applications. Thermal cameras are useful as soon as there exists a measurable temperature difference. Compared to cameras operating in the visual spectrum, they are advantageous due to their ability to see in total darkness, robustness to illumination variations, and less intrusion on privacy. This thesis addresses the problem of automatic image analysis in thermal infrared images with a focus on machine learning methods. The main purpose of this thesis is to study the variations of processing required due to the thermal infrared data modality. In particular, three different problems are addressed: visual object tracking, anomaly detection, and modality transfer. All these are research areas that have been and currently are subject to extensive research. Furthermore, they are all highly relevant for a number of different real-world applications. The first addressed problem is visual object tracking, a problem for which no prior information other than the initial location of the object is given. The main contribution concerns benchmarking of short-term single-object (STSO) visual object tracking methods in thermal infrared images. The proposed dataset, LTIR (Linköping Thermal Infrared), was integrated in the VOT-TIR2015 challenge, introducing the first ever organized challenge on STSO tracking in thermal infrared video. Another contribution also related to benchmarking is a novel, recursive, method for semi-automatic annotation of multi-modal video sequences. Based on only a few initial annotations, a video object segmentation (VOS) method proposes segmentations for all remaining frames and difficult parts in need for additional manual annotation are automatically detected. The third contribution to the problem of visual object tracking is a template tracking method based on a non-parametric probability density model of the object's thermal radiation using channel representations. The second addressed problem is anomaly detection, i.e., detection of rare objects or events. The main contribution is a method for truly unsupervised anomaly detection based on Generative Adversarial Networks (GANs). The method employs joint training of the generator and an observation to latent space encoder, enabling stratification of the latent space and, thus, also separation of normal and anomalous samples. The second contribution is the previously unaddressed problem of obstacle detection in front of moving trains using a train-mounted thermal camera. Adaptive correlation filters are updated continuously and missed detections of background are treated as detections of anomalies, or obstacles. The third contribution to the problem of anomaly detection is a method for characterization and classification of automatically detected district heat leakages for the purpose of false alarm reduction. Finally, the thesis addresses the problem of modality transfer between thermal infrared and visual spectrum images, a previously unaddressed problem. The contribution is a method based on Convolutional Neural Networks (CNNs), enabling perceptually realistic transformations of thermal infrared to visual images. By careful design of the loss function the method becomes robust to image pair misalignments. The method exploits the lower acuity for color differences than for luminance possessed by the human visual system, separating the loss into a luminance and a chrominance part.

Computer Vision

Computer Vision
Author: Hongbin Zha
Publisher: Springer
Total Pages: 491
Release: 2015-09-18
Genre: Computers
ISBN: 3662485702

The two volumes CCIS 546 and 547 constitute the refereed proceedings of the CCF Chinese Conference on Computer Vision, CCCV 2015, held in Xi'an, China, in September 2015. The total of 89 revised full papers presented in both volumes were carefully reviewed and selected from 176 submissions. The papers address issues such as computer vision, machine learning, pattern recognition, target recognition, object detection, target tracking, image segmentation, image restoration, face recognition, image classification.

Proceedings of the 2012 Joint Workshop of Fraunhofer IOSB and Institute for Anthropomatics, Vision and Fusion Laboratory

Proceedings of the 2012 Joint Workshop of Fraunhofer IOSB and Institute for Anthropomatics, Vision and Fusion Laboratory
Author: Jürgen Beyerer
Publisher: KIT Scientific Publishing
Total Pages: 160
Release: 2014-05-14
Genre: Computers
ISBN: 3866449887

This book is a collection of 11 review technical reports summarizing the presentations at the 2012 Joint Workshop of Fraunhofer IOSB and Vision and Fusion Laboratory at KIT Karlsruhe, made by the students of the both institutions. The topics include image processing, visual inspection, pattern recognition and classification, human-machine interaction, world modeling, and optical signal processing.

Intelligent Systems

Intelligent Systems
Author: Chiranji Lal Chowdhary
Publisher: CRC Press
Total Pages: 294
Release: 2020-01-06
Genre: Business & Economics
ISBN: 0429560044

This volume helps to fill the gap between data analytics, image processing, and soft computing practices. Soft computing methods are used to focus on data analytics and image processing to develop good intelligent systems. To this end, readers of this volume will find quality research that presents the current trends, advanced methods, and hybridized techniques relating to data analytics and intelligent systems. The book also features case studies related to medical diagnosis with the use of image processing and soft computing algorithms in particular models. Providing extensive coverage of biometric systems, soft computing, image processing, artificial intelligence, and data analytics, the chapter authors discuss the latest research issues, present solutions to research problems, and look at comparative analysis with earlier results. Topics include some of the most important challenges and discoveries in intelligent systems today, such as computer vision concepts and image identification, data analysis and computational paradigms, deep learning techniques, face and speaker recognition systems, and more.

Machine Vision Beyond Visible Spectrum

Machine Vision Beyond Visible Spectrum
Author: Riad Hammoud
Publisher: Springer Science & Business Media
Total Pages: 254
Release: 2011-05-30
Genre: Technology & Engineering
ISBN: 3642115683

The material of this book encompasses many disciplines, including visible, infrared, far infrared, millimeter wave, microwave, radar, synthetic aperture radar, and electro-optical sensors as well as the very dynamic topics of image processing, computer vision and pattern recognition. This book is composed of six parts: * Advanced background modeling for surveillance * Advances in Tracking in Infrared imagery * Methods for Pose estimation in Ultrasound and LWIR imagery * Recognition in multi-spectral and synthetic aperture radar * Fusion of disparate sensors * Smart Sensors

Computer Vision in Vehicle Technology

Computer Vision in Vehicle Technology
Author: Antonio M. López
Publisher: John Wiley & Sons
Total Pages: 252
Release: 2017-02-08
Genre: Computers
ISBN: 1118868056

A unified view of the use of computer vision technology for different types of vehicles Computer Vision in Vehicle Technology focuses on computer vision as on-board technology, bringing together fields of research where computer vision is progressively penetrating: the automotive sector, unmanned aerial and underwater vehicles. It also serves as a reference for researchers of current developments and challenges in areas of the application of computer vision, involving vehicles such as advanced driver assistance (pedestrian detection, lane departure warning, traffic sign recognition), autonomous driving and robot navigation (with visual simultaneous localization and mapping) or unmanned aerial vehicles (obstacle avoidance, landscape classification and mapping, fire risk assessment). The overall role of computer vision for the navigation of different vehicles, as well as technology to address on-board applications, is analysed. Key features: Presents the latest advances in the field of computer vision and vehicle technologies in a highly informative and understandable way, including the basic mathematics for each problem. Provides a comprehensive summary of the state of the art computer vision techniques in vehicles from the navigation and the addressable applications points of view. Offers a detailed description of the open challenges and business opportunities for the immediate future in the field of vision based vehicle technologies. This is essential reading for computer vision researchers, as well as engineers working in vehicle technologies, and students of computer vision.

Augmented Vision Perception in Infrared

Augmented Vision Perception in Infrared
Author: Riad I. Hammoud
Publisher: Springer Science & Business Media
Total Pages: 476
Release: 2009-01-01
Genre: Science
ISBN: 1848002777

Throughout much of machine vision’s early years the infrared imagery has suffered from return on investment despite its advantages over visual counterparts. Recently, the ?scal momentum has switched in favor of both manufacturers and practitioners of infrared technology as a result of today’s rising security and safety challenges and advances in thermographic sensors and their continuous drop in costs. This yielded a great impetus in achieving ever better performance in remote surveillance, object recognition, guidance, noncontact medical measurements, and more. The purpose of this book is to draw attention to recent successful efforts made on merging computer vision applications (nonmilitary only) and nonvisual imagery, as well as to ?ll in the need in the literature for an up-to-date convenient reference on machine vision and infrared technologies. Augmented Perception in Infrared provides a comprehensive review of recent deployment of infrared sensors in modern applications of computer vision, along with in-depth description of the world’s best machine vision algorithms and intel- gent analytics. Its topics encompass many disciplines of machine vision, including remote sensing, automatic target detection and recognition, background modeling and image segmentation, object tracking, face and facial expression recognition, - variant shape characterization, disparate sensors fusion, noncontact physiological measurements, night vision, and target classi?cation. Its application scope includes homeland security, public transportation, surveillance, medical, and military. Mo- over, this book emphasizes the merging of the aforementioned machine perception applications and nonvisual imaging in intensi?ed, near infrared, thermal infrared, laser, polarimetric, and hyperspectral bands.