Robust Motion Detection in Real-Life Scenarios

Robust Motion Detection in Real-Life Scenarios
Author: Ester Martínez-Martín
Publisher: Springer Science & Business Media
Total Pages: 117
Release: 2012-07-10
Genre: Computers
ISBN: 1447142160

This work proposes a complete sensor-independent visual system that provides robust target motion detection. First, the way sensors obtain images, in terms of resolution distribution and pixel neighbourhood, is studied. This allows a spatial analysis of motion to be carried out. Then, a novel background maintenance approach for robust target motion detection is implemented. Two different situations are considered: a fixed camera observing a constant background where objects are moving; and a still camera observing objects in movement within a dynamic background. This distinction lies on developing a surveillance mechanism without the constraint of observing a scene free of foreground elements for several seconds when a reliable initial background model is obtained, as that situation cannot be guaranteed when a robotic system works in an unknown environment. Other problems are also addressed to successfully deal with changes in illumination, and the distinction between foreground and background elements.

Wide Area Surveillance

Wide Area Surveillance
Author: Vijayan K. Asari
Publisher: Springer Science & Business Media
Total Pages: 245
Release: 2013-11-19
Genre: Technology & Engineering
ISBN: 3642378412

The book describes a system for visual surveillance using intelligent cameras. The camera uses robust techniques for detecting and tracking moving objects. The real time capture of the objects is then stored in the database. The tracking data stored in the database is analysed to study the camera view, detect and track objects, and study object behavior. These set of models provide a robust framework for coordinating the tracking of objects between overlapping and non-overlapping cameras, and recording the activity of objects detected by the system.

Motion Adaptation, Its Role in Motion Detection Under Natural Image Conditions and Target Detection

Motion Adaptation, Its Role in Motion Detection Under Natural Image Conditions and Target Detection
Author:
Publisher:
Total Pages: 76
Release: 2005
Genre:
ISBN:

The contractor shall continue investigattion to (1) develop a working model for an elaborated elementary motion detector (EMD) that provides a more robust estimate of local image velocity under natural operating conditions than previous implementations of the Reichardt model and (2) establish a feasible mechanism for producing contrast invariance under real-world conditions that will ultimately provide a blueprint for a robust implementation of an adaptive EMD.

Computational Modeling of Human Dorsal Pathway for Motion Processing

Computational Modeling of Human Dorsal Pathway for Motion Processing
Author: Yuheng Wang
Publisher:
Total Pages: 298
Release: 2013
Genre: Computer vision
ISBN:

"Reliable motion estimation in videos is of crucial importance for background identification, object tracking, action recognition, event analysis, self-navigation, etc. Reconstructing the motion field in the 2D image plane is very challenging, due to variations in image quality, scene geometry, lighting condition, and most importantly, camera jittering. Traditional optical flow models assume consistent image brightness and smooth motion field, which are violated by unstable illumination and motion discontinuities that are common in real world videos. To recognize observer (or camera) motion robustly in complex, realistic scenarios, we propose a biologically-inspired motion estimation system to overcome issues posed by real world videos. The bottom-up model is inspired from the infrastructure as well as functionalities of human dorsal pathway, and the hierarchical processing stream can be divided into three stages: 1) spatio-temporal processing for local motion, 2) recognition for global motion patterns (camera motion), and 3) preemptive estimation of object motion. To extract effective and meaningful motion features, we apply a series of steerable, spatio-temporal filters to detect local motion at different speeds and directions, in a way that's selective of motion velocity. The intermediate response maps are calibrated and combined to estimate dense motion fields in local regions, and then, local motions along two orthogonal axes are aggregated for recognizing planar, radial and circular patterns of global motion. We evaluate the model with an extensive, realistic video database that collected by hand with a mobile device (iPad) and the video content varies in scene geometry, lighting condition, view perspective and depth. We achieved high quality result and demonstrated that this bottom-up model is capable of extracting high-level semantic knowledge regarding self motion in realistic scenes. Once the global motion is known, we segment objects from moving backgrounds by compensating for camera motion. For videos captured with non-stationary cameras, we consider global motion as a combination of camera motion (background) and object motion (foreground). To estimate foreground motion, we exploit corollary discharge mechanism of biological systems and estimate motion preemptively. Since background motions for each pixel are collectively introduced by camera movements, we apply spatial-temporal averaging to estimate the background motion at pixel level, and the initial estimation of foreground motion is derived by comparing global motion and background motion at multiple spatial levels. The real frame signals are compared with those derived by forward predictions, refining estimations for object motion. This motion detection system is applied to detect objects with cluttered, moving backgrounds and is proved to be efficient in locating independently moving, non-rigid regions. The core contribution of this thesis is the invention of a robust motion estimation system for complicated real world videos, with challenges by real sensor noise, complex natural scenes, variations in illumination and depth, and motion discontinuities. The overall system demonstrates biological plausibility and holds great potential for other applications, such as camera motion removal, heading estimation, obstacle avoidance, route planning, and vision-based navigational assistance, etc."--Abstract.

Efficient and Robust Machine Learning Methods for Challenging Traffic Video Sensing Applications

Efficient and Robust Machine Learning Methods for Challenging Traffic Video Sensing Applications
Author: Yifan Zhuang
Publisher:
Total Pages: 0
Release: 2022
Genre:
ISBN:

The development of economics and technologies has promoted urbanization worldwide. Urbanization has brought great convenience to daily life. The fast construction of transportation facilities provides various means of transportation for everyday commuting. However, the growing traffic volume has threatened the existing transportation system by raising more traffic safety and congestion issues. Therefore, it is urgent and necessary to implement ITS with dynamic sensing and adjustment abilities. ITS shows great potential to improve traffic safety and efficiency, empowered by advanced IoT and AI. Within this system, the urban sensing and data analysis modules play an essential role in providing primary traffic information for follow-up works, including traffic prediction, operation optimization, and urban planning. Cameras and computer vision algorithms are the most popular toolkit in traffic sensing and analysis tasks. Deep learning-based computer vision algorithms have succeeded in multiple traffic sensing and analysis tasks, e.g., vehicle counting and crowd motion detection. The large-scale deployment of the sensor network and applications of deep learning algorithms significantly magnify previous methods' flaws, which hinder the further expansion of ITS. Firstly, the large-scale sensors and various tasks bring massive data and high workloads for data analysis on central servers. In contrast, annotated data for deep learning training in different tasks is insufficient, which leads to poor generalization when transferring to another application scenario. Additionally, traffic sensing faces adverse conditions with insufficient data and analysis qualities. This dissertation works on proposing efficient and robust machine learning methods for challenging traffic video sensing applications by presenting a systematic and practical workflow to optimize algorithm accuracy and efficiency. This dissertation first considers the high data volume challenge by designing a compression and knowledge distillation pipeline to reduce the model complexity and maintain accuracy. After applying the proposed pipeline, it is possible to further use the optimized algorithm on edge devices. This pipeline also works as the optimization foundation in the remaining works of this dissertation. Besides high data volume for analysis, insufficient training data is a considerable problem when deploying deep learning in practice. This dissertation has focused on two representative scenarios related to public safety – detecting and tracking small-scale persons in crowds and detecting rare objects in autonomous driving. Data augmentation and FSL strategies have been applied to increase the robustness of the machine learning system with limited training data. Finally, traffic sensing targets 24/7 stable operation, even in adverse conditions that reduce visibility and increase image noise with the RGB camera. Sensor fusion by combining RGB and infrared cameras is studied to improve accuracy in all light conditions. In conclusion, urbanization has simultaneously brought opportunities and challenges to the transportation system. ITS shows great potential to take this development chance and handle these challenges. This dissertation works on three data-oriented challenges and improves the accuracy and efficiency of vision-based traffic sensing algorithms. Several ITS applications are explored to demonstrate the effectiveness of the proposed methods, which achieve state-of-the-art accuracy and are far more efficient. In the future, additional research works can be explored based on this dissertation. With the continuing expansion of the sensor network, edge computing will be a more suitable system framework than cloud computing. Binary quantization and hardware-specific operator optimization can contribute to edge computing. Since data insufficiency is common in other transportation applications besides traffic detection, FSL will elevate traffic pattern forecasting and event analysis with a sequence model. For crowd monitoring, the next step will be motion prediction in bird's-eye view based on motion detection results.

Local and Global Neural Mechanisms Underlying the Robust Velocity Coding of Natural Images

Local and Global Neural Mechanisms Underlying the Robust Velocity Coding of Natural Images
Author: Paul Barnett
Publisher:
Total Pages: 263
Release: 2010
Genre: Motion perception (Vision)
ISBN:

Interpreting motion in the natural world presents a major challenge for visual systems. Natural scenes vary enormously in structure, luminance and contrast, all parameters known to modulate the response of biological motion detectors. Nevertheless, many animals overcome this challenge and adopt visually guided behaviour for which the accurate estimation of self-motion and image velocity is required. It is generally accepted that Reichardt correlator-like computations underlie local motion detection in insects. Reichardt correlators, however, generate ambiguous estimates of velocity, because they are sensitive to several additional image parameters, such as those mentioned above. How does the visual system generate accurate estimates of apparent image velocity when the elements underlying local motion detection produce ambiguous velocity signals? This thesis investigates the neural processing of image velocity. I performed sharp electrode intracellular recordings from identified motion sensitive neurons in the lobula plate of the hoverfly, Eristalis tenax. A series of natural and artificial images were used to investigate the processing of a vast range of scenes. I show that the horizontal system (HS) neurons have a remarkable capacity to estimate image velocity reliably for vastly different natural scenes. This property is at odds with the HS neurons' responses to experimenter-defined stimuli. I reveal several activity dependent features of the neural response that may reconcile the ability to accurately encode the velocity of natural images with the mechanisms underlying motion processing. Images that were initially weak neural drivers have long latencies, with responses continuing to increase in magnitude over several hundred milliseconds. Images that were initially strong neural drivers, reached peak responses more rapidly followed by significant reductions in response over longer time scales. Despite being different in sign and time course, these two activity dependent changes in response act as near-ideal normalisers for images that would otherwise produce highly variable response magnitudes. By analysing the time course of neural response and manipulating image contrast, I show that this property is likely to emerge from a combination of static and dynamic non-linarities. When image contrast is reduced, thus reducing the range of input signals to local motion detectors, the essential non-linearity of the Reichardt correlator model provides a good prediction of global responses. Thus, suggesting an important role for non-linear mechanisms being recruited by high contrast local features in the robust encoding of natural scenes. Finally, I use an experimental paradigm that reduces the influence of spatial integration and thus enables the analysis of responses equivalent to the outputs of individual local motion sensitive elements presynaptic to the HS neuron. I show evidence for an adaptive gain reduction that affects the sensitivity of individual motion detector responses to subsequent features. This gain reduction is facilitated by local neighbouring motion stimulation and is thus, well suited to take advantage of the predictable nature of natural scenes.