Document View

Skip Navigation   Search Modes   Marked Items   Help   Library links

Statistical analysis of three-dimensional modeling from monocular video streams
by Roy Chowdhury, Amit K., Ph.D., University of Maryland, College Park, 2002, 151 pages; AAT 3080283

Abstract (Summary)

3D scene modeling from a video sequence is considered to be one of the most important problems in computer vision. Its successful solution has numerous possibilities in applications like multimedia communications, surveillance, virtual reality, automatic navigation, medical prognosis, etc. One of the most powerful techniques for solving this problem is known as structure from motion (SfM). Briefly, the SfM problem is about recovering the absolute or relative depth of static and moving objects using video acquired from single or multiple video cameras. The most challenging problem is when only a monocular video is present and we require a dense estimate of the depth. Successful solution of this problem requires a detailed understanding of the geometry of the 3D world and its 2D projections on the image planes. However, the motion between adjacent frames of a video sequence is usually very small, thus introducing large errors in its estimation. Hence, in order to obtain a satisfactory solution, it is important to understand the statistics of these errors and their interaction with the geometry of the problem. The overall aim of this thesis is to show how to combine the statistics describing the quality of the input video data with an understanding of the geometry, in order to obtain an accurate 3D scene reconstruction from a video sequence using the optical flow model.

In our work, we pose the 3D reconstruction problem in an estimation-theoretic framework. We adopt the optical flow paradigm for modeling the motion between the frames of the video sequence. We show how the statistics of the errors in the input motion estimates are propagated through the 3D reconstruction algorithm and affect the quality of the output. We present a new result: that the 3D estimate is always statistically biased, and the magnitude of this bias is significant. In order to demonstrate our analysis in a practical application, we consider the problem of reconstructing a 3D model of a human face from video. An algorithm is proposed that obtains a robust 3D model by fusing two-frame estimates using stochastic approximation theory and then combines it with a generic face model in a Markov chain Monte Carlo optimization procedure. We address the question of how to automatically evaluate the quality of a 3D re-construction from a video sequence, and present a criterion using concepts from information theory. Finally, we propose a probabilistic registration algorithm that extends the results of our work to create holistic 3D models from multiple video streams.

Indexing (document details)

Advisor:Chellappa, Rama
School:University of Maryland, College Park
School Location:United States -- Maryland
Keyword(s):Video streaming, Three-dimensional modeling, Monocular video
Source:DAI-B 64/03, p. 1413, Sep 2003
Source type:Dissertation
Subjects:Electrical engineering, Computer science
Publication Number: AAT 3080283
Document URL:http://proquest.umi.com/pqdweb?did=765367871&sid=16&Fmt=7&cl ientId=13708&RQT=309&VName=PQD
ProQuest document ID:765367871



End of document. At this point, you may:
 
Main Navigation
Search modes: Basic Search    Advanced Search    Topic Guide    Browse    Publication Search    Change Databases    Marked Items 
(0 documents)
Help: Accessibility Help
Library links Realtor.org Virtual Library   NAR InfoCentral Blog  
Switch to ProQuest's graphical interface
Copyright © 2010 ProQuest LLC. All rights reserved. Terms and Conditions