èŠ‚ç‚¹æ–‡çŒ®

ç”±æ‰‹æç›¸æœºèŽ·å¾—çš„åºåˆ—å›¾åƒè¿›è¡Œä¸‰ç»´é‡å»º

3D Reconstruction from Image Sequences Captured by a Hand-Held Camera

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å”ä¸½ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ è¥¿å®‰ç”µåç§‘æŠ€å¤§å¦ ï¼Œ ä¿¡å·ä¸Žä¿¡æ¯å¤„ç†ï¼Œ 2003ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ æœ¬è®ºæ–‡ç ”ç©¶äº†å¦‚ä½•ç”±éžå®šæ ‡å›¾åƒåºåˆ—æ¢å¤ä¸‰ç»´å®žä½“æ¨¡åž‹ï¼Œå¯¹å…¶ä¸çš„è‹¥å¹²å…³é”®æŠ€æœ¯è¿›è¡Œäº†æ·±å…¥ç ”ç©¶ï¼Œç‰¹åˆ«æ˜¯ç«‹ä½“åƒå¯¹çš„ç¨ å¯†åŒ¹é…ã€‚æœ¬æ–‡çš„é‡ç‚¹æ˜¯åœ¨ç†è®ºå’Œå®žè·µä¸¤æ–¹é¢ç ”ç©¶äº†åœ¨æœ‰é®æŒ¡çš„æƒ…å†µä¸‹ï¼Œå¦‚ä½•ç”±é•¿å›¾åƒåºåˆ—è¿›è¡Œä¸‰ç»´æ¬§æ°é‡å»ºå¹¶æœ€ç»ˆèŽ·å¾—ç‰©ä½“å®Œæ•´ç»“æž„çš„é—®é¢˜ã€‚ç”±æœ¬æ–‡æ‰€ç»™çš„ç®—æ³•å¯ä»¥æ¢å¤å…·æœ‰å¾ˆå¥½çœŸå®žæ„Ÿçš„å®Œæ•´ä¸‰ç»´å®žä½“æ¨¡åž‹ã€‚ä¸»è¦ç ”ç©¶æˆæžœå¦‚ä¸‹ï¼š 1ï¼Žæå‡ºäº†ä¸€ç§æ–°çš„ç”¨äºŽç§åç‚¹å¯é åŒ¹é…çš„ä¸¤å±‚ç®—æ³•ï¼Œè¯¥ç®—æ³•åœ¨å›¾åƒè¾¹ç¼˜æå–çš„åŸºç¡€ä¸Šï¼Œé¦–å…ˆæ¯”è¾ƒç›®æ ‡ç‚¹åŒ¹é…çš„è¾¹ç¼˜ç›¸ä¼¼æ€§ï¼Œè¿™ç§ç‰¹å¾åŒ¹é…å…·æœ‰ç®€å•å¯é çš„ä¼˜ç‚¹ã€‚åœ¨æ¤åŸºç¡€ä¸Šï¼Œåœ¨ä¸€ä¸ªç›¸å¯¹è¾ƒå°çš„æœç´¢èŒƒå›´å†…æ¯”è¾ƒå…¶ç°åº¦ç›¸ä¼¼æ€§ï¼Œä»Žè€Œå¾—åˆ°ç›®æ ‡ç‚¹çš„ç²¾ç¡®åŒ¹é…ã€‚è¯¥ç®—æ³•å¯ä»¥æœ‰æ•ˆåœ°é¿å…ç”±äºŽé‡å¤å›¾æ¡ˆæ‰€å¼•èµ·çš„åŒ¹é…è¯¯å·®ã€‚ 2ï¼Žæå‡ºäº†ä¸€ç§åŸºäºŽå›¾åƒåˆ’åˆ†çš„ä¼ æ’å¼ç¨ å¯†åŒ¹é…ç®—æ³•ï¼Œè¯¥ç®—æ³•ä¸ä»…é€‚ç”¨äºŽæœªç»æ ¡æ£çš„å›¾åƒå¯¹ï¼Œè€Œä¸”é€‚ç”¨äºŽå˜åœ¨å¤§è§†å·®çš„å›¾åƒå¯¹ï¼Œä»¥åŠå›¾åƒä¸çº¹ç†ç¨€ç–çš„åŒºåŸŸã€‚é€šè¿‡ç”¨ç§åç‚¹çš„Voronoiå›¾å¯¹å›¾åƒåˆ’åˆ†ï¼Œå¹¶ä»¥ç‰¹å¾è·Ÿè¸ªçš„ç»“æžœä½œä¸ºåŒ¹é…ä¼ æ’çš„èµ·ç‚¹ï¼Œæœ‰æ•ˆåœ°æ¶ˆé™¤äº†åŒ¹é…è¯¯å·®çš„ç§¯ç´¯ã€‚ä¼ æ’ç®—æ³•åœ¨æžå¤§æé«˜åŒ¹é…æ•ˆçŽ‡çš„åŒæ—¶ï¼Œä¹Ÿå¢žåŠ äº†ç®—æ³•çš„å‡†ç¡®æ€§ã€‚ 3ï¼Žæå‡ºäº†ä¸€ç§æ–°çš„ä¸‰ç»´é‡å»ºç®—æ³•ï¼Œè¯¥ç®—æ³•å¯ä»¥æ¢å¤ç›®æ ‡ç‰©ä½“å®Œæ•´çš„ä¸‰ç»´ç»“æž„ã€‚é¦–å…ˆå°†æ•´ä¸ªå›¾åƒåºåˆ—åˆ’åˆ†ä¸ºå‡ ä¸ªååºåˆ—ï¼Œä½¿æ¯ä¸ªååºåˆ—ä¸çš„é‡å»ºç‚¹å‡ä¸è¢«é®æŒ¡ï¼›ç„¶åŽåˆ©ç”¨è¿ä»£åˆ†è§£ç®—æ³•æ±‚å‡ºç‰©ä½“å±€éƒ¨çš„å°„å½±é‡å»ºï¼›æŽ¥ç€é€šè¿‡è‡ªå®šæ ‡å°†å°„å½±é‡å»ºå‡çº§è‡³æ¬§æ°é‡å»ºã€‚è¿™æ—¶ç”±æ¯ä¸ªååºåˆ—å¾—åˆ°çš„ä¸åŒéƒ¨åˆ†çš„é‡å»ºç»“æžœæ˜¯ç›¸å¯¹äºŽä¸åŒçš„åæ ‡ç³»è€Œè¨€çš„ã€‚æˆ‘ä»¬å°†å®ƒä»¬é€šè¿‡ä¸€ç»„ç›¸ä¼¼å˜æ¢è½¬ç§»è‡³åŒä¸€åæ ‡ç³»ä¸‹ï¼Œå°±å¾—åˆ°äº†ç‰©ä½“æ•´ä½“çš„ä¸‰ç»´ç»“æž„ã€‚æœ€åŽé€šè¿‡æœ€å°åŒ–é‡æŠ•å½±è¯¯å·®å¯¹æŠ•å½±çŸ©é˜µå’Œç©ºé—´ç‚¹åæ ‡è¿›è¡Œå…¨å±€ä¼˜åŒ–ã€‚è¯¥é‡å»ºç®—æ³•çš„çªå‡ºä¼˜ç‚¹åœ¨äºŽå®ƒå¯ä»¥ä»Žä¸€ä¸ªé•¿å›¾åƒåºåˆ—ä¸æ¢å¤ç‰©ä½“çš„å®Œæ•´ç»“æž„ï¼Œä»Žè€Œå…‹æœç”±é®æŒ¡(occlusion)å¼•èµ·çš„æ•°æ®ç‚¹çš„ä¸¢å¤±é—®é¢˜ã€‚ 4ï¼Žæå‡ºäº†ä¸€ç§é€‚ç”¨äºŽå˜åœ¨ä¸¢å¤±æ•°æ®(missing data)çš„å…¨å±€ä¼˜åŒ–ç®—æ³•ã€‚ä¸ºäº†å¼¥è¡¥é‡å»ºç®—æ³•å°†ä¸€ä¸ªé•¿å›¾åƒåºåˆ—åˆ’åˆ†ä¸ºå‡ ä¸ªååºåˆ—æ‰€å¸¦æ¥çš„ä¸è¶³ï¼Œæˆ‘ä»¬å¯¹ç»“æž„æ•´åˆåŽçš„æ•°æ®è¿›è¡Œå¸¦æœ‰åŠ æƒçŸ©é˜µçš„å…¨å±€ä¼˜åŒ–ï¼Œæœ€å°åŒ–é‡æŠ•å½±è¯¯å·®ä»¥æé«˜æ•°æ®(åŒ…æ‹¬æŠ•å½±çŸ©é˜µå’Œé‡å»ºç‚¹ä¸‰ç»´åæ ‡)çš„æ•´ä½“ç²¾åº¦ã€‚è¯¥ç®—æ³•é€šè¿‡å¼•å…¥åŠ æƒçŸ©é˜µï¼Œå°†å¯è§ç‚¹å’Œè¢«é®æŒ¡ç‚¹åŒç‰å¤„ç†ï¼Œæé«˜äº†æ•°æ®çš„ä¸€è‡´æ€§ã€‚ 5ï¼Žé‡‡ç”¨å¸¦è¾¹ç¼˜çº¦æŸçš„ä¸‰è§’å‰–åˆ†ç®—æ³•ï¼Œå¯¹æ¨¡æ‹Ÿæ•°æ®å’Œå˜åœ¨é®æŒ¡çš„çœŸå®žå›¾åƒåºåˆ—è¿›è¡Œä¸‰ç»´é‡å»ºï¼Œè¿™é‡Œçš„é•¿å›¾åƒåºåˆ—æ˜¯å›´ç»•ç›®æ ‡ç‰©ä½“ä¸€å‘¨æ‹æ‘„å¾—åˆ°çš„ã€‚æ¯ä¸€ä¸ªé‡å»ºç‚¹åœ¨å¤§çº¦è¿žç»çš„10å¹…å›¾åƒä¸å‡å¯è§ï¼Œè€Œåœ¨å…¶ä½™çš„å›¾åƒä¸è¢«é®æŒ¡ã€‚æˆ‘ä»¬çš„é‡å»ºç®—æ³•å¾ˆå¥½åœ°æ¢å¤å‡ºäº†ç‰©ä½“å®Œæ•´çš„å‡ ä½•ç»“æž„ã€‚æœ€åŽï¼Œé€šè¿‡æž„é€ ç›¸åº”çš„è™šæ‹Ÿä¸ŽçœŸå®žæ··ç”±æ‰‹æç›¸æœºèŽ·å¾—çš„åºåˆ—å›¾åƒè¿›è¡ŒäºŒç»´é‡å»º åˆçš„åœºæ™¯ï¼Œè¿›ä¸€æ¥è¯´æ˜Žäº†è¯¥ç®—æ³•å…·æœ‰å¾ˆå¥½çš„å‡†ç¡®æ€§ä¸Žå®žç”¨æ€§ã€‚è¿™æ ·é‡å»ºå‡ºçš„ä¸‰ ç»´åœºæ™¯ä¸Žçº¯è™šæ‹Ÿåœºæ™¯ç›¸æ¯”ï¼Œå…·æœ‰æ›´å¥½çš„çœŸå®žæ„Ÿã€‚ ä»ŠåŽå·¥ä½œä¸éœ€è¦è¿›ä¸€æ¥ç ”ç©¶çš„é—®é¢˜æœ‰:ç»§ç»ç ”ç©¶å˜åœ¨é®æŒ¡é—®é¢˜çš„ä¸‰ç»´é‡å»ºç®—æ³•ï¼Œè¿›ä¸€æ¥æé«˜ç®—æ³•çš„å‡†ç¡®æ€§ä¸Žå®žç”¨æ€§ï¼Œå‡å°‘å…¶ä¸ä¸€äº›éœ€è¦æ‰‹å·¥å¹²é¢„çš„æ¥éª¤:ç»§ç»ç ”ç©¶ç›¸æœºå®šæ ‡ç®—æ³•ï¼Œå¢žåŠ å¤–éƒ¨çº¦æŸæ¡ä»¶ä»¥æé«˜å…¶å‡†ç¡®æ€§ã€‚å¦å¤–ï¼Œç›¸æœºçš„å†…å¤–å‚æ•°ä¼šå› ä¸ºæŠ•å½±çŸ©é˜µçš„å¾®å°å·®å¼‚è€Œå‘ç”Ÿè¾ƒå¤§å˜åŒ–ï¼Œå…¶æ±‚è§£ç¨³å®šæ€§ä¹Ÿæœ‰å¾…äºŽè¿›ä¸€æ¥ç ”ç©¶ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Our research is focused on the problems of the recovery of a realistic textured model from image sequences and some critical issues related to this subject, such as dense matching to stereo images. The thesis investigates both the theoretical and practical feasibility in recovering the complete structure of an object from a long image sequence captured around it with occlusions. In this case, some points may be visible in a number of frames and then disappear in the following several frames. The main contributions of the thesis are as follows:1. We propose a new two level matching algorithm for seed points in propagation. Firstly, our algorithm compares edge similarity around the target pixel based on edge extraction. This level of feature matching is both simple and reliable. Then intensity similarity is compared within a small search window, which is constrained by the results of the first level matching. In this way, the corresponding point is located accurately. This algorithm efficiently avoids mismatches caused by the repetitive patterns.2. A novel and efficient dense matching method is proposed, which is based on the propagation by the Voronoi decomposition of the images. The significant merit of the algorithm is that it can be applied to a wide range of image pairs including those with large disparities, with or without rectification. And it may involve both textured part and less textured part of the images. Our dense matching begins from a number of seed points, which are reliably matched by feature tracking. Then corresponding relations are propagated from all of the seeds respectively. The decomposition of the images into Voronoi diagram restricts bad propagations within a single cell. It improves the performance of dense matching both in efficiency and accuracy.3. A novel 3D reconstruction algorithm with missing data is presented, by which the complete structure of the target can be recovered. Firstly, images taken around the target are divided into several subsets. Each subset has common feature points. Secondly, Euclidean reconstruction is performed by iterative factorization with all of these points visible in each image of a certain subset. Then results coming from different subset are brought into a common coordinate frame by similarity transformations. Finally, global optimization is applied to minimize the back projection errors, which can refine the data and produce a jointly optimal 3Dstructure. A significant merit of the algorithm is that it can deal with occlusions and a complete 3D model is recovered from the long image sequence.4. A new global optimization algorithm with missing data is proposed. To remedy the drawback of cutting a long image sequence into several subsets in our 3D reconstruction algorithm, global optimization with a weighting matrix is applied to refine the results, in which the visible and missing data are arranged together. The back projection error is minimized over the estimated camera matrices and 3D points. In our optimization, the visible points and the missing data are treated uniformly by adding different weights. Experiments demonstrate that the algorithmis both effective and accurate.5. The 3D reconstruction algorithm with constrained triangulation has been tested on both simulate data and real images with satisfactory results. The long image sequence is taken from 360 degrees around the target. Each point is visible in about 10 consecutive images and occluded in the rest of the images. The complete structure of the building is recovered with realistic textures and we also generate an augmented scene to demonstrate the good performance of our algorithm. The structures recovered in this way have better visualization effect than that of the virtual scenes.Future researches on this topic include: go on the work with missing data to further improve its accuracy and feasibility; decrease human interactions in the computation; improve the robustness of self-calibration by prior knowledge of orthogonal / parallel lines and ortæ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ å¯¹æžå‡ ä½•ï¼› ä¸‰ç»´é‡å»ºï¼› è‡ªå®šæ ‡ï¼› ç¨ å¯†åŒ¹é…ï¼› é®æŒ¡ï¼›
ã€Key wordsã€‘ Epipolar geometryï¼› 3D reconstructionï¼› Self-calibrationï¼› Dense matchingï¼› Occlusion.ï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ è¥¿å®‰ç”µåç§‘æŠ€å¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘11
ã€ä¸‹è½½é¢‘æ¬¡ã€‘851
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

ç”±æ‰‹æç›¸æœºèŽ·å¾—çš„åºåˆ—å›¾åƒè¿›è¡Œä¸‰ç»´é‡å»º

3D Reconstruction from Image Sequences Captured by a Hand-Held Camera

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

ç”±æ‰‹æç›¸æœºèŽ·å¾—çš„åºåˆ—å›¾åƒè¿›è¡Œä¸‰ç»´é‡å»º