节点文献

视频场景中的群体目标分析研究

Research on Crowd Target in Video Scene

【作者】 乔伟

【导师】 王汇源;

【作者基本信息】 山东大学 , 信号与信息处理, 2010, 硕士

【摘要】 计算机视觉已成为一个非常活跃的研究领域,它涉及信号采集、图像处理、机器学习、模式识别、行为控制甚至认知科学等多门学科,主要研究视频图像序列中目标的检测、跟踪、行为分析与识别等问题。视频中目标检测和跟踪是计算机视觉的一个方向,目前这方面的研究大多数集中在单或多个物体和人的检测和跟踪。多目标检测和跟踪,也往往将检测和跟踪的目标局限在十个以内。随着人口数目的增加、城市化进程的加快、社会活动频繁增加,公共场合正变得越来越拥挤,大型的集会也日渐增多。于是,对群体活动视频监测的要求变得越来越迫切,但是目前很少有研究涉及群体目标检测和分析。针对这个问题,我们做了如下研究工作:首先,本文对运动场和光流场进行了概述。运动场是描述目标运动的矢量,在没有光照影响的情况下,光流场可以用来表示运动场。光流场是指图像亮度模式的表观运动,可以通过添加限制条件来求解光流约束方程来获得。Lucas-Kanade光流法是比较适合群体运动目标检测和分类的一种光流法,本文用此光流法计算光流场。混沌动力学中,拉格朗日方法尝试跟踪运动流中各像素点的运动轨迹,是处理流体的一种方法。由于群体运动目标的高密度性,可以被认作流体,用混沌动力学的方法处理。有限时间李亚普诺夫指数表示相近粒子间的混合和分离性,反映粒子间的分离程度。根据流图,通过计算龙哥库塔方程获得有限时间李亚普诺夫指数图像。文中比较了运用立体插值和三维反距离加权插值算法对结果的影响,得到运算简单三维反距离加权插值算法更适合实时的应用于群体运动目标的检测和分析的结论。根据获得的有限时间李亚普诺夫指数图像,可以获得运动区域。本文提出了一种改进的Bernsen自适应二值化算法获得有限时间李亚普诺夫指数图像的二值图像。然后用形态学的方法获得大概的运动区域。这时的运动区域存在一个问题是:存在一些“空洞”。为了处理这个问题,可以采用Freeman轮廓提取并填充的方法。在运动区域获得后,需要对其进行分析。本文的分析从两方面进行:方向和密度。对于方向分析,用光流场来获得。提出一种改进的K-均值聚类的方法获得的光流场方向图像。由于视频本身质量原因,得到的结果斑驳不堪。又提出一种小轮廓融合算法对上述结果做改进,有效地去除了各种杂质。对于密度分析,本文采用了纹理的方法。灰度共生矩阵分析是一种纹理分析的方法,其特征参数从不同方面描述图像的纹理。其中对比度反映沟纹的清晰程度,纹理的细致程度,可以通过它获得该运动区域的密度信息。根据各个方向不同的密度信息,通过贝叶斯分类,将不同方向的群体运动目标划分为:稀疏、中等和密集。本文各种算法均通过采用C语言并结合Intel OpenCV库实现。将密集群体系统作为混沌系统处理,首先求解FTLE场。求解龙哥库塔方程时采用反距离加权插值算法,并排除FTLE中标识为非混沌系统的点,表示流动的部分。再用形态学方法处理FTLE场图后获得运动区域。然后,对运动目标区域进行方向分析,使用了改进的K-均值聚类的方法,并提出小轮廓融合算法吸收杂质以优化分割结果。接着对运动目标区域进行密度分析,对分割后的各个区域求取GLCM,以对比度为标准获得密度信息。结果表明,提出的算法有效实现了对从多体到群体的运动目标检测和分析,对群体运动目标的研究有一定的探索意义。

【Abstract】 Computer vision has become a very active research area, which involves signal acquisition, image processing, machine learning, pattern recognition, behavior control and even cognitive science and many other subjects. It researches on the target of video image sequence for detection, tracking, behavior analysis and so on. Target detection and tracking in video is an aspect of computer vision. Current research is only concentrated on single or multiple targets and human monitoring and tracking. The number of targets is limited to 10 or less for multi-target detection and tracking.With the increase of population and the acceleration of urbanization process, public places is becoming more and more crowded and large gatherings are also increasing. As a result, the requirements for crowd target monitoring have become urgent. But few researches have involved in crowd target detection and analysis. Concerning this problem, we have done the following work:First, an overview of motion field and optical flow field is given in this thesis. Motion field are vectors that describe the target motion. Without considering the effect of illumination, the optical flow field can be used to express motion field. Optical flow field is referred to the apparent motion of image brightness patterns. It can be obtained by adding constraints to solve the optical flow constraint equation. Lucas-Kanade optical flow method is suitable for crowd target detection and classification. We use the method to calculate optical flow field in this thesis.In Chaotic dynamics, the Lagrangian algorithm is an approach of dealing with fluid, which attempts to track the motion trajectory of each pixel. Due to the high density, crowd target can be recognized as fluid. Finite time Lyapunov exponent(FTLE) shows the mixing and separation of particles. It reflects the degree of separation between particles. According to flow diagram, FTLE images can be obtained by calculating the Runge-Kutta-Fehlberg equations. The thesis compares cubic interpolation and inverse distance weighted interpolation algorithm in the impact on the results. The simple inverse distance weighted interpolation algorithm is more suitable for real-time crowd motion target detection and analysis.Motion area is obtained based on FTLE image. This thesis presents an improved Bernsen adaptive binary algorithm to get FTLE binary image. Then the motion area is obtained by morphology. There are holes in the motion area. In order to solve this problem, we can use Freeman contour extraction.The motion area needs to be analyzed. The analysis is in two ways:the direction and density. For the direction, optical flow field is used. An improved K-means clustering approach is presented to obtain optical flow direction image. Duo to the quality of video, the results is mottled. The small-contour fusion algorithm is proposed to improve the results, which effectively removes all impurities. For density analysis, this thesis adopts the approach of texture. Gray level co-occurrence matrix (GLCM) analysis is an approach of texture analysis. Its characteristic parameters describe the image texture from different aspects. Contrast reflects the clarity of texture. The density of motion area can be got. According to the different density of all directions and the Bayesian classification, different targets are classified:sparse, middle and dense.All the proposed algorithms are implemented by C and Intel OpenCV library. We treat crowd target as fluid. Firstly, FTLE field is got. For solving Runge-Kutta-Fehlberg equations, inverse distance weighted interpolation algorithm is used. Non-chaotic points are excluded, and the rest represent the flow region. Secondly, motion region can be obtained after morphological processing. Then, K-means clustering is applied for motion direction analysis. Impurities are absorbed by small-contour fusion algorithm to optimize the results. Finally, GLCM is obtained for density analysis. Density information is given by the contrast of GLCM. The results show that the algorithms are effective. It brings a way to crowd target research.

  • 【网络出版投稿人】 山东大学
  • 【网络出版年期】2010年 09期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络