节点文献

基于人脸视频的疲劳检测

Fatigue Recognition Based on Face Videos

【作者】 郭秀明

【导师】 陈震;

【作者基本信息】 吉林大学 , 计算机应用技术, 2008, 硕士

【摘要】 疲劳驾驶是交通事故的一个重要原因,对财产和生命造成巨大的损失,驾驶员疲劳检测对提高交通安全具有非常重要的意义。驾驶员在疲劳时面部会出现许多特有的表现。本文提出了一个非侵入式系统,通过融合两个和疲劳相关的视觉特征来检测疲劳。文中提取了来自眼部、嘴部的视觉特征来分析疲劳,表现为疲劳时出现的眼睛闭合和打哈欠。人在疲劳时,眼睛闭合的时间和频率都会变大。为了判定眼睛是否闭合,本文首先在LBP处理的基础上,提取眼睛分区的直方图特征,然后用AdaBoost算法选择最具判别能力的直方图特征构建强分类器得到眼部的视觉特征。为了判定是否打哈欠,本文通过灰度投影定位左右嘴角位置,然后用Gabor小波提取左右嘴角的纹理特征,最后通过LDA分类判别得到嘴部的视觉特征。在决策层,本文提取的两个视觉特征通过信息融合的方法得到最终的疲劳判定结果。为了测试提出的算法,本文自建了一个人脸疲劳数据集合。在自然环境中,用摄像头采集了10人的面部视频数据。然后从中提取面部疲劳的图像,创建了人脸疲劳数据库。库中包括5女5男的1600幅图像。平均每个人有160幅图像(80幅正常图像和80幅疲劳的图像)。试验结果表明本文提出的方法比单个视觉特征可靠性和准确性都有比较大提高。

【Abstract】 Driver fatigue is a major factor in the causes of automobile accidents.In recent years.Traffic transport project of our country is developing quickly, but the malign traffic transport is also ascending. 78.5 percent of deaths by all kinds of accidents is caused by traffic accident during 2002. The traffic accidents caused by driver fatigue took part about 20 percent, which exceeded 40 percent of the most serious traffic accidents. It is reported that traffic accidents caused by driver fatigue take large percent in America, Japanese and England and so on. It can be seen that driver fatigue has become the main hidden trouble of traffic accidents. Therefore, detecting driver fatigue is extremely important to improving transportation safety.Driver fatigue monitoring technologies can be divided into subjective methods and objective methods. Subjective methods get driver fatigue levels from subjective questionaries, note tables made by drivers, questionaries about the habit of sleep, and Stanford measurement tables of sleep. Objective methods detect driver fatigue from the parameters about drivers and cars. Research on fatigue monitoring can make the measures for the prevention of dangers caused by driver fatigue more convenient and direct. In the past decade, more and more attention has been paid to the technology research and equipment development for fatigue monitoring. We can divide the methods for driver fatigue monitoring into three categories: physiological parameters, individual characters of drivers and performance of cars under driver control based on the information source. When people are in fatigue state, their circadian characters vary much. Circadian signals have some metabolic rules. Investigators get people’s circadian signals such as electroencephalogram, cardiogram, electro-oculogram, electromyogram and so on through med equipment to detect drivers’fatigue in real time. Detecting fatigue based on drivers’characters is to get drivers’videos through cameras, analyze the videos using computer optic method and judge whether drivers are in fatigue state or not according the results of the characters information. When drivers are in fatigue state, the veracity, agility and smartness of their actions all will be influenced, so it is feasible to detect drivers’fatigue by how the cars are controlled by drivers.When a driver fatigues, he will take many special visual cues on his face. In this paper we present a novel framework based on computer vision for driver fatigue detection that combines various visual cues typically related to fatigue. We combine visual cues from mouths and eyes systematically which characterize eye closed and yawning to infer driver fatigue.Eyes’changes are the most important character to reflect peoples’fatigue state, when man is tired, his eyelids stir frequently and eyes close for longer time. Judge eyes’state through detecting eyes’character seen as a charater to detect fatigue. AdaBoost is used to extract the most discriminative features from the LBP features of eye areas and constructs a highly accurate classifier to get the eye visual cue. LBP arithmetic operator is not parameterization which describes local dimensional structure. LBP is a strong tool to describe texture. Because of strong texture-distinguish ability and smaller compute cost, LBP has been used in pattern recognition widely. Basic LBP arithmetic operator converts the 3*3 pixels around every pixel of the image into binary numeric according to the grey level of the centre and takes the result as a binary digit. LBP has 256 possible values, so LBP codes can denote 256 LBP modes, and every value denotes a LBP local mode. In this article, we use LBP8,2 arithmetic operator to distill character of every pixesl and compute the LBP value of every pixel, the original eye image and LBP image then come into being.The article uses Adaboost arithmetic to choose the LBP character of eyes and train strong categorizer. Adaboost arithmetic is the most popular Boosting arithmetic, and the AdaBoosting arithmetic advanced by Freund and Schapire gives a very simple and effective study method to choose character and not-linearly classfy.AdaBoost has achieved much success in detecting face and other application areas. AdaBoost is an ensemble study algorithm. It creates a series of weak categorizers. AdaBoost conclude advanced arithmetic and basic arithmetic, basic or weak arithmetic is actually to choose suitable discriminated function, advanced arithmetic is a iterative process, it chooses the weak categorizer according to their capabilities in the training gather, then a strong categorizer forms by the linear combination of weak categorizers. Yawning is also an important cue of driver fatigue. Fusion the cues of eye closed and mouth yawning can get a more accurate and robust fatigue detection. In the paper, to get the mouth cue, we propose to use gray projection to locate the left and right mouth corners, extract the texture features by Gabor wavelets, and apply LDA to classify the mouth features to detect yawning. When drivers yawn, the texture of the mouth has great change while the geometric feature changes. The texture of the mouth includes more information. Gabor wavelets are widely used in computer vision, texture analysis and object recognition. The two dimensional Gabor wavelets can get the best resolution in spatial field, temporal field and direction. In the experiments, we use a discrete form of the Gabor wavelets which include 5 frequencies and 8 orientations. The image convolves with the real and imaginary parts of the Gabor wavelets at the left and right moth corners. Then, at each moth corner, we get 40 complex Gabor wavelet coefficients. Through the Gabor transformation, we can represent the texture of the left and right mouth corners with a vector of 80 elements. The purpose of the Fisher linear discriminant is to find a feature projection space which minimizes the ratio of the within class scatter of the projected samples to the between class scatter. That is to say the samples of different classes are separated optimally i.e. the more the scatter between classes is, the better it is while the samples from the same class are congregated i.e. the less the scatter within the classed is, the better it is. So, the samples can be separated correctly in the projection space.The analysis and understanding of fatigue based on single character have some localization, and the dubious factors may influence veracity and dependability of some fatigue characters. Therefor, many researchers fuse many characters information to analyze fatigue state. Analyse of fatigue state through fusing many characters information is to distinguish whether fatigue brings or not through two or more of blink, staring direction, gesture of head, ship of mouth, and expression of face and so on. In the article, information amalgamation occurs in the decision-making level. The result through matching of eyes’clue and mouth’clue composites a character vector which is input into LDA, and the last decision-making comes from the matching of the LDA’s output and the stencil.The method advanced by the article is tested by ten people in real life, which contain changes of gestures and illumination. The result indicates that visual fatigue clues of the article have stronger dependability and veracity.

【关键词】 LBPAdaBoostGaborLDA分类信息融合
  • 【网络出版投稿人】 吉林大学
  • 【网络出版年期】2008年 10期
  • 【分类号】TP391.41
  • 【被引频次】2
  • 【下载频次】283
节点文献中: 

本文链接的文献网络图示:

本文的引文网络