Research on Deep Learning-based Computer Vision (김형규)
- Computer Vision is one of the latest areas of computer science that studies the part of a machine’s vision
- The DMS Vision team is working on an interactive healthcare cleaning robot system for the Silver Generation.
- Object detection to detect people,
- Face recognition to recognize a particular user,
- Emotion Recognition for monitoring user emotions,
- Action Recognition to recognize the user’s motion,
- Pose Estimation to perform the user’s exercise assistance role.
Face and Emotion Recognition (장이나)
그림 1
Thesis Title: Multi-task Emotion Recognition Based on Context-aware and Attention Module(장이나)
- Summary:Emotion recognition is an important research topic in the field of computer vision, and is the primary problem in affective computing, as well as the focus and difficulty of research. It would have a better impact on our lives if we could use inexpensive machines to monitor and understand the emotional information of others. However, there is currently no system that can do such a job. Because human emotional states are expressed in various ways, such as speech, expressions, actions, the environment in which a person is living, and various physiological signals, it is difficult to accurately reflect human emotions by relying on a single characteristic parameter and its features. We propose a framework, Muti-task Emotion Recognition (MTER), with four main models: face feature extraction model, body feature extraction model and context (scene) feature extraction model, and then fusion classification model. It is used to analyze images containing multiple people and recognize fused emotions based on face facial features, body features, and contextual information. The face feature and body feature extraction module takes the face and body parts of the image as input and the information implicit in the image such as facial expression, head position and body pose is extracted. In order to make the emotion recognition actively applied to real life, Fine-tuned Mobilenet lightweight network is utilized to reduce the computational effort and increase the recognition speed.
Survey (박상윤)
- Related Work Suvey
- Emotion Recognition(SOTA)
- Multitask Emotion Recognition with Incomplete Labels
- Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network
- Multitask Emotion Recognition with Incomplete Labels
- Spatial Transformer Networks
- Emotion Recognition(SOTA)
Pose and Activity Recognition(루카이)
Thesis Title AiPE: Attention in Pose Estimation A transformer based pose estimation method(루카이)
- Summary :We propose a bottom-up human pose estimation method that combines the latest achievements in computer vision-transformer. Compared with the original version using VGG, our method uses a multi-headed self-attention mechanism to better localize the key points of the human body and connect the limbs. Even for some blurred, low-resolution images we can still perform human pose estimation, and our method achieves very good results in single and few people situations. However, in dense crowd conditions, our method does not perform human pose estimation as well as VGG, which stems from the weakness of transformer in analyzing small targets. It may also be due to the fact that we did not use the four block stack like the original swin transformer in order to reduce the computational effort and control the downsampling.
Survey (조문충)
- Related Work Suvey
- Activity Recognition(SOTA)
- Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations
- Masked Autoencoders Are Scalable Vision Learners
- Activity recognition in video or picture files uses the attention mechanism to recognize more efficiently and quickly. At the same time, multiple activity recognition in a single video file, and related papers such as multi-modal recognition methods are also under investigation.
- Vision Transformer
- Activity Recognition(SOTA)
Research Results
- [Master Thesis] 장이나 석사논문 Multi-task Emotion Recognition Based on Context-aware and Attention Module,2022.06
- [Master Thesis] 루카이 석사논문 AiPE: Attention in Pose Estimation A transformer based pose estimation method, 2022.06