AJOU Central Library Repository: Real-Time Lightweight Human Parsing Based on Class Relationship Knowledge Distillation

BROWSE

Graduate School of Ajou University Department of Artificial Intelligence 3. Theses(Master)

Real-Time Lightweight Human Parsing Based on Class Relationship Knowledge Distillation

Author(s): LANG YUQI

Alternative Author(s): LANG YUQI

Advisor: 황원준

Department: 일반대학원 인공지능학과

Publisher: The Graduate School, Ajou University

Publication Year: 2023-08

Language: eng

Keyword: Human Parsing; Knowledge Distillation; Model Lightweight

Alternative Abstract: In the field of computer vision, understanding human objectives is a crucial and chal- lenging task, as it requires recognizing and comprehending human presence and behavior in images or videos. Within this domain, human parsing is an extremely challenging task, as it necessitates accurately locating the human region and dividing it into multiple semantic areas. This is a dense prediction task that demands powerful computational capabilities and high-precision models. Recently, with the continuous development of computer vision technologies, human parsing has been widely applied to other tasks related to human ob- jectives, such as pose estimation, and human image generation. These applications are expected to play an increasingly important role in future artificial intelligence research. To achieve real-time human parsing tasks on devices with limited computational re- sources, we have designed and introduced a lightweight human parsing model. We chose Resnet18 as the core network structure and simplified the traditional pyramid module used to obtain high-definition contextual information, thus significantly reducing the complex- ity of the model. Additionally, to enhance the parsing accuracy of the model, we integrated a spatial attention fusion strategy. Our lightweight model exhibits efficient performance and achieves high segmentation accuracy on the commonly used dataset for human parsing tasks, Look into Person (LIP). Although traditional models perform excellently in terms of segmentation accuracy, their high complexity and abundance of parameters restrict their use on devices with limited computational resources. To further improve the accuracy of our lightweight network, we also implemented knowledge distillation techniques. The tra- ditional knowledge distillation method uses the Kullback-Leibler (KL) divergence to match the prediction probability scores of teacher-student models. However, this approach may be ineffective at learning useful knowledge when there is a significant difference between the teacher and student networks. Therefore, we adopted a new distillation standard, based on inter-class and intra-class relationships in prediction results, which significantly improves parsing accuracy. Empirical evidence has shown that, while maintaining high segmentation accuracy, our lightweight model has substantially reduced the number of parameters, thereby achieving our expected goals.

URI: https://dspace.ajou.ac.kr/handle/2018.oak/24285

Fulltext

Appears in Collections:: Graduate School of Ajou University > Department of Artificial Intelligence > 3. Theses(Master)

Files in This Item:: There are no files associated with this item.

Export: RIS (EndNote); XLS (Excel); XML

Show full item record

qrcode

트윗하기

License

STATISTICS: Total Visit :5,015,702; Total Download :2,095; Today View :5,886

AJOU Central Library Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

BROWSE

Browse