Toward an Efficient Deep Image Restoration Method

Author(s)
안남혁
Alternative Author(s)
Namhyuk Ahn
Advisor
손경아
Department
일반대학원 인공지능학과
Publisher
The Graduate School, Ajou University
Publication Year
2021-08
Language
eng
Keyword
Data efficiencyDeep neural networkImage restorationLow-level visionModel efficiencyMulti-modal distortion
Alternative Abstract
Image restoration is a classic but challenging problem in the computer vision field. In recent, deep learning-based methods have achieved superior performance by the huge capacity of neural networks and the large volume of the dataset. However, are these heavy models applicable in real-world applications? If not suitable, what are the directions the deep methods would take? To answer these questions, the thesis explore three aspects: model efficiency, data efficiency, and multi-modal distortion. In model efficiency, We define the "efficiency" as both network size and the computation cost to run the network. Many studies have focused on the former alone, but in reality, the latter one is the key ingredient because of the runtime latency and the battery consumption issues. To tackle this, We devise the network structure and rethinking the training strategy to maintain the performance as much as possible while effectively advance both efficiency aspects: network size, and the number of the operations. For data efficiency, We investigate the data augmentation and the unsupervised training in the image restoration task. The data augmentation method is fruitful when the training dataset is small or the network capacity is large without any computation cost in runtime. The unsupervised training assumes the scenario where only low-quality images exist, much challenging compared to the supervised regime. These two concepts have been well analyzed in the high-level vision field, but not many in the image restoration community. With both training strategies, We achieve the huge performance leap to the recent image restoration methods in many real-world scenarios and datasets. Last but not least, We tackle the multi-modal distortion, in particular, when multiple distortions corrupt the different regions of image. The single distortion restoration network or the distortion recognition-restoration pipeline system are not satisfactory in terms of both the performance and the efficiency when serving a model. In contrast, the proposed multi-expert network based on the multi-task learning and the analysis of the multi-modal distribution performs superior restoration accuracy with reasonable computation cost and good efficiency in model serving perspective.
URI
https://dspace.ajou.ac.kr/handle/2018.oak/20359
Fulltext

Appears in Collections:
Graduate School of Ajou University > Department of Artificial Intelligence > 4. Theses(Ph.D)
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse