Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment

Created by
  • Haebom

Author

Ziheng Jia, Jiaying Qian, Zicheng Zhang, Zijian Chen, Xiongkuo Min

Outline

This paper proposes Refine-IQA, a novel framework that applies reinforcement learning fine-tuning (RFT) to low-level vision domains, particularly image quality assessment (IQA). To address the rule-based reward and low-level visual quality recognition shortcomings of existing RFT-based IQA methods, the Refine-Perception-20K dataset is constructed and a multi-task reward function is designed to enhance the model's visual quality recognition. In the second stage, a probability difference compensation strategy is introduced to supervise the "thinking" process. As a result, Refine-IQA achieves excellent performance on both recognition and scoring tasks, and in particular, it excels on quality interpretation benchmarks.

Takeaways, Limitations

Takeaways:
We present a novel framework for effectively applying RFT to IQA, a low-level vision domain.
Improving the visual quality perception and 'thinking' process of the model through multi-task reward functions and probability difference compensation strategies.
Contributed to the advancement of RFT-based IQA research by building the Refine-Perception-20K dataset.
Achieved excellent performance on both recognition and scoring tasks, as well as quality interpretation benchmarks.
Limitations:
Further validation is needed regarding the scale and diversity of the Refine-Perception-20K dataset.
Further experiments are needed to evaluate the generalization performance of the proposed method.
Performance may be limited for certain types of image distortion.
👍