Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos

Shankhanil Mitra; Rajiv Soundararajan

doi:10.1609/aaai.v38i5.28221

Authors

Shankhanil Mitra Indian Institute of Science
Rajiv Soundararajan Indian Institute of Science

DOI:

https://doi.org/10.1609/aaai.v38i5.28221

Keywords:

CV: Video Understanding & Activity Analysis, CV: Computational Photography, Image & Video Synthesis, CV: Representation Learning for Vision, ML: Semi-Supervised Learning, ML: Unsupervised & Self-Supervised Learning

Abstract

Perceptual quality assessment of user generated content (UGC) videos is challenging due to the requirement of large scale human annotated videos for training. In this work, we address this challenge by first designing a self-supervised Spatio-Temporal Visual Quality Representation Learning (ST-VQRL) framework to generate robust quality aware features for videos. Then, we propose a dual-model based Semi Supervised Learning (SSL) method specifically designed for the Video Quality Assessment (SSL-VQA) task, through a novel knowledge transfer of quality predictions between the two models. Our SSL-VQA method uses the ST-VQRL backbone to produce robust performances across various VQA datasets including cross-database settings, despite being learned with limited human annotated videos. Our model improves the state-of-the-art performance when trained only with limited data by around 10%, and by around 15% when unlabelled data is also used in SSL. Source codes and checkpoints are available at https://github.com/Shankhanil006/SSL-VQA.

Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information