An Online Presentation Slide Assessment System Using Visual and Semantic Segmentation Features

Authors

  • Shengzhou Yi The University of Tokyo
  • Junichiro Matsugami Rubato Co., Ltd.
  • Hiroshi Yumoto P&I Information Engineering Co., Ltd.
  • Toshihiko Yamasaki The University of Tokyo

DOI:

https://doi.org/10.1609/aaai.v37i13.27090

Keywords:

Presentation Slide, Web Education Service, Feature Learning, Semantic Segmentation

Abstract

In this study, we present a new presentation slide assessment system that can extract the structural features from any slide file formats. Our previous work used a neural network to identify novice vs. well-designed presentation slides based on visual and structural features. However, the structural feature extraction was only applicable to PowerPoint files. To solve this problem, we extract the semantic segmentation from the slide images as a new format of structural features. The proposed multi-modal Transformer extracts the features from the original images and semantic segmentation results to assess the slide design. The prediction targets are the top-10 checkpoints pointed out by the professional consultants. Class-imbalanced learning and multi-task learning methods are also applied to improve the accuracy. The proposed model only requiring the slide images achieved an average accuracy of 81.67% that is comparative to the performance of the previous work requiring the PowerPoint files.

Downloads

Published

2023-09-06

How to Cite

Yi, S., Matsugami, J., Yumoto, H., & Yamasaki, T. (2023). An Online Presentation Slide Assessment System Using Visual and Semantic Segmentation Features. Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16494-16496. https://doi.org/10.1609/aaai.v37i13.27090