TY - JOUR AU - Zhao, Chenye AU - Mangat, Jasmine AU - Koujalgi, Sujay AU - Squicciarini, Anna AU - Caragea, Cornelia PY - 2022/05/31 Y2 - 2024/03/28 TI - PrivacyAlert: A Dataset for Image Privacy Prediction JF - Proceedings of the International AAAI Conference on Web and Social Media JA - ICWSM VL - 16 IS - 1 SE - Dataset Papers DO - 10.1609/icwsm.v16i1.19387 UR - https://ojs.aaai.org/index.php/ICWSM/article/view/19387 SP - 1352-1361 AB - Image privacy issues have become an important challenge as millions of images are being shared on social networking sites every day. Often due to users' lack of privacy awareness and social pressure, users' posted images reveal sensitive information and may be easily used to their detriment. To address these issues, several recent studies have proposed machine learning models to automatically identify whether an image contains private information. However, progress on this important task has been hampered by the absence of reliable, publicly available, up-to-date datasets. To this end, we introduce PrivacyAlert, a dataset developed from recent images extracted from Flickr and annotated with privacy labels (private or public). Our data collection process is based on state-of-the-art privacy taxonomy and captures a comprehensive set of image types of various sensitivity. We perform a comprehensive analysis of our dataset and report image privacy prediction results using classic and deep learning models to set the ground for future studies. Our dataset is publicly available at: https://doi.org/10.5281/zenodo.6406870. ER -