YouNiverse: Large-Scale Channel and Video Metadata from English-Speaking YouTube

Authors

  • Manoel Horta Ribeiro EPFL
  • Robert West EPFL

DOI:

https://doi.org/10.1609/icwsm.v15i1.18125

Keywords:

Social network analysis; communities identification; expertise and authority discovery

Abstract

YouTube plays a key role in entertaining and informing people around the globe. However, studying the platform is difficult due to the lack of randomly sampled data and of systematic ways to query the platform's colossal catalog. In this paper, we present YouNiverse, a large collection of channel and video metadata from English-language YouTube. YouNiverse comprises metadata from over 136k channels and 72.9M videos published between May 2005 and October 2019, as well as channel-level time-series data with weekly subscriber and view counts. Leveraging channel ranks from socialblade.com, an online service that provides information about YouTube, we are able to assess and enhance the representativeness of the sample of channels. Additionally, the dataset also contains a table specifying which videos a set of 449M anonymous users commented on. YouNiverse, publicly available at https://zenodo.org/record/4650046, will empower the community to do research with and about YouTube.

Downloads

Published

2021-05-22

How to Cite

Horta Ribeiro, M., & West, R. (2021). YouNiverse: Large-Scale Channel and Video Metadata from English-Speaking YouTube. Proceedings of the International AAAI Conference on Web and Social Media, 15(1), 1016-1024. https://doi.org/10.1609/icwsm.v15i1.18125