We Can Explain Your Research in Layman's Terms: Towards Automating Science Journalism at Scale

Authors

  • Rumen Dangovski Massachusetts Institute of Technology
  • Michelle Shen Massachusetts Institute of Technology
  • Dawson Byrd Massachusetts Institute of Technology
  • Li Jing Massachusetts Institute of Technology
  • Desislava Tsvetkova Sofia University
  • Preslav Nakov Qatar Computing Research Institute, HBKU
  • Marin Soljačić Massachusetts Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v35i14.17507

Keywords:

Applications

Abstract

We propose to study Automating Science Journalism (ASJ), the process of producing a layman's terms summary of a research article, as a new benchmark for long neural abstractive summarization and story generation. Automating science journalism is a challenging task as it requires paraphrasing complex scientific concepts to be grasped by the general public. Thus, we create a specialized dataset that contains scientific papers and their Science Daily press releases. We demonstrate numerous sequence to sequence (seq2seq) applications using Science Daily with the aim of facilitating further research on language generation, which requires extreme paraphrasing and coping with long research articles. We further improve the quality of the press releases using co-training with scientific abstracts of sources or partitioned press releases. Finally, we apply evaluation measures beyond ROUGE and we demonstrate improved performance for our method over strong baselines, which we further confirm by quantitative and qualitative evaluation.

Downloads

Published

2021-05-18

How to Cite

Dangovski, R., Shen, M., Byrd, D., Jing, L., Tsvetkova, D., Nakov, P., & Soljačić, M. (2021). We Can Explain Your Research in Layman’s Terms: Towards Automating Science Journalism at Scale. Proceedings of the AAAI Conference on Artificial Intelligence, 35(14), 12728-12737. https://doi.org/10.1609/aaai.v35i14.17507

Issue

Section

AAAI Technical Track on Speech and Natural Language Processing I