Understanding the Semantic Structures of Tables with a Hybrid Deep Neural Network Architecture

Kyosuke Nishida; Kugatsu Sadamitsu; Ryuichiro Higashinaka; Yoshihiro Matsuo

doi:10.1609/aaai.v31i1.10484

Understanding the Semantic Structures of Tables with a Hybrid Deep Neural Network Architecture

Authors

Kyosuke Nishida NTT Corporation
Kugatsu Sadamitsu NTT Corporation
Ryuichiro Higashinaka NTT Corporation
Yoshihiro Matsuo NTT Corporation

DOI:

https://doi.org/10.1609/aaai.v31i1.10484

Keywords:

Web Tables, Classification, Deep Learning, Recurrent Neural Networks, Convolutional Neural Networks

Abstract

We propose a new deep neural network architecture, TabNet, for table type classification. Table type is essential information for exploring the power of Web tables, and it is important to understand the semantic structures of tables in order to classify them correctly. A table is a matrix of texts, analogous to an image, which is a matrix of pixels, and each text consists of a sequence of tokens. Our hybrid architecture mirrors the structure of tables: its recurrent neural network (RNN) encodes a sequence of tokens for each cell to create a 3d table volume like image data, and its convolutional neural network (CNN) captures semantic features, e.g., the existence of rows describing properties, to classify tables. Experiments using Web tables with various structures and topics demonstrated that TabNet achieved considerable improvements over state-of-the-art methods specialized for table classification and other deep neural network architectures.

Downloads

Published

2017-02-10

How to Cite

Nishida, K., Sadamitsu, K., Higashinaka, R., & Matsuo, Y. (2017). Understanding the Semantic Structures of Tables with a Hybrid Deep Neural Network Architecture. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10484

Download Citation

Issue

Vol. 31 No. 1 (2017): Thirty-First AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: AI and the Web

Understanding the Semantic Structures of Tables with a Hybrid Deep Neural Network Architecture

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information