Detecting VoIP Data Streams: Approaches Using Hidden Representation Learning

Maya Kapoor; Michael Napolitano; Jonathan Quance; Thomas Moyer; Siddharth Krishnan

doi:10.1609/aaai.v37i13.26840

Authors

Maya Kapoor Parsons Corporation
Michael Napolitano Parsons Corporation
Jonathan Quance Parsons Corporation
Thomas Moyer University of North Carolina at Charlotte
Siddharth Krishnan University of North Carolina at Charlotte

DOI:

https://doi.org/10.1609/aaai.v37i13.26840

Keywords:

Deep Packet Inspection, Network Traffic Analysis, Density-based Clustering, Neural Networks, Network Security

Abstract

The use of voice-over-IP technology has rapidly expanded over the past several years, and has thus become a significant portion of traffic in the real, complex network environment. Deep packet inspection and middlebox technologies need to analyze call flows in order to perform network management, load-balancing, content monitoring, forensic analysis, and intelligence gathering. Because the session setup and management data can be sent on different ports or out of sync with VoIP call data over the Real-time Transport Protocol (RTP) with low latency, inspection software may miss calls or parts of calls. To solve this problem, we engineered two different deep learning models based on hidden representation learning. MAPLE, a matrix-based encoder which transforms packets into an image representation, uses convolutional neural networks to determine RTP packets from data flow. DATE is a density-analysis based tensor encoder which transforms packet data into a three-dimensional point cloud representation. We then perform density-based clustering over the point clouds as latent representations of the data, and classify packets as RTP or non-RTP based on their statistical clustering features. In this research, we show that these tools may allow a data collection and analysis pipeline to begin detecting and buffering RTP streams for later session association, solving the initial drop problem. MAPLE achieves over ninety-nine percent accuracy in RTP/non-RTP detection. The results of our experiments show that both models can not only classify RTP versus non-RTP packet streams, but could extend to other network traffic classification problems in real deployments of network analysis pipelines.

Detecting VoIP Data Streams: Approaches Using Hidden Representation Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription