On the Distribution of Traffic Volumes in the Internet and its Implications

Submitted by richard on Fri, 06/28/2019 - 10:23
Proc IEEE Infocom

This paper updates previous work on fitting traffic profiles. We use more modern statistical techniques to question (and refute) previous assumptions about heavy tails in statistics. In this case we believe that the best fit for traffic volume per unit time is the log-normal distribution. Tail distributions an have big impacts for capacity planning and for prediction of pricing (say 95th percentile).

Mobile Sensor Data Anonymization

Submitted by richard on Wed, 06/05/2019 - 09:42
ACM/IEEE International Conference on Internet-of-Things Design and Implementation

This paper looks the problem of releasing time-series data when privacy is a concern. It uses information theory to look at what extra information could "leak" if our device sends motion data. For example, can users be reidentified or can features such as height and weight be determined. A machine learning framework is given that can produce a tradeoff between allowing useful data to pass through while distorting the signal minimally to disguise information we wish to be private.

Building Distributed Temporal Graphs From Event Streams

Submitted by richard on Mon, 07/02/2018 - 15:49
Second Workshop on Advances in Mining Large-Scale Time Dependent Graphs (TD-LSG)

This paper describes the Raphtory system which is used to analysis large-scale time-varying graph systems. It can ingest streaming graph information and store the complete graph history. It enables queries to be made over the graphs at different points in that graph's history.

Faces in the Clouds: Long-Duration, Multi-User, Cloud-Assisted Video Conferencing

Submitted by richard on Mon, 02/27/2017 - 09:27
Transactions on Cloud Computing (to appear)

This paper is a simulation based study of cloud assisted multi-user video streaming. It is based upon two use cases (one related to video poker the other related to MOOCs). The paper looks at strategies for placing cloud locations to facilitate streaming using Amazon EC2 cloud locations. The paper compares a strategy that dynamically picks new locations for cloud hosts as time goes on. Interestingly this seems to provide little benefit compared with simply having a good initial choice of sites even when users may drop into and out of a cloud chat session over the course of many hours.