August 21, 2018
Overview of DEEM 2018 The ACM SIGMOD Second Workshop on Data Management for End-to-End Machine Learning (DEEM) was successfully held last June in Houston, TX. The goal of DEEM is to bring together researchers and practitioners at the intersection of applied machine learning (ML) and data management/systems research to discuss data management/systems issues in ML […]
Read moreJune 25, 2018
Information visualization is an essential tool in the arsenal of a data scientist: visualizations help identify trends and patterns, spot outliers and anomalies, and verify hypotheses. Moreover, visualizations are visceral and intuitive: they tell us stories about our data; they educate, delight, inform, enthrall, amaze, and clarify. This has led to the overwhelming popularity of […]
Read moreFebruary 14, 2018
The web is an ever-evolving source of information, with data and knowledge derived from it powering a great range of modern applications. Accompanying the huge wealth of information, web data also introduces numerous challenges due to its size, diversity, volatility, inaccuracy, and contradictions. This year’s WebDB 2018 theme emphasizes the challenges and opportunities that arise […]
Read moreApril 27, 2017
Google has recently announced that its flagship wide-area database named Spanner has been made available on the Google Cloud. Google Spanner is the next generation globally-distributed database built inside Google and announced to the world through the paper published in OSDI 2012 [1]. This article explores the implication of Google Spanner, in particular to the […]
Read moreOctober 6, 2016
Self-driving cars, ride-sharing service (e.g., Uber and Lyft), and Pokemon Go are just three examples of recent disruptive applications that gained huge market share and publicity. It is expected that each self-driving car will generate 2 PB of data per year, with 10 Million of such cars by 2020. Uber has 2+ Billion rides so […]
Read more