Archive for the Machine Learning category

Eugene Wu

Where Does Database Research Go From Here?

Databases, Machine Learning

The past few years of generative AI have upended research agendas across academia. Having just spent my sabbatical in the Bay Area, where the San Francisco fog is mixed with a tinge of forest fire and LLMs, I wanted to reflect on the role of the academic database research community within this sea change from the […]

Read more
Mahsa Baktash and Zi (Helen) Huang

A Leap from Model-Centric to Data Centric AI

Data Science, Machine Learning

Data as a major component of a deep learning solution is often undervalued in the ML projects, which results in a lower-than-expected accuracy, requiring hours and hours of model tuning. According to Andrew Ng, 99% of the recent publications are model-centric with only 1% being data-centric. He argues that there should be a balance between […]

Read more
Arun Kumar

Automation of Data Prep, ML, and Data Science: New Cure or Snake Oil?

AutoML, Data Preparation, Machine Learning

For almost 30 years, the DB / data management community has intensively studied the vexing pains of data integration, cleaning, and transformation. This research has largely been in the contexts of RDBMSs, SQL-oriented business intelligence (BI), and knowledge base construction. But as the emerging interdisciplinary field of Data Science gains prominence, the massive pain of […]

Read more
Yunyao Li and Shivakumar Vaithyanathan

Role of AI in Enterprise Applications

Analytics, Machine Learning

The recent return of AI summer and the enthusiastic uptake of AI in the commercial world can be loosely attributed to three innovations: Apple’s Siri, Google’s self-driving cars, and IBM Watson Jeopardy. This enthusiasm stems from the belief that AI will influence a wide range of applications across multiple industry segments. While such enthusiasm is, […]

Read more
Arun Kumar

ML/AI Systems and Applications: Is the SIGMOD/VLDB Community Losing Relevance?

Databases, Machine Learning

Overview of DEEM 2018 The ACM SIGMOD Second Workshop on Data Management for End-to-End Machine Learning (DEEM) was successfully held last June in Houston, TX. The goal of DEEM is to bring together researchers and practitioners at the intersection of applied machine learning (ML) and data management/systems research to discuss data management/systems issues in ML […]

Read more

Categories