Machine Learning and Data Science at Cal

Rapid Response Research for COVID-19: ML and Data Science at Cal | Keynote Spark + AI Summit 2020

June 29, 2020

The Spark + AI Summit 2020 is the largest gathering bringing together data teams in the Apache Spark™ community. As described by the Spark + AI Summit host, Databricks, the "sessions, and training will cover data engineering and data science content, along with best practices for productionizing AI — keeping training data fresh with stream processing, quality monitoring, testing, and serving models at a massive scale."

The Division of Computing, Data Science, and Society's (CDSS) Associate Provost Jennifer Chayes was among a host of dynamic keynote speakers who presented at the virtual event held June 22-26, 2020. Berkeley Professor Hany Farid, Electrical Engineering and Computer Science (EECS), and the School of Information was also among the presenters to highlight the latest in big data, analytics, and AI.

Chayes describes her keynote as highlighting, "the overall vision of CDSS and how it is transforming education and research at UC Berkeley, building bridges across a diverse set of programs, and disrupting the traditional siloed university structure. The emergence of the COVID-19 pandemic has accelerated ramp-up of this new Division and the interdisciplinary research and collaboration it fosters. It also has highlighted the importance of delivering inclusive, rigorous data science education at scale, a hallmark of the Berkeley program. I will draw on examples from across campus of how computing and data are being used to address the pandemic, and how these challenges will stress the scale, performance, privacy, and resilience of the underlying data systems, driving the next generation of requirements for systems like Spark."