people working on a white board

Berkeley Data Stack

The Berkeley Data Stack is a collection of open source tools that help enable large-scale data science research and education efforts across UC Berkeley, including Jupyter Interactive computing notebooks, textbooks, and autograding tools.

people consulting

Data Peer Consulting

Students help peers on everything from using Jupyter notebooks to learning SQL on an appointment and drop-in basis.

people talking at computers

Data Science Education Workshops

Resources from our pedagogy workshops, including the academic resource kit, are available to support instructors in learning and incorporating data science approaches and teaching tools.

people working at computer

Data Scholars

Data Scholars addresses issues of underrepresentation in the data science community by establishing a community that is welcoming, educational, and empowering for underrepresented and nontraditional students.

people talking

Student Opportunities

Students are the heart of our program, helping to create new data science content and courses, support outreach and communications, and advance research.

stylized graph lines


Berkeley's core data science classes use interactive online textbooks that are available to all.

Overhead shot of open space and stairs with the text "Modules: Bringing data to every classroom"


Data science modules are short explorations into data science that give students the opportunity to work hands-on with a data set relevant to their course and receive some instruction on the principles of data analysis, statistics, and computing.

UC Berkeley's Campanile in the background with yellow ginkgo leaves in the foreground.


Berkeley Datahub refers to the campus infrastructure set up on Datahub is a tool that allows Berkeley’s data science program to utilize cloud computing infrastructure to deploy scalable resources that enable users to interact remotely with a standardized, common computing environment.