The Berkeley Data Stack is a collection of open source tools that help enable large-scale data science research and education efforts across UC Berkeley, including Jupyter Interactive computing notebooks, textbooks, and autograding tools.
Resources from our pedagogy workshops, including the academic resource kit, are available to support instructors in learning and incorporating data science approaches and teaching tools.
Data Scholars addresses issues of underrepresentation in the data science community by establishing a community that is welcoming, educational, and empowering for underrepresented and nontraditional students.
Data science modules are short explorations into data science that give students the opportunity to work hands-on with a data set relevant to their course and receive some instruction on the principles of data analysis, statistics, and computing.
Berkeley Datahub refers to the campus infrastructure set up on datahub.berkeley.edu. Datahub is a tool that allows Berkeley’s data science program to utilize cloud computing infrastructure to deploy scalable resources that enable users to interact remotely with a standardized, common computing environment.