Supporting Data Science - A Campus-Wide Workshop

Saturday, February 23, 2013, 8 am - 11 am
Banatao Auditorium, Sutardja Dai Hall (CITRIS building), UC Berkeley         (Directions here)

Click for agenda here                       Provide feedback here

Dramatic increases in the scale of data collection, analysis, and dissemination are fostering a revolution across the natural, mathematical, and social sciences. Progress in all these fields has come to depend on individuals and collaborative teams that combine domain expertise, computational knowledge, and statistical skills.

UC Berkeley recognizes the challenges that go along with the data science transition. These challenges go beyond the pathbreaking application of data science approaches. They raise key questions about how data science can be supported at Berkeley:

  • How can we best enable a new generation of scientists to realize the potential of this torrent of data and support their career trajectories?
  • How can we facilitate and reward the development of data science tools?
  • How do we understand the university’s role in data curation and security?
  • How do we advance modern data science with coherent curricula for undergraduate and graduate students across the natural, mathematical, and social sciences?

This workshop is organized by the Vice Chancellor for Research with assistance from a group of data science researchers across campus. It will inform a funding proposal in the very near future to develop new mechanisms to support data science at Berkeley.

We are seeking participation from faculty, research staff, postdocs, and others. We want to understand the opportunities and challenges of doing data science at Berkeley for individuals at all career stages.

Please come for as much of the workshop as you can. To register please click here.  If you cannot make it but want to send your thoughts, please provide your comments here. You can also contact



8:00 am - Welcome

8:20 am - Lightning talks to introduce campus programs (2-3 min each)

Speakers include:

  • Josh Bloom, Associate Professor, Astronomy; Director, Center for Time Domain Informatics
  • Charles Marshall, Professor, Integrative Biology; Director, UC Museum of Paleontology
  • Henry Brady, Goldman School of Public Policy
  • Fernando Perez, Associate Researcher, Henry H. Wheeler Jr. Brain Imaging Center
  • Ion Stoica, Professor, Electrical Engineering and Computer Science
  • Cathryn Carson, Associate Professor, History; Acting Director of Social Sciences Data Laboratory "D-Lab"
  • Bin Yu, Professor, Statistics
  • Dave Dineen, Ph.D. and Cyrus Afrasiab, Berkeley Phylogenomics Group (Kimmen Sjölander)
  • Lee Fleming, Professor, Industrial Engineering; Director of the Fung Institute for Engineering Leadership
  • Marti Hearst, Professor, School of Information
  • additional speakers TBD


9:00 am - Structured brainstorming on key issues/themes:
Parallel sessions:
- Data science challenges facing research teams
- Data management, archiving/curation, security
- Career paths inside the university (faculty and academic staff)

- New forms of organization on campus and across fields
- Software development ecosystem
- Education (grad and undergrad)

10:30 am - Closing remarks
(30 min)