UC Berkeley researchers have made groundbreaking contributions in mathematics, statistics, and computer science, including machine-learning and visualization techniques. These efforts are complemented by an array of data collection, management and curation services. The centers, institutes and programs highlighted below advance a wide range of data science methodologies:

 

Large-scale  interdisciplinary efforts 

AMPLab
 

AMPLab

Working at the intersection of three massive trends: powerful machine learning, cloud computing, and crowdsourcing, the AMPLab is integrating Algorithms, Machines, and People to make sense of Big Data. They are creating a new generation of analytics tools to answer deep questions over dirty and heterogeneous data by extending and fusing machine learning, warehouse-scale computing and human computation. They validate these ideas on real-world problems including participatory sensing, urban planning, and personalized medicine with their application and industrial partners.

Center for Information Technology Research in the Interest of Society (CITRIS)
 

Center for Information Technology Research in the Interest of Society (CITRIS)

The Center for Information Technology Research in the Interest of Society (CITRIS) creates information technology solutions for many of our most pressing social, environmental, and health care problems. CITIRS was created to “shorten the pipeline” between world-class laboratory research and the creation of start-ups, larger companies, and whole industries. CITRIS facilitates partnerships and collaborations among more than 300 faculty members and thousands of students from numerous departments at four University of California campuses with industrial researchers from over 60 corporations.

SDAV - Scalable Data Management, Analysis, and Visualization
 

 

SDAV - Scalable Data Management, Analysis, and Visualization

SDAV provides comprehensive expertise in scientific data management, analysis, and visualization aimed at transferring state of the art techniques into operational use by application scientists on leadership-class computing facilities.  It is a collaboration tapping the expertise of researchers at six laboratories and and in seven universities.

Simons Institute for the Theory of Computing
 
 

Simons Institute for the Theory of Computing

The Simons Institute for the Theory of Computing facilitates collaborative research in theoretical computer science. Established in July 2012 with support from the Simons Foundation, its goal is to bring together the world's leading researchers in theoretical computer science and related fields, as well as the next generation of outstanding young scholars, to explore deep unsolved problems about the nature and limits of computation.

 

Computer Science 

Computational Research at the Berkeley Lab
 

Computational Research at the Berkeley Lab

The Computational Research Division conducts research and development in mathematical modeling and simulation, algorithm design, data storage, management and analysis, computer system architecture and high-performance software implementation. They collaborate directly with scientists across the Berkeley Lab, the Department of Energy and industry to solve some of the world’s most challenging computational and data management and analysis problems in a broad range of scientific and engineering fields, including materials science, biology, climate modeling, astrophysics, fusion science, and many others.

 Electrical Engineering and Computer Sciences (EECS)

Electrical Engineering and Computer Sciences (EECS)

EECS offers research and instructional programs in electrical engineering and computer science. Their key strengths lie in the integration of fundamental theoretical ideas with practical applications, leading to a wide range of cross-disciplinary, collaborative projects. The integration of electrical engineering and computer science forms the core, with strong interactions that extend into biological sciences, mechanical and civil engineering, physical sciences, chemistry, mathematics, operations research, and more.

IPython
 

IPython 

Founded at UC Berkeley, IPython is an interactive shell for the Python programming language that offers enhanced introspection, additional shell syntax, tab completion and rich history.

National Energy Research Scientific Computing Center (NERSC)
 

National Energy Research Scientific Computing Center (NERSC)

The National Energy Research Scientific Computing Center (NERSC), a division of the Berkeley Lab, is the primary scientific computing facility for the Office of Science in the U.S. Department of Energy. As one of the largest facilities in the world devoted to providing computational resources and expertise for basic scientific research, NERSC is a world leader in accelerating scientific discovery through computation.

 

Statistics 

Department of Statistics
 

Department of Statistics

The Department of Statistics is engaged in research and education in probability and statistics. In addition to developing fundamental theory and methodology, they are actively involved in statistical problems that arise in such diverse fields as molecular biology, geophysics, astronomy, AIDS research, neurophysiology, sociology, political science, education, demography, and the U.S. Census.

 

Geospatial Data Analysis & Collection

 
Archaeological Research Facility (ARF)
 

 

Archaeological Research Facility (ARF)

The Archaeological Research Facility (ARF) encourages and carries out archaeological field and laboratory research conducted by UC Berkeley archaeologists and related specialists. As a field of research, archaeology is inherently interdisciplinary and collaborative; not only are there intimate research collaborations among natural scientists, social scientists and/or humanities scholars, but archaeology is practiced by scholars who expectedly hold faculty and/or research positions in a variety of departments, ranging from classics to earth and planetary science.

Electronic Cultural Atlas Initiative (ECAI)
 

Electronic Cultural Atlas Initiative (ECAI)

The Electronic Cultural Atlas Initiative, established in 1997 by Emeritus Prof. Lewis Lancaster of UC Berkeley, is a digital humanities initiative involving numerous professors and institutions around the world with the stated goal of creating a networked digital atlas by creating tools and setting standards for dynamic, digital maps.

Geographic Information Systems (GIS)
 

Geographic Information Systems (GIS)

The Geographic Information Systems at UC Berkeley coordinate GIS activities across campus, such as: classes, talks, workshops, jobs, experience.

Geospatial Innovation Facility (GIF)
 

Geospatial Innovation Facility (GIF)

The Geospatial Innovation Facility at UC Berkeley's College of Natural Resources provides leadership and training across a broad array of integrated mapping technologies, such as Remote Sensing, Geographic Information Systems (GIS), Global Positioning Systems (GPS), and modeling. The GIF offers innovative geospatial approaches to environmental research projects and grant opportunities.

UrbanSim
 

UrbanSim

UrbanSim is an open source urban simulation system designed by Paul Waddell at UC Berkeley and developed with numerous collaborators to support metropolitan land use, transportation, and environmental planning.

 

Data Collection and Digitalization 

California Digital Library
 

California Digital Library

In collaboration with the UC libraries and other partners, the California Digital Library has assembled one of the world’s largest digital research libraries and changed the way that faculty, students, and researchers discover and access information.

CollectionSpace
 

CollectionSpace

CollectionSpace is an open-source collections management application that meets the needs of museums, historical societies, and other collection-holding organizations. The Phoebe A. Hearst Museum of Anthropology uses CollectionSpace to manage and provide online access to its collection of more than 3.8 million cataloged objects of material culture from around the world. The University and Jepson Herbaria is working to expand CollectionSpace to support research-driven interoperability, including the aggregation of content for the Consortium of California Herbaria.

 

Visualization 

Berkeley Center for New Media (BCNM)
 

Berkeley Center for New Media (BCNM)

The Berkeley Center for New Media is a focal point for research and teaching about new media, led by a highly trans-disciplinary community of 120 affiliated faculty, advisors, and scholars, from 35 UC Berkeley departments. Their mission is to critically analyze and help shape developments in new media from cross-disciplinary and global perspectives that emphasize humanities and the public interest.

School of Information (ISchool)
 

School of Information (ISchool)

The School of Information is a graduate research and education community committed to expanding access to information and to improving its usability, reliability, and credibility while preserving security and privacy. This requires the insights of scholars from diverse fields such as, information and computer science, design, social sciences, management, law, and policy.

Visualization Group
 

Visualization Group

The Visualization Group aims to assist researchers in achieving their scientific goals – solving some of the world's most challenging problems in scientific data understanding – through visualization and analytics while simultaneously advancing the state-of-the-art in visualization through their own research.