Machine Learning for Science
June 1, 2020
Berkeley Lab data scientist Daniela Ushizima is exploring whether image recognition algorithms and a data analysis pipeline powered by machine learning can help accurately distinguish COVID-19 abnormalities in CT scans and chest X-rays from other overlapping respiratory illnesses.
About Machine Learning at Berkeley Lab ⤓
Machine learning is a promising branch of artificial intelligence that Berkeley Lab scientists develop and employ in hundreds of projects every day. Our researchers track atomic particles, search for better battery materials, analyze traffic patterns, improve crop yields, pinpoint extreme weather in exascale climate simulations, and piece together metagenomic puzzles from billions of DNA fragments using tools, technology, and advanced mathematics, much of it developed by Berkeley Lab scientists. ⤓ Scroll down for more.
A Powerful Scientific Tool
Machine learning is a branch of artificial intelligence that makes inferences from raw data using sophisticated algorithms and powerful computers. For online shoppers, that means better "you might also like..." suggestions. But for scientists, machine learning tools can reveal profound insights hiding in ballooning datasets.
Thanks to better instruments, including technologies developed at Berkeley Lab, we can see things at a microscopic and atomic scale, measure vibrations imperceptible to the human eye, and capture high-resolution images of objects millions of light years away. But those instruments produce vastly larger datasets than ever. The Large Synoptic Survey Telescope (LSST) will produce 20 terabytes of data every night, about 60 petabytes over its lifetime. The Large Hadron Collider has already produced 900 petabytes of data (50 petabytes in 2018 alone) and expects to create another 500 petabytes by 2024. Conventional data analysis alone can't keep up.
Using machine learning techniques, models can be automatically derived from that data. These models can be used to identify features, reduce complexity, and control experiments.
Math, Software & Tools to Spur Innovation
Berkeley Lab's research into machine learning builds on its foundational work in mathematics to develop methods that are are consistent with physical laws, robust in the presence of noisy or biased data, and capable of being interpreted and explained in scientifically meaningful ways.
As a Department of Energy National Laboratory, we develop and share the algorithms, software, tools and libraries that are foundational to scientific machine learning. We gather, organize and store huge scientific datasets in areas such as materials, energy, environment, biology, genomics, and astronomy. And we develop tools and advanced networking facilities to make these datasets more searchable and accessible.