An international team of scientists, including from the University of Cambridge, have launched a new research collaboration that will leverage the same technology behind ChatGPT to build an AI-powered tool for scientific discovery.
The team launched the initiative, called Polymathic AI earlier this week, alongside the publication of a series of related papers on the arXiv open access repository.
“This will completely change how people use AI and machine learning in science,” said Polymathic AI principal investigator Shirley Ho, a group leader at the Flatiron Institute’s Center for Computational Astrophysics in New York City.
The idea behind Polymathic AI “is similar to how it’s easier to learn a new language when you already know five languages,” said Ho.
Starting with a large, pre-trained model, known as a foundation model, can be both faster and more accurate than building a scientific model from scratch. That can be true even if the training data isn’t obviously relevant to the problem at hand.
“It’s been difficult to carry out academic research on full-scale foundation models due to the scale of computing power required,” said co-investigator Miles Cranmer, from Cambridge’s Department of Applied Mathematics and Theoretical Physics and Institute of Astronomy. “Our collaboration with Simons Foundation has provided us with unique resources to start prototyping these models for use in basic science, which researchers around the world will be able to build from—it’s exciting.”
“Polymathic AI can show us commonalities and connections between different fields that might have been missed,” said co-investigator Siavash Golkar, a guest researcher at the Flatiron Institute’s Center for Computational Astrophysics.
“In previous centuries, some of the most influential scientists were polymaths with a wide-ranging grasp of different fields. This allowed them to see connections that helped them get inspiration for their work. With each scientific domain becoming more and more specialized, it is increasingly challenging to stay at the forefront of multiple fields. I think this is a place where AI can help us by aggregating information from many disciplines.”
“Despite rapid progress of machine learning in recent years in various scientific fields, in almost all cases, machine learning solutions are developed for specific use cases and trained on some very specific data,” said co-investigator Francois Lanusse, a cosmologist at the Center national de la recherche scientifique (CNRS) in France.
“This creates boundaries both within and between disciplines, meaning that scientists using AI for their research do not benefit from information that may exist, but in a different format, or in a different field entirely.”
Polymathic AI’s project will learn using data from diverse sources across physics and astrophysics (and eventually fields such as chemistry and genomics, its creators say) and apply that multidisciplinary savvy to a wide range of scientific problems. The project will “connect many seemingly disparate subfields into something greater than the sum of their parts,” said project member Mariel Pettee, a postdoctoral researcher at Lawrence Berkeley National Laboratory.
“How far we can make these jumps between disciplines is unclear,” said Ho. “That’s what we want to do—to try and make it happen.”
ChatGPT has well-known limitations when it comes to accuracy (for instance, the chatbot says 2,023 times 1,234 is 2,497,582 rather than the correct answer of 2,496,382). Polymathic AI’s project will avoid many of those pitfalls, Ho said, by treating numbers as actual numbers, not just characters on the same level as letters and punctuation. The training data will also use real scientific datasets that capture the physics underlying the cosmos.
Transparency and openness are a big part of the project, Ho said. “We want to make everything public. We want to democratize AI for science in such a way that, in a few years, we’ll be able to serve a pre-trained model to the community that can help improve scientific analyses across a wide variety of problems and domains.”
More information: Michael McCabe et al, Multiple Physics Pretraining for Physical Surrogate Models, arXiv (2023). DOI: 10.48550/arxiv.2310.02994
Siavash Golkar et al, xVal: A Continuous Number Encoding for Large Language Models, arXiv (2023). DOI: 10.48550/arxiv.2310.02989
Francois Lanusse et al, AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models, arXiv (2023). DOI: 10.48550/arxiv.2310.03024
News
According to Researchers, Your Breathing Patterns Could Hold the Key to Better Memory
Breathing synchronizes brain waves that support memory consolidation. A new study from Northwestern Medicine reports that, much like a conductor harmonizes various instruments in an orchestra to create a symphony, breathing synchronizes hippocampal brain waves to [...]
The Hidden Culprit Behind Alzheimer’s Revealed: Microglia Under the Microscope
Researchers at the CUNY Graduate Center have made a groundbreaking discovery in Alzheimer’s disease research, identifying a critical link between cellular stress in the brain and disease progression. Their study focuses on microglia, the brain’s immune [...]
“Mirror Bacteria” Warning: A New Kind of Life Could Pose a Global Threat
Mirror life, a concept involving synthetic organisms with reversed molecular structures, carries significant risks despite its potential for medical advancements. Experts warn that mirror bacteria could escape natural biological controls, potentially evolving to exploit [...]
Lingering Viral Fragments: The Hidden Cause of Long COVID
Long COVID, affecting 5-10% of COVID-19 patients, might be caused by the enduring presence of the virus in the body. Research suggests that viral fragments, possibly live, linger and lead to symptoms. Addressing this involves antiviral treatments, enhanced [...]
Hidden Scars: How COVID Lockdowns Altered Teen Brains Forever
Research from the University of Washington revealed that COVID-19 lockdowns led to accelerated cortical thinning in adolescents, impacting brain development significantly. This effect was more pronounced in females than males, raising concerns about long-term brain health. The study [...]
Simple Blood Test To Detect Dementia Before Symptoms Appear
UCLA researchers have identified placental growth factor (PlGF) as a potential blood biomarker for early detection of cognitive impairment and dementia. High PlGF levels correlate with increased vascular permeability, suggesting its role in the development [...]
Investing Goldman Sachs asks ‘Is curing patients a sustainable business model?’
Goldman Sachs analysts attempted to address a touchy subject for biotech companies, especially those involved in the pioneering “gene therapy” treatment: cures could be bad for business in the long run. “Is curing patients [...]
The risks of reversed chirality: Study highlights dangers of mirror organisms
A groundbreaking study evaluates the feasibility, risks, and ethical considerations of creating mirror bacteria with reversed chirality, highlighting potential threats to health and ecosystems. In a recent study published in Science, a team of researchers [...]
Alarming Mutation in H5N1 Virus Raises Pandemic Red Flags
NIH-funded study concludes that the risk of human infection remains low A recent study published in Science and funded by the National Institutes of Health (NIH) has found that a single alteration in a protein on the surface [...]
Scientists Discover Genetic Changes Linked to Autism, Schizophrenia
The Tbx1 gene influences brain volume and social behavior in autism and schizophrenia, with its deficiency linked to amygdala shrinkage and impaired social incentive evaluation. A study published in Molecular Psychiatry has linked changes in brain [...]
How much permafrost will melt this century, and where will its carbon go?
Among the many things global warming will be melting this century—sea ice, land glaciers and tourist businesses in seaside towns across the world—is permafrost. Lying underneath 15% of the northern hemisphere, permafrost consists of [...]
A Physics Discovery So Strange It’s Changing Quantum Theory
MIT physicists surprised to discover electrons in pentalayer graphene can exhibit fractional charge. New theoretical research from MIT physicists explains how it could work, suggesting that electron interactions in confined two-dimensional spaces lead to novel quantum states, [...]
Inside the Nano-Universe: New 3D X-Ray Imaging Transforms Material Science
A cutting-edge X-ray method reveals the 3D orientation of nanoscale material structures, offering fresh insights into their functionality. Researchers at the Swiss Light Source (SLS) have developed a groundbreaking technique called X-ray linear dichroic orientation tomography [...]
X-chromosome study reveals hidden genetic links to Alzheimer’s disease
Despite decades of research, the X-chromosome’s impact on Alzheimer’s was largely ignored until now. Explore how seven newly discovered genetic loci could revolutionize our understanding of the disease. Conventional investigations of the genetic contributors [...]
The Unresolved Puzzle of Long COVID: 30% of Young People Still Suffer After Two Years
A UCL study found that 70% of young people with long Covid recovered within 24 months, but recovery was less likely among older teenagers, females, and those from deprived backgrounds. Researchers emphasized the need [...]
Needle-Free: New Nano-Vaccine Effective Against All COVID-19 Variants
A new nano-vaccine developed by TAU and the University of Lisbon offers a needle-free, room-temperature-storable solution against COVID-19, targeting all key variants effectively. Professor Ronit Satchi-Fainaro’s lab at Tel Aviv University’s Faculty of Medical and [...]