An international team of scientists, including from the University of Cambridge, have launched a new research collaboration that will leverage the same technology behind ChatGPT to build an AI-powered tool for scientific discovery.
The team launched the initiative, called Polymathic AI earlier this week, alongside the publication of a series of related papers on the arXiv open access repository.
“This will completely change how people use AI and machine learning in science,” said Polymathic AI principal investigator Shirley Ho, a group leader at the Flatiron Institute’s Center for Computational Astrophysics in New York City.
The idea behind Polymathic AI “is similar to how it’s easier to learn a new language when you already know five languages,” said Ho.
Starting with a large, pre-trained model, known as a foundation model, can be both faster and more accurate than building a scientific model from scratch. That can be true even if the training data isn’t obviously relevant to the problem at hand.
“It’s been difficult to carry out academic research on full-scale foundation models due to the scale of computing power required,” said co-investigator Miles Cranmer, from Cambridge’s Department of Applied Mathematics and Theoretical Physics and Institute of Astronomy. “Our collaboration with Simons Foundation has provided us with unique resources to start prototyping these models for use in basic science, which researchers around the world will be able to build from—it’s exciting.”
“Polymathic AI can show us commonalities and connections between different fields that might have been missed,” said co-investigator Siavash Golkar, a guest researcher at the Flatiron Institute’s Center for Computational Astrophysics.
“In previous centuries, some of the most influential scientists were polymaths with a wide-ranging grasp of different fields. This allowed them to see connections that helped them get inspiration for their work. With each scientific domain becoming more and more specialized, it is increasingly challenging to stay at the forefront of multiple fields. I think this is a place where AI can help us by aggregating information from many disciplines.”
“Despite rapid progress of machine learning in recent years in various scientific fields, in almost all cases, machine learning solutions are developed for specific use cases and trained on some very specific data,” said co-investigator Francois Lanusse, a cosmologist at the Center national de la recherche scientifique (CNRS) in France.
“This creates boundaries both within and between disciplines, meaning that scientists using AI for their research do not benefit from information that may exist, but in a different format, or in a different field entirely.”
Polymathic AI’s project will learn using data from diverse sources across physics and astrophysics (and eventually fields such as chemistry and genomics, its creators say) and apply that multidisciplinary savvy to a wide range of scientific problems. The project will “connect many seemingly disparate subfields into something greater than the sum of their parts,” said project member Mariel Pettee, a postdoctoral researcher at Lawrence Berkeley National Laboratory.
“How far we can make these jumps between disciplines is unclear,” said Ho. “That’s what we want to do—to try and make it happen.”
ChatGPT has well-known limitations when it comes to accuracy (for instance, the chatbot says 2,023 times 1,234 is 2,497,582 rather than the correct answer of 2,496,382). Polymathic AI’s project will avoid many of those pitfalls, Ho said, by treating numbers as actual numbers, not just characters on the same level as letters and punctuation. The training data will also use real scientific datasets that capture the physics underlying the cosmos.
Transparency and openness are a big part of the project, Ho said. “We want to make everything public. We want to democratize AI for science in such a way that, in a few years, we’ll be able to serve a pre-trained model to the community that can help improve scientific analyses across a wide variety of problems and domains.”
More information: Michael McCabe et al, Multiple Physics Pretraining for Physical Surrogate Models, arXiv (2023). DOI: 10.48550/arxiv.2310.02994
Siavash Golkar et al, xVal: A Continuous Number Encoding for Large Language Models, arXiv (2023). DOI: 10.48550/arxiv.2310.02989
Francois Lanusse et al, AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models, arXiv (2023). DOI: 10.48550/arxiv.2310.03024
News
Ethics in Nanomedicine: Key Issues and Principles
Nanomedicine, a branch of nanotechnology, is revolutionizing healthcare by enabling the manipulation of materials at the nanoscale to diagnose, treat, and prevent diseases. Unlike traditional treatments, nanoparticles (NPs) are highly precise in targeting diseased [...]
A call for robust H5N1 influenza preparedness and response
As the global threat of H5N1 influenza looms with outbreaks across species and continents including the U.S., three international vaccine and public health experts say it is time to fully resource and support a [...]
Mucosal COVID-19 boosters outperform mRNA shots in preventing upper airway infections
In a recent study published in Nature Immunology, a team of researchers from the United States used non-human primate models to compare the protection conferred by an intramuscular booster dose of the bivalent messenger ribonucleic acid [...]
How Space Travel Really Changes Astronauts – From the Inside Out
International team reveals previously unknown effects on physiology that could shape the future of long-duration space missions. Researchers have discovered significant changes in the gut microbiome due to spaceflight, which affects host physiology and [...]
Breakthrough in blood stem cell development offers hope for leukemia and bone marrow failure
Melbourne researchers have made a world first breakthrough into creating blood stem cells that closely resemble those in the human body. And the discovery could soon lead to personalized treatments for children with leukemia [...]
Scientists Develop Game-Changing Needle-Free COVID-19 Intranasal Vaccine
A new mucosal COVID-19 vaccine poised to revolutionize the delivery process is especially beneficial for those with a fear of needles. A next-generation COVID-19 mucosal vaccine is set to be a game-changer not only when delivering [...]
Scientists Develop All-in-One Solution To Catch and Destroy “Forever Chemicals”
A new water treatment system developed by UBC researchers efficiently removes and destroys PFAS pollutants using a dual-action catalyst, offering a sustainable and cost-effective solution for water purification challenges. Chemical engineers at the University of [...]
New method accelerates drug discovery from years to months
Researchers from the University of Cincinnati College of Medicine and Cincinnati Children's Hospital have found a new method to increase both speed and success rates in drug discovery. The study, published Aug. 30 in [...]
A new smart mask analyzes your breath to monitor your health
Your breath can give away a lot about you. Each exhalation contains all sorts of compounds, including possible biomarkers for disease or lung conditions, that could give doctors a valuable insight into your health. [...]
Study reveals the role of blood clotting in COVID-19
In a study that reshapes what we know about COVID-19 and its most perplexing symptoms, scientists have discovered that the blood coagulation protein fibrin causes the unusual clotting and inflammation that have become hallmarks [...]
A Novel Cancer Vaccine Combining Nano-11 and ADU-S100
In a recent article published in npj Vaccines, researchers detailed the development of a novel cancer vaccine that combines a plant-derived nanoparticle adjuvant, Nano-11, with a clinically tested STING agonist, ADU-S100. The primary objective was [...]
AI spots cancer and viral infections with nanoscale precision
Researchers have developed an artificial intelligence which can differentiate cancer cells from normal cells, as well as detect the very early stages of viral infection inside cells. The findings, published today in a study [...]
Tiny shards of plastic are increasingly infiltrating our brains, study says
Human brain samples collected at autopsy in early 2024 contained more tiny shards of plastic than samples collected eight years prior, according to a preprint posted online in May. A preprint is a study which has not yet [...]
Scientists Have Discovered Strange DNA in Our Brains – and It Could Be Shortening Our Lives
According to the research, these mitochondrial DNA insertions could be linked to early death. Mitochondria in brain cells frequently insert their DNA into the nucleus, potentially impacting lifespan, as those with more insertions were found to [...]
Watch Out After a Hospital Stay: You Could Be Exposing Your Family to Superbugs
Research indicates hospitals contribute to the local spread of antibiotic-resistant infections. A recent study published in the journal Infection Control & Hospital Epidemiology by the Society for Healthcare Epidemiology of America suggests that family members of [...]
Molecular Trickery: How COVID-19 Silently Sabotages the Human Immune System
Researchers have discovered that SARS-CoV-2 manipulates the human immune system by forcing cells to produce non-functional proteins, hindering the body’s antiviral defenses. This groundbreaking study by teams from prestigious Brazilian universities highlights potential targets for new COVID-19 treatments, [...]