Interactive symbolic regression with co-design mechanism through offline reinforcement learning

Symbolic Regression holds great potential for uncovering underlying mathematical and physical relationships from observed data. However, the vast combinatorial space of possible expressions poses significant challenges for previous online search methods and pre-trained transformer models, which most...

Full description

Saved in:
Bibliographic Details
Published in:Nature communications Vol. 16; no. 1; pp. 3930 - 12
Main Authors: Tian, Yuan, Zhou, Wenqi, Viscione, Michele, Dong, Hao, Kammer, David S., Fink, Olga
Format: Journal Article
Language:English
Published: London Nature Publishing Group UK 26.04.2025
Nature Publishing Group
Nature Portfolio
Subjects:
ISSN:2041-1723, 2041-1723
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Symbolic Regression holds great potential for uncovering underlying mathematical and physical relationships from observed data. However, the vast combinatorial space of possible expressions poses significant challenges for previous online search methods and pre-trained transformer models, which mostly do not consider the integration of domain experts’ prior knowledge. To address these challenges, we propose the Symbolic Q-network, an advanced interactive framework for large-scale symbolic regression. Unlike previous transformer-based SR approaches, Symbolic Q-network leverages reinforcement learning without relying on a transformer-based decoder. Furthermore, we propose a co-design mechanism, where the Symbolic Q-network facilitates effective interaction with domain experts at any stage of the equation discovery process. Our extensive experiments demonstrate Sym-Q performs comparably to existing pretrained models across multiple benchmarks. Furthermore, our experiments on real-world cases demonstrate that the interactive co-design mechanism significantly enhances Symbolic Q-network’s performance, achieving greater performance gains than standard autoregressive models. Symbolic Regression is a method for uncovering mathematical and physical relationships from data. Here, the authors propose Sym-Q, an interactive reinforcement learning-based framework with a co-design mechanism that enables efficient and guided equation discovery.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-025-59288-y