基于Kolmogorov-Arnold Networks的Grad-Shafranov方程自由边界问题求解方法

张津瑞; 栾其斌; 丁思琪; 刘天源; 顾翔; 王正汹

doi:10.7498/aps.75.20260331

摘要
等离子体平衡计算是磁约束核聚变理论研究、装置设计和实时控制的核心问题之一。为克服Grad-Shafranov方程自由边界问题的迭代求解成本高、现有机器学习方法泛化能力弱的局限，本文提出了基于Kolmogorov-Arnold Network的快速求解方法。工作基于Shape Editor自由边界求解模块构建拉丁超立方采样数据集，并在模型中引入方程残差约束和等离子体电流约束。模型还在损失函数中引入B样条基函数系数L1正则化项和熵正则化项，并采取剪枝策略简化网络结构。训练完成后通过符号回归将模型表达为显式数学形式，磁通预测决定系数可达到0.995342。基于保持边界点一阶偏导数不变的线性外推方法，本文将符号回归结果外推至训练样本空间外，并通过比较外推的磁通分布和限制器边界的位置关系实现了89.1%的偏滤器与限制器位型判断准确率。该求解方法单次计算时间由传统求解器的分钟级缩短至毫秒级，且存储占用空间显著降低。

关键词:
Grad-Shafranov方程 /

Kolmogorov-Arnold Network /

物理信息神经网络 /

线性外推
Abstract
The computation of plasma equilibrium is essential for theoretical research, physical design, and real-time control of magnetic confinement fusion devices. To overcome the significant computational burdens of traditional iteration-based Grad-Shafranov (G-S) solvers and the inherent extrapolation limitations of existing "black-box" data-driven models, this work proposes an explicit, highly efficient, and physics-consistent numerical solver based on the Kolmogorov-Arnold Network (KAN). Using the free-boundary module of Shape Editor (SE), a high-quality dataset comprising 9,292 valid divertor configurations was constructed within a seven-dimensional parameter space via Latin Hypercube Sampling. To ensure physical consistency, a semi-supervised learning framework was developed by embedding G-S equation residuals and plasma current conservation constraints into the loss function. To address the spatial discontinuity of current density at coil boundaries, poloidal field coil currents were superimposed as known source terms rather than being directly predicted. Furthermore, a pruning method utilizing L1 regularization and entropy across all KAN layers was employed to sparsify the network, enabling the transformation of the trained model into explicit analytical expressions via symbolic regression. The symbolic model achieves a high determination coefficient (R²) of 0.9953 for magnetic flux prediction, and error correlation analysis confirms the numerical stability of the symbolic extraction. A comparative analysis with traditional Multilayer Perceptrons further demonstrates the superiority of the KAN architecture in terms of accuracy and interpretability. Based on a linear extrapolation method utilizing first-order gradient continuity, the model's input parameter range is extended beyond the training space. This extrapolated magnetic flux distribution enables the identification of plasma configurations through its positional relationship with the limiter. Validation on 1,000 out-of-distribution test samples yields an overall classification accuracy of 89.1% in distinguishing between divertor and limiter configurations, with a divertor recall rate of 98.9%. A detailed analysis of the misclassified samples reveals the underlying factors affecting extrapolation performance and further delineates the boundary of the model's applicability. Finally, performance benchmarks indicate that the proposed solver reduces single-step computation time from tens of seconds to the millisecond level, with a memory footprint of approximately 10.25 MB. This work provides an efficient and interpretable tool for rapid parameter scans and operational window analysis in tokamaks.

Keywords:
Grad-Shafranov equation /

Kolmogorov-Arnold Network /

Physics-Informed Neural Networks /

Linear extrapolation
施引文献

搜索

基于Kolmogorov-Arnold Networks的Grad-Shafranov方程自由边界问题求解方法

A Solution Method for the Free-Boundary Grad-Shafranov Equation Based on Kolmogorov-Arnold Networks

摘要

Abstract

施引文献

目录