Hostname: page-component-6766d58669-kl59c Total loading time: 0 Render date: 2026-05-14T19:00:06.879Z Has data issue: false hasContentIssue false

Towards fast and accurate predictions of radio frequency power deposition and current profile via data-driven modelling: applications to lower hybrid current drive

Published online by Cambridge University Press:  18 August 2022

G.M. Wallace*
Affiliation:
MIT Plasma Science and Fusion Center, Cambridge, MA 02139, USA
Z. Bai
Affiliation:
Lawrence Berkeley National Laboratory, Computing Sciences Area, Berkeley, CA 94720, USA
R. Sadre
Affiliation:
Lawrence Berkeley National Laboratory, Computing Sciences Area, Berkeley, CA 94720, USA
T. Perciano
Affiliation:
Lawrence Berkeley National Laboratory, Computing Sciences Area, Berkeley, CA 94720, USA
N. Bertelli
Affiliation:
Princeton Plasma Physics Laboratory, Princeton, NJ 08540, USA
S. Shiraiwa
Affiliation:
Princeton Plasma Physics Laboratory, Princeton, NJ 08540, USA
E.W. Bethel
Affiliation:
Lawrence Berkeley National Laboratory, Computing Sciences Area, Berkeley, CA 94720, USA Dept. of Computer Science, San Francisco State University, 1600 Holloway Ave., San Francisco, CA 94132, USA
J.C. Wright
Affiliation:
MIT Plasma Science and Fusion Center, Cambridge, MA 02139, USA
*
Email address for correspondence: wallaceg@mit.edu
Rights & Permissions [Opens in a new window]

Abstract

Three machine learning techniques (multilayer perceptron, random forest and Gaussian process) provide fast surrogate models for lower hybrid current drive (LHCD) simulations. A single GENRAY/CQL3D simulation without radial diffusion of fast electrons requires several minutes of wall-clock time to complete, which is acceptable for many purposes, but too slow for integrated modelling and real-time control applications. The machine learning models use a database of more than 16 000 GENRAY/CQL3D simulations for training, validation and testing. Latin hypercube sampling methods ensure that the database covers the range of nine input parameters ($n_{e0}$, $T_{e0}$, $I_p$, $B_t$, $R_0$, $n_{\|}$, $Z_{{\rm eff}}$, $V_{{\rm loop}}$ and $P_{{\rm LHCD}}$) with sufficient density in all regions of parameter space. The surrogate models reduce the inference time from minutes to $\sim$ms with high accuracy across the input parameter space.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Wave equation solvers (TORIC+Petra-M or GENRAY) couple to FP equation solvers (CQL3D) through the quasi-linear diffusion operator, $D_{QL}$. The FP solver perturbs the distribution function and couples information on quasi-linear RF power absorption, $P_{abs}$, back to the wave solver. This process iterates until the solutions converge. Rough per-iteration wall clock runtimes for each code are indicated in the figure. TORIC+Petra-M and GENRAY figures reproduced from Shiraiwa et al. (2017) and Wallace et al. (2010), respectively.

Figure 1

Table 1. Parameters varied in the GENRAY/CQL3D database.

Figure 2

Table 2. Best hyperparameters obtained for the MLP model.

Figure 3

Figure 2. Hyperparameter tuning of the random forest regression model for the both current and power profiles: (a,c) cross-validated MSE versus the number of estimators using the maximum tree depth of 100, 1000 and 2000; (b,d) cross-validated MSE versus number of estimators using the minimum samples split of 2, 3 and 4.

Figure 4

Figure 3. Results from the hyperparameter optimisation for the GPR models for both current profile and power deposition. For the current profile the Matern 32 kernel obtained superior results. In the case of power deposition the Exponential kernel performed the best, although the results are very similar among the different set of parameters and kernels.

Figure 5

Figure 4. Prediction results for current profiles: (a,d,g) MLP, (b,e,h) RFR and (cf,i) GPR. (ac) Good fits, (df) average fitsand (gi) poor fits representations for the same test data using the three ML models.

Figure 6

Figure 5. Prediction results for power deposition: (a,d,g) MLP, (b,e,h) RFR and (cf,i) GPR. (ac) Good fits, (df) average fits and (gi) poor fits representations for the same test data using the three ML models.

Figure 7

Figure 6. Histograms of the MSE in MLP, RFR and GPR ML models for (a) current and (b) power deposition.

Figure 8

Figure 7. Pairwise comparisons of the entire MSE of current prediction among the MLP, RFR and GPR ML models for the inference records. The three representative good, average and poor test instances as in figure 4 are highlighted.

Figure 9

Figure 8. Pairwise comparisons of the entire MSE of power prediction among the MLP, RFR and GPR ML models for the inference records. The three representative good, average and poor test instances as in figure 5 are highlighted.

Figure 10

Figure 9. (a) Inferred current density profile using the MLP surrogate model for four well-diagnosed EAST discharges. (b) Current profiles for the same set of discharges published in a study by Garofalo. Data in subfigure (b) reproduced from figure 11(b) of Garofalo et al. (2017). The orange–red–blue–green legend applies to both figures.

Figure 11

Table 3. Evaluation of our three ML models using MSE. For the five-fold cross-validation (CV) process we present the mean ($\mu$) and standard deviation ($\sigma$) of the MSE across all folds. The second column presents the prediction results of each final model trained using the full training data.

Figure 12

Table 4. Timing requirements. The first column reports the total execution time needed to run the entire five-fold cross-validation (CV). The second column reports the time necessary to train each final model along with per sample inference time.