Research on plasma vertical displacement calculation based on neural network

H.H. Song; B. Shen; Q.P. Yuan; B.H. Guo; Y.H. Wang; D.L. Chen; R.R. Zhang; B.J. Xiao

doi:10.1017/S0022377822001015

Research on plasma vertical displacement calculation based on neural network

Part of: Machine Learning for Plasma Physics and Fusion Energy

Published online by Cambridge University Press: 17 January 2023

H.H. Song

B. Shen

Q.P. Yuan ,

B.H. Guo ,

Y.H. Wang ,

D.L. Chen ,

R.R. Zhang and

B.J. Xiao

Show author details

H.H. Song: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China University of Science and Technology of China, Hefei 230031, PR China
B. Shen*: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China
Q.P. Yuan*: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China
B.H. Guo: Affiliation:
College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen 518060, PR China
Y.H. Wang: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China
D.L. Chen: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China
R.R. Zhang: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China
B.J. Xiao: Affiliation:
Institute of Plasma Physics, Chinese Academy of Sciences, PO BOX 1126, Hefei 230031, PR China
*: † Email addresses for correspondence: biaoshen@ipp.ac.cn, qpyuan@ipp.ac.cn
† Email addresses for correspondence: biaoshen@ipp.ac.cn, qpyuan@ipp.ac.cn

Article contents

Abstract
Introduction
Database construction
Construction of neural network
Model training and testing
Conclusion
Funding
Declaration of interests
References

Rights & Permissions

Abstract

Plasma vertical displacement control is essential for the stable operation of tokamak devices. The traditional plasma vertical displacement calculation method is not suitable for balancing speed and accuracy simultaneously, which is necessary for real-time feedback control. In this study, neural networks are used to rapidly detect vertical displacement recognition. Based on a fully connected neural network, the vertical displacement calculation model is trained and tested using magnetic data of approximately 2000 shots. To compare the effects of different inputs on vertical displacement calculation, different magnetic measurement diagnostic signals are used to train and test the model. Compared with a full magnetic measurement dataset, 39 magnetic measurement signals (38 magnetic probes and plasma current) show better accuracy with mean square error <0.0005. The model is tested using historical experimental data, and it demonstrates accurate vertical displacement calculation even in the case of a vertical displacement event. In general, neural network algorithm has great application potential in vertical displacement calculation.

Keywords

neural network tokamak vertical displacement magnetic measurement signals

Type: Research Article
Information: Journal of Plasma Physics , Volume 89 , Issue 1 , February 2023 , 895890104

DOI: https://doi.org/10.1017/S0022377822001015 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

1. Introduction

In contemporary tokamak devices, the method of plasma cross-section extension is typically used to improve the confinement performance of plasma for obtaining high-parameter plasma; this method is also applicable for future tokamak devices (Liu Reference Liu2015). For the elongated plasma equilibrium configuration, there is a natural axisymmetric vertical instability, and it is one of the main reasons causing large disruption. Feedback control of plasma vertical instability is an effective method to achieve stable operation of tokamak plasma (Wang Reference Wang2018). If the vertical displacement control can be detected more quickly and accurately, the adverse effect on discharge can be effectively reduced, and plasma with better performance can be obtained. To reduce the adverse effects of vertical instability, a four-channel digital signal processing controller is used in the TCV device in Switzerland for controlling the vertical displacement of plasma at a frequency of 200 kHz. ASDEX-U in Germany is equipped with a passive feedback system, which significantly mitigates the growth rate of vertical displacement (Gruber, Lackner & Pautasso Reference Gruber, Lackner and Pautasso1993). A passive plate is used in the vacuum chamber of the KSTAR device in Korea to increase the passive control ability of the device for ensuring vertical instability (Jhanga et al. Reference Jhanga, Kesselb, Pomphrey, Jardin, Lee and Chang1999). At the JET, an extreme shape controller is used to enhance shape control during the transient phases of plasma current ramp-up and ramp-down; moreover, a current limit avoidance system is implemented to avoid saturation of the actuators with these transient phases (De Tommasi et al. Reference De Tommasi2014). NSTX-U, KSTAR and other similar devices have attempted to implement real-time methods for estimating the vertical displacement velocity using magnetic flux loops; based on the vertical displacement velocity, a good control effect has been obtained for these devices (Pomphrey, Jardin & Ward Reference Pomphrey, Jardin and Ward1989; Ward & Hofmann Reference Ward and Hofmann1994; Gates et al. Reference Gates, Kessel and Menard2006). In DIII-D, a neural network is used to receive the equilibrium data from RTEFIT for estimating the vertical displacement growth rate, and then proportional–integral–differential control is used for adjusting the plasma configuration in the plasma control system (PCS) (Sammuli, Barr & Humphreys Reference Sammuli, Barr and Humphreys2021).

Plasma vertical displacement cannot be measured directly; the plasma equilibrium reconstruction code EFIT (Huang Reference Huang2017) uses external diagnostic measurement data to reconstruct the actual plasma configurations and solve the Grad–Shafranov equation in the vacuum region. Plasma shape control parameters including vertical displacement are provided; however, the EFIT calculation cycle is at the millisecond level, which cannot meet the requirements of vertical displacement fast control. The vertical displacement control system on EAST is used under the condition of a certain equilibrium; based on the rigid model RZIP, a matrix is deduced to estimate the vertical displacement (Yuan et al. Reference Yuan, Xiao and Luo2013). The PCS (Qiping Reference Qiping2009) calculates the corresponding control command; then, it transmits the command to a fast control power supply, and finally, the power supply realises the control. The EAST device in China has fast vertical displacement control with a 100 μs control cycle (Liu Reference Liu2015; Huihui & Qiping Reference Huihui and Qiping2020). When the actual plasma configuration is greatly different from the assumed equilibrium configuration, the vertical displacement estimated by the matrix deviates, resulting in an unsatisfactory control effect. In recent years, with rapid advances in neural networks, increasing numbers of researchers have realised that neural network tools may provide unique advantages in research on controlled fusion. On HL-2A, a hybrid neural network model based on bidirection long short-term memory network and long short-term memory network is used for plasma position prediction, resulting in excellent prediction (Yang et al. Reference Yang, Liu, Song and Li2020). In this study, neural networks are used to explore the advantages of vertical displacement control.

Neural networks are used to fit the vertical displacement, remove the restrictions of simplified physical models, establish a new mapping relationship between electromagnetic measurement signals and vertical displacement, and provide a new calculation method for vertical displacement. To calculate the vertical displacement using a neural network, a vertical displacement database is first established; thereafter, an appropriate neural network model is chosen for training and testing after analysing the test results.

The rest of this paper comprises four parts. Section 2 describes the composition and processing of the database according to the research steps. The principle and structural parameters of the neural networks are introduced in § 3. The test results of the models are introduced, and the advantages and disadvantages of different models are compared in § 4. Finally, a brief conclusion is presented in § 5.

2. Database construction

At EAST, the centre of gravity of the current is different in all standard divertor configurations, including double null, upper single null and lower single null (Yuan et al. Reference Yuan, Xiao and Luo2013). To ensure model accuracy, the database should contain sufficient sample data for all the different configurations. Further, one single shot on EAST contains several diagnosis signals (Guo, Chen & Shen Reference Guo, Chen and Shen2021; Guo, Shen & Chen Reference Guo, Shen and Chen2021). Of these, only 88 vertical displacement control signals are chosen, containing 14 poloidal field (PF) coil current signals, 35 magnetic flux (FL) loops, 38 magnetic probes and plasma current. These signals are closely related to the vertical displacement. The EAST device interface diagram and cross-section of probe position are shown in figure 1.

Figure 1. EAST device interface diagram.

The vertical displacement calculated using EFIT is selected as the target of the network model; 2000 shots covering EAST shot numbers from 97 500 to 99 500 in the MDSplus database are analysed. The time scale and sampling rate of signal data collected by PCS are different from those of EFIT. Therefore, all data were first aligned chronologically. The signals collected by PCS were re-sampled according to the time axis of EFIT. Second, different collected signals represent different physical meanings, and the order of magnitude of data is not the same, and it requires standardised data processing. Typical standardisation methods include Z-score standardisation, logistic standardisation and maximum–minimum values standardisation. Z-score standardisation is applicable for situations where the maximum and minimum values of unknown data and the sample distribution are very discrete, and the logistic standardised sample index data are relatively concentrated on both sides of 0. Considering the actual demand, the normalisation method of maximum and minimum values can arrange all data chronologically between [0, 1]. Numerical problems caused by orders-of-magnitude differences are avoided. The data were then shuffled and divided into training and test sets.

To explore the influence of different magnetic measurement signals on vertical displacement, input data in the database are divided into two groups after data processing. In one group, all magnetic measurement signals are used as the input data of training, and in the other group, magnetic probe signals are selected as the input data of training. The output data are all vertical displacement values calculated by EFIT. For these two sets of data, models are established for training.

3. Construction of neural network

A fully connected neural network is one of the most widely used neural network models. It comprises three parts, namely an input layer, a hidden layer and an output layer (Hu & Zeng Reference Hu and Zeng2009). The input layer of the fully connected neural network is equivalent to the stimulus given by the outside world, and it is considered as the input data. The hidden layer is the equivalent of the human brain where neurons interact. The output layer represents the response of neurons after multi-level interaction. The fully connected neural network contains multiple neurons and transmits signals forward and errors back through neurons, and thus it has the advantages of self-organisation, self-adaptability and self-learning, and it can deal with complex problems quickly and stably. Its structure is shown in figure 2.

Figure 2. Fully connected neural network architecture.

The model structure of the neural network is determined using experimental results. It comprises five layers. The input layer comprises 128 neurons, and three hidden layers are set with 16 neurons in each layer. The number of nodes in the hidden layer is set according to the formula ${n_1} = \sqrt {n + m} + a$ (Shen, Wang & Gao Reference Shen, Wang and Gao2008), where n is the number of neurons in the input layer, m is the number of neurons in the output layer and a is a constant between [1, 10]. The structure is shown in table 1.

Table 1. Structural parameters of neural network.

4. Model training and testing

4.1. All magnetic measurement signals model

All electromagnetic measurement signals (total number = 88) are used as input to train the model, and it is labelled as the 88-input model. According to neural network training rules, we train the model 500 times. The loss function of training and testing is shown in figure 3(a); the horizontal axis shows the training iterations, the blue line represents the training loss function value and the orange line represents the test loss function value. By constantly adjusting the model parameters, the loss is minimised and the model reaches the optimal convergence state, in which the maximum error of the model training results is 0.003186. Then, 500 shots with different configurations whose shot number is from 99 501 to 100 000 were used as the test. The mean square error of the model-predicted value of each test shot is shown in figure 3(b); the horizontal axis is the number of shots, and each point represents the mean square error between the model-predicted value and the target value of the shot data. The mean square error for the test data is concentrated between [0.00, 0.02].

Figure 3. The 88-input model training and test loss function diagram (a) and mean square error (MSE) of test set (b).

The test set contains all configurations, and one configuration shot @100000 is selected for testing (figure 4). The horizontal axis in figure 4 is time (seconds) and the vertical axis is vertical displacement (metres). The red line is the vertical displacement result calculated by EFIT and the blue line is the vertical displacement result fitted by the neural network model. The mean square error is 0.000002.

Figure 4. The 88-input model fitting vertical displacement result diagram @100000.

4.2. Model of magnetic probe signals

Only magnetic probe signals and plasma current signal are used as input to train the model in the second group; the PF coil current and FL loops are discarded. The model of magnetic probe signals is labelled as the 39-input model. The test results are shown in figure 5. The model training and test loss function values are ≤0.00005, and the mean square errors for test data are concentrated between [0.0000, 0.0005].

Figure 5. The 39-input model training and test loss function diagram (a) and mean square error (MSE) of test set (b).

Similar to § 4.1, one configuration shot @100000 is selected for testing (figure 6). The mean square error is 0.000001.

Figure 6. The 39-input model fitting vertical displacement result diagram @100000.

4.3. Comparison of the two models

The models with 39 and 88 inputs are compared; the mean square errors of the same test set data are smaller and more concentrated. The relevant error parameters of configuration are shown in figure 7.

Figure 7. Error parameter values of the two models @100000.

As shown in figure 7, for the test shot @100000, all error parameter values of the 39-input model are less than those of the 88-input model. Compared with the 39-input model, the 88-input model added PF coil current signals and FL signals as inputs. For PF coil current signals, the PF coils are located outside a vacuum vessel, which is considered as a passive structure. The PF coil current changes, especially high-frequency changes, are shielded by the vacuum vessel, implying that the PF coil current cannot directly reflect the plasma vertical position at the current moment. The FL signals are integral signals, which are mainly contributed by the ohmic field. The effects of plasma vertical displacement on the FL signals are approximately 0.05 mV mm⁻¹ (for 400 kA plasma), and they are easily affected by electronic noise. At the same time, fewer input signals greatly improve the speed of model calculation. The 39-input model is chosen for application in this study.

Specific configuration experimental shot @102913 whose vertical displacement changes drastically, that is, the shot disrupted due to a vertical displacement event, is selected for testing the 39-input model; the test results are shown in figure 8. Same as in figure 4, the red line is the vertical displacement result calculated by EFIT and the blue line is the vertical displacement result fitted by the neural network model. The maximum error between them is 0.007494, the mean square error is 0.000004 and the mean absolute error is 0.001520. Compared with shot@100000, error parameters are slightly larger, but are much less than the maximum controllable error of vertical displacement. Thus, the proposed model could also achieve effective control in a particular configuration.

Figure 8. The 39-input model fitting vertical displacement result diagram @102913.

5. Conclusion

This study is based on the magnetic probe measurement data of the EAST tokamak device and the vertical displacement data calculated by EFIT. The relationship between magnetic measurement signals and vertical displacement is explored using neural networks, and the better model is tested for accuracy. By simulating neural network tests, better control of vertical displacement in the future and the prediction of vertical displacement is achieved. Fewer signals are required to estimate the vertical displacement, realising faster control of vertical displacement. The next step of this study is to make the model a real-time one and carry out experimental verification of the vertical displacement control system.

Acknowledgements

Editor William Dorland thanks the referees for their advice in evaluating this article.

Funding

This work is supported by the National MCF Energy R&D Program of China (no. 2018YFE0302100) and the National Natural Science Foundation of China (no. 11905250, no. 12075285).

Declaration of interests

The authors report no conflict of interest.

References

De Tommasi, G., et al. 2014 Shape control with the XSC during plasma current ramp-up and ramp-down at the JET tokamak. J. Fusion Energy 33, 149–157.CrossRef Google Scholar

Gates, D.A., Kessel, C., Menard, J., et al. 2006 Progress towards steady state on NSTX. Nucl. Fusion 46 (3), 22–28.CrossRef Google Scholar

Gruber, O., Lackner, K. & Pautasso, G. 1993 Vertical displacement events and halo currents. Plasma Phys. Control. Fusion 35B, 191.CrossRef Google Scholar

Guo, B.H., Chen, D.L. & Shen, B. 2021 Disruption prediction on EAST tokamak using a deep learning algorithm. Plasma Phys. Control. Fusion 63, 115007.CrossRef Google Scholar

Guo, B.H., Shen, B. & Chen, D.L. 2021 Disruption prediction using a full convolutional neural network on EAST. Plasma Phys. Control. Fusion 63 (2), 025008.CrossRef Google Scholar

Hu, J. & Zeng, X. 2009 An efficient activation function for BP neural network. In 2009 International Workshop on Intelligent Systems and Applications, pp. 1–4. IEEE.CrossRef Google Scholar

Huang, Y. 2017 Study of MHD equilibrium reconstruction and plasma control based on GPU parallel computation. PhD thesis, University of Chinese Academy of Science.Google Scholar

Huihui, S. & Qiping, Y. 2020 Design and implementation of fast control on the vertical displacement based on high-speed acquisition. J. Instrum. Technol. 4, 37–42.Google Scholar

Jhanga, H., Kesselb, C., Pomphrey, N., Jardin, S.C., Lee, G.S. & Chang, C.S. 1999 Design calculations for fast plasma position control in Korea superconducting tokamak advanced research. Fusion Engng Des. 45, 101–115.CrossRef Google Scholar

Liu, L. 2015 The simulation and experimental study of passive stability and active control for EAST vertical instability. PhD thesis, University of Chinese Academy of Science.Google Scholar

Pomphrey, N., Jardin, S.C. & Ward, D.J. 1989 Feedback stabilization of the axisymmetric instability of a deformable tokamak plasma. Nucl. Fusion 29 (3), 465.CrossRef Google Scholar

Qiping, Y. 2009 Plasma control system with Linux cluster structure. PhD thesis, University of Chinese Academy of Sciences.Google Scholar

Sammuli, B.S., Barr, J.L. & Humphreys, D.A. 2021 Avoidance of vertical displacement events in DIII-D using a neural network growth rate estimator. Fusion Engng Des. 169, 112492.CrossRef Google Scholar

Shen, H.-Y., Wang, Z. & Gao, C. 2008 Determining the number of BP neural network hidden layer units. Tianjin Univ. Technol. 5, 13–15.Google Scholar

Wang, Y. 2018 EAST shape and position control opentimization and system identification. PhD thesis, University of Chinese Academy of Science.Google Scholar

Ward, D.J. & Hofmann, F. 1994 Active feedback stabilization of axisymmetric modes in highly elongated tokamak plasmas. Nucl. Fusion 34 (3), 401–415.CrossRef Google Scholar

Yang, B., Liu, Z., Song, X. & Li, X. 2020 Design of HL-2A plasma position predictive model based on deep learning. Plasma Phys. Control. Fusion 62, 125022.CrossRef Google Scholar

Yuan, Q., Xiao, B., Luo, Z., et al. 2013 Plasma current, position and shape feedback control on EAST. Nucl. Fusion 53 (4), 043009.CrossRef Google Scholar