Hostname: page-component-77f85d65b8-45ctf Total loading time: 0 Render date: 2026-04-19T07:10:03.780Z Has data issue: false hasContentIssue false

Sampling balanced high-quality data to train an automatic mesh generator

Published online by Cambridge University Press:  18 November 2025

Jie Pan
Affiliation:
Concordia Institute for Information Systems Engineering, Concordia University , Montreal, Canada
Jingwei Huang
Affiliation:
Department of Engineering Management & Systems Engineering, Old Dominion University, Norfolk, VA, USA
Gengdong Cheng
Affiliation:
Department of Engineering Mechanics, Dalian University of Technology , Dalian, China
Yong Zeng*
Affiliation:
Concordia Institute for Information Systems Engineering, Concordia University , Montreal, Canada
*
Corresponding author: Yong Zeng; Email: yong.zeng@concordia.ca
Rights & Permissions [Opens in a new window]

Abstract

In real-world scenarios, high-quality data are often scarce and imbalanced, yet it is essential for the optimal performance of data-driven algorithmic models. Data synthesis methods are commonly used to address this issue; however, they typically rely heavily on the original dataset, which limits their ability to significantly improve performance. This article presents a quality function-based method for directly generating high-quality data and applies it to a mesh generation algorithm to demonstrate its efficiency and effectiveness. The proposed approach samples input–output pairs of the algorithm based on their feature spaces, selects high-quality samples using a defined quality function that evaluates the suitability of outputs for their corresponding inputs, and trains a feedforward neural network to learn the mapping relationship using the selected data. Experimental results show that the learning cost is significantly reduced while maintaining competitive performance compared to two representative meshing algorithms.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. A sequence of decisions to complete the mesh. At each time step $ {t}_i $, an element (in red) is extracted from the current boundary (in blue). The boundary is then updated by removing the element and serves as the meshing boundary in the next time step $ {t}_{i+1} $. This process continues until the updated boundary becomes the final element.

Figure 1

Figure 2. An example of the input with $ {L}_r=4,n=2,g=3 $ (Pan et al., 2023).

Figure 2

Figure 3. Four basic rule types to form a quadrilateral element. $ {V}_0 $ is the reference vertex. The newly generated vertices $ {V}_t $ and edges are marked in yellow. Type 1 involves adding two extra edges; type 0 and 2 involve adding one extra edge; and type 3 involves adding three edges.

Figure 3

Figure 4. Quality function-based data generation procedure for mesh generation. The mesh generation algorithm consists of a set of input–output pairs. The points and edges in black are from the right and left sides of the reference vertex, and the ones in yellow are three neighboring points from the corresponding fan-shaped area.

Figure 4

Figure 5. Updated adjacent boundary quality for different solution types. $ {V}_0 $ is the reference vertex.

Figure 5

Figure 6. Comparison of datasets with four levels of quality thresholds, $ \tau \in \left\{\mathrm{0.6,0.7,0.75,0.8}\right\} $. Five kinds of metrics are used to represent the characteristics of the dataset, including element quality ($ {\eta}^e $ in Equation 3), boundary quality ($ {\eta}^b $), quality ($ \eta $), angle (i.e., $ \angle {V}_{l,1}{V}_0{V}_{r,1} $ in the input), and the averaged segment length.

Figure 6

Figure 7. Comparison of element quality with four levels of sample size, $ M\in \left\{5e3,1e4,4e4,1e5\right\} $. Element quality is used to measure the meshing results with different levels of sample size.

Figure 7

Figure 8. Comparison of the vertex distribution of samples. The first row [i.e., Subfigure (a)] is the distribution of all the vertices in the input–output samples extracted from Gmsh. The second row [i.e., Subfigure (b)] is the vertex distribution of samples generated by FreeMesh-DG with quality threshold $ \tau \ge 0.7 $. Types 0, 1, and 2 correspond to the three basic rules in the output. Only type 1 needs to generate a new vertex (in red) to form an element. Blue vertices represent the neighboring vertices around the reference Vertex; yellow vertices represent the vertices in the fan-shaped area; all of them are included in the input. The x and y axes are the coordinate axes of the vertex.

Figure 8

Figure 9. Comparison of the angle distribution of samples. The angle ranges from 0.5 to 2.5 radians. Types 0, 1, and 2 correspond to the three basic rules in the output. Subfigure (a) is the angle distribution of samples from Gmsh. Subfigure (b) is the vertex distribution of samples generated by FreeMesh-DG with a quality threshold $ \tau \ge 0.7 $.

Figure 9

Table 1. Meshing results comparison

Figure 10

Table 2. Averaged mesh quality metrics over the five domains