Hostname: page-component-77f85d65b8-v2srd Total loading time: 0 Render date: 2026-04-18T03:57:46.307Z Has data issue: false hasContentIssue false

Merging pruning and neuroevolution: towards robust and efficient controllers for modular soft robots

Published online by Cambridge University Press:  03 February 2022

Giorgia Nadizar
Affiliation:
Department of Engineering and Architecture, University of Trieste, Italy Email: giorgia.nadizar@phd.units.it, emedvet@units.it, fapellegrino@units.it, marco.zullich@phd.units.it Department of Computer Science, Artificial Intelligence Lab, Oslo Metropolitan University, Norway
Eric Medvet
Affiliation:
Department of Engineering and Architecture, University of Trieste, Italy Email: giorgia.nadizar@phd.units.it, emedvet@units.it, fapellegrino@units.it, marco.zullich@phd.units.it
Hola Huse Ramstad
Affiliation:
Department of Neuromedicine and Movement Science, Norwegian University of Science and Technology, Norway Email: olahuser@oslomet.no
Stefano Nichele
Affiliation:
Department of Computer Science, Artificial Intelligence Lab, Oslo Metropolitan University, Norway Department of Holistic Systems, Simula Metropolitan Center for Digital Engineering, Norway Email: stenic@oslomet.no
Felice Andrea Pellegrino
Affiliation:
Department of Engineering and Architecture, University of Trieste, Italy Email: giorgia.nadizar@phd.units.it, emedvet@units.it, fapellegrino@units.it, marco.zullich@phd.units.it
Marco Zullich
Affiliation:
Department of Engineering and Architecture, University of Trieste, Italy Email: giorgia.nadizar@phd.units.it, emedvet@units.it, fapellegrino@units.it, marco.zullich@phd.units.it
Rights & Permissions [Opens in a new window]

Abstract

Artificial neural networks (ANNs) can be employed as controllers for robotic agents. Their structure is often complex, with many neurons and connections, especially when the robots have many sensors and actuators distributed across their bodies and/or when high expressive power is desirable. Pruning (removing neurons or connections) reduces the complexity of the ANN, thus increasing its energy efficiency, and has been reported to improve the generalization capability, in some cases. In addition, it is well-known that pruning in biological neural networks plays a fundamental role in the development of brains and their ability to learn. In this study, we consider the evolutionary optimization of neural controllers for the case study of Voxel-based soft robots, a kind of modular, bio-inspired soft robots, applying pruning during fitness evaluation. For a locomotion task, and for centralized as well as distributed controllers, we experimentally characterize the effect of different forms of pruning on after-pruning effectiveness, life-long effectiveness, adaptability to new terrains, and behavior. We find that incorporating some forms of pruning in neuroevolution leads to almost equally effective controllers as those evolved without pruning, with the benefit of higher robustness to pruning. We also observe occasional improvements in generalization ability.

Information

Type
Research Article
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Frames of the two VSR morphologies used in the experiments. The color of each voxel encodes the ratio between its current area and its rest area: red indicated contraction, yellow rest state, and green expansion. The circular sector drawn at the center of each voxel indicates the current sensed values: subsectors represent sensors and are, where appropriate, internally divided into slices according to the sensor dimensionality $\mathrm{m}$. The rays of the vision sensors are shown in red.

Figure 1

Figure 2. A schematic representation of the centralized controller for a $3 \times 1$ VSR with two sensors in each voxel. Blue and red curved arrows represent the connection of the ANN with inputs (sensors) and outputs (actuators), respectively.

Figure 2

Figure 3. A schematic representation of the distributed controller for a $3\times1$ VSR with two sensors in each voxel and $n_c=1$ communication channel per side. Blue and red curved arrows represent the connection of the ANN with inputs (sensors and input communication channels) and outputs (actuator and output communication channels), respectively.

Figure 3

Figure 4. Mean absolute difference e between the output of a pruned ANN and the output of the corresponding unpruned ANN vs. the pruning rate $\rho$, for different ANN structures and with different pruning criteria (color) and scopes (linetype).

Figure 4

Figure 5. Comparison of the output of pruned and unpruned versions of two ANNs of different structures: $n_{\mathrm{input}}=10$, $n_{\mathrm{layers}}=0$, above, and $n_{\mathrm{input}}=100$, $n_{\mathrm{layers}}=2$, below. Pruning occurs at $t_p= 5\ \mathrm{s}$.

Figure 5

Table 1. Number of parameters to be optimized by the EA for each controller architecture and ANN topology

Figure 6

Figure 6. Fitness $v_x$ (median with lower and upper quartiles across the 10 repetitions) vs. pruning rate $\rho$, for different pruning criteria (color), controller architectures (plot row), and ANN topologies(plot column).

Figure 7

Figure 7. Fitness $v_x$ (median with lower and upper quartiles across the 10 repetitions) vs. pruning rate $\rho$, for different pruning criteria (color), VSR morphologies (plot row), and ANN topologies (plot column).

Figure 8

Figure 8. Median, lower quartiles, and upper quartiles of re-assessment velocity $v_x$ vs. re-assessment pruning rate $\rho$ of individuals evolved with and without pruning for different ANN topologies for the centralized controller and biped morphology.

Figure 9

Figure 9. Different types of terrains employed for measuring VSR adaptability.

Figure 10

Figure 10. Median, lower quartiles, and upper quartiles of re-assessment velocity $v_x$ vs. pruning rate $\rho$ averaged across re-assessment terrains for different pruning criteria, VSR controller architectures, VSR morphologies, and ANN topologies.

Figure 11

Figure 11. Fitness $v_x$ (median with lower and upper quartiles across the10 repetitions) vs. pruning rate $\rho$, for controllers evolved with $t_i=t_p=20 \ \mathrm{s}$ (first column) and controllers evolved with $t_i=5\ \mathrm{s}$ and $t_p=20\ \mathrm{s}$ (second column). Both controllers share the centralized architecture, the ANN topology with $n_{\textrm{layers}}=1$, and the biped morphology.

Figure 12

Figure 12. Median, lower quartiles, and upper quartiles of velocity $v_x$ vs. pruning rate $\rho$ of individuals before (solid line) and after (dashed line) pruning for different pruning criteria (color).

Figure 13

Figure 13. Average re-assessment velocity $v_x$ (median with lower and upper quartiles across the 10 repetitions) vs. pruning rate $\rho$, for controllers evolved with $t_i=t_p=20\ \mathrm{s}$ (first column) and controllers evolved with $t_i=5\ \mathrm{s}$ and $t_p=20 \ \mathrm{s}$ (second column). Both controllers share the centralized architecture, the ANN topology with $n_{\textrm{layers}}=1$, and the biped morphology.

Figure 14

Figure 14. Scatter plots of the first two components resulting from the PCA analysis of the features described in Section 5.5 for the biped VSR with a centralized controller. Each subplot corresponds to a pruning criterion (row) and a pruning rate $\rho$ (column); pruning is applied at $t_p=t_i=20\ \mathrm{s}$. The color of the bubble indicates whether the evaluation of velocity and feature extraction were performed before or after the occurrence of pruning, while the size of the bubble is proportional to the achievedvelocity $v_x$.

Figure 15

Figure 15. Scatter plots of the first two components resulting from the PCA analysis of the features described in Section 5.5 for the biped VSR with a centralized controller. Each subplot corresponds to a pruning criterion (row) and a pruning rate $\rho$ (column); pruning is applied at $t_p=20\ \mathrm{s}$ ($t_i=5\ \mathrm{s}$). The color of the bubble indicates whether the evaluation of velocity and feature extraction were performed before or after the occurrence of pruning, while the size of the bubble is proportional to the achieved velocity $v_x$.