Hostname: page-component-89b8bd64d-sd5qd Total loading time: 0 Render date: 2026-05-07T01:43:24.192Z Has data issue: false hasContentIssue false

Learn multi-step object sorting tasks through deep reinforcement learning

Published online by Cambridge University Press:  06 May 2022

Jiatong Bao*
Affiliation:
School of Electrical, Energy and Power Engineering, Yangzhou University, Yangzhou 225000, China School of Instrument Science and Engineering, Southeast University, Nanjing 210000, China
Guoqing Zhang
Affiliation:
School of Electrical, Energy and Power Engineering, Yangzhou University, Yangzhou 225000, China
Yi Peng
Affiliation:
School of Electrical, Energy and Power Engineering, Yangzhou University, Yangzhou 225000, China
Zhiyu Shao
Affiliation:
School of Electrical, Energy and Power Engineering, Yangzhou University, Yangzhou 225000, China
Aiguo Song
Affiliation:
School of Instrument Science and Engineering, Southeast University, Nanjing 210000, China
*
*Corresponding author. E-mail: jtbao@yzu.edu.cn

Abstract

Robotic systems are usually controlled to repetitively perform specific actions for manufacturing tasks. The traditional control methods are domain-dependent and model-dependent with cost of much human efforts. They cannot meet the new requirements of generality and flexibility in many areas such as intelligent manufacturing and customized production. This paper develops a general model-free approach to enable robots to perform multi-step object sorting tasks through deep reinforcement learning. Taking projected heightmap images from different time steps as input without extra high-level image analysis and understanding, critic models are designed to produce a pixel-wise Q value map for each type of action. It is a new trial to apply pixel-wise Q value-based critic networks to solve multi-step sorting tasks that involve many types of actions and complex action constraints. The experimental validations on simulated and realistic object sorting tasks demonstrate the effectiveness of the proposed approach. Qualitative results (videos), code for simulated and realistic experiments, and pre-trained models are available at https://github.com/JiatongBao/DRLSorting

Information

Type
Research Article
Copyright
© The Author(s), 2022. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable