Motion Control of Flexible-Joint Robotic Arms for Variable-Station Warehouse Sorting Based on Proximal Policy Optimization

Xiaogang Zhu; Xiangzhen Pan; Xiaojing Gao

doi:10.61187/ita.v4i1.296

Authors

Xiaogang Zhu
zxg198812@126.com
School of Architecture and Engineering, Yantai Institute of Technology, Yantai, Shandong, 264000, China
Xiangzhen Pan School of Management Science and Engineering, Shandong Technology and Business University, Yantai, Shandong, 264005, China
Xiaojing Gao School of Architecture and Engineering, Qingdao Binhai University, Qingdao, Shandong, 266555, China

Keywords:

Proximal policy optimization, Variable workstation, Warehouse sorting, Flexible joint, Robotic arm, Motion control

Abstract

In variable-station warehouse sorting scenarios, the motion control of flexible-joint robotic arms must simultaneously address the triple-coupled challenges of flexible joint characteristics, dynamic workstation environments, and sorting task requirements, making arm motion control of the robotic arm highly complex. To address this, a motion control method for flexible-joint robotic arms in variable-station warehouse sorting is proposed, based on Proximal Policy Optimization (PPO algorithm). After analyzing the variable-station warehouse sorting model and the robotic arm's control system architecture, a motion control model based on Proximal Policy Optimization is constructed. This model maps the robotic arm as an agent, by designing a multidimensional state space that encompassing station coordinates and cargo status. It divides the action space into overall arm movement and end-effector rotation, establishing a reward function incorporating continuous rewards, sparse rewards, and penalties. An LSTM is introduced to capture temporal motion correlations, predicting advantageous function values under different actions as workstation coordinates change. The PPO algorithm obtains the robotic arm motion control commands with the highest cumulative reward value—such as angular velocity and torque for each joint, along with gripper opening degree (gripping force)—for robotic arm motion control. Experiments demonstrate that this method achieves position control errors as low as 0.1 mm and gripping force errors reduced to 0.05N for flexible-joint robotic arms in variable-workstation warehouse sorting. Sorting speeds reach 30 items per minute, meeting the high-precision and high-robustness control demands of variable-workstation warehouse sorting.

Downloads

Download data is not yet available.

References

Abdelghani, D., Mohamed, Z. A. Nouara, A. A Novel Stepper Motor Haptic Interface for Efficient Robotic Task Programming. Journal Europeen des Systemes Automatises, 2024, 57(5), 1369-1376. https://doi.org/10.18280/jesa.570512

Parvin, M., Jafar, M., Keyvan, A. V. Robotic date fruit harvesting using machine vision and a 5‐DOF manipulator. Journal of Field Robotics, 2023, 40(6), 1408-1423. https://doi.org/10.1002/rob.22184

Steafan, E. K. Zachary, C. D. Continuous Gesture Control of a Robot Arm: Performance Is Robust to a Variety of Hand-to-Robot Maps. IEEE Transactions on Biomedical Engineering, 2024, 71(3), 944-953. https://doi.org/10.1109/TBME.2023.3323601

Sinha, A. K., Thalmann, N. M. Cai, Y. Measuring Anthropomorphism of a New Humanoid Hand-Arm System. International Journal of Social Robotics, 2023, 15(8), 1341-1363. https://doi.org/10.1007/s12369-023-00999-x

Hernandez-Sanchez, A., Chairez, I., Poznyak, A., et al. Cueing end-effector acceleration of a two-link robotic arm by dynamic averaged sub-gradient integral sliding mode control. Asian Journal of Control: Affiliated with ACPA, the Asian Control Professors Association, 2023, 25(4), 2577-2587. https://doi.org/10.1002/asjc.2994

Mozhi, G. T., Sundareswari, M. B. Dhanalakshmi, K. Dhanalakshmi. Bidirectional Position Control of a Prismatic joint for Motorized Single Link Robotic Arm Using Adaptive Super- Twisting Sliding Mode Control. Journal of The Institution of En-gineers (India), Series B. Electrical eingineering, electronics and telecommunication engineering, computer engineering, 2023, 104(5), 1035-1042. https://doi.org/10.1007/s40031-023-00908-w

Leanza, S., Juliana, L. Y., Kaczmarski, B., et al. Elephant Trunk Inspired Multimodal Deformations and Movements of Soft Robotic Arms. Advanced functional materials, 2024, 34(29),2400396.1-2400396.10. https://doi.org/10.1002/adfm.202400396

Benyamin, S., Hossein, M., Arian, S., et al. Programmable Shape-Preserving Soft Robotics Arm via Multimodal Multistability. Advanced functional materials, 2025, 35(6), 2407651.1-2407651.16. https://doi.org/10.1002/adfm.202407651

Lancaster, P., Mavrogiannis, C., Srinivasa, S., et al. Electrostatic brakes enable individual joint control of underactuated, highly articulated robots. The International Journal of Robotics Research, 2024, 43(14), 2204-2220. https://doi.org/10.1177/02783649241250362

Miroslav, M., Milena, K., Vladimir, P., et al. Optimizing the Position of a Robotic Arm Using Statistical Methods. Manufac-turing Technology, 2024, 24(4),618-625. https://doi.org/10.21062/mft.2024.073

Marco, B., Gianluca, R., Luca, Z., et al. Lightweight Human-Friendly Robotic Arm Based on Transparent Hydrostatic Transmissions. IEEE Transactions on Robotics: A publication of the IEEE Robotics and Automation Society, 2023, 39(5),4051-4064. https://doi.org/10.1109/TRO.2023.3290310

Hichame, T., Mohamed, E. H., Daachi, T. M. Real-time adaptive super twisting algorithm based on PSO algorithm: application for an exoskeleton robot. Robotica: International journal of information, education and research in robotics and artificial in-telligence, 2024, 42(6),1816-1841. https://doi.org/10.1017/S0263574724000547

Schorr, L., Cobilean, V., Mavikumbure, H. S., et al. Industrial workspace detection of a robotic arm using combined 2D and 3D vision processing. The International Journal of Advanced Manufacturing Technology, 2025, 136(3/4), 1317-1326. https://doi.org/10.1007/s00170-024-14901-0

Islem, K., Abdelhak, B., Mohamed, T. Enhancing pose estimation for mobile robots: A comparative analysis of deep rein-forcement learning algorithms for adaptive Extended Kalman Filter-based estimation. Engineering Applications of Artificial Intelligence, 2025, 150(Jun.),110548.1-110548.22. https://doi.org/10.1016/j.engappai.2025.110548

Kim, D., Choi, M., Um, J. Digital twin for autonomous collaborative robot by using synthetic data and reinforcement learning. Robotics and Computer Integrated Manufacturing: An International Journal of Manufacturing and Product and Process De-velopment, 2024, 85(Feb.),102632.1-102632.13. https://doi.org/10.1016/j.rcim.2023.102632

Moslem, M., Abbas, Z. K., Mahdi, B., et al. Sustainable Robotic Joints 4D Printing with Variable Stiffness Using Reinforcement Learning. Robotics and Computer Integrated Manufacturing: An International Journal of Manufacturing and Product and Process Development, 2024, 85(Feb.), 102636.1-102636.12. https://doi.org/10.1016/j.rcim.2023.102636

Mani, A., Reza, A. Intelligent ergonomic optimization in bimanual worker-robot interaction: A Reinforcement Learning ap-proach. Automation in construction, 2024, 168(Dec. Pt.A),105741.1-105741.13. https://doi.org/10.1016/j.autcon.2024.105741

Anh, V. L., Dinh, T. V., Nguyen, T. D., et al. Complete coverage planning using Deep Reinforcement Learning for polyia-monds-based reconfigurable robot. Engineering Applications of Artificial Intelligence, 2024, 138(Dec. Pt.B), 109424.1-109424.14. https://doi.org/10.1016/j.engappai.2024.109424

Pablo, R. B., Matteo, S., Angeliki, K., et al. Neutrons Sensitivity of Deep Reinforcement Learning Policies on EdgeAI Accel-erators. IEEE Transactions on Nuclear Science, 2024, 71(8 Pt.1),1480-1486. https://doi.org/10.1109/TNS.2024.3387087

Chen, G., Peng, Y., Zhang, M. An adaptive clipping approach for proximal policy optimization. arXiv preprint arXiv:1804.06461, 2018. https://doi.org/10.48550/arXiv.1804.06461