Motion Control of Flexible-Joint Robotic Arms for Variable-Station Warehouse Sorting Based on Proximal Policy Optimization
Keywords:
Proximal policy optimization, Variable workstation, Warehouse sorting, Flexible joint, Robotic arm, Motion controlAbstract
In variable-station warehouse sorting scenarios, the motion control of flexible-joint robotic arms must simultaneously address the triple-coupled challenges of flexible joint characteristics, dynamic workstation environments, and sorting task requirements, making arm motion control of the robotic arm highly complex. To address this, a motion control method for flexible-joint robotic arms in variable-station warehouse sorting is proposed, based on Proximal Policy Optimization (PPO algorithm). After analyzing the variable-station warehouse sorting model and the robotic arm's control system architecture, a motion control model based on Proximal Policy Optimization is constructed. This model maps the robotic arm as an agent, by designing a multidimensional state space that encompassing station coordinates and cargo status. It divides the action space into overall arm movement and end-effector rotation, establishing a reward function incorporating continuous rewards, sparse rewards, and penalties. An LSTM is introduced to capture temporal motion correlations, predicting advantageous function values under different actions as workstation coordinates change. The PPO algorithm obtains the robotic arm motion control commands with the highest cumulative reward value—such as angular velocity and torque for each joint, along with gripper opening degree (gripping force)—for robotic arm motion control. Experiments demonstrate that this method achieves position control errors as low as 0.1 mm and gripping force errors reduced to 0.05N for flexible-joint robotic arms in variable-workstation warehouse sorting. Sorting speeds reach 30 items per minute, meeting the high-precision and high-robustness control demands of variable-workstation warehouse sorting.
Downloads
References
Abdelghani, D., Mohamed, Z. A. Nouara, A. A Novel Stepper Motor Haptic Interface for Efficient Robotic Task Programming. Journal Europeen des Systemes Automatises, 2024, 57(5), 1369-1376. https://doi.org/10.18280/jesa.570512
Parvin, M., Jafar, M., Keyvan, A. V. Robotic date fruit harvesting using machine vision and a 5‐DOF manipulator. Journal of Field Robotics, 2023, 40(6), 1408-1423. https://doi.org/10.1002/rob.22184
Steafan, E. K. Zachary, C. D. Continuous Gesture Control of a Robot Arm: Performance Is Robust to a Variety of Hand-to-Robot Maps. IEEE Transactions on Biomedical Engineering, 2024, 71(3), 944-953. https://doi.org/10.1109/TBME.2023.3323601
Sinha, A. K., Thalmann, N. M. Cai, Y. Measuring Anthropomorphism of a New Humanoid Hand-Arm System. International Journal of Social Robotics, 2023, 15(8), 1341-1363. https://doi.org/10.1007/s12369-023-00999-x
Hernandez-Sanchez, A., Chairez, I., Poznyak, A., et al. Cueing end-effector acceleration of a two-link robotic arm by dynamic averaged sub-gradient integral sliding mode control. Asian Journal of Control: Affiliated with ACPA, the Asian Control Professors Association, 2023, 25(4), 2577-2587. https://doi.org/10.1002/asjc.2994
Mozhi, G. T., Sundareswari, M. B. Dhanalakshmi, K. Dhanalakshmi. Bidirectional Position Control of a Prismatic joint for Motorized Single Link Robotic Arm Using Adaptive Super- Twisting Sliding Mode Control. Journal of The Institution of En-gineers (India), Series B. Electrical eingineering, electronics and telecommunication engineering, computer engineering, 2023, 104(5), 1035-1042. https://doi.org/10.1007/s40031-023-00908-w
Leanza, S., Juliana, L. Y., Kaczmarski, B., et al. Elephant Trunk Inspired Multimodal Deformations and Movements of Soft Robotic Arms. Advanced functional materials, 2024, 34(29),2400396.1-2400396.10. https://doi.org/10.1002/adfm.202400396
Benyamin, S., Hossein, M., Arian, S., et al. Programmable Shape-Preserving Soft Robotics Arm via Multimodal Multistability. Advanced functional materials, 2025, 35(6), 2407651.1-2407651.16. https://doi.org/10.1002/adfm.202407651
Lancaster, P., Mavrogiannis, C., Srinivasa, S., et al. Electrostatic brakes enable individual joint control of underactuated, highly articulated robots. The International Journal of Robotics Research, 2024, 43(14), 2204-2220. https://doi.org/10.1177/02783649241250362
Miroslav, M., Milena, K., Vladimir, P., et al. Optimizing the Position of a Robotic Arm Using Statistical Methods. Manufac-turing Technology, 2024, 24(4),618-625. https://doi.org/10.21062/mft.2024.073
Marco, B., Gianluca, R., Luca, Z., et al. Lightweight Human-Friendly Robotic Arm Based on Transparent Hydrostatic Transmissions. IEEE Transactions on Robotics: A publication of the IEEE Robotics and Automation Society, 2023, 39(5),4051-4064. https://doi.org/10.1109/TRO.2023.3290310
Hichame, T., Mohamed, E. H., Daachi, T. M. Real-time adaptive super twisting algorithm based on PSO algorithm: application for an exoskeleton robot. Robotica: International journal of information, education and research in robotics and artificial in-telligence, 2024, 42(6),1816-1841. https://doi.org/10.1017/S0263574724000547
Schorr, L., Cobilean, V., Mavikumbure, H. S., et al. Industrial workspace detection of a robotic arm using combined 2D and 3D vision processing. The International Journal of Advanced Manufacturing Technology, 2025, 136(3/4), 1317-1326. https://doi.org/10.1007/s00170-024-14901-0
Islem, K., Abdelhak, B., Mohamed, T. Enhancing pose estimation for mobile robots: A comparative analysis of deep rein-forcement learning algorithms for adaptive Extended Kalman Filter-based estimation. Engineering Applications of Artificial Intelligence, 2025, 150(Jun.),110548.1-110548.22. https://doi.org/10.1016/j.engappai.2025.110548
Kim, D., Choi, M., Um, J. Digital twin for autonomous collaborative robot by using synthetic data and reinforcement learning. Robotics and Computer Integrated Manufacturing: An International Journal of Manufacturing and Product and Process De-velopment, 2024, 85(Feb.),102632.1-102632.13. https://doi.org/10.1016/j.rcim.2023.102632
Moslem, M., Abbas, Z. K., Mahdi, B., et al. Sustainable Robotic Joints 4D Printing with Variable Stiffness Using Reinforcement Learning. Robotics and Computer Integrated Manufacturing: An International Journal of Manufacturing and Product and Process Development, 2024, 85(Feb.), 102636.1-102636.12. https://doi.org/10.1016/j.rcim.2023.102636
Mani, A., Reza, A. Intelligent ergonomic optimization in bimanual worker-robot interaction: A Reinforcement Learning ap-proach. Automation in construction, 2024, 168(Dec. Pt.A),105741.1-105741.13. https://doi.org/10.1016/j.autcon.2024.105741
Anh, V. L., Dinh, T. V., Nguyen, T. D., et al. Complete coverage planning using Deep Reinforcement Learning for polyia-monds-based reconfigurable robot. Engineering Applications of Artificial Intelligence, 2024, 138(Dec. Pt.B), 109424.1-109424.14. https://doi.org/10.1016/j.engappai.2024.109424
Pablo, R. B., Matteo, S., Angeliki, K., et al. Neutrons Sensitivity of Deep Reinforcement Learning Policies on EdgeAI Accel-erators. IEEE Transactions on Nuclear Science, 2024, 71(8 Pt.1),1480-1486. https://doi.org/10.1109/TNS.2024.3387087
Chen, G., Peng, Y., Zhang, M. An adaptive clipping approach for proximal policy optimization. arXiv preprint arXiv:1804.06461, 2018. https://doi.org/10.48550/arXiv.1804.06461
Published
How to Cite
Issue
Section
Copyright (c) 2026 Xiaogang Zhu, Xiangzhen Pan, Xiaojing Gao

This work is licensed under a Creative Commons Attribution 4.0 International License.





