Design and Simulation of an Autonomous System Using Reinforcement Learning for Articulated Vehicle Pose Adjustment

Qanbari Senjegani, Moein; Sadedel, Majid

doi:10.48311/mme.2025.117147.82868

Design and Simulation of an Autonomous System Using Reinforcement Learning for Articulated Vehicle Pose Adjustment

Document Type : Original Article

Authors

Moein Qanbari Senjegani

Majid Sadedel

Mechanical Engineering Department, Tarbiat Modares University, Tehran, Iran

10.48311/mme.2025.117147.82868

Abstract

Automatic posture adjustment is a key application in the domain of intelligent vehicles, playing a fundamental role in enhancing safety and optimizing vehicle maneuvering operations. Articulated Vehicles (AVs), due to their high degrees of freedom and the complex, nonlinear dynamics resulting from the joint between the tractor and trailer, present a more challenging control problem compared to rigid vehicles. The objective of this research is to design and simulate an automatic control system for articulated vehicle posture adjustment utilizing a Deep Reinforcement Learning (DRL) framework. This system can also serve as a foundation for more advanced applications, such as autonomous parking.

In this study, the precise modeling of articulated vehicle dynamics and the jackknifing phenomenon was initially carried out. The developed model was validated using the specialized software, TruckSim. Subsequently, to reduce computational complexity, the learning process was segmented into two distinct phases: maneuver preparation and final posture adjustment. For training the intelligent agent, Deep Deterministic Policy Gradient (DDPG) and Twin Delayed DDPG (TD3) algorithms were employed, which were optimized with neural networks comprising three to five hidden layers. Evaluation results indicated that the TD3 algorithm, owing to its superior ability to maintain the stability of the learning process, outperformed DDPG. Ultimately, the proposed control system, with the optimal structure for each phase, achieved success rates of 96.6% in the preparation phase and 94.6% in the final adjustment phase, thereby confirming the high efficiency and reliability of the DRL-based system in addressing the control challenges of articulated vehicles.

Keywords

Motion Planning

Articulated Vehicle

Reinforcement Learning

Deep Deterministic Policy Gradient (DDPG)

Twin Delayed Deep Deterministic Policy Gradient (TD3)

Subjects

Mechatronics

1] H. Rezaei, H. N. Pishkenari, S. Jafari, and E. M. J. Al-Rashed, "Automatic parking of an articulated vehicle using ANFIS," in Proc. IEEE Int. Symp. Ind. Electron. (ISIE), Taipei, Taiwan, 2013, pp. 2519-2524.

[2] R. P. Vieira, E. V. Argento, and T. C. Revoredo, "Trajectory planning for car-like robots through curve parametrization and genetic algorithm optimization with applications to autonomous parking," IEEE Latin Am. Trans., vol. 20, no. 2, pp. 309-316, Feb. 2022, doi: 10.1109/TLA.2022.9661471.

[3] B. Li, T. Acarman, Y. Zhang, Y. Ouyang, C. Yaman, Q. Kong, X. Zhong, and X. Peng, "Optimization-based trajectory planning for autonomous parking with irregularly placed obstacles: A lightweight iterative framework," IEEE Trans. Intell. Transp. Syst., vol. 23, no. 8, pp. 11970-11981, Aug. 2022, doi: 10.1109/TITS.2021.3109011.

[4] H. R. Nedamani, M. Soleymanifard, A. Safaeifar, and P. M. Khiabani, "Soft Computing-Based Driver Modeling for Automatic Parking of Articulated Heavy Vehicles," SAE Int. J. Commer. Veh., vol. 16, no. 03-16-04, pp. 268-281, 2023, doi: 10.4271/02-16-04-0027.

[5] S. Azadi, H. R. Nedamani, and R. Kazemi, "Automatic parking of an articulated vehicle using ANFIS," Global J. Sci. Eng. Technol., vol. 3, pp. 93-104, 2013.

[6] J. Zhang, H. Chen, S. Song, and F. Hu, "Reinforcement learning-based motion planning for automatic parking system," IEEE Access, vol. 8, pp. 154485-154501, Aug. 2020, doi: 10.1109/ACCESS.2020.3017770.

[7] S. Song, H. Chen, H. Sun, and M. Liu, "Data efficient reinforcement learning for integrated lateral planning and control in automated parking system," Sensors, vol. 20, no. 24, p. 7297, Dec. 2020, doi: 10.3390/s20247297.

[8] T. Lei, J. Wang, and Z. Yao, "Modelling and stability analysis of articulated vehicles," Appl. Sci., vol. 11, no. 8, p. 3663, Apr. 2021, doi: 10.3390/app11083663.

[9] B. Saeedi and M. Sadedel, "Implementation of Behavior-Based Navigation Algorithm on Four-Wheel Steering Mobile Robot," Int. J. Adv. Manuf. Technol., vol. 113, no. 3-4, pp. 903-920, Mar. 2021. doi: 10.22059/jcamech.2021.330072.648

[10] A. Molaei and M. Amirkhani, "Distributed Reinforcement Learning Framework for Autonomous Driving on Highways," Iranian Journal of Intelligent Systems, vol. 10, no. 2, pp. 45-59, 2021.

[11] R. Rizehvandi and A. Azadi, "Deep Reinforcement Learning for Highway Driving Decision-Making," Journal of Transportation Systems, vol. 15, no. 3, pp. 112-128, 2024.

[12] A. Adrisi, K. Bagherzadeh Chehreh, and A. Naderi, "Proposing an Innovative Model Based on Reinforcement Learning for Intelligent Vehicle Routing in a Dynamic Network (Case Study: City of Isfahan)," in Proc. Int. Conf. Civil Eng., Archit., Urbanism Contemp. Iran, 2017.

[13] T. T. Li, J. Wang, and Z. Yao, "Nonlinear Dynamic Model for Articulated Vehicles and Stability Analysis," J. Veh. Dyn., vol. 42, no. 1, pp. 56-68, Jan. 2018.

[14] J. Jung, H. Chen, S. Song, and F. Hu, "Side Slip Angle Estimation with Dual Kalman Filters," J. Veh. Control, vol. 25, no. 2, pp. 134-147, Feb. 2019.

[15] J. Doe, H. Chen, S. Song, and F. Hu, "Lyapunov Control for Articulated Systems," IEEE Trans. Control Syst. Technol., vol. 24, no. 5, pp. 1789-1802, Sep. 2016.

[16] K. D. A. van de Vendel, J. Elfring, and A. J. Aertssen, "Path Planning for a Reverse Parking Maneuver using Reinforcement Learning," TU Eindhoven Repository (pure.tue.nl), 2025.

[17] A. Aertssen, R. Huisman, I. Besselink, and M. J. G. R. van de Molengraft, "Real-Time Trajectory Planning for an Autonomous Articulated Commercial Vehicle," in Proc. 43rd Benelux Meeting Syst. Control, 2024, pp. 1-10.

[18] Y. Cheng, X. Hu, K. Chen, X. Yu, and Y. Luo, "Online longitudinal trajectory planning for connected and autonomous vehicles in mixed traffic flow with deep reinforcement learning approach," J. Transp. Eng. A: Syst., vol. 150, no. 7, Jul. 2024, doi: 10.1061/JTEPBS.TEENG-8286.

[19] H. Pei, J. Zhang, Y. Zhang, H. Xu, and L. Li, "Self-organized routing for autonomous vehicles via deep reinforcement learning," IEEE Trans. Veh. Technol., vol. 72, no. 4, pp. 4351-4363, Apr. 2023, doi: 10.1109/TVT.2022.3225808.

[20] F. Ye, S. Zhang, P. Wang, and C.-Y. Chan, "A survey of deep reinforcement learning algorithms for motion planning and control of autonomous vehicles," in Proc. IEEE Intell. Veh. Symp. (IV), Nagoya, Japan, 2021, pp. 1075-1080, doi: 10.1109/IV48863.2021.9575880.

[21] R. Cheng, G. Orosz, R. M. Murray, and J. W. Burdick, "End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks," arXiv preprint arXiv:1903.08792, 2019.

[22] H. Liu and L. Wang, "TD3 based collision free motion planning for robot navigation," arXiv preprint arXiv:2405.15460, 2024.

[23] T. Xu, Z. Meng, W. Chen, and Z. Li, "End-to-end autonomous driving decision method based on improved TD3 algorithm in complex scenarios," Sensors, vol. 24, no. 15, p. 4962, 2024, doi: 10.3390/s24154962.

[24] Z. Zhang, Y. Luo, Y. Chen, H. Zhao, Z. Ma, and H. Liu, "Automated parking trajectory generation using deep reinforcement learning," arXiv preprint arXiv:2504.21071, 2025.

[25] I. M. Anagnostara, E. Tsardoulias, and A. L. Symeonidis, "Deep reinforcement learning and imitation learning for autonomous parking simulation," Electronics, vol. 14, no. 10, p. 1992, 2025, doi: 10.3390/electronics14101992.

[26] K. Wahdan, N. Ehab, Y. Mansy, and A. El Mougy, "Dynamic path planning for autonomous vehicles using adaptive reinforcement learning TD3 model," in Proc. ICAART, Rome, Italy, 2024, pp. 272-279, doi: 10.5220/0012363300003636.

[27] D. Attard, "Reinforcement learning for autonomous navigation of articulated vehicles," M.Sc. Thesis, University of Malta, 2024.

[28] S. van Dam, L. Wisell, K. Shingade, M. Sadeghi Reineh, and F. Bruzelius, "Safe Control Allocation of Articulated Heavy Vehicles," in Proc. AVEC, LNME, 2024, doi: 10.1007/978-3-031-70392-8_1.

[29] J. Wang, L. Chu, Y. Zhang, Y. Mao, and C. Guo, "Intelligent Vehicle Decision-Making and Trajectory Planning Method Based on Deep Reinforcement Learning in the Frenet Space," Sensors, vol. 23, no. 5, p. 2463, 2023, doi: 10.3390/s23052463.

[30] "Mercedes-Benz Actros 2011 blueprint," Drawing Database, Feb. 2025. [Online]. Available: Drawing Database.

[31] I. Han, "Path Planning for Perpendicular Parking of Large Articulated Vehicles Based on Qualitative Kinematics and Geometric Methods," Autonomous Intelligent Systems, vol. 5, no. 3, p. 48, 2023, doi: 10.3390/vehicles503004.

Volume 26, Issue 5
April 2026
Pages 383-399

XML

PDF 1.75 M

Article View 167
PDF Download 72

Modares Mechanical Engineering

Design and Simulation of an Autonomous System Using Reinforcement Learning for Articulated Vehicle Pose Adjustment

Volume 26, Issue 5April 2026Pages 383-399

Files

Share

How to cite

Statistics

Volume 26, Issue 5
April 2026
Pages 383-399