Navigasi UAV (Unmanned Aerial Vehicle) Otonom Menggunakan Algoritma SAC (Soft Actor-Critic) dengan Pendekatan Deep Reinforcement Learning

Anwari, Anggra

Please use this identifier to cite or link to this item: http://repository.ipb.ac.id/handle/123456789/165175

Title:	Navigasi UAV (Unmanned Aerial Vehicle) Otonom Menggunakan Algoritma SAC (Soft Actor-Critic) dengan Pendekatan Deep Reinforcement Learning
Other Titles:	Autonomous UAV (Unmanned Aerial Vehicle) Navigation Using SAC (Soft Actor-Critic) Algorithm with a Deep Reinforcement learning Approach
Authors:	Supriyo, Prapto Tri Najib, Mohamad Khoirun Anwari, Anggra
Issue Date:	2025
Publisher:	IPB University
Abstract:	Navigasi otonom pada Unmanned Aerial Vehicle (UAV) di lingkungan dalam ruangan atau tanpa sinyal GPS menjadi tantangan krusial karena pendekatan konvensional seringkali tidak adaptif terhadap kondisi dinamis. Penelitian ini mengusulkan solusi menggunakan algoritma Soft Actor-Critic (SAC) dengan pendekatan Deep Reinforcement Learning. Model dilatih selama 350.000 langkah simulasi dalam lingkungan Unreal Engine dan AirSim, yang terdiri dari koridor dengan 15 ruangan dimana tekstur dinding dan posisi lubang diacak untuk meningkatkan kemampuan generalisasi. Pelatihan menggunakan fungsi reward yang mempertimbangkan penalti atas tabrakan dan penggunaan energi, serta mengevaluasi tiga jenis input sensor: DepthMap, Single-RGB, dan Multi-RGB. Meskipun nilai reward selama pelatihan cenderung negatif (-95 hingga -97) akibat desain penalti yang ketat, hasil pengujian menunjukkan tingkat keberhasilan navigasi yang sangat tinggi. Model dengan input DepthMap menunjukkan performa terbaik, mencapai tingkat keberhasilan 95% dengan waktu penyelesaian rata-rata 12.3 detik. Kinerja ini mengungguli model Multi-RGB (94% dengan 12.9 detik) dan Single-RGB (91% dengan 13.4 detik). Temuan ini membuktikan bahwa algoritma SAC efektif memandu UAV di lingkungan kompleks, namun perbaikan pada fungsi reward dan arsitektur tetap diperlukan untuk meningkatkan efisiensi energi dan generalisasi model. Penelitian ini menegaskan bahwa input DepthMap adalah data sensor paling andal untuk tugas navigasi serupa. Autonomous navigation for Unmanned Aerial Vehicles (UAVs) in indoor or GPS-denied environments presents a crucial challenge, as conventional approaches often lack adaptability to dynamic conditions. This study proposes a solution using the Soft Actor-Critic (SAC) algorithm with a Deep Reinforcement Learning approach. The model was trained for 350,000 simulation timesteps in an environment built with Unreal Engine and the AirSim plugin. This environment features a corridor with 15 enclosed rooms where wall textures and hole positions are randomized to improve the agent's generalization capabilities. The training process utilized a reward function that balanced penalties for collisions and energy consumption, while evaluating three sensor input types: DepthMap, Single-RGB, and Multi-RGB. Despite the training rewards being negative (ranging from -95 to -97) due to the strict penalty design, testing revealed a very high navigation success rate. The model using DepthMap input demonstrated the best performance, achieving a 95% success rate with an average completion time of 12.3 seconds. This surpassed the performance of the Multi-RGB model (94% success rate with a 12.9-second completion time) and the Single-RGB model (91% success rate with a 13.4-second completion time). These findings demonstrate that the SAC algorithm can effectively guide a UAV in complex environments. However, improvements to the reward function and network architecture are still needed to enhance energy efficiency and model generalization. This study confirms that DepthMap input is the most reliable sensor data for similar navigation tasks.
URI:	http://repository.ipb.ac.id/handle/123456789/165175
Appears in Collections:	UT - Mathematics

Files in This Item:

File	Description	Size	Format
cover_G5401201086_29ebf424e9a1440a8a80c55057f65082.pdf	Cover	1.89 MB	Adobe PDF	View/Open
fulltext_G5401201086_948aea1f0dd54b2e8b1870788712b292.pdf Restricted Access	Fulltext	6.1 MB	Adobe PDF	View/Open
lampiran_G5401201086_40f62f2e8fdf44d8bb8c4f292c726067.pdf Restricted Access	Lampiran	2.08 MB	Adobe PDF	View/Open

Show full item record Recommend this item

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets