CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving

📅 2025-01-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the core challenges of poor generalization, insufficient safety guarantees, and opaque decision-making in deep reinforcement learning (DRL) agents for autonomous driving, this paper proposes a novel DRL framework integrating dual-level curriculum learning with safety-driven representation learning. Methodologically: (1) A progressive environmental difficulty escalation mechanism is designed in CARLA, jointly optimizing scenario complexity and multi-source reward balancing; (2) A novel joint modeling approach unifies variational autoencoder (VAE)-based implicit representation learning with proximal policy optimization (PPO) under safety constraints, enhancing both policy robustness and decision interpretability; (3) A dynamic collision penalty mechanism is introduced to enforce safe exploration. Experiments demonstrate that the proposed method reduces collision rates by 42% in unseen dynamic scenarios and improves cross-scenario task success rates by 31%, significantly advancing agent adaptability, safety, and generalization capability.

Technology Category

Application Category

📝 Abstract
In autonomous driving, traditional Computer Vision (CV) agents often struggle in unfamiliar situations due to biases in the training data. Deep Reinforcement Learning (DRL) agents address this by learning from experience and maximizing rewards, which helps them adapt to dynamic environments. However, ensuring their generalization remains challenging, especially with static training environments. Additionally, DRL models lack transparency, making it difficult to guarantee safety in all scenarios, particularly those not seen during training. To tackle these issues, we propose a method that combines DRL with Curriculum Learning for autonomous driving. Our approach uses a Proximal Policy Optimization (PPO) agent and a Variational Autoencoder (VAE) to learn safe driving in the CARLA simulator. The agent is trained using two-fold curriculum learning, progressively increasing environment difficulty and incorporating a collision penalty in the reward function to promote safety. This method improves the agent's adaptability and reliability in complex environments, and understand the nuances of balancing multiple reward components from different feedback signals in a single scalar reward function. Keywords: Computer Vision, Deep Reinforcement Learning, Variational Autoencoder, Proximal Policy Optimization, Curriculum Learning, Autonomous Driving.
Problem

Research questions and friction points this paper is trying to address.

Autonomous Driving
Deep Reinforcement Learning
Adaptability and Safety
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep Reinforcement Learning
Adaptive Driving Strategies
PPO Algorithm with VAE
🔎 Similar Papers
No similar papers found.
B
Bhargava Uppuluri
Birla Institute of Technology & Science, Pilani, Hyderabad Campus, Jawahar Nagar, Kapra Mandal, Medchal District - 500078, Telangana, India
A
Anjel Patel
Birla Institute of Technology & Science, Pilani, Hyderabad Campus, Jawahar Nagar, Kapra Mandal, Medchal District - 500078, Telangana, India
Neil Mehta
Neil Mehta
Asso Dean and Professor, Cleveland Clinic, Cleveland Clinic Lerner College of Medicine of Case
Education technologySocial Media in LearningElectronic Health recordseLearningArtificial Int
S
Sridhar Kamath
Birla Institute of Technology & Science, Pilani, Hyderabad Campus, Jawahar Nagar, Kapra Mandal, Medchal District - 500078, Telangana, India
Pratyush Chakraborty
Pratyush Chakraborty
Associate Professor, BITS Pilani, Hyderabad
Smart GridGame TheoryControl TheoryOptimizationSustainable Energy