Traffic light management using reinforcement learning methods

Can, Sultan Kübra

Traffic light management using reinforcement learning methods

dc.contributor.author	Can, Sultan Kübra
dc.contributor.department	AGÜ, Fen Bilimleri Enstitüsü, Elektrik ve Bilgisayar Mühendisliği Ana Bilim Dalı	en_US
dc.date.accessioned	2022-12-14T12:44:19Z
dc.date.available	2022-12-14T12:44:19Z
dc.date.issued	2022	en_US
dc.date.submitted	2022-06
dc.description.abstract	Traffic lights have been around since 19th century, and aims to ease the chaos happening in intersections. It’s recorded that, people spend hours in traffic leading degradations in human health and environment. Even though its main purpose is to reduce traffic congestion and decrease the number of accidents, most of the approaches cannot adapt very well to fast changing dynamics and growing demands of the intersections with modern world developments. Fixed-time approaches use predefined settings, and to maximize its success time slots are identified. Although there are successful attempts, they don’t answer today’s demands of traffic. To overcome this problem, adaptive controllers are developed, and detectors and sensors are added to systems to enable adoption and dynamism. Recently, reinforcement learning has shown its capability to learn the dynamics of complex environments such as urban traffic. Although it was studied in single junction systems, one of the problems was the lack of consistency with how the real world system works. Most of the systems assume the environment is fully observable or actions would be freely executed using simulators. This study aims to merge usefulness of reinforcement learning methods with real world constraints. The experiments conducted have shown that, with queue data obtained from sensors located at the beginning and at the end of the roads and limited action spaces it works very well and A2C is able to learn the dynamics of the environment while converging and stabilizes itself in a respectively short duration.	en_US
dc.description.abstract	Trafik ışıkları, 19. yüzyıldan bu yana aktif olarak kavşaklardaki karmaşıklığı ve düzensizliği azaltmak amacı ile faaliyet gösteriyorlar. Kaynaklara göre, insanlar trafikte saatler geçiriyor, ki bu da hem insan sağlığı hem de çevre bakımından bozulmalara sebep oluyor. Trafik ışıklarının görevi trafik sıkışıklığını ve kaza sayısını azaltmak olsa da, şu an çalışan çoğu sistem modern zamanın gelişmeleri ile artan isteklere ve hızlı değişen kavşak dinamiklerine uyum sağlayamıyor. Bunlardan biri olan sabit zamanlı sistemler, önceden tanımlanmış ayarları kullanıyorlar ve performansını daha da artırmak için zaman dilimleri tanımlanıyor. Başarılı girişimler ve düzeltmeler görülse de, bugünün ihtiyaçlarına cevap veremiyorlar. Daha sonra, sistemlere sensörler ve detektörler eklenerek daha akıllı, dinamik ve adaptif sistemler geliştirildi. Son çalışmalar ise, pekiştirmeli öğrenmenin ve özellikle pekiştirmeli derin öğrenmenin kavşaklar gibi karmaşık ortamların dinamiklerini öğrenebildiğini gösterdi. Tekli kavşaklarda buna yönelik çalışmalar olmasına rağmen, gerçek dünya ile tam olarak tutarlı olmadığı, simülatörler vasıtasıyla tüm ortamın görünür ve karar verilen aksiyonların sınırsız olabileceğinin varsayıldığı fark edildi. Bu çalışma, pekiştirmeli öğrenme yöntemlerinin başarısı ve sağladığı fayda ile gerçek dünyanın sınırlarını birleştirmeyi hedeflemektedir. Bu çalışmada yapılmış olan deneyler gösteriyor ki, her bir yolun başına ve sonuna yerleştirilmiş olan sensörler vasıtası ile elde edilen kuyruk değerleri ve kısıtlı aksiyonlar kullanılarak geliştirilen pekiştirmeli öğrenme yöntemleri iyi bir performans sergiliyor ve özellikle A2C yöntemi çevrenin dinamiklerini öğrenerek nispeten kısa sürede yakınsıyor ve stabil hale geliyor.	en_US
dc.description.tableofcontents	TABLE OF CONTENTS 1. INTRODUCTION.............................................................................................. 1 1.1 OFFLINE APPROACHES....................................................................................... 2 1.2 ADAPTIVE CONTROLLERS .................................................................................. 2 1.3 REINFORCEMENT LEARNING BASED APPROACHES............................................... 2 2. REINFORCEMENT LEARNING..................................................................... 4 2.1 ADVANTAGE ACTOR-CRITIC (A2C).................................................................... 6 2.2 TRUST REGION POLICY OPTIMIZATION (TRPO)................................................... 7 2.3 PROXIMAL POLICY OPTIMIZATION (PPO)............................................................ 7 3. METHODOLOGY............................................................................................. 9 3.1 TRAFFIC SIMULATION ........................................................................................ 9 3.2 STATE REPRESENTATION...................................................................................11 3.3 ACTION SPACE DEFINITION ...............................................................................12 3.4 REWARD FUNCTION DESIGN..............................................................................13 3.5 VEHICLE GENERATION METHOD........................................................................14 4. RESULTS..........................................................................................................16 4.1 EXPERIMENT DESIGN........................................................................................16 4.2 RESULTS ..........................................................................................................20 5. CONCLUSIONS AND FUTURE PROSPECTS...............................................23 5.1 CONCLUSIONS ..................................................................................................23 5.2 SOCIETAL IMPACT AND CONTRIBUTION TO GLOBAL SUSTAINABILITY ..................23 5.3 FUTURE PROSPECTS..........................................................................................24	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.12573/1421
dc.language.iso	eng	en_US
dc.publisher	Abdullah Gül Üniversitesi, Fen Bilimleri Enstitüsü	en_US
dc.relation.publicationcategory	Tez	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Deep Reinforcement Learning	en_US
dc.subject	Urban Traffic Control	en_US
dc.title	Traffic light management using reinforcement learning methods	en_US
dc.title.alternative	Pekiştirmeli öğrenme yöntemi tabanlı trafik ışık yönetim sistemleri	en_US
dc.type	masterThesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 747532.pdf
Size:: 2.17 MB
Format:: Adobe Portable Document Format
Description:: Yüksek Lisans Tezi

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.44 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Elektrik ve Bilgisayar Mühendisliği Ana Bilim Dalı Tez Koleksiyonu