Structured Cooperative Reinforcement Learning With Time-Varying Composite Action Space | PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Text

Structured Cooperative Reinforcement Learning With Time-Varying Composite Action Space

Li, Wenhao - Nama Orang; Wang, Xiangfeng - Nama Orang; Jin, Bo - Nama Orang; Luo, Dijun - Nama Orang; Zha, Hongyuan - Nama Orang;

In recent years, reinforcement learning has achieved excellent results in low-dimensional static action spaces such as games and simple robotics. However, the action space is usually composite, composed of multiple sub-action with different functions, and time-varying for practical tasks. The existing sub-actions might be temporarily invalid due to the external environment, while unseen sub-actions can be added to the current system. To solve the robustness and transferability problems in time-varying composite action spaces, we propose a structured cooperative reinforcement learning algorithm based on the centralized critic and decentralized actor framework, called SCORE. We model the single-agent problem with composite action space as a fully cooperative partially observable stochastic game and further employ a graph attention network to capture the dependencies between heterogeneous sub-actions. To promote tighter cooperation between the decomposed heterogeneous agents, SCORE introduces a hierarchical variational autoencoder, which maps the heterogeneous sub-action space into a common latent action space. We also incorporate an implicit credit assignment structure into the SCORE to overcome the multi-agent credit assignment problem in the fully cooperative partially observable stochastic game. Performance experiments on the proof-of-concept task and precision agriculture task show that SCORE has significant advantages in robustness and transferability for time-varying composite action space.

Ketersediaan

Barcode		Tipe Koleksi	Nomor Panggil	Lokasi	Status
art145400	null	Artikel		Gdg9-Lt3	Tersedia namun tidak untuk dipinjamkan - No Loan

Informasi Detail

Judul Seri: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE; Vol.44 No.11 Part 2 November 2022
No. Panggil: -
Penerbit: : .,
Deskripsi Fisik: p. 8618-8634
Bahasa: English
ISBN/ISSN: -
Klasifikasi: NONE
Tipe Isi: -
Tipe Media: -
Tipe Pembawa: -
Edisi: -
Subjek: COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING
COMPOSITE ACTION SPACE
TIME-VARYING ACTION SPACE
Info Detail Spesifik: DOI: 10.1109/TPAMI.2021.3102140
Pernyataan Tanggungjawab: Wenhao Li, Xiangfeng Wang, Bo Jin, Dijun Luo, Hongyuan Zha

Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas

Tidak Ada Data

Komentar

Anda harus masuk sebelum memberikan komentar