Accelerating Primal-Dual Methods for Regularized Markov Decision Processes | PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Text

Accelerating Primal-Dual Methods for Regularized Markov Decision Processes

Li, Haoya - Nama Orang; Dhillon, Inderjit S. - Nama Orang; Ying, Lexing - Nama Orang; Yu, Hsiang-Fu - Nama Orang;

Entropy regularized Markov decision processes have been widely used in reinforcement learning. This paper is concerned with the primal-dual formulation of the entropy regularized problems. Standard first-order methods suffer from slow convergence due to the lack of strict convexity and concavity. To address this issue, we first introduce a new quadratically convexified primal-dual formulation. The natural gradient ascent descent of the new formulation enjoys global convergence guarantee and exponential convergence rate. We also propose a new interpolating metric that further accelerates the convergence significantly. Numerical results are provided to demonstrate the performance of the proposed methods under multiple settings.

Ketersediaan

Barcode		Tipe Koleksi	Nomor Panggil	Lokasi	Status
art148728	null	Artikel		Gdg9-Lt3	Tersedia namun tidak untuk dipinjamkan - No Loan

Informasi Detail

Judul Seri: SIAM JOURNAL ON OPTIMIZATION; Vol.34 No.1 January-March 2024
No. Panggil: -
Penerbit: : .,
Deskripsi Fisik: p. 764-789
Bahasa: English
ISBN/ISSN: -
Klasifikasi: NONE
Tipe Isi: -
Tipe Media: -
Tipe Pembawa: -
Edisi: -
Subjek: MARKOV DECISION PROCESS
REINFORCEMENT LEARNING
PRIMAL-DUAL METHOD
ENTROPY REGULARIZATION
Info Detail Spesifik: https://doi.org/10.1137/21M1468851
Pernyataan Tanggungjawab: Haoya Li, Hsiang-Fu Yu, Lexing Ying, Inderjit S. Dhillon

Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas

Tidak Ada Data

Komentar

Anda harus masuk sebelum memberikan komentar