| PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Fairness-Oriented Learning for Optimal Individualized Treatment Rules

There has recently been a surge on the methodological development for optimal individualized treatment rule (ITR) estimation. The standard methods in the literature are designed to maximize the pot…

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

Provably Efficient Reinforcement Learning with Linear Function Approximation

Bookmark Share

Jordan, Michael I.Wang, Zhaoran Yang, Zhuoran Jin, Chi

Modern reinforcement learning (RL) is commonly applied to practical problems with an enormous number of states, where function approximation must be deployed to approximate either the value functio…

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation…

Bookmark Share

Wang, Zhaoran Chen, Yudong Xie, Qiaomin Yang, Zhuoran

We develop provably efficient reinforcement learning algorithms for two-player zero-sum finite-horizon Markov games with simultaneous moves. To incorporate function approximation, we consider a fam…

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation…

Bookmark Share

Wang, Zhaoran Chen, Yudong Xie, Qiaomin Yang, Zhuoran

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Comp…

Bookmark Share

Hong, Mingyi Wai, Hoi-To Wang, Zhaoran Yang, Zhuoran

This paper analyzes a two-timescale stochastic algorithm framework for bilevel optimization. Bilevel optimization is a class of problems which exhibits a two-level structure, and its goal is to min…

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

Tensor Graphical Model : Non-Convex Optimization and Statistical Inference

Bookmark Share

Yang, Jian Liu, Han Wang, Zhaoran Sun, Will Wei Cheng, Guang Lyu, Xiang

We consider the estimation and inference of graphical models that characterize the dependency structure of high-dimensional tensor-valued data. To facilitate the estimation of the precision matrix …

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi