| PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Motivated by operations research applications, such as inventory control and real-time bidding, we consider undiscounted reinforcement learning in Markov decision processes under model uncertainty …

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

Inventory Balancing with Online Learning

Bookmark Share

Simchi-Levi, David Ma, Will Cheung, Wang Chi Wang, Xinshang

We study a general problem of allocating limited resources to heterogeneous customers over time under model uncertainty. Each type of customer can be serviced using different actions, each of which…

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi

Hedging the Drift : Learning to Optimize Under Nonstationarity

Bookmark Share

Simchi-Levi, David Cheung, Wang Chi Zhu, Ruihao

We introduce data-driven decision-making algorithms that achieve state-of-the-art dynamic regret bounds for a collection of nonstationary stochastic bandit settings. These settings capture applicat…

Ketersediaan1

Tambahkan ke dalam keranjang

Tampilkan Detail Sitasi