Training Neural Networks by Lifted Proximal Operator Machines | PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Text

Training Neural Networks by Lifted Proximal Operator Machines

Li, Jia - Nama Orang; Xu, Chao - Nama Orang; Lin, Zhouchen - Nama Orang; Xiao, Mingqing - Nama Orang; Fang, Cong - Nama Orang; Dai, Yue - Nama Orang;

We present the lifted proximal operator machine (LPOM) to train fully-connected feed-forward neural networks. LPOM represents the activation function as an equivalent proximal operator and adds the proximal operators to the objective function of a network as penalties. LPOM is block multi-convex in all layer-wise weights and activations. This allows us to develop a new block coordinate descent (BCD) method with convergence guarantee to solve it. Due to the novel formulation and solving method, LPOM only uses the activation function itself and does not require any gradient steps. Thus it avoids the gradient vanishing or exploding issues, which are often blamed in gradient-based methods. Also, it can handle various non-decreasing Lipschitz continuous activation functions. Additionally, LPOM is almost as memory-efficient as stochastic gradient descent and its parameter tuning is relatively easy. We further implement and analyze the parallel solution of LPOM. We first propose a general asynchronous-parallel BCD method with convergence guarantee. Then we use it to solve LPOM, resulting in asynchronous-parallel LPOM. For faster speed, we develop the synchronous-parallel LPOM. We validate the advantages of LPOM on various network architectures and datasets. We also apply synchronous-parallel LPOM to autoencoder training and demonstrate its fast convergence and superior performance.

Ketersediaan

Barcode		Tipe Koleksi	Nomor Panggil	Lokasi	Status
art143108	null	Artikel		Gdg9-Lt3	Tersedia namun tidak untuk dipinjamkan - No Loan

Informasi Detail

Judul Seri: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE; Vol.44 No.6 June 2022
No. Panggil: -
Penerbit: : .,
Deskripsi Fisik: p. 3334-3348
Bahasa: English
ISBN/ISSN: -
Klasifikasi: NONE
Tipe Isi: -
Tipe Media: -
Tipe Pembawa: -
Edisi: -
Subjek: NEURAL NETWORKS
BLOCK COORDINATE DESCENT
BLOCK MULTI CONVEX
PARALLEL IMPLEMENTATION
Info Detail Spesifik: 10.1109/TPAMI.2020.3048430
Pernyataan Tanggungjawab: Jia Li, Mingqing Xiao, Cong Fang, Yue Dai, Chao Xu, Zhouchen Lin

Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas

Tidak Ada Data

Komentar

Anda harus masuk sebelum memberikan komentar