Strengthening Gradient Descent by Sequential Motion Optimization for Deep Neural Networks | PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Text

Strengthening Gradient Descent by Sequential Motion Optimization for Deep Neural Networks

Le-Duc, Thang - Nama Orang; Nguyen, Quoc-Hung - Nama Orang; Lee, Jaehong - Nama Orang; Nguyen-Xuan, H. - Nama Orang;

In this article, we explore the advantages of heuristic mechanisms and devise a new optimization framework named sequential motion optimization (SMO) to strengthen gradient-based methods. The key idea of SMO is inspired from a movement mechanism in a recent metaheuristic method called balancing composite motion optimization (BCMO). Specifically, SMO establishes a sequential motion chain of two gradient-guided individuals, including a leader and a follower to enhance the effectiveness of parameter updates in each iteration. A surrogate gradient model with low computation cost is theoretically established to estimate the gradient of the follower by that of the leader through chain rule during the training process. Experimental results in terms of training quality on both fully connected multilayer perceptrons (MLPs) and convolutional neural networks (CNNs) with respect to three popular benchmark datasets, including MNIST, Fashion-MNIST, and CIFAR-10 demonstrate the superior performance of the proposed framework in comparison with the vanilla stochastic gradient descent (SGD) implemented via backpropagation (BP) algorithm. Although this study only introduces the vanilla gradient descent (GD) as a main gradient-guided factor in SMO for deep neural network (DNN) training application, it is great potential to combine with other gradient-based variants to improve its effectiveness and solve other large-scale optimization problems in practice.

Ketersediaan

Barcode		Tipe Koleksi	Nomor Panggil	Lokasi	Status
art146824	null	Artikel		Gdg9-Lt3	Tersedia namun tidak untuk dipinjamkan - No Loan

Informasi Detail

Judul Seri: IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION; Vol.27 No.3 June 2023
No. Panggil: -
Penerbit: : .,
Deskripsi Fisik: p. 565-579
Bahasa: English
ISBN/ISSN: -
Klasifikasi: NONE
Tipe Isi: -
Tipe Media: -
Tipe Pembawa: -
Edisi: -
Subjek: METAHEURISTIC ALGORITHMS
LARGE-SCALE OPTIMIZATION
DEEP NEURAL NETWORKS (DNNS)
BACKPROPAGATION (BP) ALGORITHM
SEQUENTIAL MOTION OPTIMIZATION (SMO)
STOCHASTIC GRADIENT DESCENT (SGD)
SURROGATE GRADIENT
Info Detail Spesifik: DOI: 10.1109/TEVC.2022.3171052
Pernyataan Tanggungjawab: Thang Le-Duc, Quoc-Hung Nguyen, Jaehong Lee, H. Nguyen-Xuan

Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas

Tidak Ada Data

Komentar

Anda harus masuk sebelum memberikan komentar