Abstract
The training process for deep learning and pattern recognition normally involves the use of convex and strongly convex optimization algorithms such as AdaBelief and SAdam to handle lots of 'uninformative' samples that should be ignored, thus incurring extra calculations. To solve this open problem, we propose to design bandit sampling method to make these algorithms focus on 'informative' samples during training process. Our contribution is twofold: first, we propose a convex optimization algorithm with bandit sampling, termed AdaBeliefBS, and prove that it converges faster than its original version; second, we prove that bandit sampling works well for strongly convex algorithms, and propose a generalized SAdam, called SAdamBS, that converges faster than SAdam. Finally, we conduct a series of experiments on various benchmark datasets to verify the fast convergence rate of our proposed algorithms.
| Original language | English |
|---|---|
| Pages (from-to) | 565-577 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Emerging Topics in Computational Intelligence |
| Volume | 7 |
| Issue number | 2 |
| DOIs | |
| State | Published - 1 Apr 2023 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2017 IEEE.
Keywords
- Bandit sampling
- convex optimization
- image processing
- training algorithm
ASJC Scopus subject areas
- Computer Science Applications
- Control and Optimization
- Computational Mathematics
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Towards Faster Training Algorithms Exploiting Bandit Sampling From Convex to Strongly Convex Conditions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver