Designing Optimizer for Deep Learning by Sudipta Mazumder

By:

Mazumder, Sudipta

Contributor(s):

Paul, Angshuman

Material type: Text

TextPublication details: IIT Jodhpur Department of Computer Science and Technology 2023Description: vii,10p. HBSubject(s):

DDC classification:

006.31 M476D

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Home library	Collection	Call number	Status	Date due	Barcode	Item holds
Thesis	S. R. Ranganathan Learning Hub Reference	Theses	006.31 M476D (Browse shelf(Opens below))	Not for loan		TM00524

Total holds: 0

Browsing S. R. Ranganathan Learning Hub shelves, Shelving location: Reference, Collection: Theses Close shelf browser (Hides shelf browser)

Previous	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	Next
Previous	006.3 K963U Udyojaka, An App for Matchmaking Bio-Entrepreneurs, Investors & other Stakeholders powered by AI	006.3 M678E Explainable Mutlimodal Emotion Recognition	006.3 S774P 6DoF Pose Estimation and 3D Map Reconstruction from Flash LiDAR Data	006.31 M476D Designing Optimizer for Deep Learning	006.312 S225T A Twitter sentiment based Indian stock price forecasting using deep learning	006.32 N218G Graph Neural Network Methods (Comparison)	006.4 U48A An Approach to Develop Artificially Intelligent Agent for Automatic Defect Detection for Smart Manufacturing	Next

Optimizers form the backbone of any convolutional network as they are responsible for making the functions converge faster. The optimizers do this by modifying the weights and learning rate of the algorithm, which reduces the loss and improves the accuracy. A lot of optimizers have gained traction over the years, out of which SGD and Adam take the cake. Adam has taken the lead as it helps to reduce the dying gradient problem of SGD. However, we still have scope for improvement. With this paper, we aim to introduce a new algorithm that surpasses the performance of Adam by calculating the angular gradients (cosine and tangent angles) at consecutive steps. This algorithm uses the gradient of the current step, the previous step, and the step previous to that. As we present more information to the optimizer’s algorithm, the algorithm has more information, making it better poised to make more accurate predictions at faster convergence rates. We have tested this approach on benchmark datasets and compared it with other state-of-the-art optimizers, and have obtained superior results in almost every approach.

There are no comments on this title.

to post a comment.

Print
Send to device
Save record
BIBTEX Dublin Core MARC (non-Unicode/MARC-8) MARCXML RIS
More searches

Search for this title in:
Other Libraries (WorldCat) Other Databases (Google Scholar) Online Stores (Bookfinder.com) Open Library (openlibrary.org)