Evaluating learning rate effects on long short-term memory for Indonesian sentiment classification

Serly Eldina, Tekad Matulatan, Novrizal Fattah Fahmitra

Abstract


Hyperparameter optimization is a crucial process for enhancing the performance of deep learning models, particularly in the context of Indonesian sentiment classification. This study examines the impact of varying learning rates on a long short-term memory (LSTM) architecture trained with the adaptive moment estimation (Adam) optimizer. The dataset comprises 9,295 Indonesian comments automatically labeled by the Indonesian Bidirectional Encoder Representations from Transformers (IndoBERT) model. Stratified k-fold cross-validation was employed to maintain class balance during training. Learning curves were analyzed to evaluate convergence and identify potential overfitting, while early stopping was applied when performance improvements became insignificant. The one-way analysis of variance (ANOVA) test (p-adj = 0.000575 < 0.05) revealed significant differences among the learning rate variations. Post-hoc analysis indicated the learning rates of 0.0001, 0.001, and 0.002 differ significantly from 0.02. Descriptive statistics showed that a learning rate of 0.001 was the most optimal, achieving the highest validation accuracy while maintaining a relatively low variance. Evaluation across two data categories demonstrated that lower learning rates (0.0001 and 0.002) achieved the best accuracy, 78.71% on in-domain data, whereas higher learning rates (0.01 and 0.02) performed better on cross-domain data with 36% accuracy. These findings highlight the crucial role of learning rate selection in determining model stability and generalization capability.

Keywords


adaptive moment estimation; early stopping; learning rate; long short-term memory; one-way analysis of variance; sentiment classification;

Full Text:

PDF


DOI: http://doi.org/10.12928/telkomnika.v24i2.27398

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

TELKOMNIKA Telecommunication, Computing, Electronics and Control
ISSN: 1693-6930, e-ISSN: 2302-9293
Universitas Ahmad Dahlan, 4th Campus
Jl. Ringroad Selatan, Kragilan, Tamanan, Banguntapan, Bantul, Yogyakarta, Indonesia 55191
Phone: +62 (274) 563515, 511830, 379418, 371120
Fax: +62 274 564604

View TELKOMNIKA Stats