A Review of Document Classification Techniques Using Machine Learning and Deep Learning

Sanjay M. Pardhi; Sampada M. Margaj

doi:10.53032/tvcr/2025.v7n2.23

Authors

Sanjay M. Pardhi aDepartment of Computer Science, University of Mumbai, Vidyanagari, Mumbai, India Research Scholar, bDepartment of Computer Science, Kirti M. Doongursee College of Arts, Science & Commerce, Dadar, Mumbai, India
Sampada M. Margaj Kirti M. Doongursee College of Arts, Science & Commerce, Department of Computer Science, Mumbai, India

DOI:

https://doi.org/10.53032/tvcr/2025.v7n2.23

Keywords:

Document, Classification, Deep Learning, RNN, BERT

Abstract

The study shows different machine learning and natural language processing techniques are used to address fully automated text classification of extensive datasets. The research looks at multiple studies which employ probabilistic models with deep learning approaches and established machine learning methods to identify documents. The discussion evaluates target model advantages against disadvantages while exploring future development paths in order to resolve the need for highly accurate scalable classification systems. This research evaluates how transformer-based models recently developed will affect classification model outcomes.

References

Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Surveys (CSUR), 34(1), 1–47. https://doi.org/10.1145/505282.505283 DOI: https://doi.org/10.1145/505282.505283

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems (NeurIPS) (pp. 5998–6008). Curran Associates, Inc.

Devlin, J., Chang, M. W., Lee, K., Toutanova, K. (2018). BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805. https://arxiv.org/abs/1810.04805

Minaee, S., Kalchbrenner, N., Cambria, E., Nikzad, N., Chenaghlu, M., Gao, J. (2021). Deep learning-based text classification: A comprehensive review. ACM Computing Surveys (CSUR), 54(3), 1–40. https://doi.org/10.1145/3439726 DOI: https://doi.org/10.1145/3439726

Baygin, M. (2018). Classification of text documents based on naïve Bayes using n-gram features. Journal of Information Science and Engineering, 34(4), 987–1002. DOI: https://doi.org/10.1109/IDAP.2018.8620853

Deshmukh, R., Patil, M., Bhosale, R. (2019). A document classification using NLP and recurrent neural network. International Journal of Computer Applications, 181(5), 23–29. https://doi.org/10.5120/ijca2019918562

Schwenk, H., Li, X. (2018). A corpus for multilingual document classification in eight languages. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC).

Cheng, Y. (2019). Document classification based on convolutional neural network and hierarchical attention network. Neural Networks, 110, 56–64. https://doi.org/10.1016/j.neunet.2019.07.004 DOI: https://doi.org/10.1016/j.neunet.2019.07.004

Adhikari, A., Ram, A., Tang, R., Lin, J. (2019). DocBERT: BERT for document classification. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 961–969. https://doi.org/10.18653/v1/D19-1094 DOI: https://doi.org/10.18653/v1/D19-1094

Huang, X., Paul, M. J. (2018). Examining temporality in document classification Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 394–404. https://doi.org/10.18653/v1/N18-1036 DOI: https://doi.org/10.18653/v1/N18-1036

Aggarwal, C. C., Zhai, C. (2012). Mining text data. Springer. https://doi.org/10.1007/978-1-4614-3223-4 DOI: https://doi.org/10.1007/978-1-4614-3223-4

Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (NeurIPS) (pp. 3111–3119).

Howard, J., Ruder, S. (2018). Universal language model fine-tuning for text classification. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 328–339). Association for Computational Linguistics. https://doi.org/10.18653/v1/P18-1031 DOI: https://doi.org/10.18653/v1/P18-1031

Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., Zhou, Y., Li, W., Liu, P. J. (2020). Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research, 21(140), 1–67.

Samek, W., Wiegand, T., Müller, K. R. (2019). Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models. arXiv preprint arXiv:1708.08296. https://arxiv.org/abs/1708.08296 DOI: https://doi.org/10.1007/978-3-030-28954-6_1

A Review of Document Classification Techniques Using Machine Learning and Deep Learning

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Indexing

Current Issue

Information

Keywords

People Reached Us!

Images

Impact Factor