Jiayou Chao and Wei Zhu
Adv. Artif. Intell. Mach. Learn., 4 (1):1977-1990
1. Jiayou Chao: Stony Brook University
2. Wei Zhu: Stony Brook University
DOI: https://dx.doi.org/10.54364/AAIML.2024.41113
Article History: Received on: 10-Jan-24, Accepted on: 03-Mar-24, Published on: 10-Mar-24
Corresponding Author: Jiayou Chao
Email: jiayou.chao@stonybrook.edu
Citation: Jiayou Chao, Wei Zhu (2024). EFFICIENT MULTI-DOMAIN TEXT RECOGNITION DEEP NEURAL NETWORK PARAMETERIZATION WITH RESIDUAL ADAPTERS. Adv. Artif. Intell. Mach. Learn., 4 (1 ):1977-1990
Recent advancements in deep neural networks have markedly enhanced the performance of computer vision tasks, yet the specialized nature of these networks often necessitates extensive data and high computational power. Addressing these requirements, this study presents a novel neural network model adept at optical character recognition (OCR) across diverse domains, leveraging the strengths of multi-task learning to improve efficiency and generalization. The model is designed to achieve rapid adaptation to new domains, maintain a compact size conducive to reduced computational resource demand, ensure high accuracy, retain knowledge from previous learning experiences, and allow for domain-specific performance improvements without the need to retrain entirely. Rigorous evaluation on open datasets has validated the model's ability to significantly lower the number of trainable parameters without sacrificing performance, indicating its potential as a scalable and adaptable solution in the field of computer vision, particularly for applications in optical text recognition.