ISSN :2582-9793

EFFICIENT MULTI-DOMAIN TEXT RECOGNITION DEEP NEURAL NETWORK PARAMETERIZATION WITH RESIDUAL ADAPTERS

Original Research (Published On: 10-Mar-2024 )
EFFICIENT MULTI-DOMAIN TEXT RECOGNITION DEEP NEURAL NETWORK PARAMETERIZATION WITH RESIDUAL ADAPTERS
DOI : https://dx.doi.org/10.54364/AAIML.2024.41113

Jiayou Chao and Wei Zhu

Adv. Artif. Intell. Mach. Learn., 4 (1):1977-1990

Jiayou Chao : Stony Brook University

Wei Zhu : Stony Brook University

Download PDF Here

DOI: https://dx.doi.org/10.54364/AAIML.2024.41113

Article History: Received on: 10-Jan-24, Accepted on: 03-Mar-24, Published on: 10-Mar-24

Corresponding Author: Jiayou Chao

Email: jiayou.chao@stonybrook.edu

Citation: Jiayou Chao, Wei Zhu (2024). EFFICIENT MULTI-DOMAIN TEXT RECOGNITION DEEP NEURAL NETWORK PARAMETERIZATION WITH RESIDUAL ADAPTERS. Adv. Artif. Intell. Mach. Learn., 4 (1 ):1977-1990

          

Abstract

    

Recent advancements in deep neural networks have markedly enhanced the performance of computer vision tasks, yet the specialized nature of these networks often necessitates extensive data and high computational power. Addressing these requirements, this study presents a novel neural network model adept at optical character recognition (OCR) across diverse domains, leveraging the strengths of multi-task learning to improve efficiency and generalization. The model is designed to achieve rapid adaptation to new domains, maintain a compact size conducive to reduced computational resource demand, ensure high accuracy, retain knowledge from previous learning experiences, and allow for domain-specific performance improvements without the need to retrain entirely. Rigorous evaluation on open datasets has validated the model's ability to significantly lower the number of trainable parameters without sacrificing performance, indicating its potential as a scalable and adaptable solution in the field of computer vision, particularly for applications in optical text recognition.

Statistics

   Article View: 526
   PDF Downloaded: 13