Natural Language Processing in Low-Resource Languages: Progress and Prospects

Ritul Phukan; Monalisa Daimari; Anupam Kharghoria; Biman Basumatary

doi:10.5281/zenodo.17582873

IJAMA Info

Submit Paper

Apply for Reviewer

Notice Board

Contact editorinchief.ijama@gmail.com | submitpaper@ijama.in

Fast Peer Review – Average Decision Time Within 24 Hours

ISSN 3048-9350 (Online) has been officially allotted to IJAMA.

Submissions Open for Volume 3, Issue 3(March 2026)

BASE, CORE, WorldCat have indexed journal content.

Google Scholar & Zenodo have indexed IJAMA articles

All accepted articles are assigned individual DOIs through Zenodo.

Recent Archives

Journal Indexing

Downloads

Publisher Details

Journal: International Journal of Advanced Multidisciplinary Application

Published by: NERD Publication

Language: English

Frequency: Monthly

Publication Format: Online

Email: editorinchief.ijama@gmail.com

Website: www.ijama.com

Natural Language Processing in Low-Resource Languages: Progress and Prospects

Author(s):Ritul Phukan, Monalisa Daimari, Anupam Kharghoria, Biman Basumatary

Affiliation: Department of Computer Science and Engineering, Assam Down Town University, Guwahati, India

Page No: 4-8

Volume issue & Publishing Year: Volume 2, Issue 9, Sep 2025

published on: 2025/09/30

Journal: International Journal of Advanced Multidisciplinary Application.(IJAMA)

ISSN NO: 3048-9350

DOI: https://doi.org/10.5281/zenodo.17582873

Download PDF

Article Indexing:

Abstract:
Low-resource languages languages with limited annotated corpora, lexicons, and digital resources pose major challenges for modern natural language processing (NLP). Recent progress in transfer learning, multilingual pretraining, parameter-efficient adaptation, data augmentation, and community-driven dataset creation has substantially improved capabilities for many such languages, yet large performance gaps remain compared to high-resource languages. This article surveys the technical advances that enable NLP for low-resource languages (including unsupervised and weakly supervised methods, multilingual and massively multilingual models, few-shot and in-context learning with large language models, and adapter/LoRA-style parameter-efficient fine-tuning). We examine practical pipelines for tasks such as machine translation, speech recognition, OCR, and information extraction; describe prominent dataset and community projects; summarize typical evaluation strategies and their pitfalls; and outline promising research directions (community data collection, privacy-preserving methods, on-device adaptation, and ethics-aware deployments). The review highlights approaches that balance performance, compute cost, and data-efficiency, and recommends research and deployment practices to accelerate inclusive language technology.

Keywords: Low-resource languages, transfer learning, multilingual pretraining, few-shot learning, LoRA / adapters, data augmentation, machine translation, speech datasets, Masakhane, Common Voice

Reference:

[1] A. Conneau et al., Unsupervised cross-lingual representation learning at scale, in Proc. ACL, 2020, pp. 8440–8451.
[2] S. Ruder, I. Vuli?, and A. Sgaard, A survey of cross-lingual word embedding models, J. Artif. Intell. Res., vol. 65, pp. 569–631, 2019.
[3] J. Tiedemann, Parallel data, tools and interfaces in OPUS, in Proc. LREC, 2012, pp. 2214–2218.
[4] M. Nekoto et al., Participatory research for low-resourced machine translation: A case study in African languages, Findings of ACL: EMNLP 2020, pp. 2144–2160.
[5] K. Heffernan, A. Salesky, and A. Post, Bitext mining using distant supervision for low-resource languages, in Proc. NAACL-HLT, 2021, pp. 3617–3629.
[6] J. Schneider et al., Common Voice: A massively-multilingual speech corpus, in Proc. LREC, 2020, pp. 4218–4226.

Publication Features

✔ Article Processing Charge (APC): INR 1,250 per article

DOI assignment via Zenodo
Articles indexed and discoverable via Google Scholar, Zenodo, and BASE
Electronic certificates provided to all authors
Transparent publication charges with no additional fees
Lifetime article access (Open Access)

Announcements

Call for Papers – Volume 3, Issue 3- March 2026

Learn more

ICSEMSS – International Conference on Science Engineering Management and Social Studies,12-13 March-2026

Learn more

ICIRAI – International Conference on Interdisciplinary Research and Advanced Innovations, 26-27 March-2026

Learn more

ICIAT – International Conference on Interdisciplinary Applications and Technology, 9-10 April -2026

Learn more

ICMRD -International Conference on Multidisciplinary Research Developments, 23-24 April-2026

Learn more

Journal Gallery

ISSN Info

Online: 3048-9350