Home > Published Issues > 2026 > Volume 17, No. 2, 2026 >
JAIT 2026 Vol.17(2): 311-332
doi: 10.12720/jait.17.2.311-332

A Sentiment and Context-Aware Machine Translation Framework for Polite English-to-Tamil Conversion of Offensive Language with Cultural Sensitivity Consideration

Rama S 1,* and R. Mythili 2
1. Department of Computer Science and Engineering, SRM Institute of Science and Technology, Ramapuram, Chennai, India
2. Department of Information Technology, SRM Institute of Science and Technology, Ramapuram, Chennai, India
Email: rama.sugavanam@gmail.com (R.S.); mythilir2@srmist.edu.in (R.M.)
*Corresponding author

Manuscript received August 8, 2025; revised September 23, 2025; accepted October 9, 2025; published February 10, 2026.

Abstract—Neural Machine Translation (NMT) involves the automated conversion of text from one language into another, aiming for linguistic accuracy and fluency. The domain of machine-based text translation and sentence matching undergoes rapid transformation due to swift developments in deep learning methods. However, training NMT systems demand extensive parallel datasets. For languages with scarce data resources, producing high-quality translations remain a challenge, often resulting in less reliable outputs. Here, a politeness and sentiment-aware English-Tamil NMT module has been implemented. Initially, English text is gathered from the data sources and undergoes multitask classification with the developed Robustly Optimized Bidirectional Encoder Representations from Transformers Pretraining Approach-Residual Long Short-Term Memory (RoBERTa-ResLSTM) network, and the module finds the sentiment class and offensive class from the text. This module maps the English words into the compact word embedding space, and ResLSTM helps in finding the contextual semantics of the word. The resulting annotations related to the sentiment class and offensive class from the RoBERTa-ResLSTM are sent over to the politeness-controlled transfer Model. This module is built with the Multi-head Cross Attention-based mT5 (MC-AmT5) network. For better cross-handling of large multiple-language texts and sentence sequences, the proposed MC-AmT5 undergo hyperparameter tuning with the newly developed Modified Motion Vector-based Tornado optimizer with Coriolis force (MMV-MToC). The outcome from the politeness-controlled transfer model translated the English text into an equivalent Tamil text with better matching accuracy. For showcasing the effectiveness of the module, the model is experimented on standard datasets and compared against similar models.
 
Keywords—sentiment and context-aware machine translation, Robustly Optimized Bidirectional Encoder Representations from Transformers Pretraining Approach (RoBERTa), Residual Long Short-Term Memory (ResLSTM), Multi-head Cross Attention-based mT5 network (MC-AmT5), Modified Motion Vector-based Tornado optimizer with Coriolis force (MMV-MToC)

Cite: Rama S and R. Mythili, "A Sentiment and Context-Aware Machine Translation Framework for Polite English-to-Tamil Conversion of Offensive Language with Cultural Sensitivity Consideration," Journal of Advances in Information Technology, Vol. 17, No. 2, pp. 311-332, 2026. doi: 10.12720/jait.17.2.311-332

Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions