An Invasive Embedding Model in Favor of Low-Resource Languages Understanding

📌 If you use this work, please cite our paper as follows:

@article{tahery2025,
  author    = {Saedeh Tahery and Saeed Farzi},
  title     = {An Invasive Embedding Model in Favor of Low-Resource Languages Understanding},
  journal   = {ACM Transactions on Asian and Low-Resource Language Information Processing},
  year      = {2025},
  url       = {https://dl.acm.org/doi/10.1145/3771926}
}

ACM Author-Izer Service: Download Paper (PDF)

🗺️ Overview

Cross-lingual natural language understanding (NLU) tasks, such as intent detection (ID) and slot filling (SF), suffer from performance degradation due to language-specific information embedded in multilingual pre-trained models. This is especially problematic in data-scarce scenarios.
We propose an encoder-decoder model with adversarial learning that eliminates language-specific information while preserving semantic meaning to tackle this issue. Our approach enhances knowledge transferability across languages, leading to better zero-shot cross-lingual performance.

🏋️‍♂️ Training and Data Utilization

The training process consists of a strategic adversarial learning phase, where three key components interact dynamically:

Generator → Creates language-independent contextual representations.
Discriminator → Evaluates representations to detect language identity.
Decoder → Reconstructs the original input, ensuring semantic preservation.

📊 Main Results

Our model demonstrates strong zero-shot performance across multiple languages on Facebook-multilingual (XTOD) and Persian-ATIS datasets:

Language	ID Accuracy	SF F1-Score
Spanish	94.15	70.44
Thai	82.61	17.60
Persian	86.45	59.60

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
README.md		README.md
main_code.ipynb		main_code.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Invasive Embedding Model in Favor of Low-Resource Languages Understanding

🗺️ Overview

🏋️‍♂️ Training and Data Utilization

📊 Main Results

About

Uh oh!

Releases

Packages

Languages

saedeht/Cross-Lingual-NLU_Invasive-Embedding-Model

Folders and files

Latest commit

History

Repository files navigation

An Invasive Embedding Model in Favor of Low-Resource Languages Understanding

🗺️ Overview

🏋️‍♂️ Training and Data Utilization

📊 Main Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages