Nonstandard constructions: syntactic calques

UD_Italian-Valico, a treebank of L2 Italian, treats syntactic calques (typically from the learners' L1s) similarly to how foreign material is treated under the [guidelines for code-switched analysis](https://universaldependencies.org/foreign.html#option-1-code-switched-analysis) (example from Valico [here](https://journals.openedition.org/ijcol/1007#:~:text=Example%2022)). We followed the same approach in UD_Swedish-SweLL, e.g.:

<img width="500" height="314" alt="example from SweLL" src="https://github.com/user-attachments/assets/184ac5c5-3a83-4e07-9e95-907286c84385" />

The problem (which hasn't occurred yet, but is bound to happen) is that "borrowing" guidelines from other languages might result in validation errors, as the categories and structures used don't match the language-specific guidelines.

A solution could be to mark syntactic calques with `Lang=CODE_OF_THE_CALQUED_LANG` in the MISC field of each of the tokens that make them up, but I'm afraid that could be misleading, as that that is currently reserved for actual foreign words. Alternatively, these cases could be assimilated to those mentioned in  #1178 (there is definitely an overlap!), but that would make the rationale for the chosen analysis less transparent.
Any thoughts?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nonstandard constructions: syntactic calques #1181

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Nonstandard constructions: syntactic calques #1181

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions