Improve language recognition for Quarto Projects using R or Python on GitHub #9810
-
DescriptionWhen using Quarto with R or Python, GitHub does not correctly recognize the primary languages of the repository. Instead, it primarily identifies the repository as TeX or HTML, depending on the output format of the rendered files. This misclassification affects the visibility and discoverability of Quarto projects that utilize R or Python. Steps to reproduce
Expected behaviourGitHub only recognizes and displays TeX or HTML as the corresponding languages in the repository, ignoring the R or Python code. Proposed solutionThis is ultimately a GitHub issue, but I believe that a request from the Quarto development team would be more likely prioritized than a request from a lambda Quarto user. I suggest that a lead developer from the Quarto team bring up this issue to the GitHub development team. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 10 replies
-
That's a GitHub issue/feature. You can configure the automatic recognition operated by GitHub. |
Beta Was this translation helpful? Give feedback.
-
Hello @mcanouil and many thanks for the feedback.
In these 3 example: the github projects contain several long quarto files that are the core of it, plus theur rendered webpages (html, js, css) and in these 3 cases, quarto is never identified by github, and the rendered content is recognized very inconsistently (jss and css not identified in 1., html not identified in 2., Tex hallucinated in 3). |
Beta Was this translation helpful? Give feedback.
That's beyond Quarto.
"qmd" file are detected as markdown.
Linguist does not detect language in markdown code blocks because that's markdown.
That's the same thing with Rmarkdown files.
If you use Python code using reticulate/R in an R script, Python code won't be recognised as Python.
Look at what linguist does and how it works.
Override parameters as you wish and as described in the repository.
No matter who open an issue, it's very unlikely Markdown will be considered as a coding language in GitHub statistics and that code within code will ever be detected by linguist.
Note that the "languages" section on GitHub does not count "markdown" as a coding language (qmd and rmd are markdown).