Add support for processing language parameter in message entities #2466

pplulee · 2025-04-05T15:29:11Z

Description

Add support for processing code language parameter in <pre> tag under message entities

Describe your tests

How did you test your change?

Python version: 3.12

OS: Windows 11

Send a message containing a code block to the bot, then see the result of the parsed HTML code.

Checklist:

I added/edited example on new feature/change (if exists)
My changes won't break backward compatibility
I made changes both for sync and async

Badiboy · 2025-04-05T16:41:47Z

Looks fine. Did you perform tests of this update?

pplulee · 2025-04-06T02:34:38Z

I was unable to run some of the tests, but that seems to be the original problem, not caused by these updates.

For example, in test_telebot.py, I'm confused as to why empty data was used. This caused the tests to fail.

pplulee · 2025-04-06T04:12:00Z

I updated the test file with some optimisations. For the issues mentioned above, environment variables are used directly. All tests now pass.

Also, I added tests for the apply_html_entities function.

Badiboy · 2025-04-06T13:14:02Z

For example, in test_telebot.py, I'm confused as to why empty data was used.

You need to pass YOUR bot token there.

Badiboy · 2025-04-06T13:17:26Z

For the issues mentioned above, environment variables are used directly.

Not absolutely correct due to TOKEN var may not exist if nothing in environment, but better then ''...

pplulee · 2025-04-06T13:17:38Z

For example, in test_telebot.py, I'm confused as to why empty data was used.

You need to pass YOUR bot token there.

I saw environment variables for token, but some tests used them and some didn't, so I was confused. I updated to use the same environment variables for all tests so we don't have to manually edit the test files.

Badiboy · 2025-04-06T13:25:15Z

func is designed wrong.

    def func(upd_text, subst_type=None, url=None, user=None, custom_emoji_id=None, language=None):
        upd_text = upd_text.decode("utf-16-le")
        if subst_type == "text_mention":
            subst_type = "text_link"
            url = "tg://user?id={0}".format(user.id)
        elif subst_type == "mention":
            url = "https://t.me/{0}".format(upd_text[1:])
        upd_text = upd_text.replace("&", "&amp;").replace("<", "&lt;").replace(">", "&gt;")
        if not subst_type or not _subs.get(subst_type):
            return upd_text
        subs = _subs.get(subst_type)
        if subst_type == "custom_emoji":
            return subs.format(text=upd_text, custom_emoji_id=custom_emoji_id)
        elif (subst_type == "pre") and language:
                return upd_text='<code class="language-{0}">{1}</code>'.format(language, upd_text)
        return subs.format(text=upd_text, url=url)

Otherwise symbols like < and > will break the markdown.

pplulee · 2025-04-06T14:04:35Z

func is designed wrong.

    def func(upd_text, subst_type=None, url=None, user=None, custom_emoji_id=None, language=None):
        upd_text = upd_text.decode("utf-16-le")
        if subst_type == "text_mention":
            subst_type = "text_link"
            url = "tg://user?id={0}".format(user.id)
        elif subst_type == "mention":
            url = "https://t.me/{0}".format(upd_text[1:])
        upd_text = upd_text.replace("&", "&amp;").replace("<", "&lt;").replace(">", "&gt;")
        if not subst_type or not _subs.get(subst_type):
            return upd_text
        subs = _subs.get(subst_type)
        if subst_type == "custom_emoji":
            return subs.format(text=upd_text, custom_emoji_id=custom_emoji_id)
        elif (subst_type == "pre") and language:
                return upd_text='<code class="language-{0}">{1}</code>'.format(language, upd_text)
        return subs.format(text=upd_text, url=url)

Otherwise symbols like < and > will break the markdown.

Thanks for pointing out the problem. I've modified the code along the lines of your idea and added a test for special characters.

Badiboy · 2025-04-06T15:34:34Z

Thank you!

Copilot

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (1)

tests/test_telebot.py:47

Ensure that TOKEN is defined and imported in this test file to prevent an undefined variable error.

tb = telebot.TeleBot(TOKEN)

telebot/formatting.py

Badiboy · 2025-04-06T15:36:41Z

Copilot reviewed

@pplulee It was just for test )

Add support for processing language parameter in message entities

19bf138

Add test for HTML parsing

e270371

Fix special character conversion

d25dae8

Badiboy merged commit 7c3824e into eternnoir:master Apr 6, 2025
7 checks passed

Badiboy requested a review from Copilot April 6, 2025 15:35

Copilot AI reviewed Apr 6, 2025

View reviewed changes

telebot/formatting.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for processing language parameter in message entities #2466

Add support for processing language parameter in message entities #2466

pplulee commented Apr 5, 2025

Uh oh!

Badiboy commented Apr 5, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add support for processing language parameter in message entities #2466

Add support for processing language parameter in message entities #2466

Conversation

pplulee commented Apr 5, 2025

Description

Describe your tests

Checklist:

Uh oh!

Badiboy commented Apr 5, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

pplulee commented Apr 6, 2025

Uh oh!

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Badiboy commented Apr 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants