Skip to content

[nudebird] add extractor support#9142

Open
NecRaul wants to merge 7949 commits intomikf:masterfrom
NecRaul:extractor/nudebird
Open

[nudebird] add extractor support#9142
NecRaul wants to merge 7949 commits intomikf:masterfrom
NecRaul:extractor/nudebird

Conversation

@NecRaul
Copy link
Copy Markdown
Contributor

@NecRaul NecRaul commented Feb 27, 2026

This PR adds extractor support for Nudebird along with documentation and unit tests. Closes #9118.

Changes

  • Add NudebirdExtractor base class
  • Add sub-extractors:
    • NudebirdPostExtractor
    • NudebirdCategoryExtractor
    • NudebirdTagExtractor
    • NudebirdPopularExtractor
    • NudebirdSearchExtractor
  • Register nudebird in extractor __init__.py
  • Add documentation entry in docs/supportedsites.md
  • Add unit tests for all supported URL types

Notes

There are two, somewhat hacky code snippets here. See code comments below.

mikf and others added 30 commits January 20, 2026 17:29
"Bato.to has shut down."

There are mirror sites, but they are unscrapeable
due to heavily obfuscated HTML and JS
- add 'callback' argument to _pagination()
- generalize 'author' metadata collection
* [tiktok] download best quality videos
* [tiktok] code formatting fix
* simplify sorting in '_extract_video_urls'
Support chapter and manga extractors with metadata extraction.
Remove unused variable, use ternary expressions, deduplicate strip calls.
fixes regression introduced in d57dc48
also prevents previous _mtime_meta entries from affecting new files
check for all (sub)domains and not only '.site.tld'
add workaround for (sub)album items without 'pid' field
AngeredBacterium and others added 28 commits February 21, 2026 13:17
and update mastodon extractor tests
keep memory-cached results throughout process runtime
instead of only per extractor run
- prevent test failures when running tests in any order
- remove database file at exit
- config_deprecated
- _cache
- _cache_memory
- _dump
a first attempt at detecting and filtering these "videos"
- allow selecting multiple strategies to try including their order
- implement 'web' strategy
- Add NudebirdExtractor base class
- Add various sub-extractors (post, category, tag, popular, search)
- Update __init__.py to include nudebird
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add support for "NudeBird"

10 participants