Architecture for Anubis on containerized environments and multiple backends #536

willianmga · 2025-05-22T18:23:26Z

willianmga
May 22, 2025

Hello Anubis Community,

I'm looking to enable anubis on a containerized kubernetes cluster, but with some specific restrictions

Anubis will serve multiple backends, not just a single one
it will be a centralized service for easy of maintenance and costs, so it will receive traffic from pods of different namespaces
it needs some sort of routing support to more than just a single target

Here's a diagram of the target architecture

Looking at the documentation, especially at the suggested architecture for running alongside nginx, it suggests to use sockets to send request back to nginx for proper routing to the backend, but this is not possible on containerized environments, where sockets across pods is not an option.

Is this architecture possible to achieve today with anubis? If not, is something like this being discussed as future improvements ?

derritter88 · 2025-05-26T07:33:03Z

derritter88
May 26, 2025

My current solution is:
For Anubis I have a dedicated namespace
One central botPolicy.yml as a config map which I mount for deployments.
Per reversed proxy site I have an own, dedicated deployment with a dedicated port number.

2 replies

joshuaganger Jun 9, 2025

@derritter88 Any chance you could share an example of your ConfigMap and how you're importing it in to Anubis? I don't necessarily need to see your exact rules, I'm just a little confused about how to structure and use a custom policy set. Do I need to duplicate the default botPolicies.yaml and modify that? Or can I import new policies in addition to the defaults?

Thanks.

EDIT: I worked out how to load a config file from a ConfigMap. Still a little unclear on what the correct way to include default rules and custom rules. For now I just copied the botPolicies.yaml and added an import for my custom rules at the end.

willianmga Jun 10, 2025
Author

Hey @derritter88 ,

Thank you for taking time to answer my question.

Per reversed proxy site I have an own, dedicated deployment with a dedicated port number.

You mean 1 anubis instance for each of your nginx instances, is that correct ? 1:1 between anubis and nginx?

derritter88 · 2025-06-10T05:22:15Z

derritter88
Jun 10, 2025

@joshuaganger I have one ConfigMap for stuff I do not want to get pass anything:

apiVersion: v1
kind: ConfigMap
metadata:
  name: anubis-config-general
  namespace: anubis
data:
  botPolicy.yaml: |
    bots:
    # Generic catchall rule
    - name: generic-browser
      remote_addresses: [0.0.0.0/0]
      action: CHALLENGE

    dnsbl: false

    # By default, send HTTP 200 back to clients that either get issued a challenge
    # or a denial. This seems weird, but this is load-bearing due to the fact that
    # the most aggressive scraper bots seem to really really want an HTTP 200 and
    # will stop sending requests once they get it.
    status_codes:
      CHALLENGE: 200
      DENY: 200

And one custom ConfigMap for stuff I am actually working with:

apiVersion: v1
kind: ConfigMap
metadata:
  name: anubis-config-custom
  namespace: anubis
data:
  botPolicy.yaml: |
    ## Anubis has the ability to let you import snippets of configuration into the main
    ## configuration file. This allows you to break up your config into smaller parts
    ## that get logically assembled into one big file.
    ##
    ## Of note, a bot rule can either have inline bot configuration or import a
    ## bot config snippet. You cannot do both in a single bot rule.
    ##
    ## Import paths can either be prefixed with (data) to import from the common/shared
    ## rules in the data folder in the Anubis source tree or will point to absolute/relative
    ## paths in your filesystem. If you don't have access to the Anubis source tree, check
    ## /usr/share/docs/anubis/data or in the tarball you extracted Anubis from.

    bots:
    # Pathological bots to deny
    # This correlates to data/bots/ai-robots-txt.yaml in the source tree
    - import: (data)/bots/ai-robots-txt.yaml
    - import: (data)/bots/cloudflare-workers.yaml 
    - import: (data)/bots/headless-browsers.yaml
    - import: (data)/bots/us-ai-scraper.yaml

    # Allow common "keeping the internet working" routes (well-known, favicon, robots.txt)
    - import: (data)/common/keep-internet-working.yaml

    # Generic catchall rule
    - name: generic-browser
      user_agent_regex: >-
        Mozilla|Opera
      action: CHALLENGE

    - name: jellyfin
      user_agent_regex: >-
        'Ktor client'|JellyfinMediaPlayer|AppleCoreMedia|Dart
      action: ALLOW

    - name: gitlab
      user_agent_regex: >-
        git|RenovateBot|Gitlab|gitlab-runner|gitlab-kas
      action: ALLOW

    - name: authentik
      user_agent_regex: >-
        goauthentik.io
      action: ALLOW

    # # Punish any bot with "bot" in the user-agent string
    # # This is known to have a high false-positive rate, use at your own risk
    - name: generic-bot-catchall
      user_agent_regex: (?i:bot|crawler)
      action: CHALLENGE
      challenge:
        difficulty: 16  # impossible
        report_as: 4    # lie to the operator
        algorithm: slow # intentionally waste CPU cycles and time

    dnsbl: false

    # By default, send HTTP 200 back to clients that either get issued a challenge
    # or a denial. This seems weird, but this is load-bearing due to the fact that
    # the most aggressive scraper bots seem to really really want an HTTP 200 and
    # will stop sending requests once they get it.
    status_codes:
      CHALLENGE: 200
      DENY: 200

1 reply

joshuaganger Jun 10, 2025

Thanks, it's helpful to see a real world configuration. Are you concatenating these files together somehow, or are they used in different instances of Anubis? POLICY_FNAME only seems to allow specifying a single file.

derritter88 · 2025-06-10T14:11:53Z

derritter88
Jun 10, 2025

I am using one instance of Anubis per SNI.
The general one which sents any request to "hell" is primarily a catch all rule for traffic reaching me on my IP address.

The other rule is used by my "per SNI Anubis instances".

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Architecture for Anubis on containerized environments and multiple backends #536

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Architecture for Anubis on containerized environments and multiple backends #536

Uh oh!

willianmga May 22, 2025

Replies: 3 comments · 3 replies

Uh oh!

derritter88 May 26, 2025

Uh oh!

Uh oh!

joshuaganger Jun 9, 2025

Uh oh!

willianmga Jun 10, 2025 Author

Uh oh!

derritter88 Jun 10, 2025

Uh oh!

joshuaganger Jun 10, 2025

Uh oh!

derritter88 Jun 10, 2025

willianmga
May 22, 2025

Replies: 3 comments 3 replies

derritter88
May 26, 2025

willianmga Jun 10, 2025
Author

derritter88
Jun 10, 2025

derritter88
Jun 10, 2025