Skip to content

Conversation

@JonathanPLev
Copy link

Added a custom metric to detect which cloud genAI services a website accesses.


Test websites:

@github-actions
Copy link

https://almanac.httparchive.org/en/2022/

WPT result details

Changed custom metrics values:

{
  "_cloud_genAI_services": []
}
https://example.com

WPT result details

Changed custom metrics values:

{
  "_cloud_genAI_services": []
}
https://chatgpt.com

WPT result details

Changed custom metrics values:

{
  "_cloud_genAI_services": {
    "ChatGPT": [
      "https://chatgpt.com",
      "https://chatgpt.com/cdn-cgi/challenge-platform/h/b/scripts/jsd/07af8245c728/main.js?",
      "https://chatgpt.com/cdn-cgi/challenge-platform/h/b/jsd/r/0.037247701934645515:1750803236:y2LJkKXZw3oNGxvuUfA8A9jAEbhKunBf03ZFelqxKUM/954fe90e191dd6e7",
      "https://chatgpt.com/backend-api/edge",
      "https://chatgpt.com/ces/v1/projects/oai/settings",
      "https://ab.chatgpt.com/v1/rgstr?k=client-nb0qtYlZuy2tCMN5s5ncnuIBCJncjRViT0IzFm7GqST&st=javascript-client&sv=3.17.0&t=1750806752270&sid=08cd4074-4751-4a30-a2b5-f9a2764a9564&ec=102&gz=1",
      "https://chatgpt.com/backend-anon/me",
      "https://chatgpt.com/backend-anon/conversation/init",
      "https://chatgpt.com/backend-anon/prompt_library/?limit=8&use_v2=true&model_slug=auto",
      "https://chatgpt.com/backend-anon/system_hints",
      "https://chatgpt.com/backend-anon/user_is_in_search_holdout_with_web_disabled",
      "https://chatgpt.com/backend-anon/settings/voices",
      "https://chatgpt.com/backend-anon/sentinel/chat-requirements",
      "https://ab.chatgpt.com/v1/rgstr?k=client-nb0qtYlZuy2tCMN5s5ncnuIBCJncjRViT0IzFm7GqST&st=javascript-client&sv=3.17.0&t=1750806752707&sid=08cd4074-4751-4a30-a2b5-f9a2764a9564&ec=9&gz=1",
      "blob:https://chatgpt.com/edb4871b-5f57-4bcc-af61-d429473b64f7",
      "https://chatgpt.com/cdn-cgi/challenge-platform/scripts/jsd/main.js"
    ]
  }
}
https://www.yes4youth.co.za/4-business?hsLang=en

WPT result details

Changed custom metrics values:

{
  "_cloud_genAI_services": []
}

@max-ostapenko
Copy link
Contributor

@JonathanPLev the technology detection is tackled here https://github.com/HTTPArchive/wappalyzer

You'll be able to use the request URLs, as well as JS variables or DOM elements for detection.

Please move your detections there.

Unfortunately I'll have to close the PR in this repo.

@JonathanPLev
Copy link
Author

@max-ostapenko is adding my metrics to that repo the same process as for this one? I don't see information on how to add to that repo in that ReadME.

alternatively, would I be able to adjust my detection or add some sort of other analysis that would qualify the detection for this repo?

Thank you!

@max-ostapenko
Copy link
Contributor

@JonathanPLev here is a recent example of technology added: HTTPArchive/wappalyzer#136

I also notice that you use API endpoint URLs - these are usually used on the backend. There is also an argument against detecting such usage scenarios: #165 (comment)

There usually should be some JS library, or JS variable that help detect technology on the webpage. Please see the patterns supported for detection: https://github.com/HTTPArchive/wappalyzer?tab=readme-ov-file#patterns-optional

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants