- NEW: Added LLM-based domain classification with
openai_cat()andclaude_cat()functions - Support for OpenAI GPT models and Anthropic Claude models for domain categorization
- Flexible custom category schemas - users can specify their own categories or use defaults
- Consistent API design matching existing
*_cat()functions for seamless integration - Built-in rate limiting and error handling for API calls
- REMOVED: BrightCloud support due to service unavailability
- Updated documentation URLs from HTTP to HTTPS where applicable
- Fixed Shallalist references to reflect service discontinuation
- shallalist stopped its service so downloaded latest shalla db and changed the URL from which we fetch the shallalist file
- URL fixes. in resubmission now because site from which data was downloaded went down which broke some tests
- R package supporting headless browsing has been abandoned. So removing trusted_cat. Sigh.
- Function for checking if domain a university domain using https://github.com/Hipo/university-domains-list
- Changes due to move to a new repo.
- Basic brightcloud function added
- Adds not_news classifier that classifies not news based on published work.
- passes expect_lint_free
- Shallalist and DMOZ data read in with stringAsFactors as FALSE.
- Swapped the DMOZ data to domain level category data, included English translations of non-English categories, quote protection of multiple categories.
- Accounting for changes in RSelenium --- startServer() for instance is deprecated. But currently only allow for passing of log for trusted_cat.
- Fixed bug in shalla_cat for multiple domain names arguments
- Fixed small issue with adult_ml1_cat() whose returned data.frame had a column that was a named list. The column is now a vector.
- If an unknown domain is passed to virustotal, it will return an empty data.frame rather than throw an error.
- Initial release