1. Check the default `RobotFileParser` 2. `is_not_crawlable(link)` + `can_fetch(crawler, link)` → `is_doable(link)`