Noticed tests failing on https://github.com/SubstanceSearch, wanted to check why.
I found that in some instances, there are multiple entries for the same substance just with slightly different names. A potential fix for this could be to link a substance to any other entries via its main name or any of its aliases.
In others, discussion pages ended up in the final scrape (see https://en.wikipedia.org/wiki/Help:Talk_pages).
These 2 issues are the main causes of the tests failing in the main repo. A fallback could in theory be introduced to fix both of these issues but I wanted to let you know prior to anything else.
Example 1 (substance naming)
Entry under mushrooms with only TripSit information.

Entry under psilocybin mushrooms with only Psychonaut information.

Example 2 (discussion pages)
Entry under dextroamphetamine
