>>21355Thanks. You should have gotten my email by now, if the one you put in your email field is real.
It will probably be a bit until I get a working website up and running again due to privacy concerns with owning a clearnet site and not securing my hidden service. Although when I do, I'd be more than happy to have you test it.
I currently just get the title, description, and keywords and check for banned words in there while also accounting for leetspeak. I do this rather than searching the full website for any banned words, as mentioning anything in those meta fields is more intentional and thus might rule out some false positives.
In the future I hope to use something like pytorch-text to classify websites. This would also allow for further search filtering (like possible scam detection or categorizing hidden services), not just sorting out bad sites. I know Ahmia already does something like this, I use their blocklist and their indexed sites as a starting point, but they seem to have a few false positives.
5014f8ebd867a23f02209e61de4980a0