-
-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
False positives: "can use", "via ssh" #28
Comments
Yeah, adding |
True. My issue was more about whether there could be a way to detect these innocent legitimate two word messages. |
Yeah there isn't really one besides using the false positives list. You could create a PR to add them to the default false positives if you'd like: Line 4 in b5570db
|
It would take some work on your end, but you could process my comprehensive false positives list in a code generator, as follows:
If you're wondering, I generated it using a dictionary search of words and pairs of words, combined with my own additions. The downside is that my filter operates a bit differently (has some interesting heuristics), and doesn't require certain false positives to be explicitly included in its list. In these cases, you would still need to maintain your own false positive list and/or replicate the dictionary search. |
True! The downside here is that you would be including the entire list, when only a subset is relevant to goaway. A build step/code generator is more work, but could avoid wasting space in the compiled binary by filtering in advance.
I check in on this repository every once in a while, as it was and is a great source of inspiration for my profanity filters 😃 |
We could trim the file once as needed |
Of course, I could add these to the false positives list, but maybe there's a better, more general way to tackle these.
The text was updated successfully, but these errors were encountered: