When I search for anything on Google or DuckDuckGo, more than half of the results are useless AI generated articles.
Those articles are generated to get in the first results of requests, since the search engine use algorithms to index websites and pages.
If we manually curate “good” websites (newspapers, forums, encyclopedias, anything that can be considered a good source) and only index their contents, would it be possible to create a good ol’fashioned search engine? Does it already exist?
It looks like that list only focuses on AI images? Which can be useful but probably more narrow than what OP is looking for.
It links to a related project that might be more relevant though.
https://github.com/NotaInutilis/Super-SEO-Spam-Suppressor
The blocklist is over 6 MB though, please be aware that I have not vetted the list yet and do not speak for it.
Good call, thanks for the added link!