google-style search filtering
Posted: 29 Jun 2010 15:18
by "google style" I am referring to the way that google (and probably other search engines) ignore words like "the", "and", and "to".
So here is what I am proposing:
we remove words like "the", "and", and "to" from the search terms but add them (only the terms used in that particular search) to the search filter box and automatically apply them to the results. If one or more of these words occur in a row (for example "in the") then search normally. The end result should be the same, but with less spam.
I considered including the previous or next term (eg: "make the" or "the final") with quotation marks surrounding them in the filter box as well, but after watching the search monitor for a while I see that this would not work in many cases, since people already try to outsmart the spammers by re-ordering there search terms (and probably entering them, in the correct order, in quotations in the filters box)
Since spammers do not know if or where these terms are required they will have difficulty creating realistic spam results. If they do not include the correct terms, or do not include any, then the fake results will not even show up. If they try to include all of them in all results then they can be easily blocked with security filters.
So here is what I am proposing:
we remove words like "the", "and", and "to" from the search terms but add them (only the terms used in that particular search) to the search filter box and automatically apply them to the results. If one or more of these words occur in a row (for example "in the") then search normally. The end result should be the same, but with less spam.
I considered including the previous or next term (eg: "make the" or "the final") with quotation marks surrounding them in the filter box as well, but after watching the search monitor for a while I see that this would not work in many cases, since people already try to outsmart the spammers by re-ordering there search terms (and probably entering them, in the correct order, in quotations in the filters box)
Since spammers do not know if or where these terms are required they will have difficulty creating realistic spam results. If they do not include the correct terms, or do not include any, then the fake results will not even show up. If they try to include all of them in all results then they can be easily blocked with security filters.