Doable strategies against blog spam?
posted: May 20th, 2006 · by: Sven
My old Wordpress blog had the option to turn on “moderated comments”. That means that I’ve recieved an email notification about every comment asking me to “please moderate this comment”. The email contained a link to instantly approve and a link to reject the comment.
Sigh. My blog never has been linked to that extensively that I’d expected comment spam to become a problem in the first place. I’ve been wrong. Wordpress seems to allure blog spam like a light bulb flies on a midsummer evening. I recieved some hundred mail notifications per week, asking me to moderate cheap perfumes, pills and online bets (for the world cup, lately). Finally my email spam filter caught up and started to sort out the notifications from my own blog. Hum.
Having switched over to Typo I was curious about Typos anti-spam measurements and came up with some thoughts about possible “joint-forces” strategies against blog spam.
In the Typo admin area there’s some “blacklist” option in the admin interface, which claims to compare the users IP address to “local and remote blacklists”. The interface explains this as “Good defense against spam bots.”
Also you’re recommended to not “allow non-ajax comments” which means that comments will only accepted when send via Ajax (which most spammers don’t seem to be capable of).
Don’t know about the blacklist option yet. I think I’ll read up the code. But I wonder why there’s any need of the non-ajax comments option at all when the IP comparsion really does it’s job well.
While I’ve been thinking about this someone dropped an annoucement for a Rails <a href=”http://cuttingtheredtape.blogspot.com/2006/05/actsasclassifiable.html”>act_as_classifiable plugin to the RoR Mailinglist.
Hmmm. I wonder if there’s been any attemp to accomplish something like this:
- Manually confirmed Typo blogs build up a “trusted blogs net” featuring a distributed notification service about obvious spam requests. By obvious I mean things like requests to Wordpress or other missing files on a Typo blog or manually moderated comments. When there’s any such request on any Typo blog, it will be made visible to all other Typo installations by distributing it somehow (either through a central server or some kind of peer-to-peer approach).
- Each HTTP request will be analysed (i.e. “learned”) as a whole (like <a href=”http://www.homelandstupidity.us/software/bad-behavior/”>Bad Behaviour does it) through a Bayesian filter as “bad/spam” so that each blog acts as a user to the Bayesian knowledge of the Typo blog net as a whole.
- This knowledge will be published back to the blogs so that each blog can decide on additional measurements – like actually blocking the whole request even for GET with a 412 error (“you’re not allowed to read my blog.”) when a Request fails a local Baysian test.
This way the following conditions would trigger whole HTTP Requests to be marked/learned as “bad”.
- it comes from an IP address known as a spammer by a remote blacklist
- it has been moderated manually as spam
HTTP Requests that come from IP addresses that are used by clients who occasionally log in will be learned as “good”.
This approach could also be used to greatly enhance a comment moderation queue I think. Depending on the results of the Bayesian analysis the notification mails could include a marker in the subject line indicating that the system requires human feedback for this comment. Clicking on a given URL in the mail would learn/unlearn the comment as good/bad. Comments that have been recognized as “good” could immidiately be published.
Newly “learned” spam sources could be published to services like Akismet, SURBL or DNSBL.
Also there could be an additional interface to allow for complaints when your comment has been treated as “bad” by the system.
Another thought:
I don’t know enough about how the majority of spam bots work but I assume that they’re using some “fire and forget” strategy – by just issuing the POST request to several known “candidate” URLs.
If so, this would allow for a very basic additional measurement – simply add some kind of “please confirm your comment” page or (in case of Ajax being used) alert box. Someone who just “fires-and-forgets” would fail this handshake.
If not so, this on the other hand would allow to send the spammer to a honeypot which could make him wait half an hour or so to complete the request and additionally logging as much of information as possible. (I’ve read about 1&1 having been asked to provide their logs to a lawsuit against a spammer but can’t remember the details.)
Chris said June 16th, 2006 at 10:01 PM ¶
I just found this page ... which I found a nice overview.
Too Biased mentions that there's (been?) some kind of "Sophisticated Spam protection" in Typo 2.0 ... would probably be interesting to check this out?
jack said January 24th, 2011 at 03:25 PM ¶
UCVHOST has changed the face of web hosting industry in a major way, people were paying gold for peanuts (and it is still happening). cheap hosting has become synonym with UCVHOST, anybody and everybody who wants a reliable and affordable domain web hosting visits UCVHOST and gets either windows vps or Linux hosting from UCVHOST. UCVHOST sells cheap hosting WITHOUT hidden terms and conditions where as competition has huge MSA and SLA’s which are good enough to confuse a seasoned lawyer also. For clients by now Business with us for the value of windows vps became very critical piece of puzzle for their whole operation, uptime and performance became a huge concern.. However it came with a cost, dedicated servers proved to be at least 100 times expensive in comparison to any windows or Linux plans. Somewhere in the labs engineers were working on splicing raw power of a server into virtual instances, this technology was called as Virtualization also termed as or virtual private servers. Also UCVHOST comes handy when you are looking for remotely hosted and managed FOREX MetaTrader4 terminals. Our forex vps platform is all geared up in fight of pips, our platform support any number of expert advisory (EA) and along with an assure of 100% uptime. Our Virtual Forex Tradng Terminals are well equipped to help you in making money .
chat said March 31st, 2011 at 08:17 PM ¶
The following cleaned up the issue:
Dependencies.loadoncepaths -= Dependencies.loadoncepaths.select{|path| \ path =~ %r(^#{File.dirname(FILE)}) }
Okey oyunu said May 12th, 2011 at 04:25 PM ¶
Thanks a lot for this nice post. Tüm dünya artik okey oyunu oynuyor. Yillardir bir çok oyun programi olmasina ragmen, içlerinden en güzeli olarak nitelendirebilecegimiz tek bir site göze çarpmaktadir. Diger tüm okey oyunu programlarinin aksine ücretsiz olmasi ve 3 boyutlu olarak hizmet vermesi mükemmel bir gelismedir. Sizlerde www.okey-oyunu.com adresinden bu essiz okey oyununu indirebilirsiniz. Kullanimi çok basit ve Türkçe dil seçenegi ile kolaylikla oyuna baslayabilirsiniz. Ister kendi ülkenizden, isterseniz dünyanin tüm farkli bölgelerinden dilediginiz oyun odalarini seçerek, oyuna hemen baslayabilirsiniz. Okey oyunu oynamak için artik arkadas bile aramaniza gerek kalmadan, bilgisayarinizdan 100 binlerce üye ile online olarak okey oyununu oynamanin zevkine varabilirsiniz.