Computing.Net > Forums > Web Development > Can SpamBots read images?

Computer Problems? Computing.Net has over 1,000,000 posts about all things technology related! Over 90% answered within 24 hours! Click here to get for your free account now!

Can SpamBots read images?

Reply to Message Icon

Name: doghead
Date: December 8, 2008 at 12:32:20 Pacific
OS: WinXP
CPU/Ram: P4 3.0
Manufacturer/Model: Hewlett-packard / DC5000 SFF
Comment:

I figure that the simplest way to put my email address on my new site is to just insert an image of my business card, which is a nicely designed card as well. My contacts are all on the card. Could any kind of spambot pick up the email address from an image??

Thanks very much.


Report Offensive Message For Removal

Sponsored Link
Ads by Google

Response Number 1
Name: Alphawave
Date: December 8, 2008 at 19:26:53 Pacific
Reply:

Yes they can. They use Optical Character Recognition (OCR).

Don't squat with your spurs on!


Report Offensive Follow Up For Removal

Response Number 2
Name: StuartS
Date: December 10, 2008 at 16:15:47 Pacific
Reply:

OCR cannot read a graphics image stored in memory. OCR can only read something once it is printed. Thats where the Optical bit comes in!

optical adj. 1 of sight; visual. 2 of or according to optics. 3 aiding sight.  optically adv.

All a Spamm bot will see is a bunch of miscellaneous data that it wont be able to resolve into readable characters.

What a spam bot might do is recognise it as an graphics file and download it on the of chance that it might contain an Email address but it would take human intervention to confirm it. Not impossible but very unlikely.

Stuart


Report Offensive Follow Up For Removal

Response Number 3
Name: doghead
Date: December 10, 2008 at 22:55:24 Pacific
Reply:

Thanks for those replies.

I was able to cut, paste, and insert a little javascript, which will apparently make the thing unspammable. Well.... I previewed it in my browser, and it looks like an other hypertext email address; and in the browser's 'page source,' the javascript was of course visible. So, is that unspammable? Can the bots read neither the javascript hypertext link [which is loud and clear to the human eye], nor the address in the javascript in 'page source' [which of course uses no @ symbol] ???

Thanks again.


Report Offensive Follow Up For Removal

Response Number 4
Name: StuartS
Date: December 11, 2008 at 02:04:23 Pacific
Reply:

Spam bots will be able to read the Java scrip, including the address of the page source. But if it does all it will get is a graphics file.

The data in a graphics file is meaningless until it is rendered by either a printer or a graphics processor. You need to look inside a graphics file and think how a spam bot could make any sense of that.

Stuart


Report Offensive Follow Up For Removal

Response Number 5
Name: doghead
Date: December 11, 2008 at 11:34:23 Pacific
Reply:

So, spambots simply read the page source, no more or less?


Report Offensive Follow Up For Removal

Related Posts

See More



Response Number 6
Name: Michael J (by mjdamato)
Date: December 12, 2008 at 15:02:58 Pacific
Reply:

I'll have to disagree with StuartS. There are OCR readers out there that spammers use which can decypher the text on a web page. That is the reason the captcha images used have become more and more complex.

Typically these are used to create accounts on sites built upon a common framework (such as PHPNuke) so they can then spam the forum.

I guess it would be possible to use this functionality to scan sites looking for email addresses, but I think the threat is low.

A JavaScript solution will work fine IF the email is properly obfruscated in the code. Something like this:

<html>
<head>

<script type="text/JavaScript">

function spamProof(uname, site, tld)
{
email = '';
email += uname + '@' + site + '.' + tld + '
';
document.write(email);
}
</script>

</head>

<body>

My email address is:
<script type="text/JavaScript">spamProof('me', 'here', 'com');</script>

</body>
</html>

Michael J


Report Offensive Follow Up For Removal

Response Number 7
Name: StuartS
Date: December 12, 2008 at 16:32:44 Pacific
Reply:


In that case the Spam bot downloads the image and renders it, something that could be done, but hardly worthwhile in the majority of cases.

Downloading and rendering a Captchar image is worthwhile because the Spam bot knows it is going to find something useful in it.

Downloading end rendering every image it finds on a web page is not going to be worthwhile as the vast majority of them will not contain any useful information.

Stuart


Report Offensive Follow Up For Removal

Response Number 8
Name: doghead
Date: December 13, 2008 at 08:55:38 Pacific
Reply:

Thanks for all those.

I found this little cut & paste javascript mailto:


<script language="JavaScript"><!--
var name = "MyName";
var domain = "MyDomain.com";
document.write('');
document.write(name + '@' + domain + '
');
// --></script>

However, I notice that it has an @ symbol, which would be visible in Page Source. Maybe the bots look for those.... ?

Anyway - you should bear in mind that this is like nuclear physics or jazz theory to me - I am now onto a fill-in form with recaptcha, although I am not finished yet. I still have to tell the form what to do with the data and plug in the recaptcha thing. I am doing this with DIY software.

So even the recaptcha method is crackable?
Yikes! I should have people submit DNA samples before they get my email address.


Report Offensive Follow Up For Removal
Reply to Message Icon






Post Locked

This post is quite old and has been locked from receiving new replies. Please create a new posting instead.


Go to Web Development Forum Home


Sponsored links

Ads by Google


Results for: Can SpamBots read images?

SSIs can't read/write subdirectory www.computing.net/answers/webdevel/ssis-cant-readwrite-subdirectory/2001.html

can't see images on my website!! www.computing.net/answers/webdevel/cant-see-images-on-my-website/1292.html

background images won't load www.computing.net/answers/webdevel/background-images-wont-load/272.html