ICANN Email Archives: [gnso-ff-pdp-may08]

ICANN ICANN Email List Archives

[gnso-ff-pdp-may08]

<<< Chronological Index >>> <<< Thread Index >>>

[gnso-ff-pdp-may08] Whois through DNS - The Details

To: "gnso-ff-pdp-May08@xxxxxxxxx" <gnso-ff-pdp-May08@xxxxxxxxx>
Subject: [gnso-ff-pdp-may08] Whois through DNS - The Details
From: Marc Perkel <marc@xxxxxxxxxx>
Date: Fri, 18 Jul 2008 11:01:33 -0700


Continuing with my line of thinking .....

WHY HAVE A DNS VERSION FOR WHOIS

The reason to have a DNS based version of Whois is for high speeddistributed access to whois data for real time queries of whoisinformation. As someone in the spam filtering business I'm looking at itfrom that perspective. There are other perspectives as well and othersshould join in.

What I do is front end spam filtering. The customers point their MX toour servers, we filter it, and forward the good email on to thecustomer's existing server. This is done in real time so that the emailusers don't see any noticeable delays.

In the spam filtering world we use DNS to post all kinds of data, muchof which is unrelated to the original purpose of DNS. We are essentiallyusing DNS as a high speed database and we, in the spam filtering world,have very solid and widely used tools for doing this. So I'm talkingabout using existing and well accepted technology. No new technology oruntested technology is being suggested here.

The existing Whois infrastructure isn't suitable to handle the speed orload levels required. DNS however is suitable and it is an establishedmethod used in the spam filtering industry. Thus the idea here that I'msuggesting is to take parts of the public information that alreadyexists in whois and make that same information available to the worldthrough a different high speed protocol.


WHAT WHOIS INFORMATION NEEDS TO BE AVAILABLE THROUGH DNS?

Generally the registrant is of little use in detecting spam. However theregistrant might be useful in detecting good email. Many spam filteringcompanies not only focus on actively detecting spam. Many of us focus ondetecting good email to avoid false positives. It is more important tonot block good email than it is to block junk email. So in the spamfiltering world we might create a registrant reputation database thatwould be useful in classifying good registrants.

As to fast flux, if the registrars were to publish DNS nameserver changeinformation through DNS that could be useful. This is not part of thecurrent Whois protocol, but it is public information in that if I wereto monitor all domains I could construct this information myself,although it would be massively inefficient. So this wouldn't revealanything that wasn't already technically public.

I would also be interested in the age of the domain. Or alternativelythe starting date, or perhaps the expiration date. Much fraud is done bynew domains which I might subject to a higher level of scrutiny.However, if the domain is paid up several years in advance then thatindicates permanence which can be used as a white rule.

Another key piece of information would be the registrar of the domain.There may be some registrars that are very exclusive and very expensivethat spammers would never use (with the exception of free mail domainslike Google, yahoo, hotmail). But more importantly - if I know theregistrar and I detect an issue then I know where to report the problem.For example, someone impersonating Well Fargo Bank registerswellsfargo.cn and sends fraud spam, if I know that they registered withgodaddy.com then I could send an automated email to them that a domainunder their control is being used for fraud.

Additionally the email address of the technical contact is also usefulfor reporting problems.

I believe much spam, fraud, and abuse can be stopped through fastautomated reporting of problems. If this information were publishedthrough DNS then we in the spam filtering community can work with theregistrars to quickly report and shut down domains being used for fraud.

WHY AN INFORMATION BASED SOLUTION IS BETTER THAN A RESTRICTION BASEDSOLUTION

The quick answer is response speed and precision. Policy is slow andimprecise. I can't think of any way through policy that we candistinguish good fast flux from bad fast flux. If we restrict freespeech then it might take years to undo the damage. And those in thefraud world will just change methods and move on.

In the spam filtering world if I see a new scam I might be able to writea new rule in minutes and block that scam. If my rule also takes outsome free speech I can modify my rule to fix that quickly. The moreinformation I have to work with the more accurately I can distinguishbetween free speech and fraud. Thus the free speech is passed and thefraud is blocked and reported.


THE NATURE OF SPAM AND FRAUD

A little education about spam and fraud. In order for a fraud to workthere has to be a plan that includes advertising the scam to victims,getting victims to respond, and getting the victims money. If any partof the process is disrupted then that scam fails and we win.

Spam always wants to to do something. The want you to click on thislink. They want you to reply to an email address. They want you to callsome phone number. And one of the easiest ways of detecting spam andfraud is to focus on what the message wants you to do.

Most spam wants you to click on a link and go to a web site. These linkseither have an IP address or a domain name as part of the link. IPaddresses are more easily shut down. However a domain name using fastflux is not.

When I get an email I scan it for links to domains that are blacklistedin URIBL lists. These lists use DNS as a database (as I am suggestinghere) to list domains that are web sites used by spammers to get yourmoney and defraud you. These lists are built cooperatively by spamfiltering companies coming together and building them. Thus if we see adomain name being used for fraud spam then we can blacklist it and stopall email that links to that domain. This disrupts one part of theprocess and makes the fast fluxing useless.

If we can detect that a domain is fast fluxing that isn't yet listed wecan determine if the domain should be listed taking into account whoisinformation and combining it with other information we know. This willallow us in the real time world to respond faster and stop fraud usingthe information provided. And the faster we can make an accuratedetermination the faster we can stop the fraud.


DETECTING SPAM BY BEHAVIOR

Most of the spam that I block is based on the behavior of the spammerrather than the content of the message. There are tricks that onlyspammers do and if that trick can be detected it can be blocked based onthe spammer using the trick. Sometimes it's a combination of factorswhere if the message is doing A B and C then only spammers do that. Thusfast fluxing in itself might not be spam. But fast fluxing AND wantingyou to give up a password would be.

I have some interesting tricks to detect spambots and I can detect spambots with near 100% accuracy on the first attempt to send spam. Onething I do is post fake high numbered MX records pointing to IPaddresses on the same machine that hosts the lowest numbered MX record.I also have a middle ring of fallback servers so in theory these high MXrecords should never see traffic. However spammers often try to go inthe back door thinking the backup servers have less spam filtering thanthe main server. So they try the high numbered MX records first. Thushosts hitting the high MX records are noted and I return a 451 temporaryerror telling them to come back later. This by itself doesn't get themblacklisted.

Spam bots however after being rejected or delivering spam don't closethe connection with a QUIT command as their spam is delivered and beingpolite just uses up processor and bandwidth that can be used to spamsomeone else. So I also watch for the no quit and note that as well. Soif I see the combination of hign numbered MX hits AND no quit and thereis any one of other sins I track (bogus HELO, etc.) I can instantly IDthe IP as a spam bot and can get them into the blacklist within 2minutes of the attempted spam. And anyone using my blacklist can thenblock spam on their system based on my listing of the virus infected IP.

This is just an example of what we do and others in the spam filteringindustry do. We look at a lot of information and make automateddecisions. The Whois information would help us do a better job. So thesolution to fast flux might not be in ICANN doing something to stop it,but helping other through information to stop it.


CONTACT INFORMATION FOR REPORTING IS IMPORTANT

Often filtering companies detect a problem that could be stopped at thesource if we could just alert the source that there is a problem. In thecase of spambots, the source is the ISP who provides access to theinternet to the virus infected victim. If the ISPs knew of problems thenthey could take action like temporary port 25 blocks or calling theircustomer to let them know their computer has been hacked so they can fixit. Thus I suggest that through WHOIS and policy that we create aproblem reporting infrastructure so that those of us who detect aproblem can communicate that to those who need to know about theproblem. And we need high speed DNS based whois so that we can useautomation to do this.


CONCLUSION

I believe that the worlds spam bots can be completely (or 99%+) defeatedthrough information, communication, ISP tools, and publishing bestpractices and this can be done without restricting free speech or civilliberties. This war is winnable and if we are careful and think itthrough we can have a nearly fraud free, spam free world. Quite frankly,I'd like to win this spam war and put myself out of business. I haveother things I want to do with my life and although I make a good livingat this I have better things to do.

Hopefully I have given you all something to think about. Feel free tojump in and expand or tell me why I'm wrong.

<<< Chronological Index >>> <<< Thread Index >>>

Privacy Policy | Terms of Service | Cookies Policy