I need someone to collect current WHOIS information for 457k (457,000) .pl domains. I will send you a list of the domains i need.
The WHOIS server accepts 100 queries per 24 hours from a single IP address. That is important. You need to have some proxy servers ready.
How you collect the data is up to you. I will be able to verify if all the data is correct and up to date.
The data has to be delivered in raw format as a MySQL file, with one column having the raw data stored in form of array, with each line of output stored as a single array element. Here is a simple output from WHOIS server:
DOMAIN NAME: [login to view URL]
registrant type: organization
nameservers: ns.kkibci.pl. [[login to view URL]]
ns2.kkibci.pl. [[login to view URL]]
created: 2010.04.16 14:45:37
last modified: 2014.04.17 08:17:07
renewal date: 2015.04.16 14:45:37
no option
dnssec: Unsigned
REGISTRAR:
NetArt Spolka Akcyjna S.K.A.
ul. Cystersow 20A
31-553 Krakow
Polska/Poland
Here's how i'd like it to be delivered, ideally:
a:27:{i:0;s:1:"
";i:1;s:32:"DOMAIN NAME: [login to view URL]
";i:2;s:37:"registrant type: organization
";i:3;s:54:"nameservers: ns.kkibci.pl. [[login to view URL]]
";i:4;s:53:" ns2.kkibci.pl. [[login to view URL]]
";i:5;s:44:"created: 2010.04.16 14:45:37
";i:6;s:44:"last modified: 2014.04.17 08:17:07
";i:7;s:44:"renewal date: 2015.04.16 14:45:37
";i:8;s:2:"
";i:9;s:11:"no option
";i:10;s:2:"
";i:11;s:33:"dnssec: Unsigned
";i:12;s:2:"
";i:13;s:2:"
";i:14;s:12:"REGISTRAR:
";i:15;s:30:"NetArt Spolka Akcyjna S.K.A.
";i:16;s:21:"ul. Cystersow 20A
";i:17;s:15:"31-553 Krakow
";i:18;s:15:"Polska/Poland
";i:19;s:18:"
";i:20;s:19:"
";i:21;s:18:"
";i:22;s:18:"
";i:23;s:14:"
";i:24;s:2:"
";i:25;s:98:"WHOIS displays data with a delay not exceeding 15 minutes in relation to the .pl Registry system
";i:26;s:62:"Registrant data available at [login to view URL]";}