Antisplog index feeded with 2 milion blogs
Today I paused the AntiSplog spider at 2,056,958 Blog. Frist I was going to study the behaviour on 1 million then I keep it running for the second million since it didn't took long time. But I needed to stop it for technical reasons, to let me finish studying more cases and run the filter on the indexed blogs.
The two milion blogs have been collected in about 4 (four) days, and there is a special charecteristics for this sample that I'll talk more about it in the future. Because I envisage that a large percentage of this sample will be splogs.
I thought that I could run the filter within one week, but I think it will take some more time. The sample is larger than my first prediction, and it's more representative of real blogging activities in all over the world. But It's already a good step to track splogs.


