I'd love a list of id's for active accounts and another list of id's
for inactive ones, by some sensible criteria of activity. Publishing
this is in twitter.com's interest, admittedly for that large first and
second crawl. I'm calling this for everyone:
Stop doing this. You are stressing the system and producing questionable
results. You run a very high risk of blacklisting. Also, there are many many
existing studies that go over this same ground of active users and break the
data down in painstaking detail.
Instead, take the Spritzer sample