There's been some progress, but it's unfortunately not public. The DHS PREDICT project (www.predict.org) includes various captured data sets, including about 200 gig of artificial data sets we generated to support a research project. PREDICT data's only available to researchers based in the US who meet the program requirements.
Not quite true. You can apply for any data set. You have to tell what you want to use it for and there are different levels of how you can use the logs. The log supplier defines what the policies are around the logs. Some suppliers require US based consumption, etc. There is a bit of a process to sign up, but it's open.
Check it out Raffael -- Raffael Marty @zrlram Chief Security Strategist @ Splunk> Security Visualization: http://secviz.org raffy.ch/blog
