Re: [tor-dev] Better relay uptime visualisation

2015-12-08 Thread nusenu
> Also, here are the steps to reproduce: > > wget > https://collector.torproject.org/archive/relay-descriptors/consensuses/consensuses-2015-11.tar.xz > tar xvJf consensuses-2015-11.tar.xz > go get git.torproject.org/user/phw/sybilhunter.git > sybilhunter -data consensuses-2015-11/

Re: [tor-dev] Better relay uptime visualisation

2015-12-08 Thread Philipp Winter
On Mon, Dec 07, 2015 at 01:44:47PM -0800, David Fifield wrote: > On Mon, Dec 07, 2015 at 02:51:23PM -0500, Philipp Winter wrote: > > I spent some time improving the existing relay uptime visualisation [0]. > > Inspired by a research paper [1], the new algorithm uses single-linkage > > clustering

Re: [tor-dev] Better relay uptime visualisation

2015-12-08 Thread Philipp Winter
On Mon, Dec 07, 2015 at 09:57:18PM +, nusenu wrote: > > and every column is a relay. White pixels mean > > that a relay was offline and black pixels means that a relay was > > online. Red pixels are used to highlight suspiciously similar clusters. > > I assume they are highlighted only if

Re: [tor-dev] Better relay uptime visualisation

2015-12-08 Thread Philipp Winter
On Tue, Dec 08, 2015 at 04:52:45PM +, nusenu wrote: >> Also, here are the steps to reproduce: >> >> wget >> https://collector.torproject.org/archive/relay-descriptors/consensuses/consensuses-2015-11.tar.xz >> tar xvJf consensuses-2015-11.tar.xz >> go get

Re: [tor-dev] Better relay uptime visualisation

2015-12-08 Thread Philipp Winter
On Mon, Dec 07, 2015 at 11:43:38PM -0500, grarpamp wrote: > Can a one be generated covering each year and maybe a five year one. I haven't checked the complexity of the clustering algorithm I use, but it's probably quadratic. I think a full year worth of uptimes would require pruning the data,

Re: [tor-dev] Better relay uptime visualisation

2015-12-07 Thread Tim Wilson-Brown - teor
> On 8 Dec 2015, at 10:43, Tom Ritter wrote: > > On 7 December 2015 at 13:51, Philipp Winter > wrote: >> I spent some time improving the existing relay uptime visualisation [0]. >> Inspired by a research paper [1], the new algorithm uses

[tor-dev] Better relay uptime visualisation

2015-12-07 Thread Philipp Winter
I spent some time improving the existing relay uptime visualisation [0]. Inspired by a research paper [1], the new algorithm uses single-linkage clustering with Pearson's correlation coefficient as distance function. The idea is that relays are grouped next to each other if their uptime (basically

Re: [tor-dev] Better relay uptime visualisation

2015-12-07 Thread David Fifield
On Mon, Dec 07, 2015 at 02:51:23PM -0500, Philipp Winter wrote: > I spent some time improving the existing relay uptime visualisation [0]. > Inspired by a research paper [1], the new algorithm uses single-linkage > clustering with Pearson's correlation coefficient as distance function. > The idea

Re: [tor-dev] Better relay uptime visualisation

2015-12-07 Thread nusenu
Philipp Winter: > Red pixels are used to highlight suspiciously similar clusters. Last year [1] there were a few huge groups, 3 of them are not flagged (black lines, not red) even though they look like a perfectly matching group? [1]

Re: [tor-dev] Better relay uptime visualisation

2015-12-07 Thread David Fifield
On Tue, Dec 08, 2015 at 10:47:08AM +1100, Tim Wilson-Brown - teor wrote: > > On 8 Dec 2015, at 10:43, Tom Ritter <[1]t...@ritter.vg> wrote: > > On 7 December 2015 at 13:51, Philipp Winter <[2]p...@nymity.ch> wrote: > > I spent some time improving the existing relay uptime

Re: [tor-dev] Better relay uptime visualisation

2015-12-07 Thread grarpamp
Can a one be generated covering each year and maybe a five year one. And three other check sets but sorted left to right by first online date FP AS As to the actual FP's, all I can think of is including a second text file with pixel number to FP mappings. Or some "maps" style online zooming.

Re: [tor-dev] Better relay uptime visualisation

2015-12-07 Thread Philipp Winter
On Mon, Dec 07, 2015 at 05:43:01PM -0600, Tom Ritter wrote: > On 7 December 2015 at 13:51, Philipp Winter wrote: > > I spent some time improving the existing relay uptime visualisation [0]. > > Inspired by a research paper [1], the new algorithm uses single-linkage > > clustering