Re: Graph Output formats

2011-09-18 Thread Ted Dunning
On Sun, Sep 18, 2011 at 2:15 PM, Grant Ingersoll wrote: > Cool, I've pushed my changes to ClusterDumper to Lucid's github account > (lucidimagination) and am planning on pushing all of it to Mahout this week. > It is now possible to output CSV, Text (the current option) and GraphML. > Easy enoug

Re: Graph Output formats

2011-09-18 Thread Grant Ingersoll
Cool, I've pushed my changes to ClusterDumper to Lucid's github account (lucidimagination) and am planning on pushing all of it to Mahout this week. It is now possible to output CSV, Text (the current option) and GraphML. Easy enough to extend to output JSON or whatever. I would imagine it wo

Re: Graph Output formats

2011-09-18 Thread Ted Dunning
You have to make one hack to make sure that the JS downloads from your local server, but that is easy. On Sun, Sep 18, 2011 at 12:17 PM, Ted Dunning wrote: > Yes. The old stuff from google used to require their servers and was very > limited on size of data. > > This newer stuff is not. > > > O

Re: Graph Output formats

2011-09-18 Thread Ted Dunning
Yes. The old stuff from google used to require their servers and was very limited on size of data. This newer stuff is not. On Sun, Sep 18, 2011 at 4:46 AM, Grant Ingersoll wrote: > > On Sep 17, 2011, at 9:22 PM, Ted Dunning wrote: > > > I strongly recommend Google's visualization API. > > Cool

Re: Graph Output formats

2011-09-18 Thread Grant Ingersoll
On Sep 17, 2011, at 9:22 PM, Ted Dunning wrote: > I strongly recommend Google's visualization API. Cool. Here I thought it required using Goog's servers, but I guess not. So you can run the server and hit it locally? > > This is divided into two parts, the reporting half and the data source

Re: Graph Output formats

2011-09-17 Thread Ted Dunning
I strongly recommend Google's visualization API. This is divided into two parts, the reporting half and the data source half. The reporting half is pretty good and very easy to use from javascript. It is the library that underlies pretty much all of Google's internal and external web visualizati

Re: Graph Output formats

2011-09-17 Thread Grant Ingersoll
I'll be checking in an abstraction, people can implement writers as they see fit. FWIW, I'm mostly looking for something that can be used in a vizualization toolkit, such as Gephi (although all be impressed if any of them can handle 7M points) -Grant On Sep 16, 2011, at 7:14 PM, Ted Dunning

Re: Graph Output formats

2011-09-16 Thread Ted Dunning
Indeed. I strongly prefer the other two for expressivity. On Fri, Sep 16, 2011 at 4:37 PM, Jake Mannix wrote: > On Fri, Sep 16, 2011 at 3:30 PM, Ted Dunning > wrote: > > > I think that Avro and protobufs are the current best options for large > data > > assets like this. > > > > (or serialized

Re: Graph Output formats

2011-09-16 Thread Jake Mannix
On Fri, Sep 16, 2011 at 3:30 PM, Ted Dunning wrote: > I think that Avro and protobufs are the current best options for large data > assets like this. > (or serialized Thrift)

Re: Graph Output formats

2011-09-16 Thread Ted Dunning
I think that Avro and protobufs are the current best options for large data assets like this. On Fri, Sep 16, 2011 at 2:44 PM, Jake Mannix wrote: > Can I vote for whichever one isn't based on XML? :) > > I really can't imagine encoding a 10-billion node graph in XML. Or rather, > I can, and I'm

Re: Graph Output formats

2011-09-16 Thread Lance Norskog
What's your displayer? And what formats does it use? On Fri, Sep 16, 2011 at 2:29 PM, Grant Ingersoll wrote: > Yeah, I hear you. I've actually just modeled it like our VectorWriter and > it will be pluggable. I'm likely just going to do CSV and GML to start (the > latter being XML) Maybe we ne

Re: Graph Output formats

2011-09-16 Thread Grant Ingersoll
Yeah, I hear you. I've actually just modeled it like our VectorWriter and it will be pluggable. I'm likely just going to do CSV and GML to start (the latter being XML) Maybe we need YAGF (yet another graph format)? I used to do a lot of NLP processing and output XML and always felt like what

Re: Graph Output formats

2011-09-16 Thread Tanton Gibbs
I have used XML to represent very large graphs (billions of nodes). It is as bad as you would imagine. On Fri, Sep 16, 2011 at 1:44 PM, Jake Mannix wrote: > Can I vote for whichever one isn't based on XML? :) > > I really can't imagine encoding a 10-billion node graph in XML.  Or rather, > I can,

Re: Graph Output formats

2011-09-16 Thread Jake Mannix
Can I vote for whichever one isn't based on XML? :) I really can't imagine encoding a 10-billion node graph in XML. Or rather, I can, and I'm skeered. On Fri, Sep 16, 2011 at 1:02 PM, Grant Ingersoll wrote: > I'm going to write a converter to dump out clusters and their points to a > graph

Graph Output formats

2011-09-16 Thread Grant Ingersoll
I'm going to write a converter to dump out clusters and their points to a graph structure so they can be displayed. Gephi (and others) supports a myriad of formats: http://gephi.org/users/supported-graph-formats/ * GEXF * GDF * GML * GraphML * Pajek NET * GraphViz DOT * CSV * UCINET DL