Re: [Biojava-l] Rooted trees in nexus files

Thasso Griebel Wed, 04 Nov 2009 03:59:22 -0800

Hi,

A getRoot() function sounds good. It would return the String labelof the root node, the same as which identifies the correspondingvertex in the JGraphT model. An equivalent setRoot() would be nice.

Though you have to keep in mind that switching the root to anothernode has certain implications on the tree structure and this has to betaken into account when the newick string is parsed and the graph iscreated. You have to parse the graph from newick and then "reroot" thetree as the root might not be equal to the one specified in the newickstring.

Personally I would also alter the methods that return JGraphTs sothat they return their Directed equivalents if possible. I believethat these can still be unrooted - you'd have to check the JGraphTdocumentation to make sure.

You have to change that method signature if you want to use the samemethod. The only relationship between JGraphTs UndirectedGraph and theDirectedGraph counterpart is that they both extend the Graphinterface, but a DirectedGraph is not an UndirectedGraph. Switching toDirectedGraph definitely breaks the current API ! I don't know how youusually handle such situations in BioJava, but this clearly breakscompatibility. Maybe it would be better to introduce a new method thatreturns directed graphs ?


cheers,
-thasso

Richard.

On 3 Nov 2009, at 18:55, Tiago Antão wrote:
But the point is that the class interface changes to the outsideuser:
1. How does one report back the root to the user?
2. Regarding the prefix stuff, should the user be allowed tospecify a
preferred prefix?

Both this things imply interface changes visible to users.
If you still need volunteers to do the change, I can do it. But Ineed
to know what changes to the user interface are to be done.
For 1, maybe a method getRoot, returning a string with the name ofthe
root node?
For 2, maybe an extended version of the parse function with a suffix
as input parameter?

2009/11/3 Richard Holland <[email protected]>:
1. Lack of knowledge of root node
The Newick tree string is read as-is and is not parsed. It onlygets parsedat the point of conversion to a Undirected or WeightedGraph insidethe
TreeBlocks.java source code (inside the two types of get-As-JGraphT
methods). It's at this point the string is parsed and it's herethat rootnote determination should take place. It's already known whether&R or &Uhave been specified here, which should help the code work out whatto do.
2. The p* stuff.
Exactly the same part of the code as described above. Wherever itpushesvalues to the stack but prepends them with 'p' first, you'll needto changethe 'p' to some instance variable and provide a getter/setter tochange it,
with 'p' being the default setting.

cheers,
Richard
Tiago
2009/11/3 Richard Holland <[email protected]>:
Agreed that there is a bug. Now all we need is someone to go inand fix
it!
:)

cheers,
Richard

On 3 Nov 2009, at 18:16, Tiago Antão wrote:
2009/11/3 Thasso Griebel <[email protected]>:
There is a way to uniquely get a root from a newick string.Usually arooted newick is surrounded with brackets, which indicates theroot as
the
highest node in the tree. For example:

(A, (B,C))
Agree, it is quite easy to get the root of the tree from thenewickrepresentation. But it should be done on parsing and returnedin someway by the parsing system. If the user has to do it again, itmeans
that the user has to parse it again just to know the root node.
I would also suggest to generally parse trees as rooted trees(maybe
jsut
for th initial internal model). Creating an unrooted tree froma rooted
one
is easy, remove the root and forget about directions. Theother way
might
be
hard and ambiguous.
100% agree.
The newick _representation_ always has a root by virtue of theway itis done. If that root has meaning or not depends. Doing as yousuggest
seems the most reasonable idea.
I would add that even if it is an unrooted tree, the topologymight beof interest. In my case I am doing a comparative visualizer anditmight be nice for the user to be able to visualize the topologyasspecified. It has no biological meaning, but in practice, formany
users, it helps.
I note that PhyloXML (even by virtue of being a XML format)always
represents the phylogenies as trees (not weigthed DAGs). There an
attribute rooted which can be true or false.

But, anyway. Even assuming a very conservative view on this, the
current parser, for rooted trees, does not allow to determinewhere isthe root. I think that there would be a consensus that that isa bug?
Tiago
--
Richard Holland, BSc MBCS
Operations and Delivery Director, Eagle Genomics Ltd
T: +44 (0)1223 654481 ext 3 | E: [email protected]
http://www.eaglegenomics.com/
--
"The hottest places in hell are reserved for those who, in times of
moral crisis, maintain a neutrality." - Dante
--
Richard Holland, BSc MBCS
Operations and Delivery Director, Eagle Genomics Ltd
T: +44 (0)1223 654481 ext 3 | E: [email protected]
http://www.eaglegenomics.com/
--
"The hottest places in hell are reserved for those who, in times of
moral crisis, maintain a neutrality." - Dante
--
Richard Holland, BSc MBCS
Operations and Delivery Director, Eagle Genomics Ltd
T: +44 (0)1223 654481 ext 3 | E: [email protected]
http://www.eaglegenomics.com/


--
Dipl. Inf. Thasso Griebel-------------------Lehrstuhl fuer Bioinformatik
Office 3426--http://bio.informatik.uni-jena.de--Institut fuer Informatik
Phone +49 (0)3641 9-46454-----------Friedrich-Schiller-Universitaet Jena
Fax +49 (0)3641 9-46452----------Ernst-Abbe-Platz 2, 07743 Jena, Germany




_______________________________________________
Biojava-l mailing list  -  [email protected]
http://lists.open-bio.org/mailman/listinfo/biojava-l

Re: [Biojava-l] Rooted trees in nexus files

Reply via email to