Re: MoreLikeThis interestingTerms : SortableIntField breaks XML

2009-11-22 Thread Yonik Seeley
If you could help create a testcase for MoreLikeThisHandlerTest it would be great! (does this only apply to the MTL handler and not the component?) I've been trying to reproduce, but I haven't been able to (i.e. I can't even get as far as getting any interestingTerms output to appear in the

MoreLikeThis interestingTerms : SortableIntField breaks XML

2009-11-10 Thread Chantal Ackermann
Hi everyone, I've just realised that this line in the result for an MLT query is broken: float name=decade:€#0;ߐ0.2517573/float (this is the last child of element interestingTerms see more output below) decade should contain the int 2000 from what I can see in the results for that query. The

Re: MoreLikeThis interestingTerms : SortableIntField breaks XML

2009-11-10 Thread Yonik Seeley
On Tue, Nov 10, 2009 at 8:01 AM, Chantal Ackermann chantal.ackerm...@btelligent.de wrote: I've just realised that this line in the result for an MLT query is broken: float name=decade:€#0;ߐ0.2517573/float (this is the last child of element interestingTerms see more output below) Looks like

Re: MoreLikeThis interestingTerms : SortableIntField breaks XML

2009-11-10 Thread Chantal Ackermann
Hi Yonik, I'll do that. Is this a general requirement: that the terms have to be externalized? Because the TermVectorComponent doesn't externalize either. Shall the ticket mention that? Thanks, Chantal Response using TermVectorComponent: lst name=termVectors − lst name=doc-0 str

Re: MoreLikeThis interestingTerms : SortableIntField breaks XML

2009-11-10 Thread Yonik Seeley
On Tue, Nov 10, 2009 at 8:54 AM, Chantal Ackermann chantal.ackerm...@btelligent.de wrote: Hi Yonik, I'll do that. Is this a general requirement: that the terms have to be externalized? Because the TermVectorComponent doesn't externalize either. Shall the ticket mention that? It generally

Re: MoreLikeThis interestingTerms : SortableIntField breaks XML

2009-11-10 Thread Chantal Ackermann
Hi Yonik, I was about to create the issue in JIRA and for that I copied the output to a text editor (JEdit, in fact). But JEdit displays valid XML which I am not able to paste into here (Thunderbird). I think Thunderbird is not using UTF-8, even if it asks whether it should send UTF-8. This