RE: Luke - in need of maintainer

2006-06-01 Thread mark harwood
I can pick this up, but I don't think I've got much more bandwidth than Andrzej to work on it. I certainly don't have the time now for a port to an Apache-friendly GUI framework but ultimately I think Luke should end up under the contrib section where it can be managed and benefit from the

RE: Luke - in need of maintainer

2006-06-01 Thread Chris Nokleberg
On Thu, 01 Jun 2006 09:22:56 +0100, mark harwood wrote: Andrzej, have you tried contacting the Thinlet author to see if he would consider releasing the old Thinlet code under an Apache license? If this doesn't work out, the quickest solution might be to code a clean-room implementation of the

Re: Luke - in need of maintainer

2006-06-01 Thread Simon Willnauer
When the Summer of Code Project ends I'll still maintain and work on the GData Server but I'm willing to help maintaining Luke and maybe porting it to Swing / RCP based application. This won't happen until Oktober ;) simon On 6/1/06, Chris Nokleberg [EMAIL PROTECTED] wrote: On Thu, 01 Jun

Lexicon access questions

2006-06-01 Thread eks dev
We have faced the following use case: In order to optimize performance and more importantly quality of search results we are forced to attach more attributes to particular words (Terms). Generic attributes like TF, IDF are usefull to model our similarity only up to some level. Examples: 1.

Re: Luke - in need of maintainer

2006-06-01 Thread Andrzej Bialecki
mark harwood wrote: I can pick this up, but I don't think I've got much more bandwidth than Andrzej to work on it. I certainly don't have the time now for a port to an Apache-friendly GUI framework but ultimately I think Luke should end up under the contrib section where it can be managed and

[jira] Commented: (LUCENE-584) Decouple Filter from BitSet

2006-06-01 Thread JIRA
[ http://issues.apache.org/jira/browse/LUCENE-584?page=comments#action_12414224 ] Peter Schäfer commented on LUCENE-584: -- thanks, this looks interesting. Regards, Peter Decouple Filter from BitSet --- Key:

Re: Flexible Indexing (was Re: Lucene Planning)

2006-06-01 Thread Marvin Humphrey
On Jun 1, 2006, at 5:48 AM, Grant Ingersoll wrote: Someone on the list a while ago suggested moving Term Vectors out of the postings and storing them separately, as then they don't have to be merged (but they doc ids would have to be kept up to date) Yes, that was me. :) I suggested

Re: Luke - in need of maintainer

2006-06-01 Thread Dawid Weiss
Please contact Dawid Weiss (in CC:), he had a well-advanced port, perhaps it just needs a little polishing (Polish-ing? :) . Yes, this project is in fact still on my list... I do have a partial implementation of Thinlet API that emulates it in Swing. With a JGoodies look and feel the

Re: Lexicon access questions

2006-06-01 Thread Chuck Williams
This approach comes to mind. You could model your semantic tags as tokens and index them at the same positions as the words or phrases to which they apply. This is particularly easy if you can integrate your taggers with your Analyzer. You would probably want to create one or more new Query

[jira] Updated: (LUCENE-578) Summer of Code GDATA Server --Project structure and simple version to start with--

2006-06-01 Thread Simon Willnauer (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-578?page=all ] Simon Willnauer updated LUCENE-578: --- Attachment: diff_01_06_06.diff New Diff file this sould do the job for initial revision. If not i have to do the diff on the linux system - the other svn

[jira] Created: (LUCENE-586) Very inefficient implementation of MultiTermDocs.skipTo

2006-06-01 Thread Andrew Hudson (JIRA)
Very inefficient implementation of MultiTermDocs.skipTo --- Key: LUCENE-586 URL: http://issues.apache.org/jira/browse/LUCENE-586 Project: Lucene - Java Type: Improvement Components: Index, Search Reporter:

Re: Luke - in need of maintainer

2006-06-01 Thread Chris Hostetter
: cooperate in any sane fashion, to no avail. He's not interested in : changing the license, he's not even interested in any contributions. Licensing issues are one of those things i'm always glad other people understand and worry about -- because i don't want to -- so forgive me if this is a

Re: Luke - in need of maintainer

2006-06-01 Thread Andrzej Bialecki
Chris Hostetter wrote: : cooperate in any sane fashion, to no avail. He's not interested in : changing the license, he's not even interested in any contributions. Licensing issues are one of those things i'm always glad other people understand and worry about -- because i don't want to -- so

[jira] Resolved: (LUCENE-583) ISOLatin1AccentFilter discards position increments of filtered terms

2006-06-01 Thread Yonik Seeley (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-583?page=all ] Yonik Seeley resolved LUCENE-583: - Fix Version: 2.1 Resolution: Fixed Assign To: Yonik Seeley I just committed a fix for this. ISOLatin1AccentFilter discards position

[jira] Resolved: (LUCENE-575) SpellChecker min score is increased by time

2006-06-01 Thread Yonik Seeley (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-575?page=all ] Yonik Seeley resolved LUCENE-575: - Fix Version: 2.1 Resolution: Fixed Assign To: Yonik Seeley I just commited this. Thanks Karl! SpellChecker min score is increased by time

Re: svn commit: r410680 - in /lucene/java/branches/lucene_2_0: CHANGES.txt src/jsp/results.jsp

2006-06-01 Thread DM Smith
Just my opinion based on working with SVN with an entirely different development model: Trunk is production and branches are different people individual or collaborative efforts, with one branch for maintenance changes. When enough maintenance changes are ready for a release, we merge it

GData Server - Lucene storage

2006-06-01 Thread Simon Willnauer
Hello folks, as I'm the only developer on the project due to the SummerOfCode program it is quiet a tough task to discuss all the architecture with you on the mailing list. For this reason I decided to create UML diagrams to discuss the main components. I will not attach the uml to the mails

[jira] Created: (LUCENE-587) Explanation.toHtml outputs invalid HTML

2006-06-01 Thread Trejkaz (JIRA)
Explanation.toHtml outputs invalid HTML --- Key: LUCENE-587 URL: http://issues.apache.org/jira/browse/LUCENE-587 Project: Lucene - Java Type: Bug Components: Search Versions: 2.0.0 Reporter: Trejkaz If you want