I think this could be done more general, such that every time a document
is changing is being indexed, e.g. also after editing, whereas there could
be one index for the authoring and one index for the live area.

If the document changes, it will be reindexed. I don't really see the need of a seperate index for every area.

within the authoring area the content can be quite different. Also there can be documents which don't exist within the live area. I think it definitely makes sense to have different indices or rather being able to search on "different versions"
re workflow status.

I don't think it will be much more work to implement, but rather keep the interface general enough and maybe just implement the live area if time it too limited for you.
But I'd suggest that you rather drop some othe features and focuse on this.


I don't think i have the time to change the proposal. The deadline is tomorrow. I don't think this is difficult to implement: add a field to Lucene with the area and add a hidden field to the search form. It would also need the indexing of the document in the submit step. This should be fairly easy to add when i am working on it.


Changing the fields would require a change of the 'obsolete' xml documents, but i think this is a rare case that should actually be avoided. Fields can be added or fields can become obsolete without a problem, but changing a field is something that is done rarely, if ever. Could you give me a scenario where this would be an urgent problem?

let's say you have one title field, and then one adopts the schema that one
has a maintitle and a subtitle and title will be gone, whereas the title is becoming
the maintitle

I would say: keep the title and add a field subtitle:)
I think changing a field is not something one should do often, also because of the iterative nature of the index building: the index isn't rebuilt but changed when a document changes. Making a complete change of the field in the index just does not fit into this idea. One could, of course, change all documents. This would also cause the document to be reindexed. For this obscure occasion, one could write a simple script to run trough the documents that have to be changed.

I am not sure if i should add it to my proposal, due to time constraints. Do you think this is a major issue?

<pr>
<title>
<lenya:index>title</lenya:index>
Lenya 14 release preponed
</title>
<content>
<lenya:index>contents</lenya:index>
The release of Lenya 1.4, the Apache Content Management System, ladila
</content>
</pr>

how do you want to mark attributes?

Hmm. Good question. I don't see the need for this in my current documents, but it should be possible, of course.

The lenya:index element could have an attribute to select the indexable attribute.

Besides this: the lenya:index element should have an attribute to define the selection of the child elements. One could ony index the content of the current element or also include the child nodes. This would be the case with the <body> element of an XHTML page.

<lenya:index fieldname="title" includechilds="false" indexattr="test">

When one wants to add some element to multiple Lucene fields, the lenya:index element should be added a number of times.

Any other remarks?



how do you want to treat these external links?

I was actually rather thinking about how to do you want to handle them within the index, because they won't have the same fields as the ones within Lenya. Do you want
to create a separate index?

Ah, ok! I want to add them to the same index. Having some field unfilled is no problem in Lucene. I could consider using the keywords of the document that contains the links, but this seems very nasty to me...


As far as i can see, it contains all the output one can ask for from a Lucene query.

also pagening?

Yep. Pretty nice implemented!


no problem, thanks very much for working on it. Please don't be afraid of my comments (in case you are), but I just want to make sure that various things are being
considered.

Quite the contrary! It is a delight to know that people are actually reading it thoroughly. Thanks!


I would like to send the proposal at the end of this afternoon. Are there some major issues that should be changed or added?

Regards, Robert

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to