Eddie Epstein wrote: [...] > With regards indexing information in the Cas XMI format, what is passed > is a list of FS that are indexed [in each view]. Today, without delta Cas, > the indexes are fully rebuilt when the Cas is returned. All sorted indexes > will retain the same iteration order. Non-sorted indexes may have a > different > order, but that has always been documented: "A bag index simply stores > everything, without any guaranteed order. " > > The only potential change in behavior that I am aware of has to do with > adding > an FS to the index repository multiple times: "... all FSs that are commited > > are entered, even if they are duplicates of already existing FSs." So yes, > that would be a change in behavior, as there would only be a single instance > of each FS in the index upon return from a remote component. Is this the > difference you were referring to, or is there more? > > Eddie >
Our current XMI implementation and what is on the table for OASIS are two different things. Let me quote from the standard proposal: <quote> Currently the Apache UIMA Component Metadata Descriptor includes the following elements that are not part of the proposed UIMA Specification. 1. Indexes: Defines the structure of indexes through which the analytic will access data. In some sense the actual indexing design is an Apache UIMA issue and so this may be an extension to the descriptor schema that is specific to Apache UIMA. However if we think of the index definitions as a component declaring the key features that it is going to use to query the data, we can make a case that this should be a UIMA standard, so that any framework could optimize based on this information. 2. Type Priorities: These are closely related to the index definitions and should probably be combined with them rather than represented as a separate element </quote> Maybe I'm wrong, but I think this has consequences for Apache UIMA flows that use OASIS compliant services, as indexing information is lost. In Apache UIMA, you explicitly need to add FSs to indexes (or not). This distinction is lost if indexes are not part of the spec. --Thilo
