Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Richard Eckart de Castilho
I don't think the code should enforce the use of the "default" file extensions. It looks like the load methods can more-or-less auto-detect any format except xcas. Maybe we should consider adding a load method where the user explicitly requests a SerialFormat for cases were one wants to avoid au

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
sorry, my bad English... :-) By casing, I didn't mean upper case, lower case, I meant the code uses the Java "switch" statement, with "case"s . (I think the code already is case-insensitive to upper/lower case for this.) And, yes, I mean the file extensions, the ".xmi", ".xcas", etc. -Marshall

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Richard Eckart de Castilho
On 03.08.2016, at 22:51, Marshall Schor wrote: > > The "load" methods have some attention paid to "casing" on the file type. You mean ".BCAS" and ".BcAs" should be treated equally to ".bcas"? > The "save" methods do not do any defaulting to set the "proper" file type that > would be needed by t

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
The "load" methods have some attention paid to "casing" on the file type. The "save" methods do not do any defaulting to set the "proper" file type that would be needed by the load method. Should the save methods be architected to include this? -Marshall On 7/18/2016 2:30 PM, Marshall Schor wr

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
The "mitigating" factor would be if we could easily imagine a significant sub-community of UIMA users appreciating these variants. In this particular case, I'm leaning toward agreeing with Richard, but am fine with having some variants if needed by Peter. -Marshall On 8/3/2016 4:32 PM, Richard

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
The save in format COMPRESSED_FILTERED only makes sense if you pass in an additional type system to represent the "filter". Some choices: Disable this - throw unsupported operation exception for saving with COMPRESSED_FILTERED until we get a real need/use-case for this. I would also do this for C

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Richard Eckart de Castilho
Under that policy, should we really introduce all kinds of variants using File, Path, and URL not rather stick to InputStream (maybe to URL which incurs additional overhead opening/closing streams)? -- Richard > On 03.08.2016, at 22:29, Marshall Schor wrote: > > My general feeling: anytime we

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
My general feeling: anytime we make something "public" it becomes "set in stone". So, it is best not to make things public ahead of clear needs/use-cases. That way, if at some point in the future, we find we now have a clear use-case for the loadBinary kind of thing, we can implement it then, wit

[jira] [Commented] (UIMA-5032) Support feature toggle using lazy annotator initialization

2016-08-03 Thread Eddie Epstein (JIRA)
[ https://issues.apache.org/jira/browse/UIMA-5032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15406564#comment-15406564 ] Eddie Epstein commented on UIMA-5032: - A feature toggle would often be at an aggregate

Re: Testing uima v3

2016-08-03 Thread Marshall Schor
The answer depends on whether or not you make use of "JCas". Existing JCas classes need to be either converted or regenerated from the descriptors. The code for regenerating is in the uimaj-tools project. The code for converting existing is in the uimaj-v3migration-jcas project. The conversion

Jenkins build is back to normal : UIMA-uimaFIT_java8 #61

2016-08-03 Thread Apache Jenkins Server
See

Jenkins build is back to normal : UIMA-uimaFIT #242

2016-08-03 Thread Apache Jenkins Server
See

[jira] [Resolved] (UIMA-5039) Language and mimetype capabilities only added if type/sofa capabilities present

2016-08-03 Thread Richard Eckart de Castilho (JIRA)
[ https://issues.apache.org/jira/browse/UIMA-5039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Eckart de Castilho resolved UIMA-5039. -- Resolution: Fixed > Language and mimetype capabilities only added if type

Build failed in Jenkins: UIMA-uimaFIT #241

2016-08-03 Thread Apache Jenkins Server
See -- Started by user rec [EnvInject] - Loading node environment variables. Building remotely on windows-2012-1 (Windows) in workspace Cleaning local Directo

Build failed in Jenkins: UIMA-uimaFIT #240

2016-08-03 Thread Apache Jenkins Server
See -- Started by user rec [EnvInject] - Loading node environment variables. Building remotely on windows-2012-1 (Windows) in workspace Cleaning local Directo

[jira] [Created] (UIMA-5039) Language and mimetype capabilities only added if type/sofa capabilities present

2016-08-03 Thread Richard Eckart de Castilho (JIRA)
Richard Eckart de Castilho created UIMA-5039: Summary: Language and mimetype capabilities only added if type/sofa capabilities present Key: UIMA-5039 URL: https://issues.apache.org/jira/browse/UIMA-503

Jenkins build is back to normal : UIMA-uimaFIT #239

2016-08-03 Thread Apache Jenkins Server
See

Jenkins build is back to normal : UIMA-uimaFIT » Apache UIMA uimaFIT - Documentation #239

2016-08-03 Thread Apache Jenkins Server
See

Jenkins build is back to normal : UIMA-uimaFIT » Apache UIMA uimaFIT - Legacy uimaFIT support #239

2016-08-03 Thread Apache Jenkins Server
See

Jenkins build is back to normal : UIMA-uimaFIT » Apache UIMA uimaFIT #239

2016-08-03 Thread Apache Jenkins Server
See

Jenkins build is back to normal : UIMA-uimaFIT » Apache UIMA uimaFIT - Collection Processing Engine support #239

2016-08-03 Thread Apache Jenkins Server
See

CasIOUtils class - ZIP file scenario

2016-08-03 Thread Richard Eckart de Castilho
In the current design, I think the following code does not make much sense: // To support writing to ZIPs, the type system must be written separately from the CAS data if (typeOS != null && !typeSystemWritten) { writeTypeSystem(aCas, typeOS); typeSystemWritten = true; } O

Jenkins build is back to normal : UIMA-SDK #825

2016-08-03 Thread Apache Jenkins Server
See

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Richard Eckart de Castilho
It is a bit hard to see... do we have cases now where two headers are written to the file? E.g. in a form6 + TS, one before the type system and another one before the actual CAS data? -- Richard

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Richard Eckart de Castilho
After a quick look, I think the loadBinary methods could be private. There seems to be a problem with stream handling. All methods that internally create streams must make sure they are closed again. This does e.g. not happen in the URL-based methods. There openStream() is called in various places

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Peter Klügl
We can make them private, but I thought that maybe someone want to directly load a binary cas. Well that's not really necessary. Best, Peter Am 03.08.2016 um 18:10 schrieb Marshall Schor: > This class is a collection of static methods, mostly variants of "load" and > "save". > > There are 3

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
This class is a collection of static methods, mostly variants of "load" and "save". There are 3 public methods "loadBinary" that appear to be called from various "load" methods. Could these loadBinary methods be changed to private, instead of public, or do they need to be part of the public API?

Re: moderating emails - maybe lost two

2016-08-03 Thread Richard Eckart de Castilho
Feel free to add me to the moderation mechanism. I hope there is a convenient web interface? ;) Cheers, -- Richard > On 03.08.2016, at 17:43, Marshall Schor wrote: > > Hi, > > While going thru my apache in-box (which, I admit, I have neglected for a > bit), > I saw two emails from july 27, w

moderating emails - maybe lost two

2016-08-03 Thread Marshall Schor
Hi, While going thru my apache in-box (which, I admit, I have neglected for a bit), I saw two emails from july 27, which I attempted to moderate through, but the moderation system says they were no longer there. That can happen if they're already thru (but I didn't see them) or if someone "reject

Re: CasIOUtils class - some meta-questions

2016-08-03 Thread Marshall Schor
just committed SerialFormat with re-ordering. -M On 8/3/2016 2:28 AM, Peter Klügl wrote: > Yes, sure go ahead. I changed the order so that the first one is the > default format (XMI), but forgot that it could break compatibility. > > I adapt the ruta code... > > > Am 02.08.2016 um 23:23 schrieb M

[jira] [Created] (UIMA-5038) UV3 refactor class loading for consistency / maintainability

2016-08-03 Thread Marshall Schor (JIRA)
Marshall Schor created UIMA-5038: Summary: UV3 refactor class loading for consistency / maintainability Key: UIMA-5038 URL: https://issues.apache.org/jira/browse/UIMA-5038 Project: UIMA Issu

Testing uima v3

2016-08-03 Thread Peter Klügl
Hi, I promised some time ago to test the experimental branch against the ruta implementation. Maven complains about some missing artifact uimaj-jcas. Which modules do I need? Just uimaj-core? Best, Peter