kinow commented on PR #59:
URL: https://github.com/apache/opennlp-sandbox/pull/59#issuecomment-1397192288

   > @kinow I'd like to point your interest to the test resources directory and 
the files I added herein. What's your opinion on the bunch of data. IMHO, it is 
required to have some sort of tests running...; on the other hand it's quite a 
mass. Any ideas how we can achieve a tradeoff here, that is, have it more 
lightweight?
   
   Thanks for pointing that out, @mawiesne . I am using the GH UI, and skipped 
the test files without noticing their sizes.
   
   Does compressing these files help us here? If so we could include them as 
gzip/some other compression algo, and decompress when running the tests?
   
   Otherwise, what I used in another project that needed some large files for 
tests was the following setup:
   
   - enabled LFS in git
   - create tests using the large files in a separate suite
   - disable the suite by default, with a toggle in the build to enable that 
(i.e. you assume the user/dev will have enabled LFS and pulled the large files)
   
   The final alternative I can think of is storing the files in some ASF host 
somewhere, if possible, and do something similar to the setup with Git LFS, but 
asking users to download these files to run the tests in certain suites.
   
   Not sure if really helpful.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to