[GitHub] nifi issue #397: NIFI-1815

2016-06-20 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
back to reviewing this. It has some merge conflicts but i'll fox them as I 
go along


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-08 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
I stepped away from it as I am trying to finish something else that is very 
involved. Will get back to it once I am finished, but I am committed to getting 
it in to both 0.7.0 and 1.0.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-08 Thread jdye64
Github user jdye64 commented on the issue:

https://github.com/apache/nifi/pull/397
  
Olegz any luck getting your local install of Tesseract to work?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-06 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
True, and I did the same ('brew install tesseract') and got this
```
Trying a mirror...
==> Downloading 
ftp://ftp.simplesystems.org/pub/libpng/png/src/libpng16/libpng-1.6.17.tar.xz

curl: (78) RETR response: 550
Error: Failed to download resource "libpng"
Download failed: 
ftp://ftp.simplesystems.org/pub/libpng/png/src/libpng16/libpng-1.6.17.tar.xz
```
So, I'll keep on trying 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-06 Thread jdye64
Github user jdye64 commented on the issue:

https://github.com/apache/nifi/pull/397
  
Olegz - I'll certainly add the extra checking. As for installation on nom 
Windows I did the development on OS X and simply ran "brew install tesseract" 
rather than building from source

Sent from my iPhone

> On Jun 6, 2016, at 7:47 AM, Oleg Zhurakousky  
wrote:
> 
> Ok, while all is good on Windows I can't seem to have any success 
building those .so files on OSx. Normally I would not worry about it that much 
but given that Tesseract distribution includes DLLs inside the JAR means that 
for "all other" OS such native libraries will come from outside and need to be 
known to the processor, so we probably would need another property and 
definitely test with at least one non-Win system.
> So, I'll keep on trying (when I get a chance) to get/build those native 
libraries, but could use some help here as well
> 
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub, or mute the thread.
> 



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-06 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
Ok, while all is good on Windows I can't seem to have any success building 
those _.so_ files on OSx. Normally I would not worry about it that much but 
given that Tesseract distribution includes DLLs inside the JAR means that for 
"all other" OS such native libraries will come from outside and need to be 
known to the processor, so we probably would need another property and 
definitely test with at least one non-Win system.
So, I'll keep on trying (when I get a chance) to get/build those native 
libraries, but could use some help here as well


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-05 Thread jdye64
Github user jdye64 commented on the issue:

https://github.com/apache/nifi/pull/397
  
@olegz my initial intention was that the "TESS_DATA_PATH" and the 
"TessdataDirectoryValidator" being a required PropertyDescriptor would catch 
those instances where users tried to use this processor without the native 
system dependencies installed. You bring up a good point however since really 
all this would do is check that the Tesseract dictionary files are present 
which certainly doesn't mean that the native libraries will be there as well. 
This doesn't cover the unit test cases however. I like your point about 
creating a throw away Tesseract instance to check if the dependencies are 
present. I will add something like that.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-05 Thread joewitt
Github user joewitt commented on the issue:

https://github.com/apache/nifi/pull/397
  
@olegz definitely was thinking a customValidate check which determined 
whether tesseract was ready to roll.  If the check you mention does that then 
that sounds good.  We should be careful about ever call to customValidate 
resulting in an exception so probably best to periodically check if tesseract 
can work.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-05 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
The DLLs are distributed with the Tesseract JAR, but not _.so_ or other OS 
specific libs. Basically the current JAR only supports Windows out-of-the-box.
@joewitt If I understand correctly we need to create a custom validator 
with something like this inside it
```
new Tesseract();
```
. . . basically a throw-away instance of Tesseract just to see if it loads 
successfully as that is where JNA mapping happens 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-05 Thread joewitt
Github user joewitt commented on the issue:

https://github.com/apache/nifi/pull/397
  
as long as the processor is invalid and won't start until the 
environmentally required bits are present we're good.  And yes it is correct we 
cannot distribute the dlls in our source release but they would be fine for the 
convenience binary artifacts we put out presuming the licensing allows this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-05 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
@jdye64 I want to switch the discussion to a different topic now as I am 
starting to have second thoughts, specifically around JNA bindings and how to 
properly go about it in the context of NiFi and this processor. Basically out 
of the box the proc won't work until native libs are installed and I am not 
sure we can really 'make' and distribute them either. Any thoughts on this? 
Even tests depend on native libs and the only libs distributed with Tesseract 
are windows DLLs 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nifi issue #397: NIFI-1815

2016-06-02 Thread olegz
Github user olegz commented on the issue:

https://github.com/apache/nifi/pull/397
  
@jdye64 I left one last comment. Once addressed I'll begin the merge 
procedure. I will also take care of LICENSE and NOTICE


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---