[jira] [Commented] (CONNECTORS-1079) the parsing in TikaExtractor always return empty result

2014-10-26 Thread Mingchun Zhao (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14184445#comment-14184445
 ] 

Mingchun Zhao commented on CONNECTORS-1079:
---

Karl, thanks!
It does work pretty well.

 the parsing in TikaExtractor always return empty result
 ---

 Key: CONNECTORS-1079
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1079
 Project: ManifoldCF
  Issue Type: Bug
  Components: Tika extractor
Affects Versions: ManifoldCF 2.0
Reporter: Mingchun Zhao
Assignee: Karl Wright
 Fix For: ManifoldCF 1.8, ManifoldCF 2.0


 When I use latest trunk source(2.0) to try the Tika content extractor,It did 
 not return any expected results.
 I looked at it using debugging tools, found that the parser of Tika content 
 extractor does not return any data.
 I've tried to move lib/tika-core-1.6.jar into connector-lib/, 
 Then, the Tika content extractor returned data as expected.
 My configurations are as below:
 ==
 Transformation:
  Type: Tika content extractor
 Output:
  Type:Solr(Use extract update handler=false)
 Repository:
  type: Web
 Job:
  1.type: repository
  2.type: transformation
  3.type: output
 ==
 Maybe, it is related to CONNECTORS-1074(?), 
 It looks like that the place of tika-core-1.6.jar affects the result of 
 TikaExtractor.
  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CONNECTORS-1077) Add activity logging for decision and exception events across all connectors

2014-10-26 Thread Muhammed Olgun (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14184589#comment-14184589
 ] 

Muhammed Olgun commented on CONNECTORS-1077:


r1634373 (trunk) for GridFS

 Add activity logging for decision and exception events across all connectors
 

 Key: CONNECTORS-1077
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1077
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Alfresco connector
Affects Versions: ManifoldCF 2.0
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 2.0

 Attachments: Example.patch, capture, elasticsearch_review.patch, 
 elasticsearch_review2.patch


 Many document skip decisions or transient exceptions are only logged, and are 
 not recorded as history events.  This makes it necessary upon occasion to 
 refer to the manifoldcf log for basic diagnosis.  We should record activity 
 events for most decisions and exceptions in the history.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CONNECTORS-1085) Introduce a mcf-connector-common.jar to save binary delivery space

2014-10-26 Thread Karl Wright (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14184807#comment-14184807
 ] 

Karl Wright commented on CONNECTORS-1085:
-

This change trims our binary down to this:

{code}
10/27/2014  12:15 AM   168,748,568 apache-manifoldcf-2.0-dev-bin.zip
{code}


 Introduce a mcf-connector-common.jar to save binary delivery space
 --

 Key: CONNECTORS-1085
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1085
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Framework core
Affects Versions: ManifoldCF 2.0
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 2.0


 The ManifoldCF 2.0 deliverable provides a number of connector-only services 
 in mcf-core, such as:
 - ISO 8601 date parsing and formatting
 - Axis SOAP transport support via Httpcomponents Httpclient
 - extension to mime type mapping
 These functions have the unfortunate requirement that many (large) jar 
 packages wind up needing to be included at the root level, which since these 
 wind up in all of the various war files, really bloats the binary deliverable.
 For MCF 2.0, we can fix this by moving this functionality to a 
 mcf-connector-common.jar, which would be included in connector-lib rather 
 than at the root level.
 This can't be done for MCF 1.8, because of backwards compatibility reasons.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)