Build failed in Jenkins: ManifoldCF » ManifoldCF-ant-1x #5

2020-10-15 Thread Apache Jenkins Server
See 


Changes:


--
Started by an SCM change
Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on H31 (ubuntu) in workspace 

Updating https://svn.apache.org/repos/asf/manifoldcf/branches/dev_1x at 
revision '2020-10-16T02:04:07.146 +'
At revision 1882571

[ManifoldCF-ant-1x] $ ant clean-core-deps make-core-deps clean
Exception in thread "main" java.lang.UnsupportedClassVersionError: 
org/apache/tools/ant/launch/Launcher : Unsupported major.minor version 52.0
at java.lang.ClassLoader.defineClass1(Native Method)
at java.lang.ClassLoader.defineClass(ClassLoader.java:800)
at 
java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
at java.net.URLClassLoader.defineClass(URLClassLoader.java:449)
at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:482)
Build step 'Invoke Ant' marked build as failure
Archiving artifacts
Publishing Javadoc


Build failed in Jenkins: ManifoldCF » ManifoldCF-mvn-1x #5

2020-10-15 Thread Apache Jenkins Server
See 


Changes:


--
[...truncated 355.02 KB...]
AU
site/src/documentation/resources/images/zh_CN/meridio-authority-status.PNG
AU
site/src/documentation/resources/images/zh_CN/meridio-authority-user-service-server.PNG
AU
site/src/documentation/resources/images/zh_CN/meridio-connection-credentials.PNG
AU
site/src/documentation/resources/images/zh_CN/meridio-connection-document-server.PNG
AU
site/src/documentation/resources/images/zh_CN/meridio-connection-records-server.PNG
AU
site/src/documentation/resources/images/zh_CN/meridio-connection-status.PNG
AU
site/src/documentation/resources/images/zh_CN/meridio-connection-web-client.PNG
AU
site/src/documentation/resources/images/zh_CN/metadataadjuster-job-add-metadata.PNG
AU
site/src/documentation/resources/images/zh_CN/metadataadjuster-job-move-metadata.PNG
AU
site/src/documentation/resources/images/zh_CN/opensearchserver-connection-parameters.PNG
AU
site/src/documentation/resources/images/zh_CN/opensearchserver-history-report.PNG
AU
site/src/documentation/resources/images/zh_CN/opensearchserver-job-parameters.PNG
AU
site/src/documentation/resources/images/zh_CN/opensearchserver-user.PNG
AUsite/src/documentation/resources/images/zh_CN/output-throttling.PNG
AUsite/src/documentation/resources/images/zh_CN/queue-status-example.PNG
AU
site/src/documentation/resources/images/zh_CN/queue-status-select-connection.PNG
AU
site/src/documentation/resources/images/zh_CN/queue-status-select-job.PNG
AU
site/src/documentation/resources/images/zh_CN/regexp-mapping-status.PNG
AU
site/src/documentation/resources/images/zh_CN/regexp-mapping-user-mapping.PNG
AU
site/src/documentation/resources/images/zh_CN/repository-throttling-with-throttle.PNG
AU
site/src/documentation/resources/images/zh_CN/repository-throttling.PNG
AU
site/src/documentation/resources/images/zh_CN/rss-configure-bandwidth.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-configure-email.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-configure-proxy.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-configure-robots.PNG
AU
site/src/documentation/resources/images/zh_CN/rss-job-canonicalization.PNG
AU
site/src/documentation/resources/images/zh_CN/rss-job-dechromed-content.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-job-exclusions.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-job-mappings.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-job-metadata.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-job-security.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-job-time-values.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-job-urls.PNG
AUsite/src/documentation/resources/images/zh_CN/rss-status.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepoint-configure-authoritytype.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepoint-configure-server.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepoint-job-metadata.PNG
AUsite/src/documentation/resources/images/zh_CN/sharepoint-job-paths.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepoint-job-security.PNG
AUsite/src/documentation/resources/images/zh_CN/sharepoint-status.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepointadauthority-configure-cache.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepointadauthority-configure-dc.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepointadauthority-status.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepointnativeauthority-configure-cache.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepointnativeauthority-configure-server.PNG
AU
site/src/documentation/resources/images/zh_CN/sharepointnativeauthority-status.PNG
AU
site/src/documentation/resources/images/zh_CN/simple-history-example.PNG
AU
site/src/documentation/resources/images/zh_CN/simple-history-select-activities.PNG
AU
site/src/documentation/resources/images/zh_CN/simple-history-select-connection.PNG
AU
site/src/documentation/resources/images/zh_CN/solr-configure-arguments.PNG
AU
site/src/documentation/resources/images/zh_CN/solr-configure-commits.PNG
AU
site/src/documentation/resources/images/zh_CN/solr-configure-documents.PNG
AU
site/src/documentation/resources/images/zh_CN/solr-configure-schema.PNG
AU
site/src/documentation/resources/images/zh_CN/solr-configure-server.PNG
AU
site/src/documentation/resources/images/zh_CN/solr-configure-solr-type.PNG
AU

[jira] [Commented] (CONNECTORS-1653) Solr ingester connector contribution

2020-10-15 Thread Olivier Tavard (Jira)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17214976#comment-17214976
 ] 

Olivier Tavard commented on CONNECTORS-1653:


No it is not relevant, sorry about that. It only needs the solr-solrj*.jar 
mentioned upper in the file.

> Solr ingester connector contribution
> 
>
> Key: CONNECTORS-1653
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1653
> Project: ManifoldCF
>  Issue Type: New Feature
>Reporter: Olivier Tavard
>Assignee: Karl Wright
>Priority: Minor
> Attachments: solr_ingester_connector_patch.txt
>
>
> Hi,
> We developed a new repository connector for crawling data from Solr and we 
> would like to contribute to MCF by releasing the code into Apache v2 license.
> The goal of this connector is to crawl Solr instances and manage it in MCF 
> rather than using DIH for instance.
> So to do it, we send requests to Solr and we manage the large number of 
> results thanks to the cursormark. The Solr fields must be stored in order to 
> be gathered.
> By the way we do not use any specific libraries, all the dependencies are 
> already into MCF. We tested it so far for Solr 7 and 8 versions.
> The documentation is here : 
> https://datafari.atlassian.net/wiki/spaces/DATAFARI/pages/673742849/Solr+ingester+crawler+connector
> The code is attached.
> Best regards,
> Olivier Tavard



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (CONNECTORS-1655) Web connector - UnsupportedEncodingException utf-8

2020-10-15 Thread Karl Wright (Jira)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17214873#comment-17214873
 ] 

Karl Wright commented on CONNECTORS-1655:
-

So you are using a non-standard JVM that doesn't understand utf-8 character 
encoding.
Sorry, you don't get a fix for that. o_O  Use a standard JVM please.


> Web connector - UnsupportedEncodingException utf-8
> --
>
> Key: CONNECTORS-1655
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1655
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Web connector
>Affects Versions: ManifoldCF 2.17
>Reporter: Julien Massiera
>Priority: Critical
>
> When crawling some sites (for instance this one: 
> [http://www.antibes-juanlespins.com/] ) the job manages to index some 
> documents, but the stops with the following error code:
> Error: IO error: utf-8; filename=rseventspro_rss20_56.xml
> Here is one the MCF stacktrace: 
> Exception tossed: IO error: utf-8; filename=rseventspro_rss20_56.xml
> org.apache.manifoldcf.core.interfaces.ManifoldCFException: IO error: utf-8; 
> filename=rseventspro_rss20_56.xml
> at 
> org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.handleXML(WebcrawlerConnector.java:4203)
>  ~[?:?]
> at 
> org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.extractLinks(WebcrawlerConnector.java:3855)
>  ~[?:?]
> at 
> org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.processDocuments(WebcrawlerConnector.java:746)
>  ~[?:?]
> at 
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) 
> [mcf-pull-agent.jar:?]
> Caused by: java.io.UnsupportedEncodingException: utf-8; 
> filename=rseventspro_rss20_56.xml
> at sun.nio.cs.StreamDecoder.forInputStreamReader(StreamDecoder.java:71) 
> ~[?:1.8.0_212]
> at java.io.InputStreamReader.(InputStreamReader.java:100) ~[?:1.8.0_212]
> at 
> org.apache.manifoldcf.connectorcommon.fuzzyml.DecodingByteReceiver.dealWithBytes(DecodingByteReceiver.java:47)
>  ~[?:?]
> at 
> org.apache.manifoldcf.connectorcommon.fuzzyml.BOMEncodingDetector.dealWithRemainder(BOMEncodingDetector.java:250)
>  ~[?:?]
> at 
> org.apache.manifoldcf.connectorcommon.fuzzyml.SingleByteReceiver.dealWithBytes(SingleByteReceiver.java:52)
>  ~[?:?]
> at 
> org.apache.manifoldcf.connectorcommon.fuzzyml.Parser.parseWithCharsetDetection(Parser.java:74)
>  ~[?:?]
> at 
> org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.handleXML(WebcrawlerConnector.java:4174)
>  ~[?:?]
> ... 3 more



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (CONNECTORS-1655) Web connector - UnsupportedEncodingException utf-8

2020-10-15 Thread Julien Massiera (Jira)
Julien Massiera created CONNECTORS-1655:
---

 Summary: Web connector - UnsupportedEncodingException utf-8
 Key: CONNECTORS-1655
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1655
 Project: ManifoldCF
  Issue Type: Bug
  Components: Web connector
Affects Versions: ManifoldCF 2.17
Reporter: Julien Massiera


When crawling some sites (for instance this one: 
[http://www.antibes-juanlespins.com/] ) the job manages to index some 
documents, but the stops with the following error code:
Error: IO error: utf-8; filename=rseventspro_rss20_56.xml

Here is one the MCF stacktrace: 
Exception tossed: IO error: utf-8; filename=rseventspro_rss20_56.xml
org.apache.manifoldcf.core.interfaces.ManifoldCFException: IO error: utf-8; 
filename=rseventspro_rss20_56.xml
at 
org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.handleXML(WebcrawlerConnector.java:4203)
 ~[?:?]
at 
org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.extractLinks(WebcrawlerConnector.java:3855)
 ~[?:?]
at 
org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.processDocuments(WebcrawlerConnector.java:746)
 ~[?:?]
at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) 
[mcf-pull-agent.jar:?]
Caused by: java.io.UnsupportedEncodingException: utf-8; 
filename=rseventspro_rss20_56.xml
at sun.nio.cs.StreamDecoder.forInputStreamReader(StreamDecoder.java:71) 
~[?:1.8.0_212]
at java.io.InputStreamReader.(InputStreamReader.java:100) ~[?:1.8.0_212]
at 
org.apache.manifoldcf.connectorcommon.fuzzyml.DecodingByteReceiver.dealWithBytes(DecodingByteReceiver.java:47)
 ~[?:?]
at 
org.apache.manifoldcf.connectorcommon.fuzzyml.BOMEncodingDetector.dealWithRemainder(BOMEncodingDetector.java:250)
 ~[?:?]
at 
org.apache.manifoldcf.connectorcommon.fuzzyml.SingleByteReceiver.dealWithBytes(SingleByteReceiver.java:52)
 ~[?:?]
at 
org.apache.manifoldcf.connectorcommon.fuzzyml.Parser.parseWithCharsetDetection(Parser.java:74)
 ~[?:?]
at 
org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.handleXML(WebcrawlerConnector.java:4174)
 ~[?:?]
... 3 more



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (CONNECTORS-1653) Solr ingester connector contribution

2020-10-15 Thread Karl Wright (Jira)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17214684#comment-17214684
 ] 

Karl Wright commented on CONNECTORS-1653:
-

Looked briefly at the code; looked good so far from what I see.

However, one question.  The connector build.xml has this in it:

{code}
+
+
+
+
+  
+
+
+
+  
+
+
{code}

These are the ManifoldCF solr security plugins.  Do they apply here?


> Solr ingester connector contribution
> 
>
> Key: CONNECTORS-1653
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1653
> Project: ManifoldCF
>  Issue Type: New Feature
>Reporter: Olivier Tavard
>Assignee: Karl Wright
>Priority: Minor
> Attachments: solr_ingester_connector_patch.txt
>
>
> Hi,
> We developed a new repository connector for crawling data from Solr and we 
> would like to contribute to MCF by releasing the code into Apache v2 license.
> The goal of this connector is to crawl Solr instances and manage it in MCF 
> rather than using DIH for instance.
> So to do it, we send requests to Solr and we manage the large number of 
> results thanks to the cursormark. The Solr fields must be stored in order to 
> be gathered.
> By the way we do not use any specific libraries, all the dependencies are 
> already into MCF. We tested it so far for Solr 7 and 8 versions.
> The documentation is here : 
> https://datafari.atlassian.net/wiki/spaces/DATAFARI/pages/673742849/Solr+ingester+crawler+connector
> The code is attached.
> Best regards,
> Olivier Tavard



--
This message was sent by Atlassian Jira
(v8.3.4#803005)