Hi Karl and Mingchun,

Thanks for your work on the last few issues. I join you on this Solr testing problem.

That said, we've tested this new connector in our application, with a FileShare job and everything was OK.

I hope and think it's just a problem specific to the test. Missing updates or incompatible dependencies...

Le 20/10/2023 à 02:58, Mingchun Zhao a écrit :
Hi Karl, Thanks!

so I wonder if, once again, there's a problem with dependencies for the
version of Solr they chose.

I'll take a look at this issue.

2023年10月20日(金) 9:50 Karl Wright<daddy...@gmail.com>:

This connector FranceLabs updated.  The problem seems to occur at a basic
level during http2 communication, so I wonder if, once again, there's a
problem with dependencies for the version of Solr they chose.

Karl


On Thu, Oct 19, 2023 at 8:32 PM Mingchun Zhao<mingchun.zha...@gmail.com>
wrote:

About the test "SolrCrawlHSQLDBIT" failure, it seems that "IO exception
during indexinghttp://test58.txt:
frame_size_error/invalid_frame_length"
error is occurring on the ManifoldCF side.

- command:
```
manifoldcf/connectors/solr% ant run-IT-HSQLDB

run-IT-HSQLDB:
     [junit] Testsuite:
org.apache.manifoldcf.agents.output.solr.tests.SolrCrawlHSQLDBIT
... ...
```

- I checked "connectors/solr/test-HSQLDB-output/manifoldcf.log":
```
  WARN 2023-10-20T09:14:56,635 (Worker thread '18') - IO exception during
indexinghttp://test58.txt: frame_size_error/invalid_frame_length
java.io.IOException: frame_size_error/invalid_frame_length
at org.eclipse.jetty.http2.HTTP2Session.toFailure(HTTP2Session.java:566)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at org.eclipse.jetty.http2.HTTP2Session.access$2700(HTTP2Session.java:80)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Session$StreamsState.onSessionFailure(HTTP2Session.java:1857)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Session$StreamsState.access$400(HTTP2Session.java:1436)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Session.onSessionFailure(HTTP2Session.java:511)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Session.onConnectionFailure(HTTP2Session.java:506)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.parser.Parser$Listener$Wrapper.onConnectionFailure(Parser.java:414)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Connection$ParserListener.onConnectionFailure(HTTP2Connection.java:397)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.parser.BodyParser.notifyConnectionFailure(BodyParser.java:223)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.parser.BodyParser.connectionFailure(BodyParser.java:215)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at
org.eclipse.jetty.http2.parser.Parser.connectionFailure(Parser.java:209)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at org.eclipse.jetty.http2.parser.Parser.parseHeader(Parser.java:155)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at org.eclipse.jetty.http2.parser.Parser.parse(Parser.java:121)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Connection$HTTP2Producer.produce(HTTP2Connection.java:261)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produceTask(EatWhatYouKill.java:362)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:186)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:173)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produce(EatWhatYouKill.java:137)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at
org.eclipse.jetty.http2.HTTP2Connection.produce(HTTP2Connection.java:183)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Connection.onFillable(HTTP2Connection.java:138)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.http2.HTTP2Connection$FillableCallback.succeeded(HTTP2Connection.java:361)
~[http2-common-9.4.48.v20220622.jar:9.4.48.v20220622]
at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:105)
~[jetty-io-9.4.48.v20220622.jar:9.4.48.v20220622]
at org.eclipse.jetty.io.ChannelEndPoint$1.run(ChannelEndPoint.java:104)
~[jetty-io-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.Invocable.invokeNonBlocking(Invocable.java:69)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.invokeTask(EatWhatYouKill.java:350)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:305)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:173)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produce(EatWhatYouKill.java:137)
~[jetty-util-9.4.48.v20220622.jar:9.4.48.v20220622]
at


org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:271)
~[solr-solrj-9.1.0.jar:9.1.0 aa4f3d98ab19c201e7f3c74cd14c99174148616d -
ishan - 2022-11-11 13:00:47]
at


java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
~[?:?]
at


java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
~[?:?]
at java.lang.Thread.run(Thread.java:829) ~[?:?]
```

2023年10月19日(木) 20:37 Mingchun Zhao<mingchun.zha...@gmail.com>:

I'm having trouble with the Solr test `ant run-IT-HSQLDB`.
It's been over an hour since I ran the test and it still doesn't
finish.
It seems that the problem is probably due to the Manifold job not
finishing.
Do you have any ideas or advice? Thanks!

```
~/ManifoldCF/manifoldcf/connectors/solr% ant run-IT-HSQLDB
Buildfile:
/Users/zhaomingchun/ManifoldCF/manifoldcf/connectors/solr/build.xml

... ...
run-IT-HSQLDB:
     [junit] Testsuite:
org.apache.manifoldcf.agents.output.solr.tests.SolrCrawlHSQLDBIT
     [junit] Configuration file successfully read
     [junit] [main] INFO org.eclipse.jetty.util.log - Logging
initialized
@7416ms to org.eclipse.jetty.util.log.Slf4jLog
     [junit] [main] INFO org.eclipse.jetty.server.Server -
jetty-9.4.48.v20220622; built: 2022-06-21T20:42:25.880Z; git:
6b67c5719d1f4371b33655ff2d047d24e171e49a; jvm 11.0.11+9
     [junit] [main] INFO org.eclipse.jetty.server.session -
DefaultSessionIdManager workerName=node0
     [junit] [main] INFO org.eclipse.jetty.server.session - No
SessionScavenger set, using defaults
     [junit] [main] INFO org.eclipse.jetty.server.session - node0
Scavenging every 600000ms
     [junit] [main] INFO
org.eclipse.jetty.server.handler.ContextHandler -
Started o.e.j.w.WebAppContext@4bdc8b5d{ManifoldCF Crawler

Interface,/mcf-crawler-ui,file:///private/var/folders/zh/mx4q_qh93cv6jtp13ht8b1fr0000gn/T/jetty-0_0_0_0-8346-mcf-crawler-ui_war-_mcf-crawler-ui-any-17189990303852051874/webapp/,AVAILABLE}{/Users/zhaomingchun/ManifoldCF/manifoldcf/dist/web/war/mcf-crawler-ui.war}
     [junit] [main] INFO
org.eclipse.jetty.server.handler.ContextHandler -
Started o.e.j.w.WebAppContext@3bcd426c{ManifoldCF Authorities API

Webapp,/mcf-authority-service,file:///private/var/folders/zh/mx4q_qh93cv6jtp13ht8b1fr0000gn/T/jetty-0_0_0_0-8346-mcf-authority-service_war-_mcf-authority-service-any-1541416273384984853/webapp/,AVAILABLE}{/Users/zhaomingchun/ManifoldCF/manifoldcf/dist/web/war/mcf-authority-service.war}
     [junit] Creating mock service
     [junit] [main] INFO
org.eclipse.jetty.server.handler.ContextHandler -
Started o.e.j.w.WebAppContext@5f14a673{ManifoldCF General API

Webapp,/mcf-api-service,file:///private/var/folders/zh/mx4q_qh93cv6jtp13ht8b1fr0000gn/T/jetty-0_0_0_0-8346-mcf-api-service_war-_mcf-api-service-any-12421572948842353797/webapp/,AVAILABLE}{/Users/zhaomingchun/ManifoldCF/manifoldcf/dist/web/war/mcf-api-service.war}
     [junit] Mock service created
     [junit] [main] INFO org.eclipse.jetty.server.AbstractConnector -
Started ServerConnector@70325d20{HTTP/1.1, (http/1.1)}{0.0.0.0:8346}
     [junit] [main] INFO org.eclipse.jetty.server.Server - Started
@9616ms
     [junit] [main] INFO org.eclipse.jetty.server.Server -
jetty-9.4.48.v20220622; built: 2022-06-21T20:42:25.880Z; git:
6b67c5719d1f4371b33655ff2d047d24e171e49a; jvm 11.0.11+9
     [junit] [main] INFO org.eclipse.jetty.server.session -
DefaultSessionIdManager workerName=node0
     [junit] [main] INFO org.eclipse.jetty.server.session - No
SessionScavenger set, using defaults
     [junit] [main] INFO org.eclipse.jetty.server.session - node0
Scavenging every 600000ms
     [junit] [main] INFO
org.eclipse.jetty.server.handler.ContextHandler -
Started o.e.j.s.ServletContextHandler@7808f638{/solr,null,AVAILABLE}
     [junit] [main] INFO org.eclipse.jetty.server.AbstractConnector -
Started ServerConnector@2ab5afc7{HTTP/1.1, (http/1.1)}{0.0.0.0:8188}
     [junit] [main] INFO org.eclipse.jetty.server.Server - Started
@9626ms
     [junit] [main] INFO org.eclipse.jetty.server.AbstractConnector -
Stopped ServerConnector@2ab5afc7{HTTP/1.1, (http/1.1)}{0.0.0.0:8188}
     [junit] [main] INFO org.eclipse.jetty.server.session - node0
Stopped
scavenging
     [junit] [main] INFO
org.eclipse.jetty.server.handler.ContextHandler -
Stopped o.e.j.s.ServletContextHandler@7808f638{/solr,null,STOPPED}
```

2023年10月19日(木) 20:05 Mingchun Zhao<mingchun.zha...@gmail.com>:

Hi Karl,

I've tried to update Kafka and its dependencies to the latest version
including zookeeper,
and confirmed tha kafka test run-IT-HSQLDB passed as below:

```
~manifoldcf% cd connectors/kafka
~manifoldcf/connectors/kafka/% ant run-IT-HSQLDB

BUILD SUCCESSFUL
Total time: 1 minute 19 seconds
```

Also, I prepared a PR for this issue:
https://github.com/apache/manifoldcf/pull/155

2023年10月19日(木) 7:12 Karl Wright<daddy...@gmail.com>:

Hi,

It looks like the latest release of Kafka is 3.6.0.

I'd try setting that version in the pom for connectors/kafka and
doing
mvn
install.  Then you can see what dependencies it wants by:
mvn dependency:tree

It may be that Kafka no longer even requires zookeeper - I didn't
find
it
in a cursory inspection. But the dependency:tree would be the final
word.
Karl


On Sat, Oct 14, 2023 at 2:17 AM Mingchun Zhao <
mingchun.zha...@gmail.com
wrote:

Kalr, Thanks!
Though I'm not familiar with kafka, I'll try to find out what's
causing the
error as much as possible.

Kind Regards,
Mingchun


2023年10月14日(土) 14:07 Karl Wright<daddy...@gmail.com>:

Yes, this seems to be something related to zookeeper update and
the
Kafka
library version we're using.

Someone will need to dig into what is going wrong here before we
can
release.  I don't know how widely used the kafka connector is but
if
it
is
lightly used we can perhaps not distribute the connector any
longer.  But
that would be a last choice.

Karl


On Fri, Oct 13, 2023 at 12:12 PM Mingchun Zhao <
mingchun.zha...@gmail.com>
wrote:

By applying r1912939, I was able to confirm that the kafka test
compile
error has disappeared when running `ant test`.
Thanks, Karl!

However, the following error occurred on subsequent test runs.
```
     [junit] Testsuite:
org.apache.manifoldcf.agents.output.kafka.APISanityHSQLDBIT
     [junit] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0,
Time
elapsed:
0 sec
     [junit]
     [junit] Testcase:

org.apache.manifoldcf.agents.output.kafka.APISanityHSQLDBIT:sanityCheck:
    Caused an ERROR
     [junit] Forked Java VM exited abnormally. Please note the
time
in
the
report does not reflect the time until the VM exit.
     [junit] junit.framework.AssertionFailedError: Forked Java
VM
exited
abnormally. Please note the time in the report does not reflect
the
time
until the VM exit.
     [junit]     at
jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown
Source)
     [junit]     at


java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
     [junit]     at
jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown
Source)
     [junit]     at


java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
     [junit]     at
jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown
Source)
     [junit]     at


java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
     [junit]     at
jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown
Source)
     [junit]     at


java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
     [junit]
     [junit]

BUILD FAILED
/Users/zhaomingchun/ManifoldCF/manifoldcf/build.xml:517: The
following
error occurred while executing this line:
/Users/zhaomingchun/ManifoldCF/manifoldcf/build.xml:471: The
following
error occurred while executing this line:

/Users/zhaomingchun/ManifoldCF/manifoldcf/dist/connector-build.xml:1102:
Test
org.apache.manifoldcf.agents.output.kafka.APISanityHSQLDBIT
failed
(crashed)
```


2023年10月13日(金) 21:56 Karl Wright<daddy...@gmail.com>:

r1912939 fixes this but I need to spin a new RC.
Karl


On Fri, Oct 13, 2023 at 8:46 AM Karl Wright <
daddy...@gmail.com>
wrote:
Yes I get the same thing; a test needs to be updated.

     [javac]

C:\wip\mcf\release-2.26-branch\connectors\kafka\connector\src\test\java\org\apache\manifoldcf\agents\output\kafka\ZooKeeperLocal.java:45:
error: unreported exception AdminServerException; must be
caught or
declared to be thrown
     [javac]
  zooKeeperServer.runFromConfig(configuration);
     [javac]

Karl


On Fri, Oct 13, 2023 at 8:35 AM Karl Wright <
daddy...@gmail.com>
wrote:
There was a Zookeeper dependency change this release.  I
wonder if
there
is a test that needs to be updated.  Let me try and see.

Karl


On Fri, Oct 13, 2023 at 4:51 AM Piergiorgio Lucidi <
piergior...@apache.org> wrote:

Hi Mingchun,

thank you for your message and I was trying to build
ManifoldCF
using
OpenJDK 17 so probably in the future for supporting this
version
of
Java
we
should include Jaxb libraries as well.

The build is ok now and I can compile and package
everything
correctly.
Unfortunately executing tests I have the following error:

compile-tests:
     [javac] Compiling 1 source file to


/Users/piergiorgiolucidi/Downloads/apache-manifoldcf-2.26/connectors/kafka/build/connector-tests/classes
     [javac]


/Users/piergiorgiolucidi/Downloads/apache-manifoldcf-2.26/connectors/kafka/connector/src/test/java/org/apache/manifoldcf/agents/output/kafka/ZooKeeperLocal.java:45:
error: unreported exception AdminServerException; must be
caught
or
declared to be thrown
     [javac]
  zooKeeperServer.runFromConfig(configuration);
     [javac]                                        ^
     [javac] 1 error

BUILD FAILED

/Users/piergiorgiolucidi/Downloads/apache-manifoldcf-2.26/build.xml:497:
The following error occurred while executing this line:

/Users/piergiorgiolucidi/Downloads/apache-manifoldcf-2.26/build.xml:471:
The following error occurred while executing this line:


/Users/piergiorgiolucidi/Downloads/apache-manifoldcf-2.26/dist/connector-build.xml:720:
Compile failed; see the compiler error output for
details.
Any ideas?

Thanks.

Cheers,
PG



Il giorno gio 12 ott 2023 alle ore 10:39 Mingchun Zhao <
mingchun.zha...@gmail.com> ha scritto:

Hi Piergiorgio,

FYI, Allow me to share the java and ant versions and
the
build
steps
I
ran
in my environment.

```
$ java --version
openjdk 11.0.11 2021-04-20
OpenJDK Runtime Environment AdoptOpenJDK-11.0.11+9
(build
11.0.11+9)
OpenJDK 64-Bit Server VM AdoptOpenJDK-11.0.11+9 (build
11.0.11+9,
mixed
mode)

$ ant -version
Apache Ant(TM) version 1.10.0 compiled on December 27
2016
```
```
ant clean
ant clean-deps
ant clean-core-deps
ant make-core-deps
ant make-deps
ant build
```

Regards,
Mingchun

2023年10月12日(木) 17:32 Piergiorgio Lucidi <
piergior...@apache.org
:
Hi folks,

it seems that I can't compile the CswsConnector:

    [javac]     public List<? extends AttributeGroup>
getAttributeGroups()
     [javac]                           ^
     [javac]   symbol:   class AttributeGroup
     [javac]   location: class
CswsConnector.ObjectInformation
     [javac]


/Volumes/BackupPJ/ManifoldCF-release/apache-manifoldcf-2.26/connectors/csws/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/csws/CswsConnector.java:3966:
error: cannot find symbol
     [javac]     public NodePermissions
getPermissions()
     [javac]            ^
     [javac]   symbol:   class NodePermissions
     [javac]   location: class
CswsConnector.ObjectInformation
     [javac] 100 errors
     [javac] 1 warning
     [javac] only showing the first 100 errors, of 123
total;
use
-Xmaxerrs
if you would like to see more

BUILD FAILED

/Volumes/BackupPJ/ManifoldCF-release/apache-manifoldcf-2.26/build.xml:489:
The following error occurred while executing this
line:
/Volumes/BackupPJ/ManifoldCF-release/apache-manifoldcf-2.26/build.xml:471:
The following error occurred while executing this
line:

/Volumes/BackupPJ/ManifoldCF-release/apache-manifoldcf-2.26/dist/connector-build.xml:686:
Compile failed; see the compiler error output for
details.
Do you have any ideas?

Thanks,
PG

Il giorno gio 12 ott 2023 alle ore 09:03 Guylaine
BASSETTE <
guylaine.basse...@francelabs.com> ha scritto:

Hi,

+1 from France Labs

Regards,
Guylaine

France Labs – Your knowledge, now
Datafari Enterprise Search – Découvrez la version
5 /
Discover
our
version
5
www.datafari.com  <http://www.datafari.com>
Le 11/10/2023 à 23:12, Furkan KAMACI a écrit :
Hi,

+1!

Kind regards,
Furkan Kamaci

On Wed, Oct 11, 2023 at 11:13 PM Mingchun Zhao<
mingchun.zha...@gmail.com
wrote:

Hi,
Ran all the tests.
+1 from me.

Kind regards,
Mingchun

2023年10月12日(木) 4:29 Cihad Guzel<
cguz...@gmail.com
:
Hi,

+1 from me

Regards,
Cihad Guzel

11 Eki 2023 Çar, saat 15:08 tarihinde Karl
Wright<
daddy...@gmail.com
şunu
yazdı:

Please vote on whether to release Apache
ManifoldCF
2.26,
RC0.
This release is the first release that
requires
at
least
Java
11,
and
it
also includes a new CSV connector along with
support
for
Solr
9.
The
release artifact can be found at:

https://dist.apache.org/repos/dist/dev/manifoldcf/apache-manifoldcf-2.26
,
and there is a release tag also at

https://svn.apache.org/repos/asf/manifoldcf/tags/release-2.26-RC0
.
Karl



--
Piergiorgio


--
Piergiorgio

--
Cordialement,
Guylaine

France Labs – Your knowledge, now
Datafari Enterprise Search – Découvrez la version 5 / Discover our version 5
www.datafari.com <http://www.datafari.com>

Reply via email to