Hi Mike,

This might be the issue indeed.  I'll look into it.

Karl


On Mon, Jul 30, 2018 at 2:26 PM Mike Hugo <m...@piragua.com> wrote:

> I'm not sure what the solution is yet, but I think I may have found the
> culprit:
>
> JobManager.noteTransformationConnectionRegistration(List<String> list) is
> creating a pretty big query:
>
> SELECT id,status FROM jobs WHERE  (id=? OR id=? OR id=? OR id=? ........
> OR id=?) FOR UPDATE
>
> replace the elipsis  with as list of 39,941 ids (it's a huge query when it
> prints out)
>
> It seems that the database doesn't like that query and closes the
> connection before returning with a response.
>
> As I mentioned this instance of manifold has nearly 40,000 web crawlers.
> is that a high number for Manifold to handle?
>
> On Mon, Jul 30, 2018 at 10:58 AM, Karl Wright <daddy...@gmail.com> wrote:
>
>> Well, I have absolutely no idea what is wrong and I've never seen
>> anything like that before.  But postgres is complaining because the
>> communication with the JDBC client is being interrupted by something.
>>
>> Karl
>>
>>
>> On Mon, Jul 30, 2018 at 10:39 AM Mike Hugo <m...@piragua.com> wrote:
>>
>>> No, and manifold and postgres run on the same host.
>>>
>>> On Mon, Jul 30, 2018 at 9:35 AM, Karl Wright <daddy...@gmail.com> wrote:
>>>
>>>> ' LOG:  incomplete message from client'
>>>>
>>>> This shows a network issue.  Did your network configuration change
>>>> recently?
>>>>
>>>> Karl
>>>>
>>>>
>>>> On Mon, Jul 30, 2018 at 9:59 AM Mike Hugo <m...@piragua.com> wrote:
>>>>
>>>>> Tried a postgres vacuum and also a restart, but the problem persists.
>>>>> Here's the log again with some additional logging details added (below)
>>>>>
>>>>> I tried running the last query from the logs against the database and
>>>>> it works fine - I modified it to return a count and that also works.
>>>>>
>>>>> SELECT count(*) FROM jobs t1 WHERE EXISTS(SELECT 'x' FROM jobpipelines
>>>>> WHERE t1.id=ownerid AND transformationname='Tika');
>>>>>  count
>>>>> -------
>>>>>  39941
>>>>> (1 row)
>>>>>
>>>>>
>>>>> Is 39k jobs a high number?  I've run some other instances of Manifold
>>>>> with more like 1,000 jobs and those seem to be working fine.  That's the
>>>>> only thing I can think of that's different between this instance that 
>>>>> won't
>>>>> start and the others.  Any ideas?
>>>>>
>>>>> Thanks for your help!
>>>>>
>>>>> Mike
>>>>>
>>>>> LOG:  duration: 0.079 ms  parse <unnamed>: SELECT connectionname FROM
>>>>> transformationconnections WHERE classname=$1
>>>>> LOG:  duration: 0.079 ms  bind <unnamed>: SELECT connectionname FROM
>>>>> transformationconnections WHERE classname=$1
>>>>> DETAIL:  parameters: $1 =
>>>>> 'org.apache.manifoldcf.agents.transformation.tika.TikaExtractor'
>>>>> LOG:  duration: 0.017 ms  execute <unnamed>: SELECT connectionname
>>>>> FROM transformationconnections WHERE classname=$1
>>>>> DETAIL:  parameters: $1 =
>>>>> 'org.apache.manifoldcf.agents.transformation.tika.TikaExtractor'
>>>>> LOG:  duration: 0.039 ms  parse <unnamed>: SELECT * FROM agents
>>>>> LOG:  duration: 0.040 ms  bind <unnamed>: SELECT * FROM agents
>>>>> LOG:  duration: 0.010 ms  execute <unnamed>: SELECT * FROM agents
>>>>> LOG:  duration: 0.084 ms  parse <unnamed>: SELECT id FROM jobs t1
>>>>> WHERE EXISTS(SELECT 'x' FROM jobpipelines WHERE t1.id=ownerid AND
>>>>> transformationname=$1)
>>>>> LOG:  duration: 0.359 ms  bind <unnamed>: SELECT id FROM jobs t1 WHERE
>>>>> EXISTS(SELECT 'x' FROM jobpipelines WHERE t1.id=ownerid AND
>>>>> transformationname=$1)
>>>>> DETAIL:  parameters: $1 = 'Tika'
>>>>> LOG:  duration: 77.622 ms  execute <unnamed>: SELECT id FROM jobs t1
>>>>> WHERE EXISTS(SELECT 'x' FROM jobpipelines WHERE t1.id=ownerid AND
>>>>> transformationname=$1)
>>>>> DETAIL:  parameters: $1 = 'Tika'
>>>>> LOG:  incomplete message from client
>>>>> LOG:  disconnection: session time: 0:00:06.574 user=REMOVED
>>>>> database=REMOVED host=127.0.0.1 port=45356
>>>>> >2018-07-30 12:36:09,415 [main] ERROR org.apache.manifoldcf.root -
>>>>> Exception: This connection has been closed.
>>>>> org.apache.manifoldcf.core.interfaces.ManifoldCFException: This
>>>>> connection has been closed.
>>>>> at
>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.reinterpretException(DBInterfacePostgreSQL.java:627)
>>>>> ~[mcf-core.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.rollbackCurrentTransaction(DBInterfacePostgreSQL.java:1296)
>>>>> ~[mcf-core.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.core.database.Database.endTransaction(Database.java:368)
>>>>> ~[mcf-core.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.endTransaction(DBInterfacePostgreSQL.java:1236)
>>>>> ~[mcf-core.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.registerConnectors(ManifoldCF.java:605)
>>>>> ~[mcf-pull-agent.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.reregisterAllConnectors(ManifoldCF.java:160)
>>>>> ~[mcf-pull-agent.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.jettyrunner.ManifoldCFJettyRunner.main(ManifoldCFJettyRunner.java:239)
>>>>> [mcf-jetty-runner.jar:?]
>>>>> Caused by: org.postgresql.util.PSQLException: This connection has been
>>>>> closed.
>>>>> at org.postgresql.jdbc.PgConnection.checkClosed(PgConnection.java:766)
>>>>> ~[postgresql-42.1.3.jar:42.1.3]
>>>>> at
>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:1576)
>>>>> ~[postgresql-42.1.3.jar:42.1.3]
>>>>> at
>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:367)
>>>>> ~[postgresql-42.1.3.jar:42.1.3]
>>>>> at
>>>>> org.apache.manifoldcf.core.database.Database.execute(Database.java:873)
>>>>> ~[mcf-core.jar:?]
>>>>> at
>>>>> org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696)
>>>>> ~[mcf-core.jar:?]
>>>>> org.apache.manifoldcf.core.interfaces.ManifoldCFException: This
>>>>> connection has been closed.
>>>>> at
>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.reinterpretException(DBInterfacePostgreSQL.java:627)
>>>>> at
>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.rollbackCurrentTransaction(DBInterfacePostgreSQL.java:1296)
>>>>> at
>>>>> org.apache.manifoldcf.core.database.Database.endTransaction(Database.java:368)
>>>>> at
>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.endTransaction(DBInterfacePostgreSQL.java:1236)
>>>>> at
>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.registerConnectors(ManifoldCF.java:605)
>>>>> at
>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.reregisterAllConnectors(ManifoldCF.java:160)
>>>>> at
>>>>> org.apache.manifoldcf.jettyrunner.ManifoldCFJettyRunner.main(ManifoldCFJettyRunner.java:239)
>>>>> Caused by: org.postgresql.util.PSQLException: This connection has been
>>>>> closed.
>>>>> at org.postgresql.jdbc.PgConnection.checkClosed(PgConnection.java:766)
>>>>> at
>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:1576)
>>>>> at
>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:367)
>>>>> at
>>>>> org.apache.manifoldcf.core.database.Database.execute(Database.java:873)
>>>>> at
>>>>> org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696)
>>>>> LOG:  disconnection: session time: 0:00:10.677 user=postgres
>>>>> database=template1 host=127.0.0.1 port=45354
>>>>>
>>>>>
>>>>>
>>>>> On Sun, Jul 29, 2018 at 8:09 AM, Karl Wright <daddy...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> It looks to me like your database server is not happy.  Maybe it's
>>>>>> out of resources?  Not sure but a restart may be in order.
>>>>>>
>>>>>> Karl
>>>>>>
>>>>>>
>>>>>> On Sun, Jul 29, 2018 at 9:06 AM Mike Hugo <m...@piragua.com> wrote:
>>>>>>
>>>>>>> Recently we started seeing this error when Manifold CF starts up.
>>>>>>> We had been running Manifold CF with many web connectors and a few RSS
>>>>>>> feeds for a while and it had been working fine.  The server got rebooted
>>>>>>> and since then we started seeing this error. I'm not sure exactly what
>>>>>>> changed.  Any ideas as to where to start looking and how to fix this?
>>>>>>>
>>>>>>> Thanks!
>>>>>>>
>>>>>>> Mike
>>>>>>>
>>>>>>>
>>>>>>> Initial repository connections already created.
>>>>>>> Configuration file successfully read
>>>>>>> Successfully unregistered all domains
>>>>>>> Successfully unregistered all output connectors
>>>>>>> Successfully unregistered all transformation connectors
>>>>>>> Successfully unregistered all mapping connectors
>>>>>>> Successfully unregistered all authority connectors
>>>>>>> Successfully unregistered all repository connectors
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.solr.SolrConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.searchblox.SearchBloxConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.opensearchserver.OpenSearchServerConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.nullconnector.NullConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.kafka.KafkaOutputConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.hdfs.HDFSOutputConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.gts.GTSConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.filesystem.FileOutputConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.elasticsearch.ElasticSearchConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered output connector
>>>>>>> 'org.apache.manifoldcf.agents.output.amazoncloudsearch.AmazonCloudSearchConnector'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> WARNING:  there is no transaction in progress
>>>>>>> Successfully registered transformation connector
>>>>>>> 'org.apache.manifoldcf.agents.transformation.tikaservice.TikaExtractor'
>>>>>>> WARNING:  there is already a transaction in progress
>>>>>>> LOG:  incomplete message from client
>>>>>>> >2018-07-29 13:02:06,659 [main] ERROR org.apache.manifoldcf.root -
>>>>>>> Exception: This connection has been closed.
>>>>>>> org.apache.manifoldcf.core.interfaces.ManifoldCFException: This
>>>>>>> connection has been closed.
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.reinterpretException(DBInterfacePostgreSQL.java:627)
>>>>>>> ~[mcf-core.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.rollbackCurrentTransaction(DBInterfacePostgreSQL.java:1296)
>>>>>>> ~[mcf-core.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.Database.endTransaction(Database.java:368)
>>>>>>> ~[mcf-core.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.endTransaction(DBInterfacePostgreSQL.java:1236)
>>>>>>> ~[mcf-core.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.registerConnectors(ManifoldCF.java:605)
>>>>>>> ~[mcf-pull-agent.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.reregisterAllConnectors(ManifoldCF.java:160)
>>>>>>> ~[mcf-pull-agent.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.jettyrunner.ManifoldCFJettyRunner.main(ManifoldCFJettyRunner.java:239)
>>>>>>> [mcf-jetty-runner.jar:?]
>>>>>>> Caused by: org.postgresql.util.PSQLException: This connection has
>>>>>>> been closed.
>>>>>>> at
>>>>>>> org.postgresql.jdbc.PgConnection.checkClosed(PgConnection.java:766)
>>>>>>> ~[postgresql-42.1.3.jar:42.1.3]
>>>>>>> at
>>>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:1576)
>>>>>>> ~[postgresql-42.1.3.jar:42.1.3]
>>>>>>> at
>>>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:367)
>>>>>>> ~[postgresql-42.1.3.jar:42.1.3]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.Database.execute(Database.java:873)
>>>>>>> ~[mcf-core.jar:?]
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696)
>>>>>>> ~[mcf-core.jar:?]
>>>>>>> org.apache.manifoldcf.core.interfaces.ManifoldCFException: This
>>>>>>> connection has been closed.
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.reinterpretException(DBInterfacePostgreSQL.java:627)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.rollbackCurrentTransaction(DBInterfacePostgreSQL.java:1296)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.Database.endTransaction(Database.java:368)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.endTransaction(DBInterfacePostgreSQL.java:1236)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.registerConnectors(ManifoldCF.java:605)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.crawler.system.ManifoldCF.reregisterAllConnectors(ManifoldCF.java:160)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.jettyrunner.ManifoldCFJettyRunner.main(ManifoldCFJettyRunner.java:239)
>>>>>>> Caused by: org.postgresql.util.PSQLException: This connection has
>>>>>>> been closed.
>>>>>>> at
>>>>>>> org.postgresql.jdbc.PgConnection.checkClosed(PgConnection.java:766)
>>>>>>> at
>>>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:1576)
>>>>>>> at
>>>>>>> org.postgresql.jdbc.PgConnection.createStatement(PgConnection.java:367)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.Database.execute(Database.java:873)
>>>>>>> at
>>>>>>> org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696)
>>>>>>>
>>>>>>
>>>>>
>>>
>

Reply via email to