Re: ManifoldCF database model

2018-10-16 Thread Karl Wright
Hi, you can look at ManifoldCF In Action. There's a link to it on the manifoldcf page. However, you should be aware that we consider it a severe bug if ManifoldCF doesn't clean up after itself. The only time that is not expected is when people write buggy connectors or mess with database tables

Re: Create documents from transformation connector

2018-10-16 Thread Karl Wright
Hi Julien, That is one thing you cannot do with the MCF pipeline. All documents must originate in a RepositoryConnector. The repository connector can create multiple subdocuments itself, if need be, but the rest of the pipeline does not allow further splitting. One way around this: If the

Create documents from transformation connector

2018-10-16 Thread Julien
Hi Karl, I was wondering if there is a simple way to generate multiple documents from a transformation connector. My use case is the following : I have some files that are archives files and I would like to create a transformation connector that will be able to extract the files within the

ManifoldCF database model

2018-10-16 Thread Gustavo Beneitez
Hi all, how do you do? I was wandering if there is any technical document about what is the meaning of each table in database, the relationship between documents, repositories, jobs and any other output connector (some kind of a database model). We are facing some "garbage issues", jobs are

[jira] [Commented] (CONNECTORS-1546) Optimize Elasticsearch performance by removing 'forcemerge'

2018-10-16 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16651950#comment-16651950 ] Karl Wright commented on CONNECTORS-1546: - I agree with your decision. > Optimize

[jira] [Commented] (CONNECTORS-1546) Optimize Elasticsearch performance by removing 'forcemerge'

2018-10-16 Thread Steph van Schalkwyk (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16651942#comment-16651942 ] Steph van Schalkwyk commented on CONNECTORS-1546: - Hans is correct. I would remove

[jira] [Commented] (CONNECTORS-1546) Optimize Elasticsearch performance by removing 'forcemerge'

2018-10-16 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16651761#comment-16651761 ] Karl Wright commented on CONNECTORS-1546: - Hi [~st...@remcam.net], can you comment on this?

[jira] [Assigned] (CONNECTORS-1546) Optimize Elasticsearch performance by removing 'forcemerge'

2018-10-16 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wright reassigned CONNECTORS-1546: --- Assignee: Steph van Schalkwyk > Optimize Elasticsearch performance by removing

[jira] [Created] (CONNECTORS-1546) Optimize Elasticsearch performance by removing 'forcemerge'

2018-10-16 Thread Hans Van Goethem (JIRA)
Hans Van Goethem created CONNECTORS-1546: Summary: Optimize Elasticsearch performance by removing 'forcemerge' Key: CONNECTORS-1546 URL: https://issues.apache.org/jira/browse/CONNECTORS-1546