[jira] [Commented] (CONNECTORS-1535) Documentum Connector cannot find dfc.properties

2019-01-17 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745820#comment-16745820
 ] 

James Thomas commented on CONNECTORS-1535:
--

One other thing: I don't believe I have evidence that it's necessary, but it is 
now my practice to edit the CLASSPATH initialisation in the 
documentum-registry/run.sh script in the same way as documentum-server/run.sh, 
again to parallel the Windows run.bat scripts.

In the experiment I just described, I reverted to CLASSPATH="" in both scripts 
initially, but only changed documentum-server/run.sh before restarting MCF.

> Documentum Connector cannot find dfc.properties
> ---
>
> Key: CONNECTORS-1535
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1535
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10, ManifoldCF 2.11
> Environment: Manifold 2.11
> CentOS Linux release 7.5.1804 (Core)
> OpenJDK 64-Bit Server VM 18.9 (build 11+28, mixed mode)
>  
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.13
>
>
> I have found that when installing a clean MCF instance I cannot get 
> Documentum repository connectors to connect to Documentum until I have added 
> this line to the processes/documentum-server/run.sh script before the call to 
> Java:
>  
> {code:java}
> CLASSPATH="$CLASSPATH""$PATHSEP""$DOCUMENTUM"{code}
> Until I do this, attempts to save the connector will result in this output to 
> the console:
>  
> {noformat}
> 4 [RMI TCP Connection(2)-127.0.0.1] ERROR 
> com.documentum.fc.common.impl.preferences.PreferencesManager  - 
> [DFC_PREFERENCE_LOAD_FAILED] Failed to load persistent preferences from null
> java.io.FileNotFoundException: dfc.properties
>     at 
> com.documentum.fc.common.impl.preferences.PreferencesManager.locateMainPersistentStore(PreferencesManager.java:378)
>     at 
> com.documentum.fc.common.impl.preferences.PreferencesManager.readPersistentProperties(PreferencesManager.java:329)
>     at 
> com.documentum.fc.common.impl.preferences.PreferencesManager.(PreferencesManager.java:37)
>     at 
> com.documentum.fc.common.DfPreferences.initialize(DfPreferences.java:64)
> ..{noformat}
> and this message in the MCF UI:
>  
> {noformat}
> Connection failed: Documentum error: No DocBrokers are configured{noformat}
>  
>  
> I mentioned this in #1512 for MCF 2.10 but it got lost in the other work done 
> in that ticket. While setting up 2.11 from scratch I encountered it again.
>  
> Once I have edited the run.sh script I get this in the console, showing that 
> (for whatever reason) the change is significant:
>  
> {noformat}
> Reading DFC configuration from 
> "file:/opt/manifold/apache-manifoldcf-2.11/processes/documentum-server/dfc.properties"
> {noformat}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1535) Documentum Connector cannot find dfc.properties

2019-01-17 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745336#comment-16745336
 ] 

James Thomas commented on CONNECTORS-1535:
--

I have found an alternative fix.

If you are prepared to call run.sh by cd-ing into the directory in which it 
exists, you can modify in a less intrusive way than I initially suggested.

change
{code:java}
CLASSPATH=""{code}
to
{code:java}
CLASSPATH ="."{code}
This makes the script more like the run.bat equivalent for Windows, which 
already has the working directory on the CLASSPATH.

> Documentum Connector cannot find dfc.properties
> ---
>
> Key: CONNECTORS-1535
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1535
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10, ManifoldCF 2.11
> Environment: Manifold 2.11
> CentOS Linux release 7.5.1804 (Core)
> OpenJDK 64-Bit Server VM 18.9 (build 11+28, mixed mode)
>  
>Reporter: James Thomas
>Priority: Major
>
> I have found that when installing a clean MCF instance I cannot get 
> Documentum repository connectors to connect to Documentum until I have added 
> this line to the processes/documentum-server/run.sh script before the call to 
> Java:
>  
> {code:java}
> CLASSPATH="$CLASSPATH""$PATHSEP""$DOCUMENTUM"{code}
> Until I do this, attempts to save the connector will result in this output to 
> the console:
>  
> {noformat}
> 4 [RMI TCP Connection(2)-127.0.0.1] ERROR 
> com.documentum.fc.common.impl.preferences.PreferencesManager  - 
> [DFC_PREFERENCE_LOAD_FAILED] Failed to load persistent preferences from null
> java.io.FileNotFoundException: dfc.properties
>     at 
> com.documentum.fc.common.impl.preferences.PreferencesManager.locateMainPersistentStore(PreferencesManager.java:378)
>     at 
> com.documentum.fc.common.impl.preferences.PreferencesManager.readPersistentProperties(PreferencesManager.java:329)
>     at 
> com.documentum.fc.common.impl.preferences.PreferencesManager.(PreferencesManager.java:37)
>     at 
> com.documentum.fc.common.DfPreferences.initialize(DfPreferences.java:64)
> ..{noformat}
> and this message in the MCF UI:
>  
> {noformat}
> Connection failed: Documentum error: No DocBrokers are configured{noformat}
>  
>  
> I mentioned this in #1512 for MCF 2.10 but it got lost in the other work done 
> in that ticket. While setting up 2.11 from scratch I encountered it again.
>  
> Once I have edited the run.sh script I get this in the console, showing that 
> (for whatever reason) the change is significant:
>  
> {noformat}
> Reading DFC configuration from 
> "file:/opt/manifold/apache-manifoldcf-2.11/processes/documentum-server/dfc.properties"
> {noformat}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-12-17 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16722781#comment-16722781
 ] 

James Thomas commented on CONNECTORS-1521:
--

{quote}[~jamesthomas] Any update on this ticket? I'm moving it to 2.13...
{quote}
I submitted a ticket but didn't get a useful answer, I'm afraid.

> Documentum Connector users ManifoldCF's local time in queries constraints 
> against the Documentum server without reference to time zones
> ---
>
> Key: CONNECTORS-1521
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.13
>
>
> I find that the time/date constraints in queries to the Documentum server are 
> based on the "raw" local time of the ManifoldCF server but appear to take no 
> account of the time zones of the two servers.
> This can lead to recently modified files not being transferred to the output 
> repository when you would naturally expect them to be. I'd like the times to 
> be aligned, perhaps by including time zone in the query. In particular, is 
> there a way to use UTC perhaps?
> Here's an example ...
>  * create a folder in Documentum
>  * set up a job to point at the folder and output to the file system
>  * put two documents into a folder in Documentum
>  * Select them, right click and export as CSV (to show the timestamps):
> {noformat}
> 1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
> 2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
> Check the local time on the ManifoldCF server machine. Observe that it's 
> reporting consistent time with the DM server:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 09:07:25 BST 2018{noformat}
> Start the job and look for the query to Documentum in the manifoldcf.log file 
> (line break added for readability):
> {noformat}
> DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and
> r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 
> AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
> ^C{noformat}
> Notice that the latest date asked for is *before* the modification date of 
> the files added to DM. (And is an hour out, see footnote.)
>   
>  See whether anything has been output by the File System connector. It hasn't:
> {noformat}
> [james@manifold]$ ls /bigdisc/source/PDFs/timezones/
> [james@manifold]$
> {noformat}
> Now:
>  * change the timezone on the ManifoldCF server machine
>  * restart the ManifoldCF server and the Documentum processes
>  * reseed the job
> Check the local time on the ManifoldCF server machine; it has changed:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 10:10:29 CEST 2018{noformat}
> Start the job again and notice that the query has changed by an hour, plus 
> the few minutes it took to change the date etc (and is still an hour out, see 
> footnote):
> {noformat}
> r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
> {noformat}
> Observe that the range of dates now covers the timestamps on the DM data, and 
> also that some data has now been transferred by the File System connector:
> {noformat}
> [james@manifold]$ ls 
> /bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
> drl?versionLabel=CURRENT=09018000e515
> drl?versionLabel=CURRENT=09018000e516
> {noformat}
>  
>  
> [Footnote] It appears that something is trying to take account of Daylight 
> Saving Time too.
> If I set the server date to a time outside of DST, the query is aligned with 
> the current time:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Mon Oct 29 00:01:13 CET 2018
> r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
> {noformat}
> But if I set the time inside DST, the time is an hour before:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Sat Oct 27 00:00:06 CEST 2018
> r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 
> {noformat}
> This is perhaps a Java issue rather than a logic issue in the connector? See 
> e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1524) Documentum Connector fails with NullPointerException

2018-10-02 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16635195#comment-16635195
 ] 

James Thomas commented on CONNECTORS-1524:
--

I have found another issue that manifests across Java versions.

If I start MCF 2.11 and Documentum processes using Java 9 or later I find that 
attempting to view or edit a Documentum connector will result in this error 
into the console:
{noformat}
3 [RMI TCP Connection(2)-127.0.0.1] ERROR 
com.documentum.fc.common.DfPreferences  - [DFC_PREFERENCE_BAD_VALUE] Bad value 
for preference "dfc.date_format", value="M/d/, h:mm:ss a"
com.documentum.fc.common.DfException: Illegal syntax found in the date format 
'M/d/, h:mm:ss a'. The default localized short date format will be used.

{noformat}
I can work around this by adding this line to my dfc.properties:
{noformat}
 dfc.date_format=.MM.dd HH:mm:s{noformat}
I don't see this issue with Java(TM) SE Runtime Environment (build 
1.8.0_181-b13)

> Documentum Connector fails with NullPointerException
> 
>
> Key: CONNECTORS-1524
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1524
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: 500_server_error.txt
>
>
> I am running ManifoldCF 2.10 patched for issues 1512 and 1517.
> I recently upgraded Java to 10.0.2 (for another server on the same machine) 
> and find that when I try to create jobs using the Documentum connector, 
> clicking on (e.g.) the Paths tab will give this error dialog:
> {code:java}
> HTTP ERROR 500
> Problem accessing /mcf-crawler-ui/execute.jsp. Reason:
>     Server Error
> Caused by:
> org.apache.jasper.JasperException: An exception occurred processing JSP page 
> /execute.jsp at line 1542
> 1539:   {
> 1540: threadContext.save("JobObject",job);
> 1541: %>
> 1542: 
> 1543: <%
> 1544:   }
> 1545:   else if (op.equals("Save")){code}
> There's a lot of stack trace (which I'll attach) but it includes some 
> ManifoldCF calls:
>  
> {code:java}
> Caused by: java.lang.RuntimeException: Unexpected exception type: 
> java.lang.NullPointerException: null
>     at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$GetSessionThread.finishUp(DCTM.java:117)
>     at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM.getSession(DCTM.java:173)
> ...
> Caused by: java.lang.NullPointerException
>     at 
> com.documentum.fc.client.impl.docbase.DocbaseDateFormat.getSeparator(DocbaseDateFormat.java:119)
>     at ...
> {code}
> If I revert to OpenJDK 1.8.0_171 (the version I upgraded from) the issue is 
> not seen.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (CONNECTORS-1535) Documentum Connector cannot find dfc.properties

2018-10-02 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1535:


 Summary: Documentum Connector cannot find dfc.properties
 Key: CONNECTORS-1535
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1535
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentum connector
Affects Versions: ManifoldCF 2.10, ManifoldCF 2.11
 Environment: Manifold 2.11
CentOS Linux release 7.5.1804 (Core)
OpenJDK 64-Bit Server VM 18.9 (build 11+28, mixed mode)

 
Reporter: James Thomas


I have found that when installing a clean MCF instance I cannot get Documentum 
repository connectors to connect to Documentum until I have added this line to 
the processes/documentum-server/run.sh script before the call to Java:

 
{code:java}
CLASSPATH="$CLASSPATH""$PATHSEP""$DOCUMENTUM"{code}
Until I do this, attempts to save the connector will result in this output to 
the console:

 
{noformat}
4 [RMI TCP Connection(2)-127.0.0.1] ERROR 
com.documentum.fc.common.impl.preferences.PreferencesManager  - 
[DFC_PREFERENCE_LOAD_FAILED] Failed to load persistent preferences from null
java.io.FileNotFoundException: dfc.properties
    at 
com.documentum.fc.common.impl.preferences.PreferencesManager.locateMainPersistentStore(PreferencesManager.java:378)
    at 
com.documentum.fc.common.impl.preferences.PreferencesManager.readPersistentProperties(PreferencesManager.java:329)
    at 
com.documentum.fc.common.impl.preferences.PreferencesManager.(PreferencesManager.java:37)
    at 
com.documentum.fc.common.DfPreferences.initialize(DfPreferences.java:64)
..{noformat}
and this message in the MCF UI:

 
{noformat}
Connection failed: Documentum error: No DocBrokers are configured{noformat}
 

 

I mentioned this in #1512 for MCF 2.10 but it got lost in the other work done 
in that ticket. While setting up 2.11 from scratch I encountered it again.

 

Once I have edited the run.sh script I get this in the console, showing that 
(for whatever reason) the change is significant:

 
{noformat}
Reading DFC configuration from 
"file:/opt/manifold/apache-manifoldcf-2.11/processes/documentum-server/dfc.properties"
{noformat}
 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-30 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16633624#comment-16633624
 ] 

James Thomas commented on CONNECTORS-1532:
--

Good news.

The repro now produces these two results of 'ls -l', and corresponding log 
messages:
{noformat}
-rw-r--r--. 1 root root 27 Oct  1 07:31 
drl?versionLabel=CURRENT=09018000fc6e
-rw-r--r--. 1 root root 0 Oct  1 07:33 
drl?versionLabel=CURRENT=09018000fc6e

INFO 2018-10-01T05:31:28.584Z (Startup thread) - Preparing non-continuous 
non-partial, either MODEL_ALL or fromBeginningOfTime, 1538299958323 for run: 
prepareFullScan
INFO 2018-10-01T05:32:58.717Z (Startup thread) - Preparing incremental scan for 
1538299958323: prepareIncrementalScan{noformat}
I have also sanity-checked that deletion behaves as it did before these changes.

[~kwri...@metacarta.com] in comment-1665 you mentioned that you might need 
to add an additional check. Does that need to be done? If so, in this ticket?

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, CONNECTORS-1532.patch, DCTM.java, 
> logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-30 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16633371#comment-16633371
 ] 

James Thomas commented on CONNECTORS-1532:
--

I applied DCTM.java, rebuilt, patched my installation, restarted all processes, 
and re-did the repro.

Again, the file sizes remain unchanged across the two runs:

 
{noformat}
INFO 2018-09-30T13:14:35.380Z (Startup thread) - Preparing non-continuous 
non-partial, either MODEL_ALL or fromBeginningOfTime, 1538299958323 for run: 
prepareFullScan
INFO 2018-09-30T13:17:29.020Z (Startup thread) - Preparing incremental scan for 
1538299958323: prepareIncrementalScan

-rw-r--r--. 1 root root 27 Sep 30 15:14 
drl?versionLabel=CURRENT=09018000fc6d
-rw-r--r--. 1 root root 27 Sep 30 15:17 
drl?versionLabel=CURRENT=09018000fc6d{noformat}
There is stdout from the first run:

 
{noformat}
All mime types allowed
Found '/Administrator/45523' in specified set; match!{noformat}
and the second:
{noformat}
All mime types allowed
Did not find any matching paths{noformat}
 

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, CONNECTORS-1532.patch, DCTM.java, 
> logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-30 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1665#comment-1665
 ] 

James Thomas commented on CONNECTORS-1532:
--

{quote}I would also like to know if it is possible to change the mime type of a 
document after it has been created; if so, a check also needs to be added for 
that situation.
{quote}
[~kwri...@metacarta.com] I believe that it is possible to do this, assuming 
that the MIME type is what is shown in the Format field for a file when viewing 
it in  DM's Documentum Administrator.

To change that:
 * right-click on the file and choose Properties
 * Check Show all Properties
 * Search for "format" in the page
 * Edit the field associated with the format
 * Click OK

The format field value has now changed

If there is a specific query I can run for you to check that this corresponds 
to whatever the DM connector is looking for, please say.

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, CONNECTORS-1532.patch, 
> logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-30 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1663#comment-1663
 ] 

James Thomas commented on CONNECTORS-1532:
--

{quote}I have attached a patch for you to try. Please let me know if it 
addresses the folder issue
{quote}
[~kwri...@metacarta.com] I don't see any significant change in behaviour using 
the same repro steps as comment-16623888.

The file is still shown on the file system at full size after the second run of 
the job. Here are the log and file system details of this run:

 
{code:java}
INFO 2018-09-30T09:34:38.276Z (Startup thread) - Preparing non-continuous 
non-partial, either MODEL_ALL or fromBeginningOfTime, 1538299958323 for run: 
prepareFullScan
INFO 2018-09-30T09:35:58.444Z (Startup thread) - Preparing incremental scan for 
1538299958323: prepareIncrementalScan

-rw-r--r--. 1 root root 27 Sep 30 11:34 
drl?versionLabel=CURRENT=09018000fc6b
-rw-r--r--. 1 root root 27 Sep 30 11:36 
drl?versionLabel=CURRENT=09018000fc6b{code}
 

I went on and repeated the repro in the same state just to see what might 
happen (essentially the same), then I reset seeding on the job and ran it 
again. Here's the file system and logs for that:

 
{code:java}
 INFO 2018-09-30T09:48:51.303Z (Startup thread) - Preparing incremental scan 
for 1538299958323: prepareIncrementalScan
 INFO 2018-09-30T09:50:06.483Z (Startup thread) - Preparing incremental scan 
for 1538299958323: prepareIncrementalScan
 INFO 2018-09-30T09:51:06.800Z (Startup thread) - Preparing non-continuous 
non-partial, either MODEL_ALL or fromBeginningOfTime, 1538299958323 for run: 
prepareFullScan
$ ### after adding another file
-rw-r--r--. 1 root root 27 Sep 30 11:36 
drl?versionLabel=CURRENT=09018000fc6b
-rw-r--r--. 1 root root 27 Sep 30 11:49 
drl?versionLabel=CURRENT=09018000fc6c
$ ### after running the job again
-rw-r--r--. 1 root root 27 Sep 30 11:36 
drl?versionLabel=CURRENT=09018000fc6b
-rw-r--r--. 1 root root 27 Sep 30 11:50 
drl?versionLabel=CURRENT=09018000fc6c
$ ## after resetting seeding and running the job
-rw-r--r--. 1 root root 0 Sep 30 11:51 
drl?versionLabel=CURRENT=09018000fc6b
-rw-r--r--. 1 root root 0 Sep 30 11:51 
drl?versionLabel=CURRENT=09018000fc6c
{code}
So it appears that reseeding can give the desired outcome.

FYI, I applied this patch on top of the logging patch from this ticket, which 
is itself on top of 2.10 patched for #1512, #1517:

 
{code:java}
$ wget 
https://issues.apache.org/jira/secure/attachment/12940883/CONNECTORS-1532.patch
$ dos2unix CONNECTORS-1532.patch
$ dos2unix 
connectors/documentum/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/DCTM/DCTM.java
$ patch -p0 -i CONNECTORS-1532.patch
$ ant build
$ find dist -type f -exec ls -l {} \; > /tmp/diff{code}
 

Here's the set of files that changed after I applied your patch:

 
{code:java}
$ grep Sep\ 30 /tmp/diff | grep jar
-rw-rw-r-- 1 james staff 12544 Sep 30 10:01 
dist/connector-lib/mcf-documentum-connector-rmistub.jar
-rw-rw-r-- 1 james staff 100864 Sep 30 10:01 
dist/connector-lib/mcf-documentum-connector.jar
-rw-rw-r-- 1 james staff 6292 Sep 30 10:01 
dist/connector-lib/mcf-filenet-connector-rmistub.jar
-rw-rw-r-- 1 james staff 3916082 Sep 30 10:02 
dist/connector-lib/mcf-meridio-connector.jar
-rw-rw-r-- 1 james staff 838582 Sep 30 10:02 
dist/connector-lib/mcf-sharepoint-connector.jar
-rw-rw-r-- 1 james staff 8567 Sep 30 10:01 
dist/processes/documentum-server/lib/mcf-documentum-connector-rmiskel.jar
-rw-rw-r-- 1 james staff 4494 Sep 30 10:01 
dist/processes/filenet-server/lib/mcf-filenet-connector-rmiskel.jar
{code}
I stopped my MFC instance and the DM processes, applied the changed files, and 
restarted DM processes and MFC server. Then attempted the repro I described 
above.

It's possible that I haven't applied the patch correctly. Is there something I 
can do to check?

 

 

 

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, CONNECTORS-1532.patch, 
> logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file 

[jira] [Commented] (CONNECTORS-1534) File System connector "deletes" files by making them zero bytes

2018-09-21 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16624497#comment-16624497
 ] 

James Thomas commented on CONNECTORS-1534:
--

{quote}For backwards compatibility reasons, you'd therefore want to add 
specification information to the connector describing how you want deletions to 
be handled, which is basically a significant enhancement to the connector, so I 
would ask if this is really something you need or just something you don't like.
{quote}
I find it (very) unexpected but it's not breaking anything for me.

Given what I know, I think that from the consumer of the FS connector's output 
it's (at best) ambiguous whether the file originated at zero size or existed 
and was then deleted. This may make a difference in some applications.

At the moment  – as you know :) – I'm primiarly working with the Documentum 
repository connector and it won't fetch zero size files 
(https://issues.apache.org/jira/browse/CONNECTORS-1526) so I don't have the 
ambiguity. I'm also only really using the FS connector as a tool for testing so 
far.

 

> File System connector "deletes" files by making them zero bytes
> ---
>
> Key: CONNECTORS-1534
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1534
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: File system connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF next
>
>
> In https://issues.apache.org/jira/browse/CONNECTORS-1532 I observed that 
> deleting a file from a repository does not (reliably?) delete it from a File 
> System output. Instead the output file remains but is given zero size and a 
> new modification time.
> I would expect that a file deletion in the repository results in file 
> deletion in the output.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-21 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16624487#comment-16624487
 ] 

James Thomas commented on CONNECTORS-1532:
--

OK. Thanks for continuing to pursue this.

I have made a new ticket for the File System connector apparently zeroing files 
instead of deleting them, https://issues.apache.org/jira/browse/CONNECTORS-1534

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> timestamp of the move (07:28):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root 

[jira] [Created] (CONNECTORS-1534) File System connector "deletes" files by making them zero bytes

2018-09-21 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1534:


 Summary: File System connector "deletes" files by making them zero 
bytes
 Key: CONNECTORS-1534
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1534
 Project: ManifoldCF
  Issue Type: Bug
  Components: File system connector
Affects Versions: ManifoldCF 2.10
Reporter: James Thomas


In https://issues.apache.org/jira/browse/CONNECTORS-1532 I observed that 
deleting a file from a repository does not (reliably?) delete it from a File 
System output. Instead the output file remains but is given zero size and a new 
modification time.

I would expect that a file deletion in the repository results in file deletion 
in the output.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-21 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16624105#comment-16624105
 ] 

James Thomas commented on CONNECTORS-1532:
--

The data is for two runs of the job, as in the repro steps: there are two runs 
of 'ls -l' and two log messages. (Or perhaps I'm misunderstanding your question)

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> timestamp of the move (07:28):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-21 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16623888#comment-16623888
 ] 

James Thomas commented on CONNECTORS-1532:
--

[~kwri...@metacarta.com] I had some difficulty getting your patch to show 
anything in the log file until I'd added an entry to the properties.xml file. 

Here's the config, the manifoldcf.log lines and the file system timestamps:

 
{noformat}
 {noformat}
 

 
{noformat}
 INFO 2018-09-21T16:51:08.645Z (Startup thread) - Preparing non-continuous 
non-partial, either MODEL_ALL or fromBeginningOfTime, 1537332478051 for run: 
prepareFullScan
 INFO 2018-09-21T16:55:48.781Z (Startup thread) - Preparing incremental scan 
for 1537332478051: prepareIncrementalScan
{noformat}
 

 
{noformat}
-rw-r--r--. 1 root i2e 27 Sep 21 18:51 
drl?versionLabel=CURRENT=09018000fc6a 
-rw-r--r--. 1 root i2e 27 Sep 21 18:55 
drl?versionLabel=CURRENT=09018000fc6a
{noformat}
My repro was as earlier:
 * In DM I created a folder F containing a single file X
 * In MF I configured a job to look at F and output to File System
 * In MF I ran the job
 * On the File System I ran 'ls -l'  and found F
 * In DM I moved X to a folder outside of F (Right click, add to clipboard, go 
to new folder, Edit> Move here).
 * In MF I ran the job again
 * On the File System I ran 'ls' again

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-20 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16622188#comment-16622188
 ] 

James Thomas commented on CONNECTORS-1532:
--

[~kwri...@metacarta.com] yep, I've patched and it's building right now.

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png, logging_patch.diff
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> timestamp of the move (07:28):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-20 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16621894#comment-16621894
 ] 

James Thomas commented on CONNECTORS-1532:
--

DQL experiment:
 * In DM I created a folder F containing a single file X
 * In DM's DQL too (From Document Administrator> Tools> Dql Editor), I ran the 
query you gave
 * An object Id was returned
 * I confirmed that was the ObjectId of X (via right-click> Properties on X)

 * In DM I moved X to a folder outside of F (By right click, add to clipboard, 
go to new folder, Edit> Move here).
 * In DM's DQL tool, I ran the query you gave
 * No object Id was returned

 

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
> Attachments: 2018-09-19_1758.png
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> 

[jira] [Commented] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-09-20 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16621700#comment-16621700
 ] 

James Thomas commented on CONNECTORS-1521:
--

{quote}I didn't have the necessary credentials to open support tickets with 
OpenText, but I've organised that now, so I'll go ahead and do it.
{quote}
Getting the appropriate credentials is taking much longer than we initially 
thought it would.

> Documentum Connector users ManifoldCF's local time in queries constraints 
> against the Documentum server without reference to time zones
> ---
>
> Key: CONNECTORS-1521
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.12
>
>
> I find that the time/date constraints in queries to the Documentum server are 
> based on the "raw" local time of the ManifoldCF server but appear to take no 
> account of the time zones of the two servers.
> This can lead to recently modified files not being transferred to the output 
> repository when you would naturally expect them to be. I'd like the times to 
> be aligned, perhaps by including time zone in the query. In particular, is 
> there a way to use UTC perhaps?
> Here's an example ...
>  * create a folder in Documentum
>  * set up a job to point at the folder and output to the file system
>  * put two documents into a folder in Documentum
>  * Select them, right click and export as CSV (to show the timestamps):
> {noformat}
> 1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
> 2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
> Check the local time on the ManifoldCF server machine. Observe that it's 
> reporting consistent time with the DM server:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 09:07:25 BST 2018{noformat}
> Start the job and look for the query to Documentum in the manifoldcf.log file 
> (line break added for readability):
> {noformat}
> DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and
> r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 
> AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
> ^C{noformat}
> Notice that the latest date asked for is *before* the modification date of 
> the files added to DM. (And is an hour out, see footnote.)
>   
>  See whether anything has been output by the File System connector. It hasn't:
> {noformat}
> [james@manifold]$ ls /bigdisc/source/PDFs/timezones/
> [james@manifold]$
> {noformat}
> Now:
>  * change the timezone on the ManifoldCF server machine
>  * restart the ManifoldCF server and the Documentum processes
>  * reseed the job
> Check the local time on the ManifoldCF server machine; it has changed:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 10:10:29 CEST 2018{noformat}
> Start the job again and notice that the query has changed by an hour, plus 
> the few minutes it took to change the date etc (and is still an hour out, see 
> footnote):
> {noformat}
> r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
> {noformat}
> Observe that the range of dates now covers the timestamps on the DM data, and 
> also that some data has now been transferred by the File System connector:
> {noformat}
> [james@manifold]$ ls 
> /bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
> drl?versionLabel=CURRENT=09018000e515
> drl?versionLabel=CURRENT=09018000e516
> {noformat}
>  
>  
> [Footnote] It appears that something is trying to take account of Daylight 
> Saving Time too.
> If I set the server date to a time outside of DST, the query is aligned with 
> the current time:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Mon Oct 29 00:01:13 CET 2018
> r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
> {noformat}
> But if I set the time inside DST, the time is an hour before:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Sat Oct 27 00:00:06 CEST 2018
> r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 
> {noformat}
> This is perhaps a Java issue rather than a logic issue in the connector? See 
> e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16621531#comment-16621531
 ] 

James Thomas commented on CONNECTORS-1532:
--

{quote}I may have found the reason you see this behavior, though. If the folder 
affinity is versioned information, and I believe it is, then the seeding query 
will pick up the last version of the document that was in the right folder. 
That's because the seeding query uses the chronicle_id, which is really a 
specific document version:
{quote}
Thanks, is this consistent with my observation that re-seeding explicitly (from 
the Job's View page) forces the change to be recognised?

Is there some logging I can provide or other experiment I can do that will help 
you to narrow this down further or confirm your thoughts?

Also, this may be an unrelated question, and I can take it to a different 
ticket if you like, but why does the File System connector zero these files 
rather than deleting them from disk?

 

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Attachments: 2018-09-19_1758.png
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16620854#comment-16620854
 ] 

James Thomas commented on CONNECTORS-1532:
--

It's a "scan every document once" job, and I'm running by clicking Start.

!2018-09-19_1758.png!

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Attachments: 2018-09-19_1758.png
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> timestamp of the move (07:28):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> 

[jira] [Updated] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)


 [ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Thomas updated CONNECTORS-1532:
-
Attachment: 2018-09-19_1758.png

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Attachments: 2018-09-19_1758.png
>
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> timestamp of the move (07:28):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16620830#comment-16620830
 ] 

James Thomas commented on CONNECTORS-1532:
--

Having not touched anything since I reported the cut-down experiment, I re-ran 
the job (I am not sure this is what you mean by a complete job?)

The File System view did not change, and the Simple History Report has a new 
entry:
{code:java}
-rw-r--r--. 1 root i2e 26 Sep 19 17:46 
drl?versionLabel=CURRENT=09018000f99a{code}
 
||Start Time||Activity||Identifier||Result Code||Bytes||Time||Result 
Description||
|09-19-2018 16:30:47.179|fetch|09018000f99a|OK|26|26| |

I then reset seeding on the job and re-ran it.

There is no additional History, but this time the file is zeroed:

 
{code:java}
-rw-r--r--. 1 root i2e 0 Sep 19 18:36 
drl?versionLabel=CURRENT=09018000f99a{code}
 

 

So, from my empirical data, I think that a move in DM requires reseeding before 
the output is synchronised with the repository? Does that fit with what you 
were saying?

You mentioned that continuous jobs would not delete - do you mean that they 
won't delete until the job ends, or until the next reseed, if a reseed interval 
is specified?

This may be an unrelated question, and I can take it to a different ticket if 
you like, but why does the File System connector zero files rather than 
deleting them from disk?

 

 

 

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 

[jira] [Commented] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16620779#comment-16620779
 ] 

James Thomas commented on CONNECTORS-1532:
--

[~kwri...@metacarta.com], here's a simple experiment cut down from the original 
which I hope is what you're after.
 * In DM I created a folder F containing a single file X
 * In MF I configured a job to look at F and output to File System
 * In MF I ran the job
 * On the File System I ran 'ls -l'  and found F
 * In DM I moved X to a folder outside of F (Right click, add to clipboard, go 
to new folder, Edit> Move here).
 * In MF I ran the job again
 * On the File System I ran 'ls' again
 * In MF I ran Simple History Report

(In the data below, note that the time stamps are two hours different because 
of the time zone setup I have currently to work around #1521.)

 

Two calls to 'ls':

 
{code:java}
-rw-r--r--. 1 root i2e 26 Sep 19 17:44 
drl?versionLabel=CURRENT=09018000f99a
-rw-r--r--. 1 root i2e 26 Sep 19 17:46 
drl?versionLabel=CURRENT=09018000f99a
{code}
 

Simple History Report for object 09018000f99a:
||Start Time||Activity||Identifier||Result Code||Bytes||Time||Result 
Description||
|09-19-2018 15:46:04.940|fetch|09018000f99a|OK|26|284| |
|09-19-2018 15:44:30.228|fetch|09018000f99a|OK|26|34| |

 

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 

[jira] [Updated] (CONNECTORS-1532) Moving a file outside of the job's Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)


 [ 
https://issues.apache.org/jira/browse/CONNECTORS-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Thomas updated CONNECTORS-1532:
-
Summary: Moving a file outside of the job's Paths is not the same as 
deleting it  (was: Moving a file outside of the jobs Paths is not the same as 
deleting it)

> Moving a file outside of the job's Paths is not the same as deleting it
> ---
>
> Key: CONNECTORS-1532
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
> Environment: Manifold 2.10 patched for #1512, #1517
>Reporter: James Thomas
>Priority: Major
>
> If I have a MF job which is connecting a specific folder, F, in Documentum to 
> a File System output then:
> 1. deleting files in Documentum shows them as zero size in the file system
> 2. moving files out of F does not remove them or zero them in the file system
> Note that moving a file from another folder (which the job is not looking at) 
> to F has the same effect as adding it to F by e.g. importing it in DM or 
> POSTing it to DM via the REST interface.
> Intuitively, I expect that moving a file out of the "view" of the Documentum 
> connector would have the same effect on the File System as deleting it. (My 
> model here is of MF synchronising content between the Paths (DM) and the 
> Output Path (File System) that I have specified in the job.)
> Starting point, I have run the MF job to fetch a bunch of files from a folder 
> - call it F - in DM (i.e. I have configured Paths in the job to be F). This 
> is what 'ls -l' on the file system looks like:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf{code}
> In DM, I delete one of the files in F and it shows as zero size, and the 
> modification date has changed:
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c4
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7{code}
> In DM, I move a file from F to another folder. (Right click, add to 
> clipboard, go to new folder, Edit> Move here). 
> The file shows as modified (07:25), but is still apparently in F (i.e. in the 
> Path my MF job is looking at):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7c1
> -rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
> drl?versionLabel=CURRENT=09018000f7bf
> -rw-r--r--. 1 root i2e  0 Sep 19 07:23 
> drl?versionLabel=CURRENT=09018000f7c7
> -rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
> drl?versionLabel=CURRENT=09018000f7c4{code}
> In DM, I move a file from another folder to F and it shows up with the 
> timestamp of the move (07:28):
> {code:java}
> -rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c0
> -rw-r--r--. 1 root i2e 26 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7be
> -rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c2
> -rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
> drl?versionLabel=CURRENT=09018000f7c3
> -rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
> 

[jira] [Created] (CONNECTORS-1532) Moving a file outside of the jobs Paths is not the same as deleting it

2018-09-19 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1532:


 Summary: Moving a file outside of the jobs Paths is not the same 
as deleting it
 Key: CONNECTORS-1532
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1532
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentum connector
Affects Versions: ManifoldCF 2.10
 Environment: Manifold 2.10 patched for #1512, #1517
Reporter: James Thomas


If I have a MF job which is connecting a specific folder, F, in Documentum to a 
File System output then:

1. deleting files in Documentum shows them as zero size in the file system
2. moving files out of F does not remove them or zero them in the file system

Note that moving a file from another folder (which the job is not looking at) 
to F has the same effect as adding it to F by e.g. importing it in DM or 
POSTing it to DM via the REST interface.

Intuitively, I expect that moving a file out of the "view" of the Documentum 
connector would have the same effect on the File System as deleting it. (My 
model here is of MF synchronising content between the Paths (DM) and the Output 
Path (File System) that I have specified in the job.)

Starting point, I have run the MF job to fetch a bunch of files from a folder - 
call it F - in DM (i.e. I have configured Paths in the job to be F). This is 
what 'ls -l' on the file system looks like:


{code:java}
-rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c0
-rw-r--r--. 1 root i2e 26 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7be
-rw-r--r--. 1 root i2e  85772 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c7
-rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c2
-rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c3
-rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c4
-rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7c1
-rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7bf{code}

In DM, I delete one of the files in F and it shows as zero size, and the 
modification date has changed:


{code:java}
-rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c0
-rw-r--r--. 1 root i2e 26 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7be
-rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c2
-rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c3
-rw-r--r--. 1 root i2e  32783 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c4
-rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7c1
-rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7bf
-rw-r--r--. 1 root i2e  0 Sep 19 07:23 
drl?versionLabel=CURRENT=09018000f7c7{code}

In DM, I move a file from F to another folder. (Right click, add to clipboard, 
go to new folder, Edit> Move here). 

The file shows as modified (07:25), but is still apparently in F (i.e. in the 
Path my MF job is looking at):


{code:java}
-rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c0
-rw-r--r--. 1 root i2e 26 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7be
-rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c2
-rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c3
-rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7c1
-rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7bf
-rw-r--r--. 1 root i2e  0 Sep 19 07:23 
drl?versionLabel=CURRENT=09018000f7c7
-rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
drl?versionLabel=CURRENT=09018000f7c4{code}

In DM, I move a file from another folder to F and it shows up with the 
timestamp of the move (07:28):


{code:java}
-rw-r--r--. 1 root i2e  12541 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c0
-rw-r--r--. 1 root i2e 26 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7be
-rw-r--r--. 1 root i2e   8790 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c2
-rw-r--r--. 1 root i2e 101888 Sep 19 07:21 
drl?versionLabel=CURRENT=09018000f7c3
-rw-r--r--. 1 root i2e  23040 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7c1
-rw-r--r--. 1 root i2e  26112 Sep 19 07:22 
drl?versionLabel=CURRENT=09018000f7bf
-rw-r--r--. 1 root i2e  0 Sep 19 07:23 
drl?versionLabel=CURRENT=09018000f7c7
-rw-r--r--. 1 root i2e  32783 Sep 19 07:25 
drl?versionLabel=CURRENT=09018000f7c4
-rw-r--r--. 1 root i2e 191513 Sep 19 07:28 
drl?versionLabel=CURRENT=0901800045b9{code}

But if I immediately move it out in DM then, again, the timestamp (07:30) 
alters but the file apparently remains:


{code:java}
-rw-r--r--. 1 root i2e  12541 Sep 

[jira] [Commented] (CONNECTORS-1517) Documentum Connector uses different "unconstrained" a_content_type filters depending on whether the Content Types tab has been edited

2018-08-13 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16578610#comment-16578610
 ] 

James Thomas commented on CONNECTORS-1517:
--

{quote}Can you be more explicit? There are many conditionally enabled options 
in many connectors under MCF.
{quote}
Yes, I was thinking specifically of the kinds of things I mentioned earlier:
 * I wonder whether "No content type restriction" checked should disable all of 
the others?
 * It is surprising to be able to submit with no checkboxes checked 
 * Generalising the above, I think I'd prefer to see some more restriction on 
what combinations of box can be checked at the same time.

> Documentum Connector uses different "unconstrained" a_content_type filters 
> depending on whether the Content Types tab has been edited
> -
>
> Key: CONNECTORS-1517
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1517
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1517-2.patch, CONNECTORS-1517.patch, Notes.txt
>
>
> I am using Manifold 2.10 patched for issue 
> https://issues.apache.org/jira/browse/CONNECTORS-1512
> I find that the "unconstrained" query submitted to Documentum differs 
> depending on whether the Content Types in the job have been edited or not. 
> This can dramatically affect which files are fetched. After editing, there 
> are likely to be fewer.
> For example, having simply created a job connecting to DM and setting only 
> the Paths value to Administrator/james the following request is generated. 
> (Taken from manifoldcf.log).
> Note that there are no a_content_type constraints (and my line break for 
> readibility):
> {code:java}
> DEBUG 2018-07-26T05:52:56,422 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:52:56','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0))
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> Once the Content Types tab has been edited (e.g. to remove the 123w type) it 
> looks like this, i.e. the search constrains to only the selected types (my 
> ellipsis for readibility):
> {code:java}
> DEBUG 2018-07-26T05:58:36,755 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:58:36','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('acad', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> If the 123w type is now reselected in the Content Types tab, the search adds 
> it to the list of a_content_type entries, but doesn't return to the 
> unconstrained initial search:
> {code:java}
> DEBUG 2018-07-26T05:59:16,863 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:59:16','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('123w', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> This means that running what appears to be an equivalent job several times 
> may not fetch the same set of documents from Documentum.
> I expect that the same configuration in the UI produces the same search to 
> Documentum, regardless of how the configuration was arrived at.
> If the selected items in the Content Types list is treated as the only set of 
> files to fetch (i,.e. the initial unconstrained search is considered 
> incorrect here) then I guess I might also like to have flexibility to fetch 
> file types not on the checklist in the Content Types tab.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1517) Documentum Connector uses different "unconstrained" a_content_type filters depending on whether the Content Types tab has been edited

2018-08-13 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16578576#comment-16578576
 ] 

James Thomas commented on CONNECTORS-1517:
--

Great to hear that backwards compatibility has been covered.

I agree that there's a functional correctness case here: the behaviour _is_ 
what has been configured.  I guess I'm interested (with a user head on) in it 
being hard to configure something that isn't useful.

Forgive me if this is a stupid question, but is there much scope for 
conditional enabling/disabling of options or client-side validation in 
ManifoldCF's connectors as things stand?

> Documentum Connector uses different "unconstrained" a_content_type filters 
> depending on whether the Content Types tab has been edited
> -
>
> Key: CONNECTORS-1517
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1517
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1517-2.patch, CONNECTORS-1517.patch, Notes.txt
>
>
> I am using Manifold 2.10 patched for issue 
> https://issues.apache.org/jira/browse/CONNECTORS-1512
> I find that the "unconstrained" query submitted to Documentum differs 
> depending on whether the Content Types in the job have been edited or not. 
> This can dramatically affect which files are fetched. After editing, there 
> are likely to be fewer.
> For example, having simply created a job connecting to DM and setting only 
> the Paths value to Administrator/james the following request is generated. 
> (Taken from manifoldcf.log).
> Note that there are no a_content_type constraints (and my line break for 
> readibility):
> {code:java}
> DEBUG 2018-07-26T05:52:56,422 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:52:56','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0))
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> Once the Content Types tab has been edited (e.g. to remove the 123w type) it 
> looks like this, i.e. the search constrains to only the selected types (my 
> ellipsis for readibility):
> {code:java}
> DEBUG 2018-07-26T05:58:36,755 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:58:36','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('acad', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> If the 123w type is now reselected in the Content Types tab, the search adds 
> it to the list of a_content_type entries, but doesn't return to the 
> unconstrained initial search:
> {code:java}
> DEBUG 2018-07-26T05:59:16,863 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:59:16','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('123w', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> This means that running what appears to be an equivalent job several times 
> may not fetch the same set of documents from Documentum.
> I expect that the same configuration in the UI produces the same search to 
> Documentum, regardless of how the configuration was arrived at.
> If the selected items in the Content Types list is treated as the only set of 
> files to fetch (i,.e. the initial unconstrained search is considered 
> incorrect here) then I guess I might also like to have flexibility to fetch 
> file types not on the checklist in the Content Types tab.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1517) Documentum Connector uses different "unconstrained" a_content_type filters depending on whether the Content Types tab has been edited

2018-08-13 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16578357#comment-16578357
 ] 

James Thomas commented on CONNECTORS-1517:
--

Hi Karl, I have now had a chance to try out the patches. I'll attach a 
transcript which shows the queries executed (from manifoldcf.log) when I ran a 
job with particular configuration in the Content Types tab of the Documentum 
Connector.

My observations and thoughts:
 * The core bug that I reported - that editing the Content Types tab and then 
resetting it results in different semantics at search time appears fixed.
 * The default search is still unconstrained.
 * It is surprising to be able to have both "No content type restriction" and 
any other checkbox checked at the same time. I wonder whether "No content type 
restriction" checked should disable all of the others?
 * It is surprising to be able to submit with no checkboxes checked and have a 
query constrained with "1<0" as it looks like this can never succeed
 * Generalising the above, I think I'd prefer to see some more restriction on 
what combinations of box can be checked at the same time.
 * It would be convenient as a user to have a control for check all/uncheck all
 * I haven't been using ManifoldCF/Documentum long enough to know whether there 
are likely to be backwards compatibility issues in changing the UI this way

[^Notes.txt]

> Documentum Connector uses different "unconstrained" a_content_type filters 
> depending on whether the Content Types tab has been edited
> -
>
> Key: CONNECTORS-1517
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1517
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1517-2.patch, CONNECTORS-1517.patch, Notes.txt
>
>
> I am using Manifold 2.10 patched for issue 
> https://issues.apache.org/jira/browse/CONNECTORS-1512
> I find that the "unconstrained" query submitted to Documentum differs 
> depending on whether the Content Types in the job have been edited or not. 
> This can dramatically affect which files are fetched. After editing, there 
> are likely to be fewer.
> For example, having simply created a job connecting to DM and setting only 
> the Paths value to Administrator/james the following request is generated. 
> (Taken from manifoldcf.log).
> Note that there are no a_content_type constraints (and my line break for 
> readibility):
> {code:java}
> DEBUG 2018-07-26T05:52:56,422 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:52:56','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0))
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> Once the Content Types tab has been edited (e.g. to remove the 123w type) it 
> looks like this, i.e. the search constrains to only the selected types (my 
> ellipsis for readibility):
> {code:java}
> DEBUG 2018-07-26T05:58:36,755 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:58:36','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('acad', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> If the 123w type is now reselected in the Content Types tab, the search adds 
> it to the list of a_content_type entries, but doesn't return to the 
> unconstrained initial search:
> {code:java}
> DEBUG 2018-07-26T05:59:16,863 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:59:16','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('123w', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> This means that running what appears to be an equivalent job several times 
> may not fetch the same set of documents from Documentum.
> I expect that the same configuration in the UI produces the same search to 
> Documentum, regardless of how the configuration 

[jira] [Updated] (CONNECTORS-1517) Documentum Connector uses different "unconstrained" a_content_type filters depending on whether the Content Types tab has been edited

2018-08-13 Thread James Thomas (JIRA)


 [ 
https://issues.apache.org/jira/browse/CONNECTORS-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Thomas updated CONNECTORS-1517:
-
Attachment: Notes.txt

> Documentum Connector uses different "unconstrained" a_content_type filters 
> depending on whether the Content Types tab has been edited
> -
>
> Key: CONNECTORS-1517
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1517
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1517-2.patch, CONNECTORS-1517.patch, Notes.txt
>
>
> I am using Manifold 2.10 patched for issue 
> https://issues.apache.org/jira/browse/CONNECTORS-1512
> I find that the "unconstrained" query submitted to Documentum differs 
> depending on whether the Content Types in the job have been edited or not. 
> This can dramatically affect which files are fetched. After editing, there 
> are likely to be fewer.
> For example, having simply created a job connecting to DM and setting only 
> the Paths value to Administrator/james the following request is generated. 
> (Taken from manifoldcf.log).
> Note that there are no a_content_type constraints (and my line break for 
> readibility):
> {code:java}
> DEBUG 2018-07-26T05:52:56,422 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:52:56','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0))
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> Once the Content Types tab has been edited (e.g. to remove the 123w type) it 
> looks like this, i.e. the search constrains to only the selected types (my 
> ellipsis for readibility):
> {code:java}
> DEBUG 2018-07-26T05:58:36,755 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:58:36','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('acad', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> If the 123w type is now reselected in the Content Types tab, the search adds 
> it to the list of a_content_type entries, but doesn't return to the 
> unconstrained initial search:
> {code:java}
> DEBUG 2018-07-26T05:59:16,863 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:59:16','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('123w', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> This means that running what appears to be an equivalent job several times 
> may not fetch the same set of documents from Documentum.
> I expect that the same configuration in the UI produces the same search to 
> Documentum, regardless of how the configuration was arrived at.
> If the selected items in the Content Types list is treated as the only set of 
> files to fetch (i,.e. the initial unconstrained search is considered 
> incorrect here) then I guess I might also like to have flexibility to fetch 
> file types not on the checklist in the Content Types tab.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1524) Documentum Connector fails with NullPointerException

2018-08-10 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16577054#comment-16577054
 ] 

James Thomas commented on CONNECTORS-1524:
--

I'm prepared to believe that I might have a configuration error. However, I can 
show this issue and remove it by switching between Java versions. (After more 
experimentation, I can see that the issue manifests in at least Oracle Java 9 
too.)

I'm not going to pursue this further in this ManifoldCF ticket at this time. 
Instead I'll arrange for my different applications to use different Java 
variants to work around it.

I can't find supported Java version information in OpenText's online material 
so I've opened a support ticket there to ask the question.

> Documentum Connector fails with NullPointerException
> 
>
> Key: CONNECTORS-1524
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1524
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: 500_server_error.txt
>
>
> I am running ManifoldCF 2.10 patched for issues 1512 and 1517.
> I recently upgraded Java to 10.0.2 (for another server on the same machine) 
> and find that when I try to create jobs using the Documentum connector, 
> clicking on (e.g.) the Paths tab will give this error dialog:
> {code:java}
> HTTP ERROR 500
> Problem accessing /mcf-crawler-ui/execute.jsp. Reason:
>     Server Error
> Caused by:
> org.apache.jasper.JasperException: An exception occurred processing JSP page 
> /execute.jsp at line 1542
> 1539:   {
> 1540: threadContext.save("JobObject",job);
> 1541: %>
> 1542: 
> 1543: <%
> 1544:   }
> 1545:   else if (op.equals("Save")){code}
> There's a lot of stack trace (which I'll attach) but it includes some 
> ManifoldCF calls:
>  
> {code:java}
> Caused by: java.lang.RuntimeException: Unexpected exception type: 
> java.lang.NullPointerException: null
>     at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$GetSessionThread.finishUp(DCTM.java:117)
>     at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM.getSession(DCTM.java:173)
> ...
> Caused by: java.lang.NullPointerException
>     at 
> com.documentum.fc.client.impl.docbase.DocbaseDateFormat.getSeparator(DocbaseDateFormat.java:119)
>     at ...
> {code}
> If I revert to OpenJDK 1.8.0_171 (the version I upgraded from) the issue is 
> not seen.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1524) Documentum Connector fails with NullPointerException under OpenJDK 10

2018-08-10 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16576550#comment-16576550
 ] 

James Thomas commented on CONNECTORS-1524:
--

Thanks, Karl.

For the record, it does manifest for me under Oracle Java 10.0.2, Java(TM) SE 
Runtime Environment 18.3 (build 10.0.2+13).

There is no difference in the stack trace produced:

 
{code:java}
$ diff 500_server_error.txt 500_server_error_oracle_java.txt

$
{code}
I'll contact OpenText support.

> Documentum Connector fails with NullPointerException under OpenJDK 10
> -
>
> Key: CONNECTORS-1524
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1524
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: 500_server_error.txt
>
>
> I am running ManifoldCF 2.10 patched for issues 1512 and 1517.
> I recently upgraded Java to 10.0.2 (for another server on the same machine) 
> and find that when I try to create jobs using the Documentum connector, 
> clicking on (e.g.) the Paths tab will give this error dialog:
> {code:java}
> HTTP ERROR 500
> Problem accessing /mcf-crawler-ui/execute.jsp. Reason:
>     Server Error
> Caused by:
> org.apache.jasper.JasperException: An exception occurred processing JSP page 
> /execute.jsp at line 1542
> 1539:   {
> 1540: threadContext.save("JobObject",job);
> 1541: %>
> 1542: 
> 1543: <%
> 1544:   }
> 1545:   else if (op.equals("Save")){code}
> There's a lot of stack trace (which I'll attach) but it includes some 
> ManifoldCF calls:
>  
> {code:java}
> Caused by: java.lang.RuntimeException: Unexpected exception type: 
> java.lang.NullPointerException: null
>     at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$GetSessionThread.finishUp(DCTM.java:117)
>     at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM.getSession(DCTM.java:173)
> ...
> Caused by: java.lang.NullPointerException
>     at 
> com.documentum.fc.client.impl.docbase.DocbaseDateFormat.getSeparator(DocbaseDateFormat.java:119)
>     at ...
> {code}
> If I revert to OpenJDK 1.8.0_171 (the version I upgraded from) the issue is 
> not seen.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (CONNECTORS-1524) Documentum Connector fails with NullPointerException under OpenJDK 10

2018-08-10 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1524:


 Summary: Documentum Connector fails with NullPointerException 
under OpenJDK 10
 Key: CONNECTORS-1524
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1524
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentum connector
Affects Versions: ManifoldCF 2.10
Reporter: James Thomas
 Attachments: 500_server_error.txt

I am running ManifoldCF 2.10 patched for issues 1512 and 1517.

I recently upgraded Java to 10.0.2 (for another server on the same machine) and 
find that when I try to create jobs using the Documentum connector, clicking on 
(e.g.) the Paths tab will give this error dialog:
{code:java}
HTTP ERROR 500
Problem accessing /mcf-crawler-ui/execute.jsp. Reason:
    Server Error
Caused by:

org.apache.jasper.JasperException: An exception occurred processing JSP page 
/execute.jsp at line 1542

1539:   {
1540: threadContext.save("JobObject",job);
1541: %>
1542: 
1543: <%
1544:   }
1545:   else if (op.equals("Save")){code}
There's a lot of stack trace (which I'll attach) but it includes some 
ManifoldCF calls:

 
{code:java}
Caused by: java.lang.RuntimeException: Unexpected exception type: 
java.lang.NullPointerException: null
    at 
org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$GetSessionThread.finishUp(DCTM.java:117)
    at 
org.apache.manifoldcf.crawler.connectors.DCTM.DCTM.getSession(DCTM.java:173)
...
Caused by: java.lang.NullPointerException
    at 
com.documentum.fc.client.impl.docbase.DocbaseDateFormat.getSeparator(DocbaseDateFormat.java:119)
    at ...
{code}
If I revert to OpenJDK 1.8.0_171 (the version I upgraded from) the issue is not 
seen.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-08-09 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16575081#comment-16575081
 ] 

James Thomas commented on CONNECTORS-1521:
--

I didn't have the necessary credentials to open support tickets with OpenText, 
but I've organised that now, so I'll go ahead and do it.

I don't think we should do a horrible hack :)

And, yes, the consistent timezone approach is the one I'd taken internally 
after discovering this issue.

> Documentum Connector users ManifoldCF's local time in queries constraints 
> against the Documentum server without reference to time zones
> ---
>
> Key: CONNECTORS-1521
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
>
> I find that the time/date constraints in queries to the Documentum server are 
> based on the "raw" local time of the ManifoldCF server but appear to take no 
> account of the time zones of the two servers.
> This can lead to recently modified files not being transferred to the output 
> repository when you would naturally expect them to be. I'd like the times to 
> be aligned, perhaps by including time zone in the query. In particular, is 
> there a way to use UTC perhaps?
> Here's an example ...
>  * create a folder in Documentum
>  * set up a job to point at the folder and output to the file system
>  * put two documents into a folder in Documentum
>  * Select them, right click and export as CSV (to show the timestamps):
> {noformat}
> 1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
> 2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
> Check the local time on the ManifoldCF server machine. Observe that it's 
> reporting consistent time with the DM server:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 09:07:25 BST 2018{noformat}
> Start the job and look for the query to Documentum in the manifoldcf.log file 
> (line break added for readability):
> {noformat}
> DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and
> r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 
> AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
> ^C{noformat}
> Notice that the latest date asked for is *before* the modification date of 
> the files added to DM. (And is an hour out, see footnote.)
>   
>  See whether anything has been output by the File System connector. It hasn't:
> {noformat}
> [james@manifold]$ ls /bigdisc/source/PDFs/timezones/
> [james@manifold]$
> {noformat}
> Now:
>  * change the timezone on the ManifoldCF server machine
>  * restart the ManifoldCF server and the Documentum processes
>  * reseed the job
> Check the local time on the ManifoldCF server machine; it has changed:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 10:10:29 CEST 2018{noformat}
> Start the job again and notice that the query has changed by an hour, plus 
> the few minutes it took to change the date etc (and is still an hour out, see 
> footnote):
> {noformat}
> r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
> {noformat}
> Observe that the range of dates now covers the timestamps on the DM data, and 
> also that some data has now been transferred by the File System connector:
> {noformat}
> [james@manifold]$ ls 
> /bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
> drl?versionLabel=CURRENT=09018000e515
> drl?versionLabel=CURRENT=09018000e516
> {noformat}
>  
>  
> [Footnote] It appears that something is trying to take account of Daylight 
> Saving Time too.
> If I set the server date to a time outside of DST, the query is aligned with 
> the current time:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Mon Oct 29 00:01:13 CET 2018
> r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
> {noformat}
> But if I set the time inside DST, the time is an hour before:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Sat Oct 27 00:00:06 CEST 2018
> r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 
> {noformat}
> This is perhaps a Java issue rather than a logic issue in the connector? See 
> e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-08-07 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16572701#comment-16572701
 ] 

James Thomas commented on CONNECTORS-1521:
--

Another potential approach: is it possible to request the time of the DM server 
(or perhaps parse it out of responses to other requests, as in HTTP headers?) 
and use that in the DQL instead of local time?

> Documentum Connector users ManifoldCF's local time in queries constraints 
> against the Documentum server without reference to time zones
> ---
>
> Key: CONNECTORS-1521
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
>
> I find that the time/date constraints in queries to the Documentum server are 
> based on the "raw" local time of the ManifoldCF server but appear to take no 
> account of the time zones of the two servers.
> This can lead to recently modified files not being transferred to the output 
> repository when you would naturally expect them to be. I'd like the times to 
> be aligned, perhaps by including time zone in the query. In particular, is 
> there a way to use UTC perhaps?
> Here's an example ...
>  * create a folder in Documentum
>  * set up a job to point at the folder and output to the file system
>  * put two documents into a folder in Documentum
>  * Select them, right click and export as CSV (to show the timestamps):
> {noformat}
> 1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
> 2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
> Check the local time on the ManifoldCF server machine. Observe that it's 
> reporting consistent time with the DM server:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 09:07:25 BST 2018{noformat}
> Start the job and look for the query to Documentum in the manifoldcf.log file 
> (line break added for readability):
> {noformat}
> DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and
> r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 
> AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
> ^C{noformat}
> Notice that the latest date asked for is *before* the modification date of 
> the files added to DM. (And is an hour out, see footnote.)
>   
>  See whether anything has been output by the File System connector. It hasn't:
> {noformat}
> [james@manifold]$ ls /bigdisc/source/PDFs/timezones/
> [james@manifold]$
> {noformat}
> Now:
>  * change the timezone on the ManifoldCF server machine
>  * restart the ManifoldCF server and the Documentum processes
>  * reseed the job
> Check the local time on the ManifoldCF server machine; it has changed:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 10:10:29 CEST 2018{noformat}
> Start the job again and notice that the query has changed by an hour, plus 
> the few minutes it took to change the date etc (and is still an hour out, see 
> footnote):
> {noformat}
> r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
> {noformat}
> Observe that the range of dates now covers the timestamps on the DM data, and 
> also that some data has now been transferred by the File System connector:
> {noformat}
> [james@manifold]$ ls 
> /bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
> drl?versionLabel=CURRENT=09018000e515
> drl?versionLabel=CURRENT=09018000e516
> {noformat}
>  
>  
> [Footnote] It appears that something is trying to take account of Daylight 
> Saving Time too.
> If I set the server date to a time outside of DST, the query is aligned with 
> the current time:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Mon Oct 29 00:01:13 CET 2018
> r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
> {noformat}
> But if I set the time inside DST, the time is an hour before:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Sat Oct 27 00:00:06 CEST 2018
> r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 
> {noformat}
> This is perhaps a Java issue rather than a logic issue in the connector? See 
> e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-08-07 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16571832#comment-16571832
 ] 

James Thomas commented on CONNECTORS-1521:
--

No, there's no patterns with "Z"

> Documentum Connector users ManifoldCF's local time in queries constraints 
> against the Documentum server without reference to time zones
> ---
>
> Key: CONNECTORS-1521
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
>
> I find that the time/date constraints in queries to the Documentum server are 
> based on the "raw" local time of the ManifoldCF server but appear to take no 
> account of the time zones of the two servers.
> This can lead to recently modified files not being transferred to the output 
> repository when you would naturally expect them to be. I'd like the times to 
> be aligned, perhaps by including time zone in the query. In particular, is 
> there a way to use UTC perhaps?
> Here's an example ...
>  * create a folder in Documentum
>  * set up a job to point at the folder and output to the file system
>  * put two documents into a folder in Documentum
>  * Select them, right click and export as CSV (to show the timestamps):
> {noformat}
> 1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
> 2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
> Check the local time on the ManifoldCF server machine. Observe that it's 
> reporting consistent time with the DM server:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 09:07:25 BST 2018{noformat}
> Start the job and look for the query to Documentum in the manifoldcf.log file 
> (line break added for readability):
> {noformat}
> DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and
> r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 
> AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
> ^C{noformat}
> Notice that the latest date asked for is *before* the modification date of 
> the files added to DM. (And is an hour out, see footnote.)
>   
>  See whether anything has been output by the File System connector. It hasn't:
> {noformat}
> [james@manifold]$ ls /bigdisc/source/PDFs/timezones/
> [james@manifold]$
> {noformat}
> Now:
>  * change the timezone on the ManifoldCF server machine
>  * restart the ManifoldCF server and the Documentum processes
>  * reseed the job
> Check the local time on the ManifoldCF server machine; it has changed:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 10:10:29 CEST 2018{noformat}
> Start the job again and notice that the query has changed by an hour, plus 
> the few minutes it took to change the date etc (and is still an hour out, see 
> footnote):
> {noformat}
> r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
> {noformat}
> Observe that the range of dates now covers the timestamps on the DM data, and 
> also that some data has now been transferred by the File System connector:
> {noformat}
> [james@manifold]$ ls 
> /bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
> drl?versionLabel=CURRENT=09018000e515
> drl?versionLabel=CURRENT=09018000e516
> {noformat}
>  
>  
> [Footnote] It appears that something is trying to take account of Daylight 
> Saving Time too.
> If I set the server date to a time outside of DST, the query is aligned with 
> the current time:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Mon Oct 29 00:01:13 CET 2018
> r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
> {noformat}
> But if I set the time inside DST, the time is an hour before:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Sat Oct 27 00:00:06 CEST 2018
> r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 
> {noformat}
> This is perhaps a Java issue rather than a logic issue in the connector? See 
> e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-08-07 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16571557#comment-16571557
 ] 

James Thomas commented on CONNECTORS-1521:
--

(I am not a developer but ...) I have explored the Javadoc for 
com.documentum.fc.common, Interface IDfTime in DFC 16.4

There are ~50 patterns, here's a couple of examples:

 
{noformat}
DF_TIME_PATTERN8 Represents time pattern string "month dd, ".
DF_TIME_PATTERN34 Represents time pattern string "mon- hh:mi:ss".
{noformat}
As far as I can see none of the patterns gives anything related to time zone or 
locale, or mentions UTC, DST etc.

This is consistent with the blog post that you found.

Googling around a little, I came across these:

[https://stackoverflow.com/a/8315831] which looks related to your comment about 
asString().

[https://msroth.wordpress.com/2011/05/15/the-content-server-and-time-zones|https://msroth.wordpress.com/2011/05/15/the-content-server-and-time-zones,m]
 and [https://blog.documentum.pro/2015/02/04/time-in-documentum] which are in 
this general area.

 

> Documentum Connector users ManifoldCF's local time in queries constraints 
> against the Documentum server without reference to time zones
> ---
>
> Key: CONNECTORS-1521
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
>
> I find that the time/date constraints in queries to the Documentum server are 
> based on the "raw" local time of the ManifoldCF server but appear to take no 
> account of the time zones of the two servers.
> This can lead to recently modified files not being transferred to the output 
> repository when you would naturally expect them to be. I'd like the times to 
> be aligned, perhaps by including time zone in the query. In particular, is 
> there a way to use UTC perhaps?
> Here's an example ...
>  * create a folder in Documentum
>  * set up a job to point at the folder and output to the file system
>  * put two documents into a folder in Documentum
>  * Select them, right click and export as CSV (to show the timestamps):
> {noformat}
> 1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
> 2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
> Check the local time on the ManifoldCF server machine. Observe that it's 
> reporting consistent time with the DM server:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 09:07:25 BST 2018{noformat}
> Start the job and look for the query to Documentum in the manifoldcf.log file 
> (line break added for readability):
> {noformat}
> DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and
> r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 
> AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
> ^C{noformat}
> Notice that the latest date asked for is *before* the modification date of 
> the files added to DM. (And is an hour out, see footnote.)
>   
>  See whether anything has been output by the File System connector. It hasn't:
> {noformat}
> [james@manifold]$ ls /bigdisc/source/PDFs/timezones/
> [james@manifold]$
> {noformat}
> Now:
>  * change the timezone on the ManifoldCF server machine
>  * restart the ManifoldCF server and the Documentum processes
>  * reseed the job
> Check the local time on the ManifoldCF server machine; it has changed:
> {noformat}
> [james@manifold]$ date
> Tue Aug  7 10:10:29 CEST 2018{noformat}
> Start the job again and notice that the query has changed by an hour, plus 
> the few minutes it took to change the date etc (and is still an hour out, see 
> footnote):
> {noformat}
> r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
> {noformat}
> Observe that the range of dates now covers the timestamps on the DM data, and 
> also that some data has now been transferred by the File System connector:
> {noformat}
> [james@manifold]$ ls 
> /bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
> drl?versionLabel=CURRENT=09018000e515
> drl?versionLabel=CURRENT=09018000e516
> {noformat}
>  
>  
> [Footnote] It appears that something is trying to take account of Daylight 
> Saving Time too.
> If I set the server date to a time outside of DST, the query is aligned with 
> the current time:
> {noformat}
> [i2e@i2ehost manifold]$ date
>  Mon Oct 29 00:01:13 CET 2018

[jira] [Updated] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-08-07 Thread James Thomas (JIRA)


 [ 
https://issues.apache.org/jira/browse/CONNECTORS-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Thomas updated CONNECTORS-1521:
-
Description: 
I find that the time/date constraints in queries to the Documentum server are 
based on the "raw" local time of the ManifoldCF server but appear to take no 
account of the time zones of the two servers.

This can lead to recently modified files not being transferred to the output 
repository when you would naturally expect them to be. I'd like the times to be 
aligned, perhaps by including time zone in the query. In particular, is there a 
way to use UTC perhaps?

Here's an example ...
 * create a folder in Documentum
 * set up a job to point at the folder and output to the file system
 * put two documents into a folder in Documentum
 * Select them, right click and export as CSV (to show the timestamps):

{noformat}
1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
Check the local time on the ManifoldCF server machine. Observe that it's 
reporting consistent time with the DM server:
{noformat}
[james@manifold]$ date
Tue Aug  7 09:07:25 BST 2018{noformat}
Start the job and look for the query to Documentum in the manifoldcf.log file 
(line break added for readability):
{noformat}
DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute query= 
(select for READ distinct i_chronicle_id from dm_document where r_modify_date 
>= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and

r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 

AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
^C{noformat}
Notice that the latest date asked for is *before* the modification date of the 
files added to DM. (And is an hour out, see footnote.)
  
 See whether anything has been output by the File System connector. It hasn't:
{noformat}
[james@manifold]$ ls /bigdisc/source/PDFs/timezones/
[james@manifold]$
{noformat}
Now:
 * change the timezone on the ManifoldCF server machine
 * restart the ManifoldCF server and the Documentum processes
 * reseed the job

Check the local time on the ManifoldCF server machine; it has changed:
{noformat}
[james@manifold]$ date
Tue Aug  7 10:10:29 CEST 2018{noformat}
Start the job again and notice that the query has changed by an hour, plus the 
few minutes it took to change the date etc (and is still an hour out, see 
footnote):
{noformat}
r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
{noformat}
Observe that the range of dates now covers the timestamps on the DM data, and 
also that some data has now been transferred by the File System connector:
{noformat}
[james@manifold]$ ls 
/bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
drl?versionLabel=CURRENT=09018000e515
drl?versionLabel=CURRENT=09018000e516
{noformat}
 

 

[Footnote] It appears that something is trying to take account of Daylight 
Saving Time too.

If I set the server date to a time outside of DST, the query is aligned with 
the current time:
{noformat}
[i2e@i2ehost manifold]$ date
 Mon Oct 29 00:01:13 CET 2018

r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
{noformat}
But if I set the time inside DST, the time is an hour before:
{noformat}
[i2e@i2ehost manifold]$ date
 Sat Oct 27 00:00:06 CEST 2018

r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 

{noformat}
This is perhaps a Java issue rather than a logic issue in the connector? See 
e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]

  was:
I find that the time/date constraints in queries to the Documentum server are 
based on the "raw" local time of the ManifoldCF server but appear to take no 
account of the time zones of the two servers.

This can lead to recently modified files not being transferred to the output 
repository when you would naturally expect them to be. I'd like the times to be 
aligned, perhaps by including time zone in the query. In particular, is there a 
way to use UTC perhaps?

Here's an example ...
 * create a folder in Documentum
 * set up a job to point at the folder and output to the file system
 * put two documents into a folder in Documentum
 * Select them, right click and export as CSV (to show the timestamps):

{noformat}
1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
Check the local time on the ManifoldCF server machine. Observe that it's 
reporting consistent time with the DM server:
{noformat}
[james@manifold]$ date
Tue Aug  7 09:07:25 BST 2018{noformat}
Start the job and look for the query to Documentum in the manifoldcf.log file 
(line break added for readability):
{noformat}
DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute 

[jira] [Created] (CONNECTORS-1521) Documentum Connector users ManifoldCF's local time in queries constraints against the Documentum server without reference to time zones

2018-08-07 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1521:


 Summary: Documentum Connector users ManifoldCF's local time in 
queries constraints against the Documentum server without reference to time 
zones
 Key: CONNECTORS-1521
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1521
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentum connector
Affects Versions: ManifoldCF 2.10
Reporter: James Thomas


I find that the time/date constraints in queries to the Documentum server are 
based on the "raw" local time of the ManifoldCF server but appear to take no 
account of the time zones of the two servers.

This can lead to recently modified files not being transferred to the output 
repository when you would naturally expect them to be. I'd like the times to be 
aligned, perhaps by including time zone in the query. In particular, is there a 
way to use UTC perhaps?

Here's an example ...
 * create a folder in Documentum
 * set up a job to point at the folder and output to the file system
 * put two documents into a folder in Documentum
 * Select them, right click and export as CSV (to show the timestamps):

{noformat}
1.png,48489.0,Portable Network Graphics,8/7/2018 9:04 AM,
2.png,28620.0,Portable Network Graphics,8/7/2018 9:04 AM,,{noformat}
Check the local time on the ManifoldCF server machine. Observe that it's 
reporting consistent time with the DM server:
{noformat}
[james@manifold]$ date
Tue Aug  7 09:07:25 BST 2018{noformat}
Start the job and look for the query to Documentum in the manifoldcf.log file 
(line break added for readability):
{noformat}
DEBUG 2018-08-07T08:07:47.297Z (Startup thread) - DCTM: About to execute query= 
(select for READ distinct i_chronicle_id from dm_document where r_modify_date 
>= date('01/01/1970 00:00:00','mm/dd/ hh:mi:ss') and

r_modify_date<=date('08/07/2018 08:07:34','mm/dd/ hh:mi:ss') 

AND (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
r_content_size>0)) AND ( Folder('/Administrator/james', DESCEND) ))
^C{noformat}
Notice that the latest date asked for is *before* the modification date of the 
files added to DM. (And is an hour out, see footnote.)
  
 See whether anything has been output by the File System connector. It hasn't:
{noformat}
[james@manifold]$ ls /bigdisc/source/PDFs/timezones/
[james@manifold]$
{noformat}
Now:
 * change the timezone on the ManifoldCF server machine
 * restart the ManifoldCF server and the Documentum processes
 * reseed the job

Check the local time on the ManifoldCF server machine; it has changed:
{noformat}
[james@manifold]$ date
Tue Aug  7 10:10:29 CEST 2018{noformat}
Start the job again and notice that the query has changed by an hour, plus the 
few minutes it took to change the date etc (and is still an hour out, see 
footnote):
{noformat}
r_modify_date<=date('08/07/2018 09:11:02','mm/dd/ hh:mi:ss') 
{noformat}
Observe that the range of dates now covers the timestamps on the DM data, and 
also that some data has now been transferred by the File System connector:
{noformat}
[james@manifold]$ ls 
/bigdisc/source/PDFs/timezones/http/mfserver\:8080/da/component/
drl?versionLabel=CURRENT=09018000e515
drl?versionLabel=CURRENT=09018000e516
{noformat}
 

 

[Footnote] It appears that something is trying to take account of Daylight 
Saving Time too.

If I set the server date to a time outside of DST, the query is aligned with 
the current time:

 
{noformat}
[i2e@i2ehost manifold]$ date
 Mon Oct 29 00:01:13 CET 2018

r_modify_date<=date('10/29/2018 00:01:39','mm/dd/ hh:mi:ss') 
{noformat}
But if I set the time inside DST, the time is an hour before:

 
{noformat}
[i2e@i2ehost manifold]$ date
 Sat Oct 27 00:00:06 CEST 2018
r_modify_date<=date('10/26/2018 23:00:26','mm/dd/ hh:mi:ss') 

{noformat}
This is perhaps a Java issue rather than a logic issue in the connector? See 
e.g. [https://stackoverflow.com/questions/6392/java-time-zone-is-messed-up]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1517) Documentum Connector uses different "unconstrained" a_content_type filters depending on whether the Content Types tab has been edited

2018-08-05 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16569399#comment-16569399
 ] 

James Thomas commented on CONNECTORS-1517:
--

I don't know DQL, I'm afraid.

Another alternative would be a "no whitelist" option, i.e. the equivalent of 
the unconstrained search that is currently happening in the first step I 
described in this ticket.

> Documentum Connector uses different "unconstrained" a_content_type filters 
> depending on whether the Content Types tab has been edited
> -
>
> Key: CONNECTORS-1517
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1517
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.10
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
>
> I am using Manifold 2.10 patched for issue 
> https://issues.apache.org/jira/browse/CONNECTORS-1512
> I find that the "unconstrained" query submitted to Documentum differs 
> depending on whether the Content Types in the job have been edited or not. 
> This can dramatically affect which files are fetched. After editing, there 
> are likely to be fewer.
> For example, having simply created a job connecting to DM and setting only 
> the Paths value to Administrator/james the following request is generated. 
> (Taken from manifoldcf.log).
> Note that there are no a_content_type constraints (and my line break for 
> readibility):
> {code:java}
> DEBUG 2018-07-26T05:52:56,422 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:52:56','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0))
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> Once the Content Types tab has been edited (e.g. to remove the 123w type) it 
> looks like this, i.e. the search constrains to only the selected types (my 
> ellipsis for readibility):
> {code:java}
> DEBUG 2018-07-26T05:58:36,755 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:58:36','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('acad', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> If the 123w type is now reselected in the Content Types tab, the search adds 
> it to the list of a_content_type entries, but doesn't return to the 
> unconstrained initial search:
> {code:java}
> DEBUG 2018-07-26T05:59:16,863 (Startup thread) - DCTM: About to execute 
> query= (select for READ distinct i_chronicle_id from dm_document where 
> r_modify_date >= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
> r_modify_date<=date('07/26/2018 05:59:16','mm/dd/ hh:mi:ss') AND 
> (i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
> r_content_size>0 
> AND a_content_type IN ('123w', ... 'zip_pub_html'))) 
> AND ( Folder('/Administrator/james', DESCEND) ))
> {code}
> This means that running what appears to be an equivalent job several times 
> may not fetch the same set of documents from Documentum.
> I expect that the same configuration in the UI produces the same search to 
> Documentum, regardless of how the configuration was arrived at.
> If the selected items in the Content Types list is treated as the only set of 
> files to fetch (i,.e. the initial unconstrained search is considered 
> incorrect here) then I guess I might also like to have flexibility to fetch 
> file types not on the checklist in the Content Types tab.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (CONNECTORS-1517) Documentum Connector uses different "unconstrained" a_content_type filters depending on whether the Content Types tab has been edited

2018-07-25 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1517:


 Summary: Documentum Connector uses different "unconstrained" 
a_content_type filters depending on whether the Content Types tab has been 
edited
 Key: CONNECTORS-1517
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1517
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentum connector
Affects Versions: ManifoldCF 2.10
Reporter: James Thomas


I am using Manifold 2.10 patched for issue 
https://issues.apache.org/jira/browse/CONNECTORS-1512

I find that the "unconstrained" query submitted to Documentum differs depending 
on whether the Content Types in the job have been edited or not. This can 
dramatically affect which files are fetched. After editing, there are likely to 
be fewer.

For example, having simply created a job connecting to DM and setting only the 
Paths value to Administrator/james the following request is generated. (Taken 
from manifoldcf.log).

Note that there are no a_content_type constraints (and my line break for 
readibility):
{code:java}
DEBUG 2018-07-26T05:52:56,422 (Startup thread) - DCTM: About to execute query= 
(select for READ distinct i_chronicle_id from dm_document where r_modify_date 
>= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
r_modify_date<=date('07/26/2018 05:52:56','mm/dd/ hh:mi:ss') AND 
(i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
r_content_size>0))

AND ( Folder('/Administrator/james', DESCEND) ))
{code}
Once the Content Types tab has been edited (e.g. to remove the 123w type) it 
looks like this, i.e. the search constrains to only the selected types (my 
ellipsis for readibility):
{code:java}
DEBUG 2018-07-26T05:58:36,755 (Startup thread) - DCTM: About to execute query= 
(select for READ distinct i_chronicle_id from dm_document where r_modify_date 
>= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
r_modify_date<=date('07/26/2018 05:58:36','mm/dd/ hh:mi:ss') AND 
(i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
r_content_size>0 

AND a_content_type IN ('acad', ... 'zip_pub_html'))) 
AND ( Folder('/Administrator/james', DESCEND) ))
{code}
If the 123w type is now reselected in the Content Types tab, the search adds it 
to the list of a_content_type entries, but doesn't return to the unconstrained 
initial search:
{code:java}
DEBUG 2018-07-26T05:59:16,863 (Startup thread) - DCTM: About to execute query= 
(select for READ distinct i_chronicle_id from dm_document where r_modify_date 
>= date('01/01/1970 01:00:00','mm/dd/ hh:mi:ss') and 
r_modify_date<=date('07/26/2018 05:59:16','mm/dd/ hh:mi:ss') AND 
(i_is_deleted=TRUE Or (i_is_deleted=FALSE AND a_full_text=TRUE AND 
r_content_size>0 

AND a_content_type IN ('123w', ... 'zip_pub_html'))) 
AND ( Folder('/Administrator/james', DESCEND) ))
{code}
This means that running what appears to be an equivalent job several times may 
not fetch the same set of documents from Documentum.

I expect that the same configuration in the UI produces the same search to 
Documentum, regardless of how the configuration was arrived at.

If the selected items in the Content Types list is treated as the only set of 
files to fetch (i,.e. the initial unconstrained search is considered incorrect 
here) then I guess I might also like to have flexibility to fetch file types 
not on the checklist in the Content Types tab.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-07-02 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16530838#comment-16530838
 ] 

James Thomas commented on CONNECTORS-1512:
--

Thanks, yes, I highlighted the fact that it was the same lines because it was 
unexpected.

I believe that I have now got the patch deployed and the DM and MF servers 
running without NPE through my repro.

In case it's useful data, I think the build and deploy of the patch were 
actually OK. The things that got in my way were configuration:

* setting the MCF_HOME environment variable
* setting the DOCUMENTUM environment variable
* editing my CLASSPATH workaround (see earlier in this thread) for the 
Documentum scripts

My inexperience with your software is a barrier for me here. Thanks again for 
your help.

> Documentum Connector throws NPE when a job is run after a document is deleted 
> from the repository
> -
>
> Key: CONNECTORS-1512
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1512
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.8.1, ManifoldCF 2.10
> Environment: Documentum 16.4
> Manifold 2.10, using dfc jars 16.4
> Manifold server machine details (although also seen on a Windows-based MF 
> 2.8.1):
> {code:java}
> $ hostnamectl
>  ...
> Chassis: vm
> Virtualization: kvm
> Operating System: CentOS Linux 7 (Core)
> CPE OS Name: cpe:/o:centos:centos:7
> Kernel: Linux 3.10.0-862.2.3.el7.x86_64
>  Architecture: x86-64
> $ java -version
> openjdk version "1.8.0_171"
> OpenJDK Runtime Environment (build 1.8.0_171-b10)
> OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)
> {code}
> I have the following entry in properties.xml for verbose logging from 
> connectors:
> {code:java}
> 
> {code}
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1512.patch, repro_delete.log
>
>
> I see an NPE from the Documentum Connector when I re-run a job after deleting 
> a file from the Documentum repository using the Documentum Administrator:
> {code:java}
> FATAL 2018-06-29T08:36:54,847 (Worker thread '5') - Error tossed: null
> java.lang.NullPointerException
> at 
> org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
>  ~[?:?]
> ...
>   at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
>  ~[?:?]
> {code}
> The job then appears not to terminate in the Manifold UI.  If the job is not 
> aborted, the NPE will show again after restarting the Manifold server and 
> Documentum Connectors.
> I expect the connectors to handle deletion of source in the repository, 
> passing the appropriate status to the output connector, and not failing like 
> this.
> Reproduce:
> * Configure Manifold to use Documentum Connectors to talk to a Documentum 
> instance
> * Sanity check your configuration by 
> ** create a directory in Documentum, D, with two files F, G
> ** create a job in Manifold to take from D and write to the file system in 
> directory L
> ** run the job and observe that F, G appear in L
> * delete G from D in Documentum 
> ** in Documentum Administrator, right click on G and choose Delete, then OK
> * re-run your job
> * observe that the job doesn't end
> * inspect L 
> * observe that F and G are both still present
> * inspect the Manifold logs
> * observe NPE in the logs
>  
> In the log file I'll attach,  F and G are Book1.xlsx and Book1.xls



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Reopened] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-07-02 Thread James Thomas (JIRA)


 [ 
https://issues.apache.org/jira/browse/CONNECTORS-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Thomas reopened CONNECTORS-1512:
--

I have just reported that I still see the issue after applying the proposed 
patch.

> Documentum Connector throws NPE when a job is run after a document is deleted 
> from the repository
> -
>
> Key: CONNECTORS-1512
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1512
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.8.1, ManifoldCF 2.10
> Environment: Documentum 16.4
> Manifold 2.10, using dfc jars 16.4
> Manifold server machine details (although also seen on a Windows-based MF 
> 2.8.1):
> {code:java}
> $ hostnamectl
>  ...
> Chassis: vm
> Virtualization: kvm
> Operating System: CentOS Linux 7 (Core)
> CPE OS Name: cpe:/o:centos:centos:7
> Kernel: Linux 3.10.0-862.2.3.el7.x86_64
>  Architecture: x86-64
> $ java -version
> openjdk version "1.8.0_171"
> OpenJDK Runtime Environment (build 1.8.0_171-b10)
> OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)
> {code}
> I have the following entry in properties.xml for verbose logging from 
> connectors:
> {code:java}
> 
> {code}
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1512.patch, repro_delete.log
>
>
> I see an NPE from the Documentum Connector when I re-run a job after deleting 
> a file from the Documentum repository using the Documentum Administrator:
> {code:java}
> FATAL 2018-06-29T08:36:54,847 (Worker thread '5') - Error tossed: null
> java.lang.NullPointerException
> at 
> org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
>  ~[?:?]
> ...
>   at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
>  ~[?:?]
> {code}
> The job then appears not to terminate in the Manifold UI.  If the job is not 
> aborted, the NPE will show again after restarting the Manifold server and 
> Documentum Connectors.
> I expect the connectors to handle deletion of source in the repository, 
> passing the appropriate status to the output connector, and not failing like 
> this.
> Reproduce:
> * Configure Manifold to use Documentum Connectors to talk to a Documentum 
> instance
> * Sanity check your configuration by 
> ** create a directory in Documentum, D, with two files F, G
> ** create a job in Manifold to take from D and write to the file system in 
> directory L
> ** run the job and observe that F, G appear in L
> * delete G from D in Documentum 
> ** in Documentum Administrator, right click on G and choose Delete, then OK
> * re-run your job
> * observe that the job doesn't end
> * inspect L 
> * observe that F and G are both still present
> * inspect the Manifold logs
> * observe NPE in the logs
>  
> In the log file I'll attach,  F and G are Book1.xlsx and Book1.xls



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-07-02 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16530388#comment-16530388
 ] 

James Thomas commented on CONNECTORS-1512:
--

You were right about line endings. I ran dos2unix on the two files that you 
edited and was then able to patch and build.

These are the changed files (where dist.vanilla is the 2.10 build without the 
patch):

{code:java}
$ diff -r dist dist.vanilla/
Binary files dist/connector-lib/mcf-documentum-connector.jar and 
dist.vanilla/connector-lib/mcf-documentum-connector.jar differ
Binary files dist/connector-lib/mcf-documentum-connector-rmistub.jar and 
dist.vanilla/connector-lib/mcf-documentum-connector-rmistub.jar differ
Binary files dist/connector-lib/mcf-filenet-connector-rmistub.jar and 
dist.vanilla/connector-lib/mcf-filenet-connector-rmistub.jar differ
Binary files dist/connector-lib/mcf-meridio-connector.jar and 
dist.vanilla/connector-lib/mcf-meridio-connector.jar differ
Binary files dist/connector-lib/mcf-sharepoint-connector.jar and 
dist.vanilla/connector-lib/mcf-sharepoint-connector.jar differ
Binary files 
dist/processes/documentum-server/lib/mcf-documentum-connector-implementation.jar
 and 
dist.vanilla/processes/documentum-server/lib/mcf-documentum-connector-implementation.jar
 differ
Binary files 
dist/processes/documentum-server/lib/mcf-documentum-connector-rmiskel.jar and 
dist.vanilla/processes/documentum-server/lib/mcf-documentum-connector-rmiskel.jar
 differ
Binary files 
dist/processes/filenet-server/lib/mcf-filenet-connector-rmiskel.jar and 
dist.vanilla/processes/filenet-server/lib/mcf-filenet-connector-rmiskel.jar 
differ
Binary files dist/web/war/mcf-crawler-ui.war and 
dist.vanilla/web/war/mcf-crawler-ui.war differ
Binary files dist/web-proprietary/war/mcf-crawler-ui.war and 
dist.vanilla/web-proprietary/war/mcf-crawler-ui.war differ
{code}

I zipped up these files and deployed them into a copy of my 2.10 instance, 
removed the dbname files (to clear any bad history) and restarted Manfiold and 
the Documentum connector servers.

I re-ran my repro steps and am still seeing an NPE at the same lines in 
DocumentumObjectImpl.java, DCTM.java at the same point:

{code:java}
FATAL 2018-07-02T20:46:25,935 (Worker thread '13') - Error tossed: null
java.lang.NullPointerException
at 
org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
 ~[?:?]
...
at 
org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
 ~[?:?]
{code}

What information can I provide you with to help diagnose further or check my 
build?

For example, as a sanity check I can see that DocumentumtImpl.java is changed 
in the patched deployment:

{code:java}
$ unzip -l 
apache-manifoldcf-2.10/processes/documentum-server/lib/mcf-documentum-connector-implementation.jar
Archive:  
apache-manifoldcf-2.10/processes/documentum-server/lib/mcf-documentum-connector-implementation.jar
  Length  DateTimeName
-  -- -   
0  04-15-2018 11:43   META-INF/
  103  04-15-2018 11:43   META-INF/MANIFEST.MF
0  04-15-2018 11:43   org/
0  04-15-2018 11:43   org/apache/
0  04-15-2018 11:43   org/apache/manifoldcf/
0  04-15-2018 11:43   org/apache/manifoldcf/crawler/
0  04-15-2018 11:43   org/apache/manifoldcf/crawler/common/
0  04-15-2018 11:43   org/apache/manifoldcf/crawler/common/DCTM/
 1126  04-15-2018 11:43   
org/apache/manifoldcf/crawler/common/DCTM/DocumentumFactoryImpl.class
10676  04-15-2018 11:43   
org/apache/manifoldcf/crawler/common/DCTM/DocumentumImpl.class
 9555  04-15-2018 11:43   
org/apache/manifoldcf/crawler/common/DCTM/DocumentumObjectImpl.class
 2487  04-15-2018 11:43   
org/apache/manifoldcf/crawler/common/DCTM/DocumentumResultImpl.class
- ---
23947 12 files

$ unzip -l 
apache-manifoldcf-2.10_1512/processes/documentum-server/lib/mcf-documentum-connector-implementation.jar
Archive:  
apache-manifoldcf-2.10_1512/processes/documentum-server/lib/mcf-documentum-connector-implementation.jar
  Length  DateTimeName
-  -- -   
0  07-02-2018 10:25   META-INF/
  105  07-02-2018 10:25   META-INF/MANIFEST.MF
0  07-01-2018 11:15   org/
0  07-01-2018 11:15   org/apache/
0  07-01-2018 11:15   org/apache/manifoldcf/
0  07-01-2018 11:15   org/apache/manifoldcf/crawler/
0  07-01-2018 11:15   org/apache/manifoldcf/crawler/common/
0  07-01-2018 11:15   org/apache/manifoldcf/crawler/common/DCTM/
 1126  07-01-2018 11:15   
org/apache/manifoldcf/crawler/common/DCTM/DocumentumFactoryImpl.class
10858  07-02-2018 10:25   
org/apache/manifoldcf/crawler/common/DCTM/DocumentumImpl.class
 9555  07-01-2018 11:15   

[jira] [Commented] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-07-01 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16529047#comment-16529047
 ] 

James Thomas commented on CONNECTORS-1512:
--

I have followed your steps (without patching) and apparently successfully built 
- I got a bunch of jars in the dist directory.

These are the tarballs I used:

{code:java}
$ wget 
http://apache.mirror.anlx.net/manifoldcf/apache-manifoldcf-2.10/apache-manifoldcf-2.10-src.tar.gz
$  wget 
http://mirrors.ukfast.co.uk/sites/ftp.apache.org/manifoldcf/apache-manifoldcf-2.10/apache-manifoldcf-2.10-lib.tar.gz
{code}

I then tried to patch, but it's not applying:

{code:java}
$  wget 
https://issues.apache.org/jira/secure/attachment/12929706/CONNECTORS-1512.patch
--2018-07-01 11:36:34--  
https://issues.apache.org/jira/secure/attachment/12929706/CONNECTORS-1512.patch
Resolving issues.apache.org... 207.244.88.139
Connecting to issues.apache.org|207.244.88.139|:443... connected.
HTTP request sent, awaiting response... 200 200
Length: 2562 (2.5K) [text/plain]
Saving to: “CONNECTORS-1512.patch”

100%[>]
 2,562   --.-K/s   in 0s

2018-07-01 11:36:34 (36.0 MB/s) - “CONNECTORS-1512.patch” saved [2562/2562]

$  patch -p0 < CONNECTORS-1512.patch
(Stripping trailing CRs from patch.)
patching file 
connectors/documentum/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/DCTM/DCTM.java
Hunk #1 FAILED at 1396.
Hunk #2 FAILED at 1463.
2 out of 2 hunks FAILED -- saving rejects to file 
connectors/documentum/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/DCTM/DCTM.java.rej
(Stripping trailing CRs from patch.)
patching file 
connectors/documentum/implementation/src/main/java/org/apache/manifoldcf/crawler/common/DCTM/DocumentumImpl.java
Hunk #1 FAILED at 331.
1 out of 1 hunk FAILED -- saving rejects to file 
connectors/documentum/implementation/src/main/java/org/apache/manifoldcf/crawler/common/DCTM/DocumentumImpl.java.rej

$  md5sum 
connectors/documentum/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/DCTM/DCTM.java
a8669dbba38ee038353b1ed0b0bb4302  
connectors/documentum/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/DCTM/DCTM.java

$  md5sum 
connectors/documentum/implementation/src/main/java/org/apache/manifoldcf/crawler/common/DCTM/DocumentumImpl.java
303bee1eaf32ddb871c8fa4c4db636a7  
connectors/documentum/implementation/src/main/java/org/apache/manifoldcf/crawler/common/DCTM/DocumentumImpl.java
{code}

I wonder whether you're working from a revision that isn't quite the 2.10 
distro? Or am I doing something obviously wrong?

I tried -a few things with patch: -merge, -fuzz etc but I'm reluctant to monkey 
around with your patch too much.

P.S. I found some inconsistency in README between the src and lib distros. 
Where should I report that?




> Documentum Connector throws NPE when a job is run after a document is deleted 
> from the repository
> -
>
> Key: CONNECTORS-1512
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1512
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.8.1, ManifoldCF 2.10
> Environment: Documentum 16.4
> Manifold 2.10, using dfc jars 16.4
> Manifold server machine details (although also seen on a Windows-based MF 
> 2.8.1):
> {code:java}
> $ hostnamectl
>  ...
> Chassis: vm
> Virtualization: kvm
> Operating System: CentOS Linux 7 (Core)
> CPE OS Name: cpe:/o:centos:centos:7
> Kernel: Linux 3.10.0-862.2.3.el7.x86_64
>  Architecture: x86-64
> $ java -version
> openjdk version "1.8.0_171"
> OpenJDK Runtime Environment (build 1.8.0_171-b10)
> OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)
> {code}
> I have the following entry in properties.xml for verbose logging from 
> connectors:
> {code:java}
> 
> {code}
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1512.patch, repro_delete.log
>
>
> I see an NPE from the Documentum Connector when I re-run a job after deleting 
> a file from the Documentum repository using the Documentum Administrator:
> {code:java}
> FATAL 2018-06-29T08:36:54,847 (Worker thread '5') - Error tossed: null
> java.lang.NullPointerException
> at 
> org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
>  ~[?:?]
> ...
>   at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
>  ~[?:?]
> {code}
> The job then appears not to terminate in the Manifold UI.  

[jira] [Commented] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-06-29 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16527468#comment-16527468
 ] 

James Thomas commented on CONNECTORS-1512:
--

I downloaded 2.10 from 
http://www.apache.org/dyn/closer.lua/manifoldcf/apache-manifoldcf-2.10/apache-manifoldcf-2.10-bin.tar.gz
 and uncompressed it.

I'm running from the command line in the example directory using 

{code:java}
java -jar start.jar
{code}

I took the DFC distro for DM 16.4 from Open Text's forums and unpacked the 
relevant files into processes/documentum-server/lib-proprietary

I found that I needed to set CLASSPATH to get the DM connectors to work so I've 
edited the supplied run.sh scripts to do this by adding this line immediately 
before calls to Java, effectively adding the location of dfc.properties to the 
classpath:

{code:java}
CLASSPATH=$CLASSPATH:/opt/manifold/apache-manifoldcf-2.10/processes/documentum-server
{code}

So I've done no building from source. If it was possible to pick up a nightly 
build of the DM connector jars only (I guess?) I'd like to do that.

If you need other specific information please say. 



> Documentum Connector throws NPE when a job is run after a document is deleted 
> from the repository
> -
>
> Key: CONNECTORS-1512
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1512
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.8.1, ManifoldCF 2.10
> Environment: Documentum 16.4
> Manifold 2.10, using dfc jars 16.4
> Manifold server machine details (although also seen on a Windows-based MF 
> 2.8.1):
> {code:java}
> $ hostnamectl
>  ...
> Chassis: vm
> Virtualization: kvm
> Operating System: CentOS Linux 7 (Core)
> CPE OS Name: cpe:/o:centos:centos:7
> Kernel: Linux 3.10.0-862.2.3.el7.x86_64
>  Architecture: x86-64
> $ java -version
> openjdk version "1.8.0_171"
> OpenJDK Runtime Environment (build 1.8.0_171-b10)
> OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)
> {code}
> I have the following entry in properties.xml for verbose logging from 
> connectors:
> {code:java}
> 
> {code}
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1512.patch, repro_delete.log
>
>
> I see an NPE from the Documentum Connector when I re-run a job after deleting 
> a file from the Documentum repository using the Documentum Administrator:
> {code:java}
> FATAL 2018-06-29T08:36:54,847 (Worker thread '5') - Error tossed: null
> java.lang.NullPointerException
> at 
> org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
>  ~[?:?]
> ...
>   at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
>  ~[?:?]
> {code}
> The job then appears not to terminate in the Manifold UI.  If the job is not 
> aborted, the NPE will show again after restarting the Manifold server and 
> Documentum Connectors.
> I expect the connectors to handle deletion of source in the repository, 
> passing the appropriate status to the output connector, and not failing like 
> this.
> Reproduce:
> * Configure Manifold to use Documentum Connectors to talk to a Documentum 
> instance
> * Sanity check your configuration by 
> ** create a directory in Documentum, D, with two files F, G
> ** create a job in Manifold to take from D and write to the file system in 
> directory L
> ** run the job and observe that F, G appear in L
> * delete G from D in Documentum 
> ** in Documentum Administrator, right click on G and choose Delete, then OK
> * re-run your job
> * observe that the job doesn't end
> * inspect L 
> * observe that F and G are both still present
> * inspect the Manifold logs
> * observe NPE in the logs
>  
> In the log file I'll attach,  F and G are Book1.xlsx and Book1.xls



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-06-29 Thread James Thomas (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16527458#comment-16527458
 ] 

James Thomas commented on CONNECTORS-1512:
--

Thanks for the quick response (y) 

I'm pretty new to Manifold. Where can I get a build containing your fix, and 
what do I need to do to install it?


> Documentum Connector throws NPE when a job is run after a document is deleted 
> from the repository
> -
>
> Key: CONNECTORS-1512
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1512
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Documentum connector
>Affects Versions: ManifoldCF 2.8.1, ManifoldCF 2.10
> Environment: Documentum 16.4
> Manifold 2.10, using dfc jars 16.4
> Manifold server machine details (although also seen on a Windows-based MF 
> 2.8.1):
> {code:java}
> $ hostnamectl
>  ...
> Chassis: vm
> Virtualization: kvm
> Operating System: CentOS Linux 7 (Core)
> CPE OS Name: cpe:/o:centos:centos:7
> Kernel: Linux 3.10.0-862.2.3.el7.x86_64
>  Architecture: x86-64
> $ java -version
> openjdk version "1.8.0_171"
> OpenJDK Runtime Environment (build 1.8.0_171-b10)
> OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)
> {code}
> I have the following entry in properties.xml for verbose logging from 
> connectors:
> {code:java}
> 
> {code}
>Reporter: James Thomas
>Assignee: Karl Wright
>Priority: Major
> Fix For: ManifoldCF 2.11
>
> Attachments: CONNECTORS-1512.patch, repro_delete.log
>
>
> I see an NPE from the Documentum Connector when I re-run a job after deleting 
> a file from the Documentum repository using the Documentum Administrator:
> {code:java}
> FATAL 2018-06-29T08:36:54,847 (Worker thread '5') - Error tossed: null
> java.lang.NullPointerException
> at 
> org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
>  ~[?:?]
> ...
>   at 
> org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
>  ~[?:?]
> {code}
> The job then appears not to terminate in the Manifold UI.  If the job is not 
> aborted, the NPE will show again after restarting the Manifold server and 
> Documentum Connectors.
> I expect the connectors to handle deletion of source in the repository, 
> passing the appropriate status to the output connector, and not failing like 
> this.
> Reproduce:
> * Configure Manifold to use Documentum Connectors to talk to a Documentum 
> instance
> * Sanity check your configuration by 
> ** create a directory in Documentum, D, with two files F, G
> ** create a job in Manifold to take from D and write to the file system in 
> directory L
> ** run the job and observe that F, G appear in L
> * delete G from D in Documentum 
> ** in Documentum Administrator, right click on G and choose Delete, then OK
> * re-run your job
> * observe that the job doesn't end
> * inspect L 
> * observe that F and G are both still present
> * inspect the Manifold logs
> * observe NPE in the logs
>  
> In the log file I'll attach,  F and G are Book1.xlsx and Book1.xls



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (CONNECTORS-1512) Documentum Connector throws NPE when a job is run after a document is deleted from the repository

2018-06-29 Thread James Thomas (JIRA)
James Thomas created CONNECTORS-1512:


 Summary: Documentum Connector throws NPE when a job is run after a 
document is deleted from the repository
 Key: CONNECTORS-1512
 URL: https://issues.apache.org/jira/browse/CONNECTORS-1512
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentum connector
Affects Versions: ManifoldCF 2.10, ManifoldCF 2.8.1
 Environment: Documentum 16.4
Manifold 2.10, using dfc jars 16.4
Manifold server machine details (although also seen on a Windows-based MF 
2.8.1):

{code:java}
$ hostnamectl
 ...
Chassis: vm
Virtualization: kvm
Operating System: CentOS Linux 7 (Core)
CPE OS Name: cpe:/o:centos:centos:7
Kernel: Linux 3.10.0-862.2.3.el7.x86_64
 Architecture: x86-64

$ java -version
openjdk version "1.8.0_171"
OpenJDK Runtime Environment (build 1.8.0_171-b10)
OpenJDK 64-Bit Server VM (build 25.171-b10, mixed mode)
{code}

I have the following entry in properties.xml for verbose logging from 
connectors:

{code:java}

{code}


Reporter: James Thomas
 Attachments: repro_delete.log

I see an NPE from the Documentum Connector when I re-run a job after deleting a 
file from the Documentum repository using the Documentum Administrator:

{code:java}
FATAL 2018-06-29T08:36:54,847 (Worker thread '5') - Error tossed: null
java.lang.NullPointerException
at 
org.apache.manifoldcf.crawler.common.DCTM.DocumentumObjectImpl.getContentSize(DocumentumObjectImpl.java:181)
 ~[?:?]
...
  at 
org.apache.manifoldcf.crawler.connectors.DCTM.DCTM$ProcessDocumentThread.run(DCTM.java:1399)
 ~[?:?]
{code}

The job then appears not to terminate in the Manifold UI.  If the job is not 
aborted, the NPE will show again after restarting the Manifold server and 
Documentum Connectors.

I expect the connectors to handle deletion of source in the repository, passing 
the appropriate status to the output connector, and not failing like this.

Reproduce:

* Configure Manifold to use Documentum Connectors to talk to a Documentum 
instance

* Sanity check your configuration by 
** create a directory in Documentum, D, with two files F, G
** create a job in Manifold to take from D and write to the file system in 
directory L
** run the job and observe that F, G appear in L

* delete G from D in Documentum 
** in Documentum Administrator, right click on G and choose Delete, then OK

* re-run your job
* observe that the job doesn't end
* inspect L 
* observe that F and G are both still present
* inspect the Manifold logs
* observe NPE in the logs
 
In the log file I'll attach,  F and G are Book1.xlsx and Book1.xls



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)