[jira] [Issue Comment Edited] (CONNECTORS-347) MySQL HopCount test sometimes fails

2012-04-20 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13258202#comment-13258202
 ] 

Karl Wright edited comment on CONNECTORS-347 at 4/20/12 12:58 PM:
--

Here's a postmortem for a run that succeeded:

{code}
mysql select parentidhash,distance from hopcount;
+--+--+
| parentidhash | distance |
+--+--+
| 5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |0 |
| 718564BC3D0C94B0F88B36B7FA2FC915F665A5E2 |3 |
| 1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 |2 |
| B622AAB99E7BFF749E2C93E01009C3E02F90693C |2 |
| 9F6F59234CC36DF341184FAEC934BEEE516E1E60 |1 |
| CD31925B98C4B481F6523030A1C7C9D9DF0CC080 |1 |
| 1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 |1 |
+--+--+
7 rows in set (0.00 sec)

mysql select parentidhash,childidhash from intrinsiclink;
+--+--+
| parentidhash | childidhash  |
+--+--+
| 1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 | 
5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |
| 1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 | 
1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 |
| 5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |  |
| 718564BC3D0C94B0F88B36B7FA2FC915F665A5E2 | 
1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 |
| 9F6F59234CC36DF341184FAEC934BEEE516E1E60 | 
5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |
| B622AAB99E7BFF749E2C93E01009C3E02F90693C | 
1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 |
| CD31925B98C4B481F6523030A1C7C9D9DF0CC080 | 
5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |
+--+--+
7 rows in set (0.00 sec)

mysql select dochash,docid from jobqueue;
+--++
| dochash  | docid  
  |
+--++
| 5432CFFCE5729BF1F3E695FFCF83B08E415694C6 | 
C:\wip\mcf\trunk\tests\filesystem\test-mysql-output\testdata   |
| B622AAB99E7BFF749E2C93E01009C3E02F90693C | 
C:\wip\mcf\trunk\tests\filesystem\test-mysql-output\testdata\testdir\test3.txt |
| 1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 | 
C:\wip\mcf\trunk\tests\filesystem\test-mysql-output\testdata\testdir\seconddir |
| CD31925B98C4B481F6523030A1C7C9D9DF0CC080 | 
C:\wip\mcf\trunk\tests\filesystem\test-mysql-output\testdata\test1.txt |
| 9F6F59234CC36DF341184FAEC934BEEE516E1E60 | 
C:\wip\mcf\trunk\tests\filesystem\test-mysql-output\testdata\test2.txt |
| 1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 | 
C:\wip\mcf\trunk\tests\filesystem\test-mysql-output\testdata\testdir   |
+--++
6 rows in set (0.00 sec)
{code}

Even in this one, the jobqueue table is inconsistent with the hopcount and 
intrinsiclink tables.  They should all be consistent.



  was (Author: kwri...@metacarta.com):
Here's a postmortem for a run that succeeded:

{code}
mysql select parentidhash,distance from hopcount;
+--+--+
| parentidhash | distance |
+--+--+
| 5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |0 |
| 718564BC3D0C94B0F88B36B7FA2FC915F665A5E2 |3 |
| 1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 |2 |
| B622AAB99E7BFF749E2C93E01009C3E02F90693C |2 |
| 9F6F59234CC36DF341184FAEC934BEEE516E1E60 |1 |
| CD31925B98C4B481F6523030A1C7C9D9DF0CC080 |1 |
| 1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 |1 |
+--+--+
7 rows in set (0.00 sec)

mysql select parentidhash,childidhash from intrinsiclink;
+--+--+
| parentidhash | childidhash  |
+--+--+
| 1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 | 
5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |
| 1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 | 
1B26DBA4653B26B3151433EBB4C1CDCB8358C1F2 |
| 5432CFFCE5729BF1F3E695FFCF83B08E415694C6 |  |
| 718564BC3D0C94B0F88B36B7FA2FC915F665A5E2 | 
1D9DB3181447F82677FB9AB91C56F33B1A2C75B0 |
| 9F6F59234CC36DF341184FAEC934BEEE516E1E60 | 

[jira] [Issue Comment Edited] (CONNECTORS-347) MySQL HopCount test sometimes fails

2012-04-20 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13258237#comment-13258237
 ] 

Karl Wright edited comment on CONNECTORS-347 at 4/20/12 1:27 PM:
-

A postgresql run is as follows:

{code}
testdb=# select dochash,docid from jobqueue;
 dochash  |   docid
--+-
 EE6C0ED002CE9303183FEFEF111D6598203F6295 | 
C:\wip\mcf\trunk\tests\filesystem\test-postgresql-output\testdata
 E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60 | 
C:\wip\mcf\trunk\tests\filesystem\test-postgresql-output\testdata\testdir
 181B9C9FB7F62D24461967391B73FF7CB0B14CD9 | 
C:\wip\mcf\trunk\tests\filesystem\test-postgresql-output\testdata\test2.txt
 24081AC0B03B2394F38D3EB78A3BEDB6CCF1D981 | 
C:\wip\mcf\trunk\tests\filesystem\test-postgresql-output\testdata\test1.txt
 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666 | 
C:\wip\mcf\trunk\tests\filesystem\test-postgresql-output\testdata\testdir\seconddir
 4ED8AFCC917B604C1EE4C83458DB9B7265967F40 | 
C:\wip\mcf\trunk\tests\filesystem\test-postgresql-output\testdata\testdir\test3.txt
(6 rows)


testdb=# select parentidhash, childidhash from intrinsiclink;
   parentidhash   |   childidhash

--+--
 EE6C0ED002CE9303183FEFEF111D6598203F6295 |
 181B9C9FB7F62D24461967391B73FF7CB0B14CD9 | 
EE6C0ED002CE9303183FEFEF111D6598203F6295
 24081AC0B03B2394F38D3EB78A3BEDB6CCF1D981 | 
EE6C0ED002CE9303183FEFEF111D6598203F6295
 E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60 | 
EE6C0ED002CE9303183FEFEF111D6598203F6295
 4ED8AFCC917B604C1EE4C83458DB9B7265967F40 | 
E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60
 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666 | 
E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60
 7013D738BB77ED47763449997384046B20B77832 | 
5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666
(7 rows)


testdb=# select parentidhash, distance from hopcount;
   parentidhash   | distance
--+--
 EE6C0ED002CE9303183FEFEF111D6598203F6295 |0
 24081AC0B03B2394F38D3EB78A3BEDB6CCF1D981 |1
 181B9C9FB7F62D24461967391B73FF7CB0B14CD9 |1
 E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60 |1
 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666 |2
 4ED8AFCC917B604C1EE4C83458DB9B7265967F40 |2
 7013D738BB77ED47763449997384046B20B77832 |3
(7 rows)


testdb=#
{code}


  was (Author: kwri...@metacarta.com):
A postgresql run is as follows:

{code}
testdb=# select dochash,docid from jobqueue;
 dochash  |
   docid
--+-

 EE6C0ED002CE9303183FEFEF111D6598203F6295 | C:\wip\mcf\trunk\tests\filesystem\te
st-postgresql-output\testdata
 E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60 | C:\wip\mcf\trunk\tests\filesystem\te
st-postgresql-output\testdata\testdir
 181B9C9FB7F62D24461967391B73FF7CB0B14CD9 | C:\wip\mcf\trunk\tests\filesystem\te
st-postgresql-output\testdata\test2.txt
 24081AC0B03B2394F38D3EB78A3BEDB6CCF1D981 | C:\wip\mcf\trunk\tests\filesystem\te
st-postgresql-output\testdata\test1.txt
 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666 | C:\wip\mcf\trunk\tests\filesystem\te
st-postgresql-output\testdata\testdir\seconddir
 4ED8AFCC917B604C1EE4C83458DB9B7265967F40 | C:\wip\mcf\trunk\tests\filesystem\te
st-postgresql-output\testdata\testdir\test3.txt
(6 rows)


testdb=# select parentidhash, childidhash from intrinsiclink;
   parentidhash   |   childidhash

--+-
-
 EE6C0ED002CE9303183FEFEF111D6598203F6295 |
 181B9C9FB7F62D24461967391B73FF7CB0B14CD9 | EE6C0ED002CE9303183FEFEF111D6598203F
6295
 24081AC0B03B2394F38D3EB78A3BEDB6CCF1D981 | EE6C0ED002CE9303183FEFEF111D6598203F
6295
 E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60 | EE6C0ED002CE9303183FEFEF111D6598203F
6295
 4ED8AFCC917B604C1EE4C83458DB9B7265967F40 | E7DE0B8D3F153CECFD3C57B4D83E59073EC4
DD60
 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666 | E7DE0B8D3F153CECFD3C57B4D83E59073EC4
DD60
 7013D738BB77ED47763449997384046B20B77832 | 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4
C666
(7 rows)


testdb=# select parentidhash, distance from hopcount;
   parentidhash   | distance
--+--
 EE6C0ED002CE9303183FEFEF111D6598203F6295 |0
 24081AC0B03B2394F38D3EB78A3BEDB6CCF1D981 |1
 181B9C9FB7F62D24461967391B73FF7CB0B14CD9 |1
 E7DE0B8D3F153CECFD3C57B4D83E59073EC4DD60 |1
 5AA0DB8B2B7D267FAD5E3E0E8926996DEFC4C666 |2
 4ED8AFCC917B604C1EE4C83458DB9B7265967F40 

[jira] [Issue Comment Edited] (CONNECTORS-347) MySQL HopCount test sometimes fails

2012-04-19 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13257899#comment-13257899
 ] 

Karl Wright edited comment on CONNECTORS-347 at 4/19/12 11:23 PM:
--

Just saw it again; clearly it is real.
My working hypothesis is that the counts returned by count(x) in the SQL 
statements are in fact not synchronized with the transactions, so maybe the 
count can run behind.  The context in which this happens is an initial crawl 
of six documents, with a hopcount filter.  If the filter is not working the 
count would be 7; with a working filter, it's 6.  But how can we then get 4?


  was (Author: kwri...@metacarta.com):
Just saw it again; clearly it is real.
  
 MySQL HopCount test sometimes fails
 ---

 Key: CONNECTORS-347
 URL: https://issues.apache.org/jira/browse/CONNECTORS-347
 Project: ManifoldCF
  Issue Type: Bug
  Components: Framework core
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF next


 The HopcountMySQLIT test sometimes fails complaining that it is expecting to 
 see 6 documents but only sees 4.
 {code}
 [junit] org.apache.manifoldcf.core.interfaces.ManifoldCFException: Wrong 
 number of documents processed - expected 6, saw 4
 [junit] at 
 org.apache.manifoldcf.filesystem_tests.HopcountMySQLIT.hopcountCheck(HopcountMySQLIT.java:152)
 {code}
 This does not happen on every run.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-347) MySQL HopCount test sometimes fails

2012-04-19 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13257899#comment-13257899
 ] 

Karl Wright edited comment on CONNECTORS-347 at 4/20/12 1:42 AM:
-

Just saw it again; clearly it is real.
My working hypothesis is that the counts returned by count() in the SQL 
statements are in fact not synchronized with the transactions, so maybe the 
count can run behind.  The context in which this happens is an initial crawl 
of six documents, with a hopcount filter.  If the filter is not working the 
count would be 7; with a working filter, it's 6.  But how can we then get 4?


  was (Author: kwri...@metacarta.com):
Just saw it again; clearly it is real.
My working hypothesis is that the counts returned by count(x) in the SQL 
statements are in fact not synchronized with the transactions, so maybe the 
count can run behind.  The context in which this happens is an initial crawl 
of six documents, with a hopcount filter.  If the filter is not working the 
count would be 7; with a working filter, it's 6.  But how can we then get 4?

  
 MySQL HopCount test sometimes fails
 ---

 Key: CONNECTORS-347
 URL: https://issues.apache.org/jira/browse/CONNECTORS-347
 Project: ManifoldCF
  Issue Type: Bug
  Components: Framework core
Affects Versions: ManifoldCF 0.5, ManifoldCF 0.6
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.6


 The HopcountMySQLIT test sometimes fails complaining that it is expecting to 
 see 6 documents but only sees 4.
 {code}
 [junit] org.apache.manifoldcf.core.interfaces.ManifoldCFException: Wrong 
 number of documents processed - expected 6, saw 4
 [junit] at 
 org.apache.manifoldcf.filesystem_tests.HopcountMySQLIT.hopcountCheck(HopcountMySQLIT.java:152)
 {code}
 This does not happen on every run.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-460) ManifoldCF authority service doesn't handle multi-domain environments

2012-04-11 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13251493#comment-13251493
 ] 

Karl Wright edited comment on CONNECTORS-460 at 4/11/12 12:06 PM:
--

Hi Colin,

The prefixes are not the issue.  In ManifoldCF, there should be one authority 
that is capable of dealing with all the interacting domains, just like 
SharePoint understands all the domains by talking with just one domain 
controller.  An authority's job is to look up the SIDs for a given username, 
and since your SharePoint instance can do this by talking with only one DC, 
ManifoldCF's active directory authority connector ought to be able to do the 
same thing.

The issue, then, is that the active directory authority connector doesn't know 
how to deal with external, trusted domains.  The authority must locate the 
DC(s) for these domains within LDAP (I believe) and route requests to the 
appropriate DC.  Alternatively, if automatic discovery of the external DC is 
too hard, we can modify the AD authority connector to simply allow the user to 
describe multiple domain controllers.


Does this sound reasonable to you?


  was (Author: kwri...@metacarta.com):
Hi Colin,

The prefixes are not the issue.  In ManifoldCF, there should be one authority 
that is capable of dealing with all the interacting domains, just like 
SharePoint understands all the domains by talking with just one domain 
controller.  An authority's job is to look up the SIDs for a given username, 
and since your SharePoint instance can do this by talking with only one DC, 
ManifoldCF's active directory authority connector ought to be able to do the 
same thing.

The issue, then, is that the active directory authority connector doesn't know 
how to deal with external, trusted domains.  The authority must located the 
DC(s) for these domains within LDAP (I believe) and route requests to the 
appropriate DC.  Alternatively, if automatic discovery of the external DC is 
too hard, we can modify the AD authority connector to simply allow the user to 
describe multiple domain controllers.


Does this sound reasonable to you?

  
 ManifoldCF authority service doesn't handle multi-domain environments
 -

 Key: CONNECTORS-460
 URL: https://issues.apache.org/jira/browse/CONNECTORS-460
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Active Directory authority, Authority Service
 Environment: Two Active Directory domains: {{internal.com}} and 
 {{external.com}}
 I'm indexing a Sharepoint site, where that site has permissions set 
 from_both_domains
Reporter: Colin Anderson
  Labels: active-directory, authorization, security

 The ManifoldCF authority service doesn't handle multi-domain environments.
 The authority service returns a list of SIDs for the specified user, from all 
 available ManifoldCF authorities, for example:
 {{TOKEN:InternalAD:S-1-5-21-1234567890-1234567890-1234567890-1234}}
 Note that the SID is prefixed with the name of the ManifoldCF authority.
 Here is my setup:
 Output connector: Solr
 Authority connector1: Active Directory ({{internal.com}} domain), named 
 {{InternalAD}}
 Authority connector2: Active Directory ({{external.com}} domain), named 
 {{ExternalAD}}
 Repository connector: Sharepoint
 If I set the Sharepoint repository connector to use the authority 'None 
 (Global Authority)', then {{allow_token_document}} will contain SIDs that are 
 _not_ prefixed with any authority name, for example:
 {{S-1-5-21-1234567890-1234567890-1234567890-1234}}
 It is therefore not possible to get any search results, because the authority 
 service tokens will not match the stored tokens (because they _are_ prefixed 
 with authority names).
 If I set the Sharepoint repository connector to use one of the AD authorities 
 'InternalAD', then {{allow_token_document}} will contain SIDs that are 
 prefixed with 'InternalAD', for example:
 {{TOKEN:InternalAD:S-1-5-21-1234567890-1234567890-1234567890-1234}}
 However, the prefix is _always_ 'InternalAD', even if the user/group actually 
 belongs to the {{external.com}} domain. Therefore it is not possible for 
 users in the {{external.com}} domain to get any search results, because the 
 authority service tokens will not match the stored tokens.
 In essence, there seems to be a mismatch between the tokens that the 
 authority service outputs, and those that repository connectors output.
 Perhaps one solution would be to use the authority 'None (Global Authority)', 
 and modify the authority service to take an extra query parameter that 
 prevents it from prefixing SIDs with the authority name.

--
This message is automatically generated by JIRA.
If you think it was sent 

[jira] [Issue Comment Edited] (CONNECTORS-430) An error should be returned if invalid seeds are typed into the seeds list for the web connector

2012-03-27 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13239993#comment-13239993
 ] 

Karl Wright edited comment on CONNECTORS-430 at 3/27/12 10:08 PM:
--

For me it fails:

{code}
   [junit] Multipart posting url 
'http://localhost:8346/mcf-crawler-ui/execute.jsp' with parameters 
'outputname=MyOutputConnectionindex=recrawlinterval=1440description=MyJobstartmethod=2expirationinterval=jobid=1332885945254priority=5reseedinterval=60tabname=Connectionconnectionname=MyRepositoryConnectionschedulerecords=0scheduletype=1type=jobop=Continue'
 and 0 files...
[junit] Traceback (most recent call last):
[junit]   File test.py, line 166, in module
[junit] var124.click()
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/VirtualBrowser.py, 
line 120, in click
[junit] self.get_form( ).execute_javascript_expression( self.onclick )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/VirtualBrowser.py, 
line 672, in execute_javascript_expression
[junit] return self.window_instance.execute_javascript_expression( 
javascript )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/VirtualBrowser.py, 
line 920, in execute_javascript_expression
[junit] return tokenstream.evaluate_expr( self.jscontext, HTML )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1085, in evaluate_expr
[junit] rval = self.evaluate_expr1( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1120, in evaluate_expr1
[junit] rval = self.evaluate_expr2( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1164, in evaluate_expr2
[junit] return self.evaluate_expr3( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1176, in evaluate_expr3
[junit] rval = self.evaluate_expr4( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1264, in evaluate_expr4
[junit] rval = self.evaluate_expr5( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1299, in evaluate_expr5
[junit] rval = self.evaluate_expr6( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1341, in evaluate_expr6
[junit] return self.evaluate_expr7( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1438, in evaluate_expr7
[junit] return reference_object.call( arguments, context )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 554, in call
[junit] return self.get_referenced_object().call(argset,context)
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 186, in call
[junit] response = ts.evaluate_statement( context, method %s % 
self.name )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 752, in evaluate_statement
[junit] result = self.evaluate_statement( newscope, place )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 988, in evaluate_statement
[junit] if self.evaluate_expr( context, place ) == None:
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1085, in evaluate_expr
[junit] rval = self.evaluate_expr1( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1120, in evaluate_expr1
[junit] rval = self.evaluate_expr2( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1164, in evaluate_expr2
[junit] return self.evaluate_expr3( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1176, in evaluate_expr3
[junit] rval = self.evaluate_expr4( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1264, in evaluate_expr4
[junit] rval = self.evaluate_expr5( context, place, parse_only )
[junit]   File 
/home/kwright/wip/trunk/tests/webcrawler/test-derby-output/Javascript.py, 
line 1299, in evaluate_expr5
[junit] rval = self.evaluate_expr6( context, place, 

[jira] [Issue Comment Edited] (CONNECTORS-288) An ElasticSearch connector would be helpful

2012-02-26 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13216851#comment-13216851
 ] 

Karl Wright edited comment on CONNECTORS-288 at 2/26/12 9:16 PM:
-

Looking at the actual test run, the history reports the following at the end:

{code}
02-26-2012 16:09:25.129 Indexation (ElasticSearch)  
http://localhost:9200/index/_optimize
OK  0   69  
02-26-2012 16:09:24.939 job end 1330290457146(Test Job)
0   1   
02-26-2012 16:09:14.909 Deletion (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/138/null
OK  0   7   
02-26-2012 16:09:07.787 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/140/null
OK  27  6   
02-26-2012 16:09:07.778 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/139/null
OK  27  7   
02-26-2012 16:09:07.769 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/137/null
OK  27  15  
02-26-2012 16:09:05.278 job start   1330290457146(Test Job)
0   1   
02-26-2012 16:08:55.020 Indexation (ElasticSearch)  
http://localhost:9200/index/_optimize
OK  0   93  
02-26-2012 16:08:54.926 job end 1330290457146(Test Job)
0   1   
02-26-2012 16:08:47.678 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/138/null
OK  27  10  
02-26-2012 16:08:47.666 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/140/null
OK  27  6   
02-26-2012 16:08:47.652 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/139/null
OK  27  11  
02-26-2012 16:08:47.646 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/137/null
OK  27  13  
02-26-2012 16:08:45.192 job start   1330290457146(Test Job)
0   1   
02-26-2012 16:08:34.940 Indexation (ElasticSearch)  
http://localhost:9200/index/_optimize
OK  0   75  
02-26-2012 16:08:34.917 job end 1330290457146(Test Job)
0   1   
02-26-2012 16:08:29.502 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/139/null
OK  27  10  
02-26-2012 16:08:29.491 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/137/null
OK  27  8   
02-26-2012 16:08:29.412 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/140/null
OK  27  66  
02-26-2012 16:08:29.404 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/138/null
OK  27  68  
02-26-2012 16:08:25.097 job start   1330290457146(Test Job)
0   1   
02-26-2012 16:08:24.846 Indexation (ElasticSearch)  
http://localhost:9200/index/_optimize
OK  0   88  
02-26-2012 16:08:14.890 job stop1330290457146(Test Job)
0   1   
02-26-2012 16:08:09.041 Optimize (ElasticSearch)
http://localhost:9090/chemistry-opencmis-server-inmemory/atom...
/138/null
OK  27  868 
02-26-2012 16:08:04.900 job start   1330290457146(Test Job)
0   1   
{code}

The job at the end is stuck in the Cleaning up state, which indicates that it 
is trying to delete the documents from the index, but is not succeeding for 
some reason.  The jobstatus reports 4 documents at that time.

The CMIS connector is not helping here because it does not seem to record ANY 
activities.  It also looks like the activities being recorded for the 
ElasticSearch connector are backwards; it records Optimize when it should 
record Indexation, and visa versa.



  was (Author: kwri...@metacarta.com):
Looking at the actual test run, the history reports the following at the 
end:

{code}
02-26-2012 16:09:25.129 Indexation (ElasticSearch)  
http://localhost:9200/index/_optimize
OK  0   69  
02-26-2012 16:09:24.939 job end 1330290457146(Test Job)
0   1   
02-26-2012 16:09:14.909 Deletion (ElasticSearch)

[jira] [Issue Comment Edited] (CONNECTORS-288) An ElasticSearch connector would be helpful

2012-02-16 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13209673#comment-13209673
 ] 

Karl Wright edited comment on CONNECTORS-288 at 2/16/12 8:17 PM:
-

I modified the ant build to properly build and run the tests.  Unfortunately 
the compilation of the IT tests fails:

{code}
compile-tests:
[javac] C:\wip\mcf\CONNECTORS-288\tests\test-build.xml:102: warning: 'includ
eantruntime' was not set, defaulting to build.sysclasspath=last; set to false fo
r repeatable builds
[javac] Compiling 4 source files to C:\wip\mcf\CONNECTORS-288\tests\elastics
earch\build\test\classes
[javac] C:\wip\mcf\CONNECTORS-288\tests\elasticsearch\src\test\java\org\apac
he\manifoldcf\elasticsearch_tests\BaseDerby.java:22: package org.elasticsearch.n
ode does not exist
[javac] import org.elasticsearch.node.Node;
[javac]   ^
[javac] C:\wip\mcf\CONNECTORS-288\tests\elasticsearch\src\test\java\org\apac
he\manifoldcf\elasticsearch_tests\BaseDerby.java:27: package org.elasticsearch.n
ode does not exist
[javac] import static org.elasticsearch.node.NodeBuilder.*;
[javac]  ^
[javac] C:\wip\mcf\CONNECTORS-288\tests\elasticsearch\src\test\java\org\apac
he\manifoldcf\elasticsearch_tests\BaseDerby.java:37: cannot find symbol
[javac] symbol  : class Node
[javac] location: class org.apache.manifoldcf.elasticsearch_tests.BaseDerby
[javac]   protected Node node = null;
[javac] ^
[javac] C:\wip\mcf\CONNECTORS-288\tests\elasticsearch\src\test\java\org\apac
he\manifoldcf\elasticsearch_tests\BaseDerby.java:92: cannot find symbol
[javac] symbol  : method nodeBuilder()
[javac] location: class org.apache.manifoldcf.elasticsearch_tests.BaseDerby
[javac] node = nodeBuilder().local(true).node();
[javac]^
[javac] 4 errors
{code}



  was (Author: kwri...@metacarta.com):
I modified the ant build to properly build and run the tests.  
Unfortunately the compilation of the IT tests fails.

  
 An ElasticSearch connector would be helpful
 ---

 Key: CONNECTORS-288
 URL: https://issues.apache.org/jira/browse/CONNECTORS-288
 Project: ManifoldCF
  Issue Type: New Feature
Affects Versions: ManifoldCF 0.5
Reporter: Piergiorgio Lucidi
Assignee: Piergiorgio Lucidi
  Labels: elasticsearch
 Fix For: ManifoldCF next

 Attachments: manifold-elasticsearch-patch, 
 manifold-elasticsearch-patch, manifold-elasticsearch-patch, 
 manifold-elasticsearch-patch, manifold-elasticsearch-patch, 
 manifold-elasticsearch-patch, manifoldcf-elasticsearch-project-patct

   Original Estimate: 120h
  Remaining Estimate: 120h

 An ElasticSearch connector could be very useful to spread the use of 
 ManifoldCF

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-402) Separate the proprietary / non-proprietary deliverables for each connector

2012-02-08 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204068#comment-13204068
 ] 

Karl Wright edited comment on CONNECTORS-402 at 2/8/12 11:06 PM:
-

I created a branch CONNECTORS-402 for this work.

Problems still outstanding:

(1) Since JDBC drivers must be built into the war files, they cannot be 
delivered into connector-lib.  This means we have no ability to prebuild the 
JDBC connector to include oracle, mssql, sybase, or mysql support.  We need a 
way of suppressing inclusion of these JDBC jars when building for release, even 
though we will likely have run the ant download-dependencies target in the 
release branch.

(2) The tests all need to be run and checked, since the place where proprietary 
materials are placed has now become lib-proprietary across the board.

(3) The how-to-build-and-deploy documentation needs significant changes.

(4) The process components of the FileNet and Documentum connectors are where 
the proprietary jars wind up; we need a solution for these.

(5) How to make commented-out connectors.xml entries when building for release, 
for those connectors that have proprietary components.


  was (Author: kwri...@metacarta.com):
I created a branch CONNECTORS-402 for this work.

Problems still outstanding:

(1) Since JDBC drivers must be built into the war files, they cannot be 
delivered into connector-lib.  This means we have no ability to prebuild the 
JDBC connector to include oracle, mssql, sybase, or mysql support.  We need a 
way of suppressing inclusion of these JDBC jars when building for release, even 
though we will likely have run the ant download-dependencies target in the 
release branch.

(2) The tests all need to be run and checked, since the place where proprietary 
materials are placed has now become lib-proprietary across the board.

(3) The how-to-build-and-deploy documentation needs significant changes.

(4) The process components of the FileNet and Documentum connectors are where 
the proprietary jars wind up; we need a solution for these.


  
 Separate the proprietary / non-proprietary deliverables for each connector
 --

 Key: CONNECTORS-402
 URL: https://issues.apache.org/jira/browse/CONNECTORS-402
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Build
Affects Versions: ManifoldCF 0.4
Reporter: Piergiorgio Lucidi
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5

   Original Estimate: 24h
  Remaining Estimate: 24h

 As we have spoken today this ticket is dedicated to try to work on the build 
 process to separate proprietary and non-proprietary libraries in the release 
 package. 
 In this way, for example, we could have a standard way to release a specific 
 library for the connector (mcf-alfresco-connector) without release 
 proprietary libraries and allow users to add proprietary libraries in a 
 specific folder connector-lib-proprietary that could be added to the 
 ManifoldCF classloader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-387) Japanese fonts are missing from the PDF files generated for the site

2012-01-30 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195994#comment-13195994
 ] 

Karl Wright edited comment on CONNECTORS-387 at 1/30/12 8:43 AM:
-

Looks good, and the properly-generated PDFs are already live.

I've rearranged the site code now to give more equal emphasis to english and 
japanese, and also allow for the crawler UI help like to take you to the 
right help page for your language, but I have not published this yet.  The only 
downside here is that there's now need of a launch tab which does nothing 
more than tell you to click on the appropriate language tab.  I'm hoping Google 
will pick this up OK.

Hitoshi, could you synch up trunk and do ant download-dependencies doc, and 
see that the site navigation for the site at dist/doc now makes sense in 
Japanese?

The only outstanding work for this ticket now is trying to get different fonts 
used to generate PDFs for different parts of the site.


  was (Author: kwri...@metacarta.com):
Looks good, and the properly-generated PDFs are already live.
I've rearranged the site to give more equal emphasis to english and japanese, 
and also allow for the crawler UI help like to take you to the right help 
page for your language.  The only downside here is that there's now need of a 
launch tab which does nothing more than tell you to click on the appropriate 
language tab.  I'm hoping Google will pick this up OK.

Hitoshi, could you synch up trunk and do ant download-dependencies doc, and 
see that the site navigation for the site at dist/doc now makes sense in 
Japanese?

The only outstanding work for this ticket now is trying to get different fonts 
used to generate PDFs for different parts of the site.

  
 Japanese fonts are missing from the PDF files generated for the site
 

 Key: CONNECTORS-387
 URL: https://issues.apache.org/jira/browse/CONNECTORS-387
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5


 Japanese fonts are missing from the PDFs generated for the site.
 This page describes how to fix that:
 http://forrest.apache.org/pluginDocs/plugins_0_90/org.apache.forrest.plugin.output.pdf/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-387) Japanese fonts are missing from the PDF files generated for the site

2012-01-30 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195994#comment-13195994
 ] 

Karl Wright edited comment on CONNECTORS-387 at 1/30/12 10:26 AM:
--

Looks good, and the properly-generated PDFs are already live.

I've rearranged the site code now to give more equal emphasis to english and 
japanese, and also allow for the crawler UI help link to take you to the 
right help page for your language, but I have not published this yet.  The only 
downside here is that there's now need of a launch tab which does nothing 
more than tell you to click on the appropriate language tab.  I'm hoping Google 
will pick this up OK.

Hitoshi, could you synch up trunk and do ant download-dependencies doc, and 
see that the site navigation for the site at dist/doc now makes sense in 
Japanese?

The only outstanding work for this ticket now is trying to get different fonts 
used to generate PDFs for different parts of the site.


  was (Author: kwri...@metacarta.com):
Looks good, and the properly-generated PDFs are already live.

I've rearranged the site code now to give more equal emphasis to english and 
japanese, and also allow for the crawler UI help like to take you to the 
right help page for your language, but I have not published this yet.  The only 
downside here is that there's now need of a launch tab which does nothing 
more than tell you to click on the appropriate language tab.  I'm hoping Google 
will pick this up OK.

Hitoshi, could you synch up trunk and do ant download-dependencies doc, and 
see that the site navigation for the site at dist/doc now makes sense in 
Japanese?

The only outstanding work for this ticket now is trying to get different fonts 
used to generate PDFs for different parts of the site.

  
 Japanese fonts are missing from the PDF files generated for the site
 

 Key: CONNECTORS-387
 URL: https://issues.apache.org/jira/browse/CONNECTORS-387
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5


 Japanese fonts are missing from the PDFs generated for the site.
 This page describes how to fix that:
 http://forrest.apache.org/pluginDocs/plugins_0_90/org.apache.forrest.plugin.output.pdf/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-387) Japanese fonts are missing from the PDF files generated for the site

2012-01-29 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195860#comment-13195860
 ] 

Karl Wright edited comment on CONNECTORS-387 at 1/29/12 10:19 PM:
--

Upgrading to the trunk version of fop fixed the problem.  There was also an 
issue with a duplicate copy of the 0.95 jar hanging around, so the 1.0 fop jar 
might well have worked if I'd noticed that before.  r1237495.
Next step is to look into whether output.xmap can be conditionalized to use 
different fonts for different parts of the tree.


  was (Author: kwri...@metacarta.com):
Upgrading to the trunk version of fop fixed the problem.  r1237495.
Next step is to look into whether output.xmap can be conditionalized to use 
different fonts for different parts of the tree.

  
 Japanese fonts are missing from the PDF files generated for the site
 

 Key: CONNECTORS-387
 URL: https://issues.apache.org/jira/browse/CONNECTORS-387
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5


 Japanese fonts are missing from the PDFs generated for the site.
 This page describes how to fix that:
 http://forrest.apache.org/pluginDocs/plugins_0_90/org.apache.forrest.plugin.output.pdf/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-387) Japanese fonts are missing from the PDF files generated for the site

2012-01-28 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195452#comment-13195452
 ] 

Karl Wright edited comment on CONNECTORS-387 at 1/28/12 8:17 AM:
-

I get the following when the PDFs are being built under the above setup:

{code}
 [exec] WARN - Font 'IPAPGothic,normal,400' not found. Substituting with 
'any,normal,400'.
 [exec] WARN - Font 'IPAPGothic,normal,700' not found. Substituting with 
'any,normal,700'.
 [exec] WARN - Font 'IPAPGothic,italic,400' not found. Substituting with 
'any,italic,400'.
{code}

It's not clear whether this is because none of the IPAP Gothic fonts are 
properly registered, or because only these three fonts are needed.  Looking 
further, it doesn't look like any of the IPAP fonts are being properly 
registered.  This is when I put config.xml at: 
src/documentation/resources/fonts/config.xml.  Clearly that's not the right 
place, or I've done something wrong in it.

I'm checking the whole thing in now, since it doesn't seem to break anything 
worse than it is already broken.



  was (Author: kwri...@metacarta.com):
I get the following when the PDFs are being built under the above setup:

{code}
 [exec] WARN - Font 'IPAPGothic,normal,400' not found. Substituting with 
'any,normal,400'.
 [exec] WARN - Font 'IPAPGothic,normal,700' not found. Substituting with 
'any,normal,700'.
 [exec] WARN - Font 'IPAPGothic,italic,400' not found. Substituting with 
'any,italic,400'.
{code}

It's not clear whether this is because none of the IPAP Gothic fonts are 
properly registered, or because only these three fonts are needed.  Looking 
further...

  
 Japanese fonts are missing from the PDF files generated for the site
 

 Key: CONNECTORS-387
 URL: https://issues.apache.org/jira/browse/CONNECTORS-387
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5


 Japanese fonts are missing from the PDFs generated for the site.
 This page describes how to fix that:
 http://forrest.apache.org/pluginDocs/plugins_0_90/org.apache.forrest.plugin.output.pdf/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-387) Japanese fonts are missing from the PDF files generated for the site

2012-01-28 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195452#comment-13195452
 ] 

Karl Wright edited comment on CONNECTORS-387 at 1/28/12 8:23 AM:
-

I get the following when the PDFs are being built under the above setup:

{code}
 [exec] WARN - Font 'IPAPGothic,normal,400' not found. Substituting with 
'any,normal,400'.
 [exec] WARN - Font 'IPAPGothic,normal,700' not found. Substituting with 
'any,normal,700'.
 [exec] WARN - Font 'IPAPGothic,italic,400' not found. Substituting with 
'any,italic,400'.
{code}

It's not clear whether this is because none of the IPAP Gothic fonts are 
properly registered, or because only these three fonts are needed.  Looking 
further, it doesn't look like any of the IPAP fonts are being properly 
registered.  This is where I put config.xml at: 
src/documentation/resources/fonts/config.xml.  Clearly that's not the right 
place, or I've done something wrong in it.

I'm checking the whole thing in now, since it doesn't seem to break anything 
worse than it is already broken.



  was (Author: kwri...@metacarta.com):
I get the following when the PDFs are being built under the above setup:

{code}
 [exec] WARN - Font 'IPAPGothic,normal,400' not found. Substituting with 
'any,normal,400'.
 [exec] WARN - Font 'IPAPGothic,normal,700' not found. Substituting with 
'any,normal,700'.
 [exec] WARN - Font 'IPAPGothic,italic,400' not found. Substituting with 
'any,italic,400'.
{code}

It's not clear whether this is because none of the IPAP Gothic fonts are 
properly registered, or because only these three fonts are needed.  Looking 
further, it doesn't look like any of the IPAP fonts are being properly 
registered.  This is when I put config.xml at: 
src/documentation/resources/fonts/config.xml.  Clearly that's not the right 
place, or I've done something wrong in it.

I'm checking the whole thing in now, since it doesn't seem to break anything 
worse than it is already broken.


  
 Japanese fonts are missing from the PDF files generated for the site
 

 Key: CONNECTORS-387
 URL: https://issues.apache.org/jira/browse/CONNECTORS-387
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5


 Japanese fonts are missing from the PDFs generated for the site.
 This page describes how to fix that:
 http://forrest.apache.org/pluginDocs/plugins_0_90/org.apache.forrest.plugin.output.pdf/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-387) Japanese fonts are missing from the PDF files generated for the site

2012-01-25 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13193606#comment-13193606
 ] 

Karl Wright edited comment on CONNECTORS-387 at 1/26/12 7:23 AM:
-

Hitoshi,

I don't see any sign that Apache Legal has ever ruled on the IPA license, but 
several clauses in there give me reason to question whether they would permit 
us to redistribute it.  I'm happy to post the license and have them rule but it 
will take some time.  What we'll also need to do is figure out a way to switch 
between fonts based on the path of the PDF being generated if we stick with 
IPA, because a clause in the license definitely forbids relabeling or modifying 
the IPA fonts, so we could not redistribute a merged font family.

That's why I've been exploring the Dejavu route, since the Forrest page 
explicitly mentioned CJK support in that family.  Too bad it isn't true.


  was (Author: kwri...@metacarta.com):
Hitoshi,

I don't see any sign that Apache Legal has ever ruled on the IPA license, but 
several clauses in there give me reason to question whether they would permit 
us to redistribute it.  I'm happy to post the license and have them rule but it 
will take some time.

If IPA is the only available choice I think we're going to have to make do 
without Japanese PDFs for the time being.  What we'll also need to do is figure 
out a way to switch between fonts based on the path of the PDF being generated 
if we stick with IPA.  That's why I've been exploring the Dejavu route, since 
the Forrest page explicitly mentioned CJK support in that family.

  
 Japanese fonts are missing from the PDF files generated for the site
 

 Key: CONNECTORS-387
 URL: https://issues.apache.org/jira/browse/CONNECTORS-387
 Project: ManifoldCF
  Issue Type: Bug
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5


 Japanese fonts are missing from the PDFs generated for the site.
 This page describes how to fix that:
 http://forrest.apache.org/pluginDocs/plugins_0_90/org.apache.forrest.plugin.output.pdf/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-356) Active Directory connector should by fully I18N/L10N

2012-01-09 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182492#comment-13182492
 ] 

Karl Wright edited comment on CONNECTORS-356 at 1/9/12 1:52 PM:


I've looked at this patch.  It embeds the Javascript etc. quotation marks right 
in the translation.  I believe this to be bad practice since the localization 
is then based on the context in which it is used.

For example:

{code}
+alert( + 
Messages.getString(locale,ActiveDirectoryAuthority.EnterADomainControllerServerName)
 + );\n+
{code}

...combined with...

{code}
+ActiveDirectoryAuthority.EnterADomainControllerServerName=\Enter a domain 
controller server name\
{code}

I think this should be:

{code}
+alert(\ + 
Messages.getBodyJavascriptString(locale,ActiveDirectoryAuthority.EnterADomainControllerServerName)
 + \);\n+
{code}

... and ...

{code}
+ActiveDirectoryAuthority.EnterADomainControllerServerName=Enter a domain 
controller server name
{code}

Do you agree?


  was (Author: kwri...@metacarta.com):
I've looked at this patch.  It embeds the Javascript etc. quotation marks 
right in the translation.  I believe this to be bad practice since the 
localization is then based on the context in which it is used.

For example:

{code}
+alert( + 
Messages.getString(locale,ActiveDirectoryAuthority.EnterADomainControllerServerName)
 + );\n+
{code}

...combined with...

{code}
+ActiveDirectoryAuthority.EnterADomainControllerServerName=\Enter a domain 
controller server name\
{code}

I think this should be:

{code}
+alert(\ + 
Messages.getJavascriptBodyString(locale,ActiveDirectoryAuthority.EnterADomainControllerServerName)
 + \);\n+
{code}

... and ...

{code}
+ActiveDirectoryAuthority.EnterADomainControllerServerName=Enter a domain 
controller server name
{code}

Do you agree?

  
 Active Directory connector should by fully I18N/L10N
 

 Key: CONNECTORS-356
 URL: https://issues.apache.org/jira/browse/CONNECTORS-356
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Active Directory authority
Affects Versions: ManifoldCF 0.5
Reporter: Hitoshi Ozawa
Assignee: Karl Wright
Priority: Minor
  Labels: I18N
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-356.patch


 All messages in Active Directory Connector should be extracted out to 
 properties file.
 Also, should create Japanese L10N properties file for all messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-336) If there's a Japanese localization, we should also have some Japanese documentation

2012-01-06 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13181387#comment-13181387
 ] 

Karl Wright edited comment on CONNECTORS-336 at 1/6/12 3:26 PM:


bq. I may have messed up my svn on Ubuntu by copying files I've edited on my 
Windows7

Copying files shouldn't mess up svn, but copying workspaces/hidden files 
certainly could.  If you're not sure it may be best to create a new workspace 
at this point.

Once you do that, you should notice that there's no ja_JP/index*.xml file.  
Then you can add yours back and do an svn diff to get the next patch.



  was (Author: kwri...@metacarta.com):
bq. I may have messed up my svn on Ubuntu by copying files I've edited on 
my Windows7

Copying files shouldn't mess up svn, but copying workspaces/hidden files 
certainly could.  If you're not sure it may be best to create a new workspace 
at this point.

  
 If there's a Japanese localization, we should also have some Japanese 
 documentation
 ---

 Key: CONNECTORS-336
 URL: https://issues.apache.org/jira/browse/CONNECTORS-336
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-336-20120106.patch, 
 CONNECTORS-336_20120104A.patch, CONNECTORS-336_20120104B.patch, 
 CONNECTORS-336_20120105.patch, ja_JP_A.zip, ja_JP_B.zip, ja_JP_C.zip


 We should provide some Japanese documentation if we have a Japanese 
 localization.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-336) If there's a Japanese localization, we should also have some Japanese documentation

2012-01-05 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13180356#comment-13180356
 ] 

Karl Wright edited comment on CONNECTORS-336 at 1/5/12 1:20 PM:


The new patch fails on tab.xml and site.xml:

patching file xdocs/tabs.xml
Reversed (or previously applied) patch detected!  Assume -R? [n] y
patching file xdocs/site.xml
Reversed (or previously applied) patch detected!  Assume -R? [n] y
Hunk #2 FAILED at 75.
1 out of 2 hunks FAILED -- saving rejects to file xdocs/site.xml.rej

Are you sure you did an svn diff against the current contents of the branch?  
Try svn update before you do the svn diff.



  was (Author: kwri...@metacarta.com):
The new patch fails on tab.xml and site.xml:

patching file xdocs/tabs.xml
Reversed (or previously applied) patch detected!  Assume -R? [n] y
patching file xdocs/site.xml
Reversed (or previously applied) patch detected!  Assume -R? [n] y
Hunk #2 FAILED at 75.
1 out of 2 hunks FAILED -- saving rejects to file xdocs/site.xml.rej

Are you sure you did an svn diff against the current contents of the branch?

  
 If there's a Japanese localization, we should also have some Japanese 
 documentation
 ---

 Key: CONNECTORS-336
 URL: https://issues.apache.org/jira/browse/CONNECTORS-336
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-336_20120104A.patch, 
 CONNECTORS-336_20120104B.patch, CONNECTORS-336_20120105.patch, ja_JP_A.zip, 
 ja_JP_B.zip, ja_JP_C.zip


 We should provide some Japanese documentation if we have a Japanese 
 localization.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-286) Get ManifoldCF to run on top of a key/value store like Voldemort, for potential massive scalability improvements and speed gains

2012-01-05 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13181155#comment-13181155
 ] 

Karl Wright edited comment on CONNECTORS-286 at 1/6/12 6:23 AM:


bq. Not sure what Wharthog is...

See 
https://svn.apache.org/repos/asf/incubator/lcf/branches/CONNECTORS-286/warthog. 
 Warthog is a potential future spinoff technology of ManifoldCF - if all this 
stuff actually works. ;-)

bq. But I wanted to suggest you consider using HBase for underlying storage.

Maybe.  One of the apparent requirements of ManifoldCF is the ability run on 
top of practically anything.  I don't think we'd be able to throw away 
PostgreSQL and MySQL and HSQLDB and Derby support for instance.  So if Warthog 
is the actual API layer ManifoldCF uses then we'd need implementations of 
Warthog for as many backends as possible.  Right now I've only got one going 
for a testing key-value store, but almost certainly the next step would be a 
SQL database.  Then moving on from then HBase (or Hive/Pig) may also be 
possibilities.  Still proving the concept however...


  was (Author: kwri...@metacarta.com):
bq. Not sure what Wharthog is...

See 
https://svn.apache.org/repos/asf/incubator/lcf/branches/CONNECTORS-286/warthog. 
 Warthog is a potential future spinoff technology of ManifoldCF - if all this 
stuff actually works. ;-)

bq. But I wanted to suggest you consider using HBase for underlying storage.

Maybe.  One of the apparent requirements of ManifoldCF is the ability run on 
top of practically anything.  I don't think we'd be able to throw away 
PostgreSQL and MySQL and HSQLDB and Derby support for instance.  So if Warthog 
is the actual API layer ManifoldCF uses then we'd need implementations of 
Warthog for as many backends as possible.  Right now I've only going one for a 
testing key-value store, but almost certainly the next step would be a SQL 
database.  Then moving on from then HBase (or Hive/Pig) may also be 
possibilities.  Still proving the concept however...

  
 Get ManifoldCF to run on top of a key/value store like Voldemort, for 
 potential massive scalability improvements and speed gains
 

 Key: CONNECTORS-286
 URL: https://issues.apache.org/jira/browse/CONNECTORS-286
 Project: ManifoldCF
  Issue Type: New Feature
  Components: Framework core
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF next


 ManifoldCF's reliance on a relational database limits its throughput and 
 scalability.  I am now convinced it is possible to build all the structures 
 we need within a distributed key-value store like Voldemort, which has the 
 nice side effect of permitting massive scaling.  I envision there will be 
 several layers to this project, some of which may have broader utility in the 
 open-source community at large:
 (1) An atomic serialization layer, which adds serialization capabilities to 
 an non-transactional substrate;
 (2) A transaction layer, which uses atomic serialization to build a notion of 
 light transactions;
 (3) A table and index layer, which defines SQL-like concepts of tables and 
 btree indexes on top of the transaction layer, via a Java API;
 (4) A generic database abstraction layer, which is capable of representing 
 both standard SQL databases as well as this NoSQL variant, so that ManifoldCF 
 can support both models.
 This is obviously a major development task, and as such is not envisioned to 
 be completed by the next standard release.  Work will indeed need to be done 
 in a branch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-336) If there's a Japanese localization, we should also have some Japanese documentation

2012-01-04 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13179461#comment-13179461
 ] 

Karl Wright edited comment on CONNECTORS-336 at 1/4/12 1:18 PM:


bq. I can't attach a zip here. It seems to hangup every time I try.

How large is the zip?  I believe the limit is 10mb (or maybe even 5mb), so if 
it's bigger than that you'll need to break it up into pieces.


  was (Author: kwri...@metacarta.com):
bq. I can't attach a zip here. It seems to hangup every time I try.

How large is the zip?

  
 If there's a Japanese localization, we should also have some Japanese 
 documentation
 ---

 Key: CONNECTORS-336
 URL: https://issues.apache.org/jira/browse/CONNECTORS-336
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-336_20120104A.patch, 
 CONNECTORS-336_20120104B.patch


 We should provide some Japanese documentation if we have a Japanese 
 localization.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-336) If there's a Japanese localization, we should also have some Japanese documentation

2012-01-04 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13179920#comment-13179920
 ] 

Karl Wright edited comment on CONNECTORS-336 at 1/4/12 10:02 PM:
-

bq. The zip file I want to attach is about 8MB so it's under 10mb.

Other people have uploaded zips to Jira successfully.  Can you break it up into 
two roughly equal pieces and try that?  5Mb may be the limit.

If that doesn't work just email the zip to me at daddy...@gmail.com, and I'll 
check it in.



  was (Author: kwri...@metacarta.com):
bq. The zip file I want to attach is about 8MB so it's under 10mb.

Other people have uploaded zips to Jira successfully.  Can you break it up into 
two roughly equal pieces and try that?  5Mb may be the limit.

  
 If there's a Japanese localization, we should also have some Japanese 
 documentation
 ---

 Key: CONNECTORS-336
 URL: https://issues.apache.org/jira/browse/CONNECTORS-336
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Documentation
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-336_20120104A.patch, 
 CONNECTORS-336_20120104B.patch


 We should provide some Japanese documentation if we have a Japanese 
 localization.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-341) override config for test cases

2011-12-29 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13177124#comment-13177124
 ] 

Karl Wright edited comment on CONNECTORS-341 at 12/29/11 11:53 AM:
---

I had been toying with moving these embedded configuration files to 
src/test/resource instead.  It sounds like that would mess up your use case 
entirely though.

I like the fact that initialize() can be separately overridden, but I'd like 
perhaps to take it a step further and provide a base class method that is 
specifically meant to be overridden such that the values written can be 
extended rather than completely replaced, e.g.:

{code}
protected void populateConfigurationXML(StringBuilder sb)
{
...
}
{code}

... which is called from the base class implementation of initialize().  Also a 
similar method for logging.ini.  Do you understand what I mean, and if so do 
you want to modify your patch accordingly?


  was (Author: kwri...@metacarta.com):
I had been toying with moving these embedded configuration files to 
src/test/resource instead.  It sounds like that would mess up your use case 
entirely though.

I like the fact that initialize() can be separately overridden, but I'd like 
perhaps to take it a step further and provide a base class method that is 
specifically meant to be overridden such that the values written can be 
extended rather than completely replaced, e.g.:

protected void populateConfigurationXML(StringBuilder sb)
{
...
}

... which is called from the base class implementation of initialize().  Also a 
similar method for logging.ini.  Do you understand what I mean, and if so do 
you want to modify your patch accordingly?

  
 override config for test cases
 --

 Key: CONNECTORS-341
 URL: https://issues.apache.org/jira/browse/CONNECTORS-341
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Tests
Reporter: Martin Goldhahn
 Attachments: CONNECTORS-341.patch


 It is hard to write a test case for a repository connector using the existing 
 *.Base classes that uses a different config file.
 The config files are actually written in 
 org.apache.manifoldcf.core.tests.Base.localSetup().
 The localSetup method is called by all sub-classes in their overridden 
 localSetup function. each sub-class adds something specific for the test 
 case. If I wanted to put something different in the config files I had to 
 override localSetup() and copy all content of the superclasses' localSetup to 
 it. This can be avoided by moving the code that writes the config file in the 
 initialize() method. It makes also more sense in that the creation of the 
 config files and the provision of their content are now in the same function.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-314) Combined MySQL and i18n/Japanese contribution

2011-12-28 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13177047#comment-13177047
 ] 

Karl Wright edited comment on CONNECTORS-314 at 12/29/11 7:50 AM:
--

bq. Just want to make sure again. Should I be getting the source code from the 
trunk now?  Where are fixes for the CONNECTORS-338 and CONNECTORS-340 being 
committed?

All fixes have now been committed to trunk EXCEPT for the Japanese 
documentation, which is still going into branches/CONNECTORS-336.

I'm also working on automated tests for the UI.  Currently I'm testing just 
en_US, but I hope to add a set of ja_JP tests as well.  As I go I am running 
into and fixing various minor UI bugs.  That ticket is CONNECTORS-339, which is 
still open.


  was (Author: kwri...@metacarta.com):
bq. Just want to make sure again. Should I be getting the source code from 
the trunk now?  Where are fixes for the CONNECTORS-338 and CONNECTORS-340 being 
committed?

All fixes have now been committed to trunk EXCEPT for the Japanese 
documentation, which is still going into branches/CONNECTORS-336.


  
 Combined MySQL and i18n/Japanese contribution
 -

 Key: CONNECTORS-314
 URL: https://issues.apache.org/jira/browse/CONNECTORS-314
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Framework agents process, Framework core, Framework 
 crawler agent
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
  Labels: I18N, mysql
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-314-20111225.patch, 
 CONNECTORS-314-doc20111220.patch, CONNECTORS-314.patch, CONNECTORS-314.patch, 
 connectors.patch, framework.patch


 Hitoshi Ozawa wishes to contribute i18n support, a Japanese localization, and 
 MySQL database support, all in one patch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-314) Combined MySQL and i18n/Japanese contribution

2011-12-19 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172193#comment-13172193
 ] 

Karl Wright edited comment on CONNECTORS-314 at 12/19/11 11:14 AM:
---

bq. Is it necessary to include Message.java in each connector or is it possible 
to just leave it as a common utility class?

A common utility class is fine.

bq. There's information schema in MySQL. Just didn't have time this time to 
use it because it'll require more work. I wanted to show that it was possible 
to use MySQL with ManifoldCF and do the revisions afterwards. Somebody else may 
want to pitch in now since it's clear that it possible.

As long as that is the case, I can lift the code from the hsqldb driver for the 
missing DBInterfaceMySQL methods.  I will do that and check everything in when 
I've also coded the conditional download of the mysql driver in the build 
process.


  was (Author: kwri...@metacarta.com):
bq. Is it necessary to include Message.java in each connector or is it 
possible to just leave it as a common utility class?

A common utility class is fine.  I'd move it to 
org.apache.manifoldcf.uicore.i18n though, I think, in the project 
framework/uicore.

bq. There's information schema in MySQL. Just didn't have time this time to 
use it because it'll require more work. I wanted to show that it was possible 
to use MySQL with ManifoldCF and do the revisions afterwards. Somebody else may 
want to pitch in now since it's clear that it possible.

As long as that is the case, I can lift the code from the hsqldb driver for the 
missing DBInterfaceMySQL methods.  I will do that and check everything in when 
I've also coded the conditional download of the mysql driver in the build 
process.

  
 Combined MySQL and i18n/Japanese contribution
 -

 Key: CONNECTORS-314
 URL: https://issues.apache.org/jira/browse/CONNECTORS-314
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Framework agents process, Framework core, Framework 
 crawler agent
Affects Versions: ManifoldCF 0.5
Reporter: Karl Wright
Assignee: Karl Wright
  Labels: I18N, mysql
 Fix For: ManifoldCF 0.5

 Attachments: CONNECTORS-314.patch, CONNECTORS-314.patch, 
 connectors.patch, framework.patch


 Hitoshi Ozawa wishes to contribute i18n support, a Japanese localization, and 
 MySQL database support, all in one patch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-330) HSQLDB external database tests fail

2011-12-17 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13171527#comment-13171527
 ] 

Karl Wright edited comment on CONNECTORS-330 at 12/17/11 12:12 PM:
---

The story points back to HSQLDB again, unfortunately.  Here's a trace:

{code}
[junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@113beb5 with user 
manifoldcf which is meant to work with schema dbname
[junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@747fa2 with user 
manifoldcf which is meant to work with schema dbname
[junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@6782a9 with user sa which 
is meant to work with schema administration
[junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@17200b4 with user 
manifoldcf which is meant to work with schema dbname
[junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@18c3679 with user sa 
which is meant to work with schema administration
[junit] Creating schema DBNAME using instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@18c3679
[junit] Altering user manifoldcf
[junit]  Alter User succeeded! ALTER USER manifoldcf SET INITIAL SCHEMA 
DBNAME
[junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@289d2e with user 
manifoldcf which is meant to work with schema dbname
[junit] Checking for table in schema DBNAME in instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@289d2e
[junit] Creating table in schema DBNAME in instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@289d2e
[junit] org.apache.manifoldcf.core.interfaces.ManifoldCFException: Database 
exception: Exception doing query: user lacks privilege or object not found: 
PUBLIC in statement [CREATE CACHED TABLE agents(classname VARCHAR(255) NOT NULL 
PRIMARY KEY)]
[junit] at 
org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:672)
...
{code}


  was (Author: kwri...@metacarta.com):
The story points back to HSQLDB again, unfortunately.  Here's an annotated 
trace:

{code}
Here's the schema creation
   [junit] Created HSQLDB instance
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@18c3679 with user sa 
which is meant to work with schema administration
   [junit] Creating schema DBNAME using instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@18c3679
Here's the alter user
   [junit] Altering user manifoldcf
   [junit]  Alter User succeeded! ALTER USER manifoldcf SET INITIAL SCHEMA 
DBNAME
Here's the table create
   [junit] Created HSQLDB instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@289d2e with user 
manifoldcf which is meant to work with schema dbname
   [junit] Checking for table in schema DBNAME in instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@289d2e
   [junit] Creating table in schema DBNAME in instance 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB@289d2e
   [junit] org.apache.manifoldcf.core.interfaces.ManifoldCFException: Database 
exception: Exception doing query: user lacks privilege or object not found: 
PUBLIC in statement [CREATE CACHED TABLE agents(classname VARCHAR(255) NOT NULL 
PRIMARY KEY)]
   [junit] at 
org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:672)
   [junit] at 
org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:716)
   [junit] at 
org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1363)
   [junit] at 
org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
   [junit] at 
org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:179)
   [junit] at 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB.performModification(DBInterfaceHSQLDB.java:752)
   [junit] at 
org.apache.manifoldcf.core.database.DBInterfaceHSQLDB.performCreate(DBInterfaceHSQLDB.java:364)
   [junit] at 
org.apache.manifoldcf.core.database.BaseTable.performCreate(BaseTable.java:111)
   [junit] at
org.apache.manifoldcf.agents.agentmanager.AgentManager.install(AgentManager.java:72)
   [junit] at
org.apache.manifoldcf.agents.system.ManifoldCF.installTables(ManifoldCF.java:73)
   [junit] at
org.apache.manifoldcf.agents.tests.BaseHSQLDBext.localSetUp(BaseHSQLDBext.java:65)
   [junit] at
org.apache.manifoldcf.crawler.tests.BaseHSQLDBext.localSetUp(BaseHSQLDBext.java:60)
   [junit] at
org.apache.manifoldcf.crawler.tests.BaseHSQLDBext.setUp(BaseHSQLDBext.java:48)
   [junit] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   [junit] at

[jira] [Issue Comment Edited] (CONNECTORS-323) *.sh in multiprocess-example doesn't work

2011-12-15 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13170683#comment-13170683
 ] 

Karl Wright edited comment on CONNECTORS-323 at 12/16/11 1:58 AM:
--

Looks good!
Please go ahead and commit to trunk.  I'll pull it up to the release branch.


  was (Author: kwri...@metacarta.com):
Looks good!
Please go ahead and commit.

  
 *.sh in multiprocess-example doesn't work
 -

 Key: CONNECTORS-323
 URL: https://issues.apache.org/jira/browse/CONNECTORS-323
 Project: ManifoldCF
  Issue Type: Bug
  Components: Framework agents process
Affects Versions: ManifoldCF 0.4
Reporter: Shinichiro Abe
Priority: Minor
 Fix For: ManifoldCF 0.4

 Attachments: CONNECTORS-323.patch


 *.sh in multiprocess-example doesn't work.Here is a patch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-313) An example multi-process properties.xml delivered to the dist folder would be very helpful

2011-12-14 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13169916#comment-13169916
 ] 

Karl Wright edited comment on CONNECTORS-313 at 12/15/11 2:05 AM:
--

I think I see the problem.  The class extends 
TransactionalCrawlerInitializationCommand, which presumes that the database 
instance already exists.  I don't think you'll be able to extend that class if 
you are attempting database creation.

I don't think it is necessary to do everything in one class anyway.  The only 
new command functionality is the part that reads connectors.xml and registers 
the connectors.  If you create a command that does just that, and a script that 
calls DBCreate, Initialize, RegisterAgent, and that, I think that would be fine.


  was (Author: kwri...@metacarta.com):
I think I see the problem.  The class extends 
TransactionalCrawlerInitializationCommand, which presumes that the database 
instance already exists.  I don't think you'll be able to extend that class.

I don't think it is necessary to do everything in one class anyway.  The only 
new command functionality is the part that reads connectors.xml and registers 
the connectors.  If you create a command that does just that, and a script that 
calls DBCreate, Initialize, RegisterAgent, and that, I think that would be fine.

  
 An example multi-process properties.xml delivered to the dist folder would 
 be very helpful
 

 Key: CONNECTORS-313
 URL: https://issues.apache.org/jira/browse/CONNECTORS-313
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Build
Affects Versions: ManifoldCF 0.4
Reporter: Shinichiro Abe
Assignee: Shinichiro Abe
Priority: Minor
 Fix For: ManifoldCF 0.4

 Attachments: CONNECTOR-313.patch, CONNECTOR-313.patch, 
 register-draft.sh


 The multiprocess setup does not have an example properties.xml file.  We 
 should deliver one, in the right place so that all the scripts find it (the 
 dist directory).  It would also be helpful to deliver into this directory 
 scripts for:
 - Registering all the connectors that were built
 - Starting the agents process

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-313) An example multi-process properties.xml delivered to the dist folder would be very helpful

2011-12-14 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13169950#comment-13169950
 ] 

Karl Wright edited comment on CONNECTORS-313 at 12/15/11 3:12 AM:
--

Oh, noticed one other thing.  There's no un-registration of connectors.  This 
is necessary because otherwise it is impossible to remove a connector using 
just RegisterAll.  If you look at what the single process example does, it 
first deregisters all connectors before registering the new ones, for this 
reason.

You may think that this is never going to happen with RegisterAll, but actually 
RegisterAll will be called not just for fresh installs but also for upgrades.


  was (Author: kwri...@metacarta.com):
Oh, noticed one other thing.  There's no un-registration of connectors.  
This is necessary because otherwise it is impossible to remove a connector 
using just RegisterAll.  If you look at what the single process example does, 
it first deregisters all connectors before registering the new ones, for this 
reason.

  
 An example multi-process properties.xml delivered to the dist folder would 
 be very helpful
 

 Key: CONNECTORS-313
 URL: https://issues.apache.org/jira/browse/CONNECTORS-313
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Build
Affects Versions: ManifoldCF 0.4
Reporter: Shinichiro Abe
Assignee: Shinichiro Abe
Priority: Minor
 Fix For: ManifoldCF 0.4

 Attachments: CONNECTOR-313.patch, CONNECTOR-313.patch, 
 register-draft.sh


 The multiprocess setup does not have an example properties.xml file.  We 
 should deliver one, in the right place so that all the scripts find it (the 
 dist directory).  It would also be helpful to deliver into this directory 
 scripts for:
 - Registering all the connectors that were built
 - Starting the agents process

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-313) An example multi-process properties.xml delivered to the dist folder would be very helpful

2011-12-12 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167421#comment-13167421
 ] 

Karl Wright edited comment on CONNECTORS-313 at 12/12/11 9:17 AM:
--

Hi Abe-san,

connectors.xml is read by the QuickStart java startup class only at this time.  
You have two choices:

(1) Modify the main build.xml to dynamically build your register.sh script, in 
a manner similar to the way it dynamically builds connectors.xml;
(2) Create a new command class in 
framework/pull-agent/src/main/java/org/apache/manifoldcf/crawler, which parses 
a connectors.xml and performs the corresponding modification (and nothing 
else), and then call that command class from your script.  You may be able to 
move the code for this from framework/jettyrunner, and change the jettyrunner 
code so it is not duplicated.

I'm not sure which is better.  It depends on how people typically deploy the 
multiprocess version.

  was (Author: kwri...@metacarta.com):
Hi Abe-san,

connectors.xml is read by the QuickStart java startup class only at this time.  
You have two choices:

(1) Modify the main build.xml to dynamically build your register.sh script, in 
a manner similar to the way it dynamically builds connectors.xml;
(2) Create a new command class in 
framework/pull-agent/src/main/java/org/apache/manifoldcf/crawler, which parses 
a connectors.xml and performs the corresponding modification (and nothing 
else).  You may be able to move the code for this from framework/jettyrunner, 
and change the jettyrunner code so it is not duplicated.

I'm not sure which is better.  It depends on how people typically deploy the 
multiprocess version.
  
 An example multi-process properties.xml delivered to the dist folder would 
 be very helpful
 

 Key: CONNECTORS-313
 URL: https://issues.apache.org/jira/browse/CONNECTORS-313
 Project: ManifoldCF
  Issue Type: Improvement
  Components: Build
Affects Versions: ManifoldCF 0.4
Reporter: Shinichiro Abe
Assignee: Shinichiro Abe
Priority: Minor
 Fix For: ManifoldCF next

 Attachments: CONNECTOR-313.patch, CONNECTOR-313.patch, 
 register-draft.sh


 The multiprocess setup does not have an example properties.xml file.  We 
 should deliver one, in the right place so that all the scripts find it (the 
 dist directory).  It would also be helpful to deliver into this directory 
 scripts for:
 - Registering all the connectors that were built
 - Starting the agents process

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-308) ant test fails due to Alfresco test problem

2011-12-07 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13164435#comment-13164435
 ] 

Karl Wright edited comment on CONNECTORS-308 at 12/7/11 2:48 PM:
-

Some research on the alfresco.war 3.4.d community edition:  It is not 
downloadable without considerable work.  It's embedded in a zip file that can 
only be downloaded from a server protected by basic auth, whose credentials are 
supplied in Javascript in encoded form.  It's under web-server/webapps in this 
zip.

I'd therefore recommend that we put the onus on the user to supply this war 
file in order for the integration tests to run.  Any objection to this approach?

I'm not really sure what the Maven implications would be either, but clearly 
the current approach of building the war ourselves using Alfresco materials is 
certainly a concern.  The license for the war itself is GPL, so we cannot just 
check it into svn and be done with it either.

  was (Author: kwri...@metacarta.com):
Some research on the alfresco.war 3.4.d community edition:  It is not 
downloadable without considerable work.  It's embedded in a zip file that can 
only be downloaded from a server protected by basic auth, whose credentials are 
supplied in Javascript in encoded form.  It's under web-server/webapps in this 
zip.

I'd therefore recommend that we put the onus on the user to supply this war 
file in order for the integration tests to run.  Any objection to this approach?

I'm not really sure what the Maven implications would be either, but clearly 
the current approach of building the war ourselves using Alfresco materials is 
certainly a concern.
  
 ant test fails due to Alfresco test problem
 -

 Key: CONNECTORS-308
 URL: https://issues.apache.org/jira/browse/CONNECTORS-308
 Project: ManifoldCF
  Issue Type: Bug
  Components: Alfresco connector
Affects Versions: ManifoldCF 0.4
 Environment: Windows (although this shouldn't matter)
Reporter: Karl Wright
Assignee: Piergiorgio Lucidi
 Fix For: ManifoldCF 0.4


 I get this error when I do ant test:
 {code}
 ...
 setup-alfresco-connector-tests:
  [copy] Copying 6 files to C:\wip\mcf\trunk\connectors\alfresco\lib
 BUILD FAILED
 C:\wip\mcf\trunk\build.xml:316: The following error occurred while executing 
 this line:
 C:\wip\mcf\trunk\tests\alfresco-war\build.xml:25: The following error 
 occurred while executing this line:
 C:\wip\mcf\trunk\tests\alfresco-war\tools\ant\tasks\tomcat-tasks.xml:30: 
 C:\wip\mcf\trunk\${appserver.dir}\server\lib does not exist.
 Total time: 18 seconds
 C:\wip\mcf\trunk
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Issue Comment Edited] (CONNECTORS-287) An Alfresco connector would be helpful

2011-11-28 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158868#comment-13158868
 ] 

Karl Wright edited comment on CONNECTORS-287 at 11/28/11 10:34 PM:
---

I built it and tried to run, w/o a repository to connect to, and got this:

{code}
org.alfresco.webservice.util.WebServiceException: Error creating authentication 
service: java.net.MalformedURLException: no protocol: balh/AuthenticationService

at 
org.alfresco.webservice.util.WebServiceFactory.getAuthenticationService(WebServiceFactory.java:167)
at 
org.alfresco.webservice.util.AuthenticationUtils.startSession(AuthenticationUtils.java:73)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.getSession(AlfrescoRepositoryConnector.java:147)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.checkConnection(AlfrescoRepositoryConnector.java:179)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.check(AlfrescoRepositoryConnector.java:115)
at 
org.apache.jsp.viewconnection_jsp._jspService(viewconnection_jsp.java:326)
at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:70)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:377)
at 
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:313)
at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511)
at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:401)
at 
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
at 
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
at 
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
at org.mortbay.jetty.servlet.Dispatcher.forward(Dispatcher.java:327)
at org.mortbay.jetty.servlet.Dispatcher.forward(Dispatcher.java:126)
at 
org.apache.jasper.runtime.PageContextImpl.doForward(PageContextImpl.java:706)
at 
org.apache.jasper.runtime.PageContextImpl.forward(PageContextImpl.java:677)
...
{code}

This caused a blank screen to come back from the view connection page.

So I recommend the following:

(1) Since it seems like you need to enter a URL for the endpoint field, you 
might want to consider requesting a protocol, server, port, and path instead of 
a free-form URL.  This will guarantee that the URL you construct will be valid.

(2) You should be catching any exceptions thrown by the API methods.  I'm not 
quite sure how you are getting something thrown that is not being caught, 
unless the exception thrown is derived from RuntimeException or Error.  In any 
case you will need to figure out how to correctly catch and handle whatever 
exceptions can be thrown by the API, and convert or wrap these in ManifoldCF 
exceptions.


  was (Author: kwri...@metacarta.com):
I built it and tried to run, w/o a repository to connect to, and got this:

{code}
org.alfresco.webservice.util.WebServiceException: Error creating authentication 
service: java.net.MalformedURLException: no protocol: balh/AuthenticationService

at 
org.alfresco.webservice.util.WebServiceFactory.getAuthenticationService(WebServiceFactory.java:167)
at 
org.alfresco.webservice.util.AuthenticationUtils.startSession(AuthenticationUtils.java:73)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.getSession(AlfrescoRepositoryConnector.java:147)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.checkConnection(AlfrescoRepositoryConnector.java:179)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.check(AlfrescoRepositoryConnector.java:115)
at 
org.apache.jsp.viewconnection_jsp._jspService(viewconnection_jsp.java:326)
at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:70)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:377)
at 
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:313)
at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511)
at 

[jira] [Issue Comment Edited] (CONNECTORS-287) An Alfresco connector would be helpful

2011-11-28 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158868#comment-13158868
 ] 

Karl Wright edited comment on CONNECTORS-287 at 11/28/11 10:35 PM:
---

I built it and tried to run, w/o a repository to connect to, and got this:

{code}
org.alfresco.webservice.util.WebServiceException: Error creating authentication 
service: java.net.MalformedURLException: no protocol: balh/AuthenticationService

at 
org.alfresco.webservice.util.WebServiceFactory.getAuthenticationService(WebServiceFactory.java:167)
at 
org.alfresco.webservice.util.AuthenticationUtils.startSession(AuthenticationUtils.java:73)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.getSession(AlfrescoRepositoryConnector.java:147)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.checkConnection(AlfrescoRepositoryConnector.java:179)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.check(AlfrescoRepositoryConnector.java:115)
at 
org.apache.jsp.viewconnection_jsp._jspService(viewconnection_jsp.java:326)
at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:70)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:377)
at 
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:313)
at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511)
at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:401)
at 
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
at 
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
at 
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
at org.mortbay.jetty.servlet.Dispatcher.forward(Dispatcher.java:327)
at org.mortbay.jetty.servlet.Dispatcher.forward(Dispatcher.java:126)
at 
org.apache.jasper.runtime.PageContextImpl.doForward(PageContextImpl.java:706)
at 
org.apache.jasper.runtime.PageContextImpl.forward(PageContextImpl.java:677)
...
{code}

This caused a blank screen to come back from the view connection page.

So I recommend the following:

(1) Since it seems like you need to enter a URL for the endpoint field, you 
might want to consider requesting a protocol, server, port, and path instead of 
a free-form URL.  Pick the protocol from a drop-down, and use javascript to 
check that the server has no / characters and the port is either blank or a 
number.  This will guarantee that the URL you construct will be valid.

(2) You should be catching any exceptions thrown by the API methods.  I'm not 
quite sure how you are getting something thrown that is not being caught, 
unless the exception thrown is derived from RuntimeException or Error.  In any 
case you will need to figure out how to correctly catch and handle whatever 
exceptions can be thrown by the API, and convert or wrap these in ManifoldCF 
exceptions.


  was (Author: kwri...@metacarta.com):
I built it and tried to run, w/o a repository to connect to, and got this:

{code}
org.alfresco.webservice.util.WebServiceException: Error creating authentication 
service: java.net.MalformedURLException: no protocol: balh/AuthenticationService

at 
org.alfresco.webservice.util.WebServiceFactory.getAuthenticationService(WebServiceFactory.java:167)
at 
org.alfresco.webservice.util.AuthenticationUtils.startSession(AuthenticationUtils.java:73)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.getSession(AlfrescoRepositoryConnector.java:147)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.checkConnection(AlfrescoRepositoryConnector.java:179)
at 
org.apache.manifoldcf.crawler.connectors.alfresco.AlfrescoRepositoryConnector.check(AlfrescoRepositoryConnector.java:115)
at 
org.apache.jsp.viewconnection_jsp._jspService(viewconnection_jsp.java:326)
at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:70)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
at 
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:377)
at 
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:313)
at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260)
at 

[jira] [Issue Comment Edited] (CONNECTORS-256) Connector for crawling Wikis

2011-10-14 Thread Karl Wright (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/CONNECTORS-256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13127534#comment-13127534
 ] 

Karl Wright edited comment on CONNECTORS-256 at 10/14/11 1:30 PM:
--

bq. Also, information about the last_modified date was missing completely.

The last_modified date I did not include in the metadata indexed; this is why I 
wanted other folks to try it out.  I've opened a new ticket to capture that 
enhancement: CONNECTORS-273.


  was (Author: kwri...@metacarta.com):
bq. Also, information about the last_modified date was missing completely.

The last_modified date I did not include in the metadata indexed; this is why I 
wanted other folks to try it out.  I'll open a new ticket to capture that 
enhancement.

  
 Connector for crawling Wikis
 

 Key: CONNECTORS-256
 URL: https://issues.apache.org/jira/browse/CONNECTORS-256
 Project: ManifoldCF
  Issue Type: New Feature
  Components: Wiki connector
Affects Versions: ManifoldCF 0.4
Reporter: Karl Wright
Assignee: Karl Wright
 Fix For: ManifoldCF 0.4


 People have been trying to crawl wikis with ManifoldCF, but using the generic 
 crawler is not a good way to do this.  Instead, it looks like we really could 
 use a wiki connector, which would understand the wiki API and thus crawl wiki 
 content quickly and effectively.
 Some pertinent API references follow:
 I don't know if it is possible to link to a wiki document with just the 
 pageid, but it is possible to to get the url for the referring pageid via api:
 http://en.wikipedia.org/w/api.php?action=queryprop=infopageids=27697087inprop=url
 It is possible to get the metadata of a document using the pages id (instead 
 of title) directly:
 Titel - 
 http://en.wikipedia.org/w/api.php?action=queryprop=revisionstitles=APIrvprop=timestamp|user|comment|content
 PageID - 
 http://en.wikipedia.org/w/api.php?action=queryprop=revisionspageids=27697087rvprop=timestamp|user|comment|content
 - There needs to be some notion of an overall list of pages:
- http://www.mediawiki.org/wiki/API:Allpages
- Example: 
 http://en.wikipedia.org/w/api.php?action=querylist=allpagesapfrom=Kreaplimit=5
 - Metadata information (author and pub date) also needs to be separated out 
 in some way:
- http://www.mediawiki.org/wiki/API:Properties#Revisions:_Example
- Example:  
 http://en.wikipedia.org/w/api.php?action=queryprop=revisionstitles=API|Main%20Pagervprop=timestamp|user|comment|content

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira