Status of CONNECTORS-474, LEGAL-137, and infrastructure

2012-06-06 Thread Karl Wright
Hi Folks, I've gotten direct feedback from Joe Becknell of qBase, who has agreed to test connectors built against my stubs against qBase's bevy of repository instances. So far, the LiveLink connector passes, but he is still grappling with the FileNet and Documentum connectors. With luck,

Svn root moving today or tomorrow

2012-06-13 Thread Karl Wright
Folks, I'm told our svn root is about to move. I'll post the appropriate svn switch command once I try it out and insure myself I'm not giving bad advice. Karl

Site down

2012-06-15 Thread Karl Wright
Hi, The SVN move messed up our site update cron job and we've got no site at the moment. FWIW, I have fixed the problem and dropped the necessary bits on people.apache.org, but it has not yet replicated and I don't know when it will. Karl

Our site has moved!

2012-06-15 Thread Karl Wright
The new base URL is: http://manifoldcf.apache.org It's svnpubsub'd, so our process for pushing the documentation out does not change at all (except I turned off my crontab b/c it is no longer helpful). I'll update the wiki shortly as well, and put an .htaccess file in place in the old location

Re: ManifoldCF 0.6 release

2012-06-20 Thread Karl Wright
installing SharePoint :-P This afternoon I'll try to take a look at all these issues to understand where I can contribute. Cheers, PJ 2012/6/19 Karl Wright daddy...@gmail.com Hi folks, The 0.6 release date is coming up (June 30) and we still have quite a number of open tickets rattling

[VOTE] Release apache-manifoldcf-solr-3.x-plugin-0.2 RC0

2012-06-27 Thread Karl Wright
This plugin has been brought up to date with the Solr/Lucene 3.6 release. I've also updated the README.txt and removed the incubation disclaimer. You can download the artifacts from http://people.apache.org/~kwright/apache-manifoldcf-solr-3.x-plugin-0.2, or find a tag in SVN at

RE: Exporting crawler configuration easier?

2012-06-28 Thread Karl Wright
seems reasonable, I can work on this issue. Since I'm going to Shanghai tomorrow, I'm afraid that I have to finish my contribution when I'm back. Erlend On 27.06.12 13.33, Karl Wright wrote: The fact that the export is a zip is not supposed to be used to actually edit the stored information

[RESULT][VOTE] Release Apache ManifoldCF 0.6 RC0

2012-07-13 Thread Karl Wright
Three +1's, 72 hours. Vote passes! Karl On Fri, Jul 13, 2012 at 7:50 AM, Jukka Zitting jukka.zitt...@gmail.com wrote: Hi, On Mon, Jul 9, 2012 at 2:34 AM, Karl Wright daddy...@gmail.com wrote: All 0.6 tickets are now resolved. I've therefore created a release candidate that you can look

Re: Going emeritus

2012-07-18 Thread Karl Wright
A big thank you, Tommaso, for all your help! Best of luck on your other endeavors. Karl On Wed, Jul 18, 2012 at 6:13 AM, Tommaso Teofili tommaso.teof...@gmail.com wrote: Hi all, when I stepped up as a mentor for Incubating ManifoldCF back some months ago my personal goal was mostly helping

[RESULT][VOTE] Release ManifoldCF SharePoint 2007 Plugin 0.2, RC0

2012-07-23 Thread Karl Wright
Three +1's, 72 hours. Vote passes! Karl On Mon, Jul 23, 2012 at 4:12 AM, Karl Wright daddy...@gmail.com wrote: Yes, I remember now the discussion last time about the .sha1 files. I'll modify the scripts now so that's not missed again. Karl On Mon, Jul 23, 2012 at 3:58 AM, Jukka Zitting

Proposed August 2012 board report

2012-07-30 Thread Karl Wright
Please let me know if you would like to see any changes. [REPORT] ManifoldCF Board Report, ManifoldCF PMC ManifoldCF PMC Chair: Karl Wright (kwri...@apache.org) Date: August 2012 Project description == ManifoldCF is an effort to provide an open source framework for connecting

[VOTE] Release Apache ManifoldCF SharePoint 2007 Plugin 0.3, RC0

2012-08-09 Thread Karl Wright
Hi all, Sorry for the short release cycle for this plugin; I know we just released 0.2. However, a critical bug was discovered by a team in Turkey, which prevented the plug-in from working correctly in some locales. See CHANGES.txt for details. (I am also expecting the first release of the

RE: Unknown state; please contact ManifoldCF developer group

2012-08-13 Thread Karl Wright
Hi Ahmet, Can you create a jira ticket for this? I will look at it when I get home. Thanks Karl Sent from my Windows Phone From: Ahmet Arslan Sent: 8/13/2012 4:03 PM To: dev@manifoldcf.apache.org Subject: Unknown state; please contact ManifoldCF developer group Hello, I just 'svn co' fresh

Re: Unknown state; please contact ManifoldCF developer group

2012-08-13 Thread Karl Wright
I've checked in a fix. Can you synch up and see whether it works for you? On Mon, Aug 13, 2012 at 4:37 PM, Ahmet Arslan iori...@yahoo.com wrote: Can you create a jira ticket for this? Karl, I created CONNECTORS-505 for this.

Re: SharePoint: Error closing connection to file

2012-08-13 Thread Karl Wright
There are two different issues here. The first one is that you are having a connection close on you; not sure the reason why, but could potentially be caused by a Tika exception in Solr. The second is that the refactored WorkerThread code I checked in Sunday might have a bug in handling

Re: SharePoint: Error closing connection to file

2012-08-14 Thread Karl Wright
) at org.apache.manifoldcf.crawler.system.SeedingActivity.doneSeeding(SeedingActivity.java:165) at org.apache.manifoldcf.crawler.system.StartupThread.run(StartupThread.java:181) --- On Tue, 8/14/12, Karl Wright daddy...@gmail.com wrote: From: Karl Wright daddy...@gmail.com Subject: Re: SharePoint

Winding down the 0.7 release, already??

2012-08-27 Thread Karl Wright
Hi Folks, It's already time to start winding down the 0.7 release. Before this is done, I think we need the following: (1) Voting on the current outstanding SharePoint-2007 plugin release. Still need 2 votes. (2) Completion of, and voting on the new SharePoint-2010 plugin release. (3)

Re: maven build/support

2012-09-03 Thread Karl Wright
Hi Ahmet, Yes, I saw your proposed contribution, but the dependency on running ant first was a problem for me, because for some things (notably the alfresco war) we have to run Maven first to build it. :-/. However, there is an ant plugin for Maven that you might be able to make use of to do the

[WITHDRAW][VOTE] Release Apache ManifoldCF SharePoint 2010 plugin 0.1 RC0

2012-09-06 Thread Karl Wright
I conclude that the plugin is not handling paging properly - there's no other explanation. So I am canceling the vote and will try to check in a fix. Karl On Thu, Sep 6, 2012 at 1:11 PM, Karl Wright daddy...@gmail.com wrote: It looks like two problems here. First, it looks like Solr

Re: [WITHDRAW][VOTE] Release Apache ManifoldCF SharePoint 2010 plugin 0.1 RC0

2012-09-06 Thread Karl Wright
Thanks for trying this. Just as a check I increased the number of documents that will be requested on the connector side to 1. If you synch up trunk and try again, it should give me an idea whether the failing logic is on the connector side or the server side. (I suspect it is still the

Re: [WITHDRAW][VOTE] Release Apache ManifoldCF SharePoint 2010 plugin 0.1 RC0

2012-09-06 Thread Karl Wright
I also just updated the plugin at sharepoint-2010/trunk to precisely follow a Microsoft example I found. Could you give this a try as well? Karl On Thu, Sep 6, 2012 at 5:08 PM, Karl Wright daddy...@gmail.com wrote: Thanks for trying this. Just as a check I increased the number of documents

Re: [VOTE] Release Apache ManifoldCF SharePoint 2007 Plugin 0.3, RC0

2012-09-06 Thread Karl Wright
+1 from me as well. Karl On Fri, Sep 7, 2012 at 1:34 AM, Jukka Zitting jukka.zitt...@gmail.com wrote: Hi, On Wed, Sep 5, 2012 at 7:33 PM, Karl Wright daddy...@gmail.com wrote: Still need 2 votes for this... +1 from me BR, Jukka Zitting

Warning: need to rerun ant make-core-deps

2012-09-07 Thread Karl Wright
Hello all, A warning: The content of the lib directory on trunk has changed! You will need to rerun ant make-core-deps if you are using a trunk checkout. Thanks, Karl

Re: [WITHDRAW][VOTE] Release Apache ManifoldCF SharePoint 2010 plugin 0.1 RC0

2012-09-07 Thread Karl Wright
of libraries that you have configured. In the document history I only see three libraries shown in the Document Status view: - / - /My Custom Library 1// - /My Custom Library 2// Is this correct? Hope this helps. Piergiorgio 2012/9/6 Karl Wright daddy...@gmail.com I also just updated

RE: [WITHDRAW][VOTE] Release Apache ManifoldCF SharePoint 2010

2012-09-08 Thread Karl Wright
Library 2// =20 Is this correct? =20 Hope this helps. Piergiorgio =20 2012/9/6 Karl Wright daddy...@gmail.com =20 I also just updated the plugin at sharepoint-2010/trunk to precisely follow a Microsoft example I found.=C2=A0 Could you give this a try as well? Karl On Thu, Sep 6

[VOTE] Release Apache ManifoldCF Sharepoint 2010 plugin 0.1, RC1

2012-09-09 Thread Karl Wright
Please vote +1 if you think the SharePoint 2010 plugin is ready for release. Tag in the usual place (https://svn.apache.org/repos/asf/manifoldcf/integration/sharepoint-2010/tags/release-0.1-RC1). Artifact at http://people.apache.org/~kwright/apache-manifoldcf-sharepoint-2010-plugin-0.1. Fixed:

Re: [VOTE] Release Apache ManifoldCF 1.0, RC0

2012-09-21 Thread Karl Wright
test on OS X. - My new file encryption function. I'll wait with my vote until the above behaviour is explained. Erlend On 21.09.12 02.46, Karl Wright wrote: Please vote +1 to release ManifoldCF 1.0, RC0. The release artifact can be found at: http://people.apache.org/~kwright/apache

Re: [VOTE] Release Apache ManifoldCF 1.0, RC0

2012-09-21 Thread Karl Wright
e.f.gara...@usit.uio.no wrote: On 21.09.12 14.07, Karl Wright wrote: If you think the agents process is running, and yet you cannot delete a job, you should see stuff in the manifoldcf log that would indicate what the trouble likely is. solr-test02 mcf-1 $ /www/var/data/mcf/mcf-1/agentctl start

Re: [VOTE] Release Apache ManifoldCF 1.0, RC0

2012-09-21 Thread Karl Wright
and connections when you upgrade. Nor is unregistering/reregistering your connectors. I'm going to spin a new RC with the NPE fix and a bunch of other things - but I'm pretty certain the mystery is solved. Thanks, Karl On Fri, Sep 21, 2012 at 10:17 AM, Karl Wright daddy...@gmail.com wrote: Thanks

[WITHDRAW][VOTE] Release Apache ManifoldCF 1.0, RC0

2012-09-21 Thread Karl Wright
Withdrawn; spinning RC1 now. Karl On Fri, Sep 21, 2012 at 2:55 PM, Karl Wright daddy...@gmail.com wrote: Hi, I see what has happened here. You unregistered the connectors before you deleted the job. That basically meant that the job cleanup can't take place until the connector(s

[VOTE] Release Apache ManifoldCF 1.0, RC1

2012-09-21 Thread Karl Wright
Please vote +1 to release ManifoldCF 1.0, RC1. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC1 Fixes since RC0: CONNECTORS-532 CONNECTORS-533

Re: [VOTE] Release Apache ManifoldCF 1.0, RC1

2012-09-24 Thread Karl Wright
resolved this issue. I tested other functionalities and it seems all ok. So I think that if we build another RC, this time it could be ok :) Piergiorgio 2012/9/23 Karl Wright daddy...@gmail.com Examined the distribution for leakage of files that shouldn't be there, ran ant rat-sources

[VOTE] Release Apache ManifoldCF 1.0, RC3

2012-09-25 Thread Karl Wright
Please vote +1 to release ManifoldCF 1.0, RC3. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC3 Fixes since RC2: CONNECTORS-540

Re: [VOTE] Release Apache ManifoldCF 1.0, RC3

2012-09-25 Thread Karl Wright
binary in the same way - checked signatures +1 from me Piergiorgio 2012/9/25 Karl Wright daddy...@gmail.com Please vote +1 to release ManifoldCF 1.0, RC3. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https

Re: [VOTE] Release Apache ManifoldCF 1.0, RC3

2012-09-27 Thread Karl Wright
On 26.09.12 17.55, Karl Wright wrote: Usually application servers unpack the war somewhere. Unless you remove the place where it is unpacked you will continue to have the applications even after the war is gone. Karl On Wed, Sep 26, 2012 at 11:52 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote

[VOTE] Release Apache ManifoldCF 1.0, RC4

2012-09-27 Thread Karl Wright
Please vote +1 to release ManifoldCF 1.0, RC4. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC4 Fixes since RC3: CONNECTORS-544 (also added example

Re: [VOTE] Release Apache ManifoldCF 1.0, RC4

2012-09-27 Thread Karl Wright
Ran all tests, tried the combined war, tried single-process example and multiprocess example. +1 (and I really hope this is the last RC for this release) Karl On Thu, Sep 27, 2012 at 5:30 PM, Karl Wright daddy...@gmail.com wrote: Please vote +1 to release ManifoldCF 1.0, RC4. The release

[VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
Please vote +1 to release ManifoldCF 1.0, RC5. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5 Fixes since RC4: CONNECTORS-545 Fixes since RC3:

Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
these documents over and over again during this weekend. Erlend On 28.09.12 09.58, Karl Wright wrote: Please vote +1 to release ManifoldCF 1.0, RC5. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https

Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
I leave because I'm afraid that MCF will try to fetch these documents over and over again during this weekend. Erlend On 28.09.12 09.58, Karl Wright wrote: Please vote +1 to release ManifoldCF 1.0, RC5. The release artifact can be found at: http://people.apache.org/~kwright/apache

Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
CONNECTORS-547 (index out of bounds) CONNECTORS-548 (cannot build with maven) Karl On Fri, Sep 28, 2012 at 7:26 AM, Karl Wright daddy...@gmail.com wrote: Meanwhile, the following is filling up my log: FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: String index out of range

Re: [VOTE] Release Apache ManifoldCF 1.0, RC6

2012-09-28 Thread Karl Wright
Exercised it as I have before, and added postgresql and mysql tests. +1 Karl On Fri, Sep 28, 2012 at 9:38 AM, Karl Wright daddy...@gmail.com wrote: Please vote +1 to release ManifoldCF 1.0, RC6. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0

[WITHDRAW][VOTE] Release Apache ManifoldCF 1.0, RC6

2012-09-30 Thread Karl Wright
Withdrawn because of CONNECTORS-549. Karl On Sun, Sep 30, 2012 at 8:07 AM, Piergiorgio Lucidi piergior...@apache.org wrote: -1 from me. I found an issue on the CMIS Connector (CONNECTORS-549), I'm committing the patch to fix the problem. We need another RC. Piergiorgio 2012/9/28 Karl

[VOTE] Release Apache ManifoldCF 1.0, RC7

2012-09-30 Thread Karl Wright
Please vote +1 to release ManifoldCF 1.0, RC7. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC7 Fixes since RC6: CONNECTORS-549 Fixes since RC5:

Re: [VOTE] Release Apache ManifoldCF 1.0, RC7

2012-09-30 Thread Karl Wright
Karl Wright daddy...@gmail.com Please vote +1 to release ManifoldCF 1.0, RC7. The release artifact can be found at: http://people.apache.org/~kwright/apache-manifoldcf-1.0 There is also an SVN tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC7 Fixes since RC6

Re: [VOTE] Release Apache ManifoldCF 1.0, RC7

2012-10-02 Thread Karl Wright
/release-1.0-RC7/build.xml:2529: A zip file cannot include itself Ahmet --- On Sun, 9/30/12, Karl Wright daddy...@gmail.com wrote: From: Karl Wright daddy...@gmail.com Subject: Re: [VOTE] Release Apache ManifoldCF 1.0, RC7 To: dev@manifoldcf.apache.org Date: Sunday, September 30, 2012, 9

Re: [VOTE] Release Apache ManifoldCF 1.0, RC7

2012-10-02 Thread Karl Wright
Oh, and by the way, I used ant image to create the release, which calls the create-source-zip target. So it's still working for me just fine. Hmmm. Karl On Tue, Oct 2, 2012 at 2:41 PM, Karl Wright daddy...@gmail.com wrote: The ant build hasn't changed in this regard, but maybe ant has

Re: [VOTE] Release Apache ManifoldCF 1.0, RC7

2012-10-02 Thread Karl Wright
Mine is: C:\wip\mcf\trunkwhich ant c:\ant\apache-ant-1.8.4\bin/ant.bat Are you running on Windows, or Linux? Karl On Tue, Oct 2, 2012 at 2:54 PM, Ahmet Arslan iori...@yahoo.com wrote: This used to exclude the zip from itself. What version of ant are you using? Apache Ant(TM) version 1.8.2

Re: [VOTE] Release Apache ManifoldCF 1.0, RC7

2012-10-02 Thread Karl Wright
Can you try changing the exclude .../ clause I noted earlier to remove the leading / from /apache-manifoldcf-*, and see if that works? I'd hesitate to spin a new kit here unless there's evidence there's a viable fix. Karl On Tue, Oct 2, 2012 at 3:01 PM, Ahmet Arslan iori...@yahoo.com wrote:

Re: [VOTE] Release Apache ManifoldCF 1.0, RC7

2012-10-03 Thread Karl Wright
) at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:533) at java.lang.Thread.run(Thread.java:680) On 30.09.12 15.12, Karl Wright wrote: Please vote +1 to release ManifoldCF 1.0, RC7. The release artifact can be found at: http://people.apache.org/~kwright/apache

Re: getMaxDocumentRequest problem

2012-10-03 Thread Karl Wright
Hi Maciej, It sounds like your loop condition must be somehow incorrect. You may not receive the full number of documents specified by getMaxDocumentRequest(), but rather a number less than that. We have a number of connectors that use document batches 1, e.g. the LiveLink connector, so this

[ANNOUNCE] Apache ManifoldCF 1.0 has been released!

2012-10-04 Thread Karl Wright
To all, Apache ManifoldCF 1.0 has been released. This release introduces support for SharePoint 2010, a new LDAP authority, many bug fixes, a new experimental deployment model (single-process combined war), and full MySQL support. Details can be found at:

Re: question about multiple languages

2012-10-08 Thread Karl Wright
Hi Maciej, Did you intend to send this to the Solr/Lucene dev list? This really isn't a ManifoldCF question. I can help a little perhaps. You are correct that stemming and normalization rules might well differ from language to language, but it is worth noting that for at least normalization it

Re: getMaxDocumentRequest problem

2012-10-09 Thread Karl Wright
in aborting state)... Problem is that it happens irregularly (sometime 10 documents, sometime 1600 and sometime all documents are indexed). Tried to check that locally but on first pass everything went ok... really strange... 2012/10/3 Karl Wright daddy...@gmail.com: Hi Maciej, It sounds

Re: getMaxDocumentRequest problem

2012-10-09 Thread Karl Wright
FWIW, getting thread dumps from the process running the agents process when it is hung may (or may not) help determine the underlying clause. Karl On Tue, Oct 9, 2012 at 9:21 AM, Karl Wright daddy...@gmail.com wrote: What is your deployment model? Is this a multiprocess deployment? What

Re: getMaxDocumentRequest problem

2012-10-09 Thread Karl Wright
logs so I could see verbose output from core functions? 2012/10/9 Karl Wright daddy...@gmail.com: FWIW, getting thread dumps from the process running the agents process when it is hung may (or may not) help determine the underlying clause. Karl On Tue, Oct 9, 2012 at 9:21 AM, Karl Wright

Re: getMaxDocumentRequest problem

2012-10-09 Thread Karl Wright
/9 Karl Wright daddy...@gmail.com: - all worker threads are gone, ??? Really?? yes... really.. this is why I am also writing that this is strange... this is list of currently active threads: system Reference Handler Waiting Finalizer Waiting Signal Dispatcher Running

Re: getMaxDocumentRequest problem

2012-10-09 Thread Karl Wright
...@gmail.com wrote: Well... that is possible :) what exactly ManifoldCFException.INTERRUPTED is doing that could cause such effects? 2012/10/9 Karl Wright daddy...@gmail.com: What JVM are you using? Because frankly this cannot logically happen. The only other possibility is that your code is somehow

[PROPOSAL] Release a ManifoldCF 1.0.1 release

2012-10-09 Thread Karl Wright
Hi folks, Due to the potential severity of CONNECTORS-551, I think it might be a good idea to release a ManifoldCF 1.0.1 release which contains the fix for this ticket. Please can I have a show of hands as to whether people agree that this is serious enough to warrant such a release. Thanks!

Re: [PROPOSAL] Release a ManifoldCF 1.0.1 release

2012-10-10 Thread Karl Wright
Ok, it looks like there is consensus. I will prepare the release-1.0-branch appropriately, and create a release candidate. Karl On Wed, Oct 10, 2012 at 5:28 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote: +1 Erlend On 09.10.12 22.53, Karl Wright wrote: Hi folks, Due to the potential

[VOTE] Release Apache ManifoldCF 1.0.1, RC0

2012-10-13 Thread Karl Wright
Vote +1 to release Apache ManifoldCF 1.0.1, RC0. You can find a tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0.1-RC0 The artifact can be downloaded from: http://people.apache.org/~kwright/apache-manifoldcf-1.0.1 This patch release fixes the critical bug CONNECTORS-551.

Re: My talk about Apache ManifoldCF at LinuxDay Rome 2012

2012-10-15 Thread Karl Wright
Looks great - hope it goes well! I wish I could come but I *just* got back from Berlin yesterday, and I have responsibilities here thru Oct 31. Karl On Mon, Oct 15, 2012 at 4:17 AM, Piergiorgio Lucidi piergior...@apache.org wrote: Hi guys, I would like to share with you that at the next

[WITHDRAW][VOTE] Release Apache ManifoldCF 1.0.1, RC0

2012-10-15 Thread Karl Wright
Looks like maven build is busted, see CONNECTORS-555. Karl On Sun, Oct 14, 2012 at 4:32 AM, Karl Wright daddy...@gmail.com wrote: Ran tests and checked documentation. +1 from me. Karl On Sat, Oct 13, 2012 at 6:16 PM, Karl Wright daddy...@gmail.com wrote: Vote +1 to release Apache

Re: Developing an Email Connector

2012-10-15 Thread Karl Wright
Sounds great! I can't wait to see it. Karl On Mon, Oct 15, 2012 at 6:31 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote: Me and Karl had a short discussion about such a connector in Cambridge for some months ago. Now I have created the following ticket regarding an Email Connector:

Re: [DISCUSS] Java coding style

2012-10-15 Thread Karl Wright
As far as I'm concerned, either coding style is OK. But having the wrong number of spaces for indent is not. Nor is using tabs instead of spaces. The only other rule I think we should really enforce is for any significant changes to the style of a class to be checked in or patched independently

[VOTE] Release Apache ManifoldCF 1.0.1, RC1

2012-10-15 Thread Karl Wright
Vote +1 to release Apache ManifoldCF 1.0.1, RC1. You can find a tag at: https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0.1-RC1 The artifact can be downloaded from: http://people.apache.org/~kwright/apache-manifoldcf-1.0.1 This patch release fixes the critical bug CONNECTORS-551.

New committer: Maciej Lizweski

2012-10-16 Thread Karl Wright
Please join me in welcoming Maciej as our newest committer and PMC member. Maciej brings with him knowledge of wikis, LDAP, and mail protocols, among many other skills. Welcome, Maciej! Karl

Some dev-related questions

2012-10-17 Thread Karl Wright
Hi Maciej, First advice is to post questions of this kind to dev@manifoldcf.apache.org. This functions in part as a repository of general knowledge, and it is searchable, so in the future others can maybe refer to answers there. Please see below for detailed answers. On Wed, Oct 17, 2012 at

Re: Some dev-related questions

2012-10-17 Thread Karl Wright
directory if changes are only in this single connector? You always need to create a whole copy of trunk. svn is very efficient about copies so this is not a problem. Karl 2012/10/17 Karl Wright daddy...@gmail.com Hi Maciej, First advice is to post questions of this kind to dev

[RESULT][VOTE] Release Apache ManifoldCF 1.0.1, RC1

2012-10-18 Thread Karl Wright
+1 from me PJ 2012/10/16 Karl Wright daddy...@gmail.com Downloaded artifacts, looked for leakage of maven target directories, and tried a maven build, which worked. +1 from me. Karl On Mon, Oct 15, 2012 at 7:34 PM, Karl Wright daddy...@gmail.com wrote: Vote +1 to release

Re: My talk about Apache ManifoldCF at LinuxDay Rome 2012

2012-10-29 Thread Karl Wright
://www.open4dev.com/journal/2012/10/28/apache-manifoldcf-at-linuxday-slides.html 2012/10/15 Karl Wright daddy...@gmail.com Looks great - hope it goes well! I wish I could come but I *just* got back from Berlin yesterday, and I have responsibilities here thru Oct 31. Karl On Mon, Oct 15, 2012 at 4:17

Re: My talk about Apache ManifoldCF at LinuxDay Rome 2012

2012-10-31 Thread Karl Wright
of the next week, after the Alfresco DevCon in Berlin, I can work on this task for updating the website ;) Piergiorgio [1] - http://www.open4dev.com/journal/2012/10/28/apache-manifoldcf-at-linuxday-slides.html 2012/10/15 Karl Wright daddy...@gmail.com Looks great - hope it goes well! I

Re: Anybody with a working SharePoint instance?

2012-11-10 Thread Karl Wright
longer than 6 MILLISECONDS DEBUG 2012-11-10 19:37:14,479 (Idle cleanup thread) - Closing connections idle longer than 6 MILLISECONDS --- On Sat, 11/10/12, Karl Wright daddy...@gmail.com wrote: From: Karl Wright daddy...@gmail.com Subject: Anybody with a working SharePoint instance

Re: Anybody with a working SharePoint instance?

2012-11-10 Thread Karl Wright
Ok, can you try it now? I think it is fixed. Karl On Sat, Nov 10, 2012 at 1:20 PM, Karl Wright daddy...@gmail.com wrote: Thanks - it looks like I will need to have it go through a temporary local file in order to work right. I'll make that change and let you know. Karl On Sat, Nov 10

Re: Anybody with a working SharePoint instance?

2012-11-10 Thread Karl Wright
longer than 6 MILLISECONDS --- On Sat, 11/10/12, Karl Wright daddy...@gmail.com wrote: From: Karl Wright daddy...@gmail.com Subject: Re: Anybody with a working SharePoint instance? To: dev@manifoldcf.apache.org Date: Saturday, November 10, 2012, 8:50 PM Ok, can you try it now? I think

Re: Anybody with a working SharePoint instance?

2012-11-11 Thread Karl Wright
Great! I've pulled this code up into trunk. Please, if anyone notices any problems, please let me know. Karl On Sat, Nov 10, 2012 at 10:10 PM, Ahmet Arslan iori...@yahoo.com wrote: Hi Karl, Its all working for me now :) Thanks, Ahmet --- On Sun, 11/11/12, Karl Wright daddy...@gmail.com

Anyone out there using RSS connector, who wants to help?

2012-11-16 Thread Karl Wright
Hi all, The branch https://svn.apache.org/repos/asf/manifoldcf/branches/CONNECTORS-120 contains an RSS connector that has been updated to use httpcomponents 4.2.2. I'd love for people who are in a position to do significant RSS crawling to try it out before I pull it into trunk. Any takers?

RE: Anyone out there using RSS connector, who wants to help?

2012-11-18 Thread Karl Wright
- failure getting document version     at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:339) By the way in Dechromed Content tab (Job Setting UI) I see four nbsp; Thanks, Ahmet --- On Fri, 11/16/12, Karl Wright daddy...@gmail.com wrote: From: Karl Wright daddy

Re: Anyone out there using RSS connector, who wants to help?

2012-11-18 Thread Karl Wright
, Karl Wright daddy...@gmail.com wrote: Odd. The problem is obviously the port of -1. But the code does not attach a specific port to the URL in that case. I will try your example exactly when I have access to internet again. Karl Sent from my Windows Phone From: Ahmet Arslan Sent: 11/17

Time to try out the web connector in branches/CONNECTORS-120

2012-11-21 Thread Karl Wright
I've ported the web connector, finally, to httpcomponents 4.2.2. This was a lot of work. The areas I anticipate there will be problems will be in exception handling and in session login. If anyone has session-protected sites they typically crawl, it would be great if you could try this code

Re: Anyone out there using RSS connector, who wants to help?

2012-11-24 Thread Karl Wright
! Karl On Tue, Nov 20, 2012 at 7:11 AM, Karl Wright daddy...@gmail.com wrote: Thanks for the update! I'm working on the web connector now. That's going to require a bit more work. Karl On Tue, Nov 20, 2012 at 7:09 AM, Maciej Liżewski maciej.lizew...@gmail.com wrote: CONNECTORS-120

Next release scheduled for 12/31

2012-12-06 Thread Karl Wright
Hello all committers, This is a reminder that our next release is scheduled for 12/31/2012. There are a number of fairly major open tickets out there that I think are almost ready to be included in the release. I am especially thinking of the ldap authority enhancements, and the simple JDBC

Re: Largest crawl

2012-12-12 Thread Karl Wright
ManifoldCF scales based on how well the underlying database handles two kinds of queries - direct access to a row via an index, and reading from an index in ordered fashion. Both of these go up as log(n) assuming b-trees. I have personally done web crawls on the order of 5 million actual content

Heads up - Derby crawling broken on trunk

2012-12-12 Thread Karl Wright
The derby database support is not apparently able to discover database indexes properly at this time, and that is causing derby crawls to fail. I will be looking at this in detail this morning. Until then, tests don't work, they hang... Karl

Re: Heads up - Derby crawling broken on trunk

2012-12-12 Thread Karl Wright
Ok, this is now fixed. Karl On Wed, Dec 12, 2012 at 5:22 AM, Karl Wright daddy...@gmail.com wrote: The derby database support is not apparently able to discover database indexes properly at this time, and that is causing derby crawls to fail. I will be looking at this in detail this morning

Re: Do we need an org.apache.manifoldcf.core.DBClean command class?

2012-12-18 Thread Karl Wright
DBDrop is in fact used internally by the tests we run. But you have to do the uninstall sequence in the correct order, otherwise, as you say, you are left with table dependencies. The correct order is this: org.apache.manifoldcf.crawler.UnRegisterAll

Re: Do we need an org.apache.manifoldcf.core.DBClean command class?

2012-12-18 Thread Karl Wright
Hmm, somehow you lost a connector jar out of the connector-lib or connector-lib-proprietary area. Deleting the jars before you clean up the database is not going to work. ;-) Karl On Tue, Dec 18, 2012 at 10:26 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote: Yes, I know the order is

RE: Can't download elastic search

2012-12-23 Thread Karl Wright
Hmm, do you have a firewall where you are? I did a download just yesterday and it worked fine. Karl Sent from my Windows Phone -Original Message- From: Minoru Osuka Sent: 12/23/2012 7:01 AM To: dev@manifoldcf.apache.org Subject: Can't download elastic search

RE: Can't download elastic search

2012-12-23 Thread Karl Wright
Could be. I will look into modifying trunk accordingly if this seems to have changed. In the meantime, you can just download the package by hand and put it in the right place... Karl Sent from my Windows Phone From: Lukáš Vlček Sent: 12/23/2012 10:52 AM To: dev@manifoldcf.apache.org Subject:

Re: Can't download elastic search

2012-12-24 Thread Karl Wright
-materials-proprietary/elasticsearch-0.19.0/plugins/mapper-attachments.zip dest=test-materials-proprietary/elasticsearch-0.19.0/plugins/mapper-attachments/ Fixed download URLs, I have succeeded in make-deps. Thanks, Minoru On Mon, Dec 24, 2012 at 6:50 AM, Karl Wright daddy...@gmail.com wrote

[VOTE] Release Apache ManifoldCF Solr 4.x Plugin, 0.3, RC0

2013-01-01 Thread Karl Wright
Please vote on whether to release the 0.3 version of the Apache ManifoldCF Solr 4.x plugin, RC0. This release simply fixes the following: (1) It builds against the tagged 4.0.0 final release of Lucene/Solr (2) It fixes the test infrastructure to be compatible with late changes made to

[VOTE] Release Apache ManifoldCF Solr 3.x Plugin, 0.3, RC0

2013-01-01 Thread Karl Wright
Please vote on whether to release the 0.3 version of the Apache ManifoldCF Solr 3.x plugin, RC0. This release simply fixes the following: (1) It builds against the tagged 3.6.2 final release of Lucene/Solr The artifact can be downloaded from:

Re: [VOTE] Release Apache ManifoldCF Solr 4.x Plugin, 0.3, RC0

2013-01-01 Thread Karl Wright
Ran tests, which pass. +1 from me. Karl On Tue, Jan 1, 2013 at 3:09 AM, Karl Wright daddy...@gmail.com wrote: Please vote on whether to release the 0.3 version of the Apache ManifoldCF Solr 4.x plugin, RC0. This release simply fixes the following: (1) It builds against the tagged 4.0.0

Re: [VOTE] Release Apache ManifoldCF Solr 3.x Plugin, 0.3, RC0

2013-01-01 Thread Karl Wright
Ran the tests, which of course still pass. +1 from me. Karl On Tue, Jan 1, 2013 at 3:12 AM, Karl Wright daddy...@gmail.com wrote: Please vote on whether to release the 0.3 version of the Apache ManifoldCF Solr 3.x plugin, RC0. This release simply fixes the following: (1) It builds against

Re: [VOTE] Release Apache ManifoldCF Solr 4.x Plugin, 0.3, RC0

2013-01-10 Thread Karl Wright
Is anyone else willing to sign off on this release? Abe-san? Ahmet? Maciej? Erlend? Help! ;-) Karl On Tue, Jan 8, 2013 at 5:30 AM, Piergiorgio Lucidi piergior...@apache.org wrote: Checked signatures. +1 from me. Piergiorgio 2013/1/1 Karl Wright daddy...@gmail.com Ran tests, which

[RESULT][VOTE] Release Apache ManifoldCF Solr 4.x Plugin, 0.3, RC0

2013-01-11 Thread Karl Wright
Three +1's, no -1's, 72 hrs. Vote passes. Karl On Thu, Jan 10, 2013 at 9:25 PM, Shinichiro Abe shinichiro.ab...@gmail.com wrote: Checked filtering documents, it works fine. +1 from me. Shinichiro On 2013/01/10, at 22:22, Karl Wright wrote: Is anyone else willing to sign off

Re: Repeated service interruptions - failure processing document: null

2013-01-14 Thread Karl Wright
Hi Ahmet, The exception that seems to be causing the abort is a socket exception coming from a socket write: Caused by: java.net.SocketException: Broken pipe This makes sense in light of the http code returned from Solr, which was 413: http://www.checkupdown.com/status/E413.html . So there

Re: Repeated service interruptions - failure processing document: null

2013-01-14 Thread Karl Wright
CONNECTORS-609 Karl On Mon, Jan 14, 2013 at 8:30 AM, Karl Wright daddy...@gmail.com wrote: Hi Ahmet, The exception that seems to be causing the abort is a socket exception coming from a socket write: Caused by: java.net.SocketException: Broken pipe This makes sense in light of the http

Re: Repeated service interruptions - failure processing document: null

2013-01-14 Thread Karl Wright
Wright daddy...@gmail.com wrote: From: Karl Wright daddy...@gmail.com Subject: Re: Repeated service interruptions - failure processing document: null To: dev@manifoldcf.apache.org Date: Monday, January 14, 2013, 3:30 PM Hi Ahmet, The exception that seems to be causing the abort

Re: Repeated service interruptions - failure processing document: null

2013-01-14 Thread Karl Wright
I checked in a fix for this ticket on trunk. Please let me know if it resolves this issue. Karl On Mon, Jan 14, 2013 at 10:20 AM, Karl Wright daddy...@gmail.com wrote: This is because httpclient is retrying on error for three times by default. This has to be disabled in the Solr connector

  1   2   3   4   5   6   7   8   9   10   >