correctly. Please review and comment if necessary.
Thanks,
Erlend
On 22.06.12 17.03, Erlend Garåsen wrote:
On 22.06.12 15.08, Karl Wright wrote:
That's OK. Any improvement welcome. ;-)
Shinichiro Abe: Will you assist me in order to translate the
documentation? I have made Japanese screenshots
each
document within a certain time in milliseconds (e.g. 1 for
committing within 10 seconds). The a
href=http://wiki.apache.org/solr/CommitWithin;commit within/a
strategy will leave the responsibility to Solr instead of ManifoldCF.
The tab looks like:
Erlend
--
Erlend Garåsen
Center
be modified to just remove it. Let me check.
Karl
On Wed, Jun 27, 2012 at 12:28 PM, Karl Wright daddy...@gmail.com wrote:
You need to run the mvn-bootstrap.sh script first.
Karl
On Wed, Jun 27, 2012 at 11:09 AM, Erlend Garåsen
e.f.gara...@usit.uio.no wrote:
mvn eclipse:eclipse fails, probably
Since I'm traveling a lot, I'm curious about where you committers live
in the world. I have already met Karl in Boston in May this year, and I
would like to meet other committers as well.
I live in Oslo, Norway's capital and largest city.
Erlend
--
Erlend Garåsen
Center for Information
in the area of SharePoint
development to the project, and we look forward to his continuing
contribution in this area, and any other area he wishes to address.
Please join me in welcoming Ahmet to the community!
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
. Basically I'm responsible for promoting
Oslo Solr Community, but I have a lot of time to recommend ManifoldCF to
people who need a new open source search engine for their business.
http://jz12.java.no/
Thanks for doing this!
Karl
Erlend
--
Erlend Garåsen
Center for Information Technology
to
your configuration file. The same passcode along with the seed value are
used to decrypt the file with the ImportConfiguration command class. See
the documentation for the commands and properties above to find the
correct arguments and settings.
Thanks,
Erlend
--
Erlend Garåsen
Center
, execute the
lock-clean procedure, and start everything back up, and see if that
fixed the issue.
Karl
On Fri, Sep 21, 2012 at 9:01 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:
On 21.09.12 14.47, Karl Wright wrote:
A temporary error should not block a (non running) job from getting
an agents.Uninstall command, then reinstall everything
and finally import the configuration.
Still I cannot delete my jobs since their statuses are cleaning up.
And the reason is because I didn't delete my jobs prior to executing
crawler.UnRegisterAll?
Erlend
--
Erlend Garåsen
Center
, but then I couldn't
connect to our PostgreSQL server because it seems to have a new SSL
certificate I haven't installed into my local keystore.
I guess ut is possible to configure HSQLDB, but I'm afraid that my time
is running out. Sorry.
Erlend
On 26.09.12 18.00, Erlend Garåsen wrote:
Yes
at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5
Fixes since RC4:
CONNECTORS-545
Fixes since RC3:
CONNECTORS-544
--
Erlend Garåsen
Center for Information Technology Services
University
you described before, but it could be. I will create
a ticket for it though.
Karl
On Fri, Sep 28, 2012 at 5:49 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:
I'm trying to start a crawl before I have to run to the airport. I just
discovered that MCF recrawls the same host over and over again
On 28.09.12 13.31, Erlend Garåsen wrote:
OK, I will give you a stack trace in the beginning of next week.
Do you still need the stack trace? If you do, I need to adjust the log
level and/or change the source code in order to print it out.
I'm still a little bit worried about how MCF deals
, 2012 at 4:38 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:
On 28.09.12 13.31, Erlend Garåsen wrote:
OK, I will give you a stack trace in the beginning of next week.
Do you still need the stack trace? If you do, I need to adjust the log level
and/or change the source code in order to print
(documentation fix)
Fixes since RC4:
CONNECTORS-545
Fixes since RC3:
CONNECTORS-544
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
etc. Many web application frameworks have
support for this. Then you may give (at query time) a higher boost to
the fields belonging to the language detected.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph
that this is serious enough to warrant such a release.
Thanks!
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
(a):
Sounds great! I can't wait to see it.
Karl
On Mon, Oct 15, 2012 at 6:31 AM, Erlend Garåsen e.f.gara...@usit.uio.no
wrote:
Me and Karl had a short discussion about such a connector in Cambridge
for
some months ago. Now I have created the following ticket regarding an
Email
Connector:
https
to some
third party web-mail system but there is a number of such systems and
document ID should be customizable to support as many of them as
possible... what do you think?
2012/10/15 Erlend Garåsen e.f.gara...@usit.uio.no
Sounds like a good idea.
I didn't even think about attachments, even
of tickets that were hanging around marked fix in
ManifoldCF next. You may want to do the same...
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP
that we do not need the functionality I was mentioning? As long
as one is removing things in the correct order, problems will not show up.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax
features of Solr, and has been
instrumental in bringing our Solr connector into the modern era.
Please join me in welcoming Minoru to the Apache ManifoldCF project!
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
connector, so that
SolrCloud is supported
- Improved NTLM support
- Partial Kerberos support
- Many other improvements, which are summarized in CHANGES.txt
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47
.), it should be ready to release. Smaller issues like
CONNECTORS-622 can and IMHO should be fixed in the following release
instead of holding up the current one.
It's better to release early and often than to wait for perfection.
BR,
Jukka Zitting
--
Erlend Garåsen
Center for Information Technology
support
- Partial Kerberos support
- Many other improvements, which are summarized in CHANGES.txt
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
, with httpmime.jar just as we deliver
it in the connector-lib directory, and I did not see this issue. It
is almost certainly configuration, seems likely.
Karl
On Tue, Jan 29, 2013 at 11:26 AM, Erlend Garåsen
e.f.gara...@usit.uio.no wrote:
I have to run now, but I will investigate this further. BTW, I have
in CHANGES.txt
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
complete this job before I conclude that
we have got rid of the problem. This may take some time, probably about
30 hours.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
suggestions to the PostgreSQL admins and try to get more
information.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
starts it - MCF just fetches and fetches
without posting anything to Solr.
E
On 12.02.13 13.38, Erlend Garåsen wrote:
I have changed some settings in MCF which will reduce the heavy load on
our PG server (changed hop count mode to Keep unreachable documents,
forever).
I will start a new crawl
On 12.02.13 19.06, Karl Wright wrote:
If this problem is non-critical, and has been around a long time, it
is not necessary to cancel a release in order to fix it. The logic in
question has not changed since probably ManifoldCF 0.3 or so.
Karl
On Tue, Feb 12, 2013 at 1:04 PM, Erlend Garåsen
doubt it will be too challenging to fix.
Karl
On Tue, Feb 12, 2013 at 1:22 PM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:
You are probably right. I can withdraw my vote, but I'm unsure whether I
should wait and see what happens with the crawl I just started on our test
server with new hop
;
}
});
Unfortunately it does not seem to have actually worked; we are still
seeing non-reusable stream retry errors in some cases. Has anybody
seen this before, and what
are we doing wrong?
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317
://community.apache.org/gsoc.html
[5] http://s.apache.org/gsoclabels
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
with a lot of information stored in LDAP.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
think I will bring with me my laptop to US since I want to carry as
little as possible, only my iPad, so it will be difficult to work more
on this issue from tomorrow. I'll be back on Monday next week.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box
functionality for minimal
crawls, with better support for ADD_CHANGE_DELETE models of crawling. (See
CHANGES.txt for a complete list.)
Karl
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47
On 08.05.13 18.00, Erlend Garåsen wrote:
I will withdraw my vote if CONNECTORS-682 is still not resolved. So far
so good, the job has been running for 7 hours. I will check it again on
Friday since will be away from my computer till then.
I just want to inform that the job completed without
I'm sorry to inform that I haven't worked with the Hydra connector the
last month. I have been busy with a major release of our search project
at the university and my summer vaccation starts tomorrow. And I have
also been busy with creating patches for Solr.
Erlend
On 6/24/13 5:13 PM,
-1
I'm getting an NPE when running ant test.
I will of course withdraw my vote in case I have forgot a crucial step
before I ran the tests, but I don't think that is the case.
Otherwise, I completed the following tests successfully:
1. Deployed the binary version on Resin and did a test
+1
Looks good.
1. Deployed the binary version on Resin and did a test crawl.
2. Built the source version using Ant
3. Ran UI tests
4. Built docs (ant doc) using Forest 0.9
5. Ran ant test
6. Ran the single process model within the example dir, started the web
crawler and posted to Solr 4
7.
On 1/29/14 3:57 PM, Karl Wright wrote:
Thanks - this shows that threads are all waiting on connection throttling.
How many simultaneous connections did you make available to the site you
are crawling, and can you look at the simple history report to confirm that
there is no activity? I'll dig
We're still having problems with this release on our test server. It
runs stable and does not hang anymore, but nothing gets sent to Solr.
Since there was a problem with the SSL certificate in previous RCs,
maybe there is a similar problem related to the Solr Output Connector?
We have
On 06.02.14 12:41, Erlend Garåsen wrote:
p://www.ibsen.uio.no/diktsamlinger.xhtml]} 0 16
select * from repohistory where entityid like
'%www.ibsen.uio.no/diktsamlinger.xhtml%'
11227;1391439283905;1391439277790;1391439283890;http://www.ibsen.uio.no/diktsamlinger.xhtml;Web;fetch;200
a look at the log; the query should be there
Karl
On Thu, Feb 6, 2014 at 6:59 AM, Erlend Garåsen e.f.gara...@usit.uio.nowrote:
On 06.02.14 12:41, Erlend Garåsen wrote:
p://www.ibsen.uio.no/diktsamlinger.xhtml]} 0 16
select * from repohistory where entityid like '%www.ibsen.uio.no
On 06.02.14 15:25, Karl Wright wrote:
So I conclude that simple history is working fine, but since it is only
returning indexing results within the last hour by default it is confusing
you. I also think it is likely that documents are getting skipped because
you've crawled this set before with
And why do I get the following result from pgAdmin when I run the
following SQL?:
select * from repohistory where entityid =
'http://www.ibsen.uio.no/brevmottakere.xhtml?bokstav=H'
On 06.02.14 15:53, Karl Wright wrote:
Hi Erlend,
Please go into the Simple History, and change the start time of the query
to be one day earlier than the default. By default, Simple History only
reports the last hour's worth of events.
Then it only displays the crawl which completed tonight
On 06.02.14 18:18, Karl Wright wrote:
Actually yes, I found it. Only exceptions/errors are recorded by the solr
connector.
CONNECTORS-884. However, I don't think this rises to the level of needing
to respin the RC. Do you agree?
Since we are on RC7 now, I agree. I'll start a complete crawl
+1
- Ran ant test | uitest | doc
- Installed binary version and ran single process model
- Installed source version, built and ran multi-process model and a huge
crawl
- Deployed on Resin application server and ran a huge crawl
Erlend
On 04.02.14 13:33, Karl Wright wrote:
This is a major
Greetings from Oslo, Norway, and welcome aboard, Graeme!
Erlend
On 10.03.14 08:18, Karl Wright wrote:
The Project Management Committee (PMC) for Apache ManifoldCFhas asked
Graeme Seaton to become a committer and we are pleased to announce
that they have accepted.
Graeme has be instrumental in
I'm getting the following error after I upgraded to version 1.6. I think
HttpClient is the source of the problem and that the following ticket
describes the issue in detail:
https://issues.apache.org/jira/browse/CONNECTORS-661
I have turned on HttpClient logging and placed the manifoldcf.log
. So in its current form it's not
very helpful. I can see that there are two 401 responses, but that's about
it.
Karl
On Wed, May 21, 2014 at 6:39 AM, Erlend Garåsen e.f.gara...@usit.uio.nowrote:
I'm getting the following error after I upgraded to version 1.6. I think
HttpClient
The complete log is not available here:
http://folk.uio.no/erlendfg/manifoldcf/manifoldcf.log
Erlend
On 21.05.14 15:09, Erlend Garåsen wrote:
Thanks for looking at this, Karl.
I have sent you the output from tcpdump directly to you.
Erlend
On 21.05.14 14:42, Karl Wright wrote:
Looking
+1 from me.
1. Ran test, uitest
2. Ran single process example, registered a Solr server and performed a
web crawl
2. Deployed on Resin, ran huge crawl with Multiprocess/Zookeeper model
Looks good!
Erlend
On 29.05.14 10:51, Karl Wright wrote:
This minor release of ManifoldCF fixes a number
On 12.08.14 05:13, Mingchun Zhao wrote:
Hi all,
Please vote on whether to release the ManifoldCF, version 1.7, RC0.
You can find the artifact at:
http://people.apache.org/~mingchun/apache-manifoldcf-1.7-RC0
There is also a tag at:
-1
All my first tests pass, but I think I found a blocker when I ran the
last one.
By running MCF using FileLockManager, I'm getting the following error
and MCF just tries to run this task over and over again. My synch folder
now contains a lot of files and it still grows. I think MCF
: File name too long
Erlend
On 15.08.14 09:46, Erlend Garåsen wrote:
-1
All my first tests pass, but I think I found a blocker when I ran the
last one.
By running MCF using FileLockManager, I'm getting the following error
and MCF just tries to run this task over and over again. My synch folder
+1
- Deployed binary dist on Caucho Resin on Linux and ran:
- a huge crawl using FileLockManager
- Built source dist on OS X and:
- Ran single-process version under example directory
- Ran ant uitest and test
Erlend
On 20.08.14 02:58, Mingchun Zhao wrote:
Hi all,
Please vote on
+1
As an exception to the rule, I will deploy a patched version on our
production server just to be sure that we have fixed the problem. For
some reason, I'm not able to reproduce the Zookeeper problem on our test
server, so I'll go ahead on our prod server instead. I'll let you know
I got the following on both my test and prod server. The error also
shows up in simple history:
Error: KeeperErrorCode = NoNode for
/org.apache.manifoldcf.locks-_Cache_OUTPUTCONNECTION_Solr/read-0001039554
I guess it is related to the shutdown process - either when I stopped
the Resin
On 17.09.14 14:55, Karl Wright wrote:
Hi Erlend,
Yes, this is shutdown related. The patch file did not include the fix for
this particular problem. The release candidate, however, does.
This is not from the patch, but from 1.7.1. I just meant to say that I
did not had any problems using
Both servers are running now. Not sure about what caused the problems on
prod. The only thing I did different was to do a lock clean on prod
prior to startup.
I'll keep both servers up and running in 24 hours and vote thereafter.
Erlend
On 17.09.14 15:05, Erlend Garåsen wrote:
On 17.09.14
I tried to restart the job dealing with www.duo.no on our test server,
but it does not seem to touch the robots.txt file at all. That's the
reason why it's able to continue. Both servers are set up to obey the
rules of such files.
Erlend
On 18.09.14 11:12, Erlend Garåsen wrote:
I'm
:24 AM, Erlend Garåsen e.f.gara...@usit.uio.no
wrote:
I tried to restart the job dealing with www.duo.no on our test server,
but it does not seem to touch the robots.txt file at all. That's the reason
why it's able to continue. Both servers are set up to obey the rules of
such files.
Erlend
On 18.09.14 13:00, Karl Wright wrote:
Hi Erlend,
please can you also add the manifoldcf log as well?
Yes, I will, but it includes entries from RC0 as well.
MCF works perfectly using the other jobs for the other hosts. Take a
look at the following once again. MCF is being interrupted:
INFO
1, or better yet, find out why you periodically lose the ability
to transmit pings from MCF to your zookeeper process.
Thanks,
Karl
On Thu, Sep 18, 2014 at 7:15 AM, Erlend Garåsen e.f.gara...@usit.uio.no
wrote:
On 18.09.14 13:00, Karl Wright wrote:
Hi Erlend,
please can you also add
On Thu, Sep 18, 2014 at 8:16 AM, Erlend Garåsen
e.f.gara...@usit.uio.no
wrote:
I tried to fetch documents by using curl from our prod server just in
case a webmaster had blocked access. No problem. Maybe I should ask
the
webmaster of that host anyway, just to be sure.
The interrupted
, Erlend Garåsen wrote:
I can verify an eventually network problem by using file-based
synchronization instead.
I'll do that right away and test RC2 as well, even though you already
have three +1's.
The three other jobs I started before I left my office on Thursday did
all complete successfully
is running
on, and then do a crawl. I will wager, well, quite a lot of money, that
you will see periods of packet loss. ;-)
Karl
On Mon, Sep 22, 2014 at 5:05 AM, Erlend Garåsen e.f.gara...@usit.uio.no
wrote:
I'm able to fetch documents from www.duo.uio.no using file-based
synchronization, so
Local tests are running fine, but there is a problem with a table which
is not properly installed on our Resin Deployment server.
I guess the following command should install the needpriority table? Not
errors shown runnins this command.
$MCF_HOME/executecommand.sh
72 matches
Mail list logo