What not using the Lucene PHP version provided by Zend?
File formats are compatibles, so that you can crawl with Java based nutch
and use Zend Lucene PHP for your frontend.
Jérôme
On 6/30/07, Tsengtan A Shuy <[EMAIL PROTECTED]> wrote:
I think I only can purchase the private tomcat hosting plan not a shared
tomcat hosting plan. Because nutch-0.9 only running under the root
directory
of tomcat. Am I right?
Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Enzo Michelangeli [mailto:[EMAIL PROTECTED]
Sent: Friday, June 29, 2007 6:58 PM
To: [EMAIL PROTECTED]
Subject: Re: integrate Nutch into my php front page
Ah, excellent, thanks: I had missed it!
Enzo
----- Original Message -----
From: "Roger Dunk" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Saturday, June 30, 2007 9:18 AM
Subject: Re: integrate Nutch into my php front page
> It's called OpenSearch (http://en.wikipedia.org/wiki/OpenSearch). Nutch
> has supported this for a long time already, and many people make good
use
> of it.
>
> -Roger
>
> ----- Original Message -----
> From: "Enzo Michelangeli" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>
> Sent: Saturday, June 30, 2007 11:08 AM
> Subject: Re: integrate Nutch into my php front page
>
>
>> Another way would be to rewrite a search.jsp so that it return XML or
>> JSON
>> rather than HTML, and then have the PHP code place a GET to that page
and
>> parse the results (the SOLR approach, so to speak). The JVM (and
Tomcat)
>> should obviously be run, but that could be done on a different machine.
>>
>> (BTW, has anybody written such JSP page already?)
>>
>> Enzo
>>
>> ----- Original Message -----
>> From: "Matthew A. Bockol" <[EMAIL PROTECTED]>
>> To: <[EMAIL PROTECTED]>
>> Sent: Saturday, June 30, 2007 7:51 AM
>> Subject: Re: integrate Nutch into my php front page
>>
>>> Hi Tsengtan,
>>>
>>> I've written a php wrapper class for nutch that permits querying the
>>> jsp.
>>> It works reasonably well, but you still need to run the JVM. Let me
know
>>> if you're interested.
>>>
>>> Matt
>>>
>>>
>>> ----- Original Message -----
>>> From: "Tsengtan A Shuy" <[EMAIL PROTECTED]>
>>> To: [EMAIL PROTECTED]
>>> Sent: Friday, June 29, 2007 5:34:45 PM (GMT-0600) America/Chicago
>>> Subject: integrate Nutch into my php front page
>>>
>>> I like to integrate the nutch search engine into my php front page of
my
>>> website www.epacificweb.com . Is there a way to do it without running
on
>>> JVM?
>>>
>>> Adam Shuy, President
>>> ePacific Web Design & Hosting
>>> Professional Web/Software developer
>>> TEL: 408-272-6946
>>> www.epacificweb.com
>>> -----Original Message-----
>>> From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
>>> Sent: Friday, June 29, 2007 2:55 PM
>>> To: [EMAIL PROTECTED]
>>> Subject: RE: windows eclipse run
>>>
>>> Please ignore my last email.
>>>
>>> I run both nutch-0.8.1 and nutch-0.9 with my windows eclipse
>>> environment.
>>> I got all the result folders: crawldb, index, indexs, linkdb and
>>> segments
>>> from nutch-0.8.1, but I only got crawldb folder from the nutch-0.9.
>>> Am I getting the right result?
>>> Any feedback will be much appreciated.
>>>
>>> Adam Shuy, President
>>> ePacific Web Design & Hosting
>>> Professional Web/Software developer
>>> TEL: 408-272-6946
>>> www.epacificweb.com
>>>
>>> -----Original Message-----
>>> From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
>>> Sent: Friday, June 29, 2007 2:44 PM
>>> To: '[EMAIL PROTECTED]'
>>> Subject: RE: NoRouteToHostException
>>>
>>> I got the same error when I ran in my cygwin environment.
>>> So I ran it in the windows eclipse environment, it ran OK but I still
>>> have
>>> some other nutch-0.9 issue to deal with.
>>> Please read the following web page:
>>> http://wiki.apache.org/nutch/RunNutchInEclipse, and
>>> http://lucene.apache.org/nutch/tutorial8.html
>>> Then ran it again.
>>>
>>> Adam Shuy, President
>>> ePacific Web Design & Hosting
>>> Professional Web/Software developer
>>> TEL: 408-272-6946
>>> www.epacificweb.com
>>> -----Original Message-----
>>> From: DANIEL CLARK [mailto:[EMAIL PROTECTED]
>>> Sent: Friday, June 29, 2007 1:07 PM
>>> To: Nutch List
>>> Subject: NoRouteToHostException
>>>
>>> I'm running 0.8.1 and I'm getting the following exception. Any help
>>> would
>>> be appreciated.
>>>
>>> $ bin/nutch crawl urls -dir crawl -depth 3
>>> crawl started in: crawl
>>> rootUrlDir = urls
>>> threads = 10
>>> depth = 3
>>> Injector: starting
>>> Injector: crawlDb: crawl/crawldb
>>> Injector: urlDir: urls
>>> Injector: Converting injected urls to crawl db entries.
>>> Exception in thread "main" java.net.NoRouteToHostException: No route
to
>>> host
>>> at java.net.PlainSocketImpl.socketConnect(Native Method)
>>> at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
>>> at
>>> java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
>>> at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
>>> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
>>> at java.net.Socket.connect(Socket.java:519)
>>> at java.net.Socket.connect(Socket.java:469)
>>> at java.net.Socket.<init>(Socket.java:366)
>>> at java.net.Socket.<init>(Socket.java:208)
>>> at
>>> org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113)
>>> at org.apache.hadoop.ipc.Client.getConnection(Client.java:359)
>>> at org.apache.hadoop.ipc.Client.call(Client.java:297)
>>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150)
>>> at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown
>>> Source)
>>> at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214)
>>> at
>>> org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248)
>>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java
:327)
>>> at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
>>> at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)
>>>
>>>
>>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>>> Daniel Clark, President
>>> DAC Systems, Inc.
>>> 5209 Nanticoke Court
>>> Centreville, VA 20120
>>> Cell - (703) 403-0340
>>> Email - [EMAIL PROTECTED]
>>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>>>
>>>
>>
>>
>
--
Jérôme Charron
Directeur Technique @ WebPulse
Tel: +33673716743 - [EMAIL PROTECTED]
http://blog.shopreflex.com/
Tous les goûts sont dans la nature, les vôtres sont sur
http://www.shopreflex.com
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general