http://www.mozdex.com/developer/#opensearch
Tsengtan A Shuy wrote: > Is there a Nutch search engine which provides the RSS feed? > Then I can use my php front end to publish the search result, and use it as > back end. > > Adam Shuy, President > ePacific Web Design & Hosting > Professional Web/Software developer > TEL: 408-272-6946 > www.epacificweb.com > -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf > Of Jérôme Charron > Sent: Saturday, June 30, 2007 6:52 AM > To: [EMAIL PROTECTED] > Subject: Re: integrate Nutch into my php front page > > What not using the Lucene PHP version provided by Zend? > File formats are compatibles, so that you can crawl with Java based nutch > and use Zend Lucene PHP for your frontend. > > Jérôme > > On 6/30/07, Tsengtan A Shuy <[EMAIL PROTECTED]> wrote: > >> I think I only can purchase the private tomcat hosting plan not a shared >> tomcat hosting plan. Because nutch-0.9 only running under the root >> directory >> of tomcat. Am I right? >> >> Adam Shuy, President >> ePacific Web Design & Hosting >> Professional Web/Software developer >> TEL: 408-272-6946 >> www.epacificweb.com >> -----Original Message----- >> From: Enzo Michelangeli [mailto:[EMAIL PROTECTED] >> Sent: Friday, June 29, 2007 6:58 PM >> To: [EMAIL PROTECTED] >> Subject: Re: integrate Nutch into my php front page >> >> Ah, excellent, thanks: I had missed it! >> >> Enzo >> >> ----- Original Message ----- >> From: "Roger Dunk" <[EMAIL PROTECTED]> >> To: <[EMAIL PROTECTED]> >> Sent: Saturday, June 30, 2007 9:18 AM >> Subject: Re: integrate Nutch into my php front page >> >> >> >>> It's called OpenSearch (http://en.wikipedia.org/wiki/OpenSearch). Nutch >>> has supported this for a long time already, and many people make good >>> >> use >> >>> of it. >>> >>> -Roger >>> >>> ----- Original Message ----- >>> From: "Enzo Michelangeli" <[EMAIL PROTECTED]> >>> To: <[EMAIL PROTECTED]> >>> Sent: Saturday, June 30, 2007 11:08 AM >>> Subject: Re: integrate Nutch into my php front page >>> >>> >>> >>>> Another way would be to rewrite a search.jsp so that it return XML or >>>> JSON >>>> rather than HTML, and then have the PHP code place a GET to that page >>>> >> and >> >>>> parse the results (the SOLR approach, so to speak). The JVM (and >>>> >> Tomcat) >> >>>> should obviously be run, but that could be done on a different machine. >>>> >>>> (BTW, has anybody written such JSP page already?) >>>> >>>> Enzo >>>> >>>> ----- Original Message ----- >>>> From: "Matthew A. Bockol" <[EMAIL PROTECTED]> >>>> To: <[EMAIL PROTECTED]> >>>> Sent: Saturday, June 30, 2007 7:51 AM >>>> Subject: Re: integrate Nutch into my php front page >>>> >>>> >>>>> Hi Tsengtan, >>>>> >>>>> I've written a php wrapper class for nutch that permits querying the >>>>> jsp. >>>>> It works reasonably well, but you still need to run the JVM. Let me >>>>> >> know >> >>>>> if you're interested. >>>>> >>>>> Matt >>>>> >>>>> >>>>> ----- Original Message ----- >>>>> From: "Tsengtan A Shuy" <[EMAIL PROTECTED]> >>>>> To: [EMAIL PROTECTED] >>>>> Sent: Friday, June 29, 2007 5:34:45 PM (GMT-0600) America/Chicago >>>>> Subject: integrate Nutch into my php front page >>>>> >>>>> I like to integrate the nutch search engine into my php front page of >>>>> >> my >> >>>>> website www.epacificweb.com . Is there a way to do it without running >>>>> >> on >> >>>>> JVM? >>>>> >>>>> Adam Shuy, President >>>>> ePacific Web Design & Hosting >>>>> Professional Web/Software developer >>>>> TEL: 408-272-6946 >>>>> www.epacificweb.com >>>>> -----Original Message----- >>>>> From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] >>>>> Sent: Friday, June 29, 2007 2:55 PM >>>>> To: [EMAIL PROTECTED] >>>>> Subject: RE: windows eclipse run >>>>> >>>>> Please ignore my last email. >>>>> >>>>> I run both nutch-0.8.1 and nutch-0.9 with my windows eclipse >>>>> environment. >>>>> I got all the result folders: crawldb, index, indexs, linkdb and >>>>> segments >>>>> from nutch-0.8.1, but I only got crawldb folder from the nutch-0.9. >>>>> Am I getting the right result? >>>>> Any feedback will be much appreciated. >>>>> >>>>> Adam Shuy, President >>>>> ePacific Web Design & Hosting >>>>> Professional Web/Software developer >>>>> TEL: 408-272-6946 >>>>> www.epacificweb.com >>>>> >>>>> -----Original Message----- >>>>> From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] >>>>> Sent: Friday, June 29, 2007 2:44 PM >>>>> To: '[EMAIL PROTECTED]' >>>>> Subject: RE: NoRouteToHostException >>>>> >>>>> I got the same error when I ran in my cygwin environment. >>>>> So I ran it in the windows eclipse environment, it ran OK but I still >>>>> have >>>>> some other nutch-0.9 issue to deal with. >>>>> Please read the following web page: >>>>> http://wiki.apache.org/nutch/RunNutchInEclipse, and >>>>> http://lucene.apache.org/nutch/tutorial8.html >>>>> Then ran it again. >>>>> >>>>> Adam Shuy, President >>>>> ePacific Web Design & Hosting >>>>> Professional Web/Software developer >>>>> TEL: 408-272-6946 >>>>> www.epacificweb.com >>>>> -----Original Message----- >>>>> From: DANIEL CLARK [mailto:[EMAIL PROTECTED] >>>>> Sent: Friday, June 29, 2007 1:07 PM >>>>> To: Nutch List >>>>> Subject: NoRouteToHostException >>>>> >>>>> I'm running 0.8.1 and I'm getting the following exception. Any help >>>>> would >>>>> be appreciated. >>>>> >>>>> $ bin/nutch crawl urls -dir crawl -depth 3 >>>>> crawl started in: crawl >>>>> rootUrlDir = urls >>>>> threads = 10 >>>>> depth = 3 >>>>> Injector: starting >>>>> Injector: crawlDb: crawl/crawldb >>>>> Injector: urlDir: urls >>>>> Injector: Converting injected urls to crawl db entries. >>>>> Exception in thread "main" java.net.NoRouteToHostException: No route >>>>> >> to >> >>>>> host >>>>> at java.net.PlainSocketImpl.socketConnect(Native Method) >>>>> at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333) >>>>> at >>>>> java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195) >>>>> at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182) >>>>> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366) >>>>> at java.net.Socket.connect(Socket.java:519) >>>>> at java.net.Socket.connect(Socket.java:469) >>>>> at java.net.Socket.<init>(Socket.java:366) >>>>> at java.net.Socket.<init>(Socket.java:208) >>>>> at >>>>> org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113) >>>>> at org.apache.hadoop.ipc.Client.getConnection(Client.java:359) >>>>> at org.apache.hadoop.ipc.Client.call(Client.java:297) >>>>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150) >>>>> at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown >>>>> Source) >>>>> at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214) >>>>> at >>>>> org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248) >>>>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java >>>>> >> :327) >> >>>>> at org.apache.nutch.crawl.Injector.inject(Injector.java:138) >>>>> at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) >>>>> >>>>> >>>>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >>>>> Daniel Clark, President >>>>> DAC Systems, Inc. >>>>> 5209 Nanticoke Court >>>>> Centreville, VA 20120 >>>>> Cell - (703) 403-0340 >>>>> Email - [EMAIL PROTECTED] >>>>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >>>>> >>>>> >>>>> >>>> >> > > > -- Glenn Cooke Insurance Squared Inc. 1-866-662-5433 Agent discussion forum: http://www.americaninsurancebroker.com Free US broker directory: http://directory.americaninsurancebroker.com Free Canadian broker directory: http://www.canadianinsurancebroker.com Search engine optimization Directory submission: http://www.insurancesquared.com/directorysubmission Advertise on blogs: http://www.insurancesquared.com/blogs Presell pages: http://www.insurancesquared.com/insurancepresellpages ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
