Ah, excellent, thanks: I had missed it! Enzo
----- Original Message ----- From: "Roger Dunk" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Saturday, June 30, 2007 9:18 AM Subject: Re: integrate Nutch into my php front page > It's called OpenSearch (http://en.wikipedia.org/wiki/OpenSearch). Nutch > has supported this for a long time already, and many people make good use > of it. > > -Roger > > ----- Original Message ----- > From: "Enzo Michelangeli" <[EMAIL PROTECTED]> > To: <[EMAIL PROTECTED]> > Sent: Saturday, June 30, 2007 11:08 AM > Subject: Re: integrate Nutch into my php front page > > >> Another way would be to rewrite a search.jsp so that it return XML or >> JSON >> rather than HTML, and then have the PHP code place a GET to that page and >> parse the results (the SOLR approach, so to speak). The JVM (and Tomcat) >> should obviously be run, but that could be done on a different machine. >> >> (BTW, has anybody written such JSP page already?) >> >> Enzo >> >> ----- Original Message ----- >> From: "Matthew A. Bockol" <[EMAIL PROTECTED]> >> To: <[EMAIL PROTECTED]> >> Sent: Saturday, June 30, 2007 7:51 AM >> Subject: Re: integrate Nutch into my php front page >> >>> Hi Tsengtan, >>> >>> I've written a php wrapper class for nutch that permits querying the >>> jsp. >>> It works reasonably well, but you still need to run the JVM. Let me know >>> if you're interested. >>> >>> Matt >>> >>> >>> ----- Original Message ----- >>> From: "Tsengtan A Shuy" <[EMAIL PROTECTED]> >>> To: [EMAIL PROTECTED] >>> Sent: Friday, June 29, 2007 5:34:45 PM (GMT-0600) America/Chicago >>> Subject: integrate Nutch into my php front page >>> >>> I like to integrate the nutch search engine into my php front page of my >>> website www.epacificweb.com . Is there a way to do it without running on >>> JVM? >>> >>> Adam Shuy, President >>> ePacific Web Design & Hosting >>> Professional Web/Software developer >>> TEL: 408-272-6946 >>> www.epacificweb.com >>> -----Original Message----- >>> From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] >>> Sent: Friday, June 29, 2007 2:55 PM >>> To: [EMAIL PROTECTED] >>> Subject: RE: windows eclipse run >>> >>> Please ignore my last email. >>> >>> I run both nutch-0.8.1 and nutch-0.9 with my windows eclipse >>> environment. >>> I got all the result folders: crawldb, index, indexs, linkdb and >>> segments >>> from nutch-0.8.1, but I only got crawldb folder from the nutch-0.9. >>> Am I getting the right result? >>> Any feedback will be much appreciated. >>> >>> Adam Shuy, President >>> ePacific Web Design & Hosting >>> Professional Web/Software developer >>> TEL: 408-272-6946 >>> www.epacificweb.com >>> >>> -----Original Message----- >>> From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] >>> Sent: Friday, June 29, 2007 2:44 PM >>> To: '[EMAIL PROTECTED]' >>> Subject: RE: NoRouteToHostException >>> >>> I got the same error when I ran in my cygwin environment. >>> So I ran it in the windows eclipse environment, it ran OK but I still >>> have >>> some other nutch-0.9 issue to deal with. >>> Please read the following web page: >>> http://wiki.apache.org/nutch/RunNutchInEclipse, and >>> http://lucene.apache.org/nutch/tutorial8.html >>> Then ran it again. >>> >>> Adam Shuy, President >>> ePacific Web Design & Hosting >>> Professional Web/Software developer >>> TEL: 408-272-6946 >>> www.epacificweb.com >>> -----Original Message----- >>> From: DANIEL CLARK [mailto:[EMAIL PROTECTED] >>> Sent: Friday, June 29, 2007 1:07 PM >>> To: Nutch List >>> Subject: NoRouteToHostException >>> >>> I'm running 0.8.1 and I'm getting the following exception. Any help >>> would >>> be appreciated. >>> >>> $ bin/nutch crawl urls -dir crawl -depth 3 >>> crawl started in: crawl >>> rootUrlDir = urls >>> threads = 10 >>> depth = 3 >>> Injector: starting >>> Injector: crawlDb: crawl/crawldb >>> Injector: urlDir: urls >>> Injector: Converting injected urls to crawl db entries. >>> Exception in thread "main" java.net.NoRouteToHostException: No route to >>> host >>> at java.net.PlainSocketImpl.socketConnect(Native Method) >>> at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333) >>> at >>> java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195) >>> at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182) >>> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366) >>> at java.net.Socket.connect(Socket.java:519) >>> at java.net.Socket.connect(Socket.java:469) >>> at java.net.Socket.<init>(Socket.java:366) >>> at java.net.Socket.<init>(Socket.java:208) >>> at >>> org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113) >>> at org.apache.hadoop.ipc.Client.getConnection(Client.java:359) >>> at org.apache.hadoop.ipc.Client.call(Client.java:297) >>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150) >>> at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown >>> Source) >>> at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214) >>> at >>> org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248) >>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327) >>> at org.apache.nutch.crawl.Injector.inject(Injector.java:138) >>> at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) >>> >>> >>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >>> Daniel Clark, President >>> DAC Systems, Inc. >>> 5209 Nanticoke Court >>> Centreville, VA 20120 >>> Cell - (703) 403-0340 >>> Email - [EMAIL PROTECTED] >>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >>> >>> >> >> > ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
