Re: Nutch-Selenium in Nutch 1.10

2015-02-19 Thread Jaydeep Bagrecha
at 9:39 PM To: dev@nutch.apache.org dev@nutch.apache.org Subject: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so right. I finished installing and successfully run the butch with selenium and Firefox. I have a question though, does your Firefox plug out for always all the urls we crawled

Re: Nutch-Selenium in Nutch 1.10

2015-02-19 Thread Jiaxin Ye
++ -Original Message- From: Jiaxin Ye jiaxi...@usc.edu Reply-To: dev@nutch.apache.org dev@nutch.apache.org Date: Thursday, February 12, 2015 at 9:39 PM To: dev@nutch.apache.org dev@nutch.apache.org Subject: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so right. I finished installing

Re: Nutch-Selenium in Nutch 1.10

2015-02-18 Thread Jaydeep Bagrecha
, 2015 at 9:39 PM To: dev@nutch.apache.org dev@nutch.apache.org Subject: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so right. I finished installing and successfully run the butch with selenium and Firefox. I have a question though, does your Firefox plug out for always all the urls we

Re: Nutch-Selenium in Nutch 1.10

2015-02-17 Thread Jaydeep Bagrecha
: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so right. I finished installing and successfully run the butch with selenium and Firefox. I have a question though, does your Firefox plug out for always all the urls we crawled? Hi Prof Mattmann. I think here is the way we install selenium

Re: Nutch-Selenium in Nutch 1.10

2015-02-17 Thread Jiaxin Ye
++ -Original Message- From: Jiaxin Ye jiaxi...@usc.edu Reply-To: dev@nutch.apache.org dev@nutch.apache.org Date: Thursday, February 12, 2015 at 9:39 PM To: dev@nutch.apache.org dev@nutch.apache.org Subject: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so right. I

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Shuo Li
, February 12, 2015, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: You need Selenium Jiaxin, in order to crawl dynamic pages in the polar dataset you have been assigned in my CSCI 572 search engines class. The instructions for integrating Selenium with Nutch 1.10-trunk are here

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Jiaxin Ye
into this patch for Selenium on Nutch 1.10 : https://issues.apache.org/jira/browse/NUTCH-1933. Hope this helps! Thanks, Sapna On Tue, Feb 10, 2015 at 9:36 PM, Shuo Li sli...@usc.edu wrote: Yop, I'm trying to install selenium in Nutch 1.10. However, this error pops out: *error: package

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Shuo Li
Interestingly, I'm a mac user but I don't want to screw my laptop so I'm using vagrant with Ubuntu Trusty. It doesn't have GUI but Xvfb can still be installed properly. The issue would be I don't know how to integrate Selenium with Nutch 1.10. On Thu, Feb 12, 2015 at 12:04 AM, Jiaxin Ye jiaxi

Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Jiaxin Ye
Selenium with Nutch 1.10-trunk are here: https://issues.apache.org/jira/browse/NUTCH-1933 Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Mattmann, Chris A (3980)
: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so right. I finished installing and successfully run the butch with selenium and Firefox. I have a question though, does your Firefox plug out for always all the urls we crawled? Hi Prof Mattmann. I think here is the way we install selenium on MAC

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Jiaxin Ye
jiaxi...@usc.edu javascript:; Reply-To: dev@nutch.apache.org javascript:; dev@nutch.apache.org javascript:; Date: Thursday, February 12, 2015 at 9:39 PM To: dev@nutch.apache.org javascript:; dev@nutch.apache.org javascript:; Subject: Nutch-Selenium in Nutch 1.10 Hi Li, Shuo. You are so

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Mattmann, Chris A (3980)
You need Selenium Jiaxin, in order to crawl dynamic pages in the polar dataset you have been assigned in my CSCI 572 search engines class. The instructions for integrating Selenium with Nutch 1.10-trunk are here: https://issues.apache.org/jira/browse/NUTCH-1933 Cheers, Chris

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Jiaxin Ye
. The instructions for integrating Selenium with Nutch 1.10-trunk are here: https://issues.apache.org/jira/browse/NUTCH-1933 Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398

Re: Nutch-Selenium in Nutch 1.10

2015-02-12 Thread Mattmann, Chris A (3980)
, Chris A (3980) chris.a.mattm...@jpl.nasa.govmailto:chris.a.mattm...@jpl.nasa.gov wrote: You need Selenium Jiaxin, in order to crawl dynamic pages in the polar dataset you have been assigned in my CSCI 572 search engines class. The instructions for integrating Selenium with Nutch 1.10-trunk are here

Nutch-Selenium in Nutch 1.10

2015-02-10 Thread Shuo Li
Yop, I'm trying to install selenium in Nutch 1.10. However, this error pops out: *error: package org.apache.nutch.storage does not exist* I can only find this package in Nutch 2.x. Is there a way to use Selenium in 1.10? Any advice would be appreciated. Regards, Shuo Li

Re: Nutch-Selenium in Nutch 1.10

2015-02-10 Thread Sapnashri Suresh
Hi Shuo Li, We were facing a similar issue. Prof. Mattman suggested we look into this patch for Selenium on Nutch 1.10 : https://issues.apache.org/jira/browse/NUTCH-1933. Hope this helps! Thanks, Sapna On Tue, Feb 10, 2015 at 9:36 PM, Shuo Li sli...@usc.edu wrote: Yop, I'm trying

Re: Nutch-Selenium in Nutch 1.10

2015-02-10 Thread Mattmann, Chris A (3980)
++ -Original Message- From: Sapnashri Suresh sapna...@usc.edu Reply-To: dev@nutch.apache.org dev@nutch.apache.org Date: Tuesday, February 10, 2015 at 9:42 PM To: dev@nutch.apache.org dev@nutch.apache.org Subject: Re: Nutch-Selenium in Nutch 1.10 Hi Shuo Li