Hello, Mr. Karl Wright:
Thank you for quick response.
As you mentioned, yes I am so writing my Repository Connector to access the
REST api I want to use.
If I need to do more scraping than provided html-extractor, then I should
write a transformer connector that works as I want.
Is the statement
Hi Kaya,
You should be able to use the existing Solr connector to index documents
into Solr.
You will probably need to write a Repository connector to access the REST
api you describe.
If the kind of scraping you need to do can be covered by the html-extractor
transformer in its current form,
Hello, falks:
I have a question about crawling and scraping in Manifold CF.
I want to the following sequence of tasks by using MCF.
1. crawling data from RESTful api
2. scraping data
3. insert the data to Apache Solr
In this case, how I need to setup Manifold CF is:
1. define output connector
It's also possible that the jtds driver now needs another jar included as a
dependency for working with mssql.
Unfortunately, you'll probably need to figure this out on your own. Please
let me know what you find so that I can update instructions or code.
Karl
On Wed, Feb 20, 2019 at 11:31 AM
OK, you are basically having trouble with the JDBC connector, not the basic
functioning of ManifoldCF. That was not clear.
The JDBC driver class name for MSSQL has likely been updated and we'll need
to figure out what it got changed to.
Karl
On Wed, Feb 20, 2019 at 11:20 AM Bisonti Mario
Yes, infact:
administrator@sengvivv01:/opt/manifoldcf/multiprocess-zk-example-proprietary$
more options.env.unix
-Xms3048m
-Xmx3048m
-Dorg.apache.manifoldcf.configfile=./properties.xml
-cp
You should be doing the following to run initialize.sh:
cd dist/multiprocess-zk-example-proprietary
./initialize.sh
The class path is pulled in from options.env.unix, which should include
your jar:
>>
C:\wip\mcf\trunk\dist\multiprocess-zk-example-proprietary>more
options.env.unix
-Xms512m
The question is: how are you *starting* the processes? and what process
are you seeing the error from? You should *not* need to make any changes
to the configuration if you put the jar file in place before building.
Karl
On Wed, Feb 20, 2019 at 9:47 AM Bisonti Mario
wrote:
> Thanks, Karl
Thanks, Karl but I didn’t download manually the .jar files.
I compiled MCF 2.12 and I found the jar in the lib-proprietary folder.
I added in properties.xml the :
I tried to :
initialize.sh the db but I have the same error.
Da: Karl Wright
Inviato: mercoledì 20 febbraio 2019 15:15
A:
Hi Mario,
You can't just plop down a jar in a directory and have this work, because
ManifoldCF requires all JDBC drivers to be in the root classpath. They are
therefore built into the classpath, which should happen if you use the
startup scripts. Please review the "how-to-build-and-deploy"
10 matches
Mail list logo