Hi All,
Importing page-article dump to mysql database is unsuccessful.
First i received Exception-
*
Caused by: javax.xml.stream.XMLStreamException: ParseError at
[row,col]:[1,249]
Message: expected mediawiki with namespace [
http://www.mediawiki.org/xml/export-0.6/], found [
Hi Jona,
I have replaced 0.8 with 0.6 only (marked bold). And this is not entire
english file but a fraction of it available at dbpedia dumps.It is of 144
MB only. This file is available at
http://dumps.wikimedia.org/enwiki/20130204/ named
On Mar 28, 2013 4:14 PM, gaurav pant golup...@gmail.com wrote:
Hi Jona,
I have replaced 0.8 with 0.6 only (marked bold). And this is not entire
english file but a fraction of it available at dbpedia dumps.It is of 144
MB only. This file is available at
Hi Jona,
You have told that The importer only imports templates ..than how page
data will get imported to database?
If page data will not get imported to database than how abstract gets
extracted?
On Thu, Mar 28, 2013 at 8:55 PM, Jona Christopher Sahnwaldt j...@sahnwaldt.de
wrote:
On Mar
The abstract extractor reads the article text from the dump file and sends
it to the modified MediaWiki. That's faster than letting MW load the
article text from the database. MW only has to load the templates from the
database, and there are so few of them (a few thousand) that they can
probably
Hi guarav,
Are you sure you are using the latest version of the source code?
In particular in line 74 of file
core/org/dbpedia/extraction/sources/WikipediaDumpParser.java
you should have the following
private final String _namespace = *null*; //
http://www.mediawiki.org/xml/export-0.8/;;
Hi Jona/All,
I have changed pom.xml accordingly as below. But I am getting error.
launcher
idimport/id
mainClassorg.dbpedia.extraction.dump.sql.Import/mainClass
jvmArgs
jvmArg-server/jvmArg
did you update the code to the latest version? git pull, update, etc.?
On Mar 25, 2013 8:33 AM, gaurav pant golup...@gmail.com wrote:
Hi Jona/All,
I have changed pom.xml accordingly as below. But I am getting error.
launcher
idimport/id
On 25 March 2013 19:35, gaurav pant golup...@gmail.com wrote:
Hi All/Jona,
With updated dump I am able to import data into mysql. Now the issue is
with abstract extraction.Thanks Jona for all the help.
I am getting below errors for many files during ../clean-install-run
extraction
Hi Gaurav,
I just changed the import stuff - in dump/pom.xml , you now use not
just the host name but a whole MySQL JDBC URL as specified on
http://dev.mysql.com/doc/refman/5.1/en/connector-j-reference-configuration-properties.html
, which can include username and password.
WARNING: The whole
On 23 March 2013 07:05, gaurav pant golup...@gmail.com wrote:
Hi All,
I am now using new code. But facing below problems with the new repository.
@Jona- Yes I am using Amazon Elastic Block Storage. Is there any issue with
this?
Not really, but... I don't know much about EBS, but I assume
On 23 March 2013 07:05, gaurav pant golup...@gmail.com wrote:
Hi All,
I am now using new code. But facing below problems with the new repository.
@Jona- Yes I am using Amazon Elastic Block Storage. Is there any issue with
this?
@All- I could not be able to understand that where I should
On 20 March 2013 15:38, Mohamed Morsey mor...@informatik.uni-leipzig.de wrote:
Hi Jona and all,
On 03/20/2013 03:25 PM, Jona Christopher Sahnwaldt wrote:
On Wed, Mar 20, 2013 at 3:01 PM, gaurav pant golup...@gmail.com wrote:
Hi Morsy/All,
While running Import.sh is am getting below
Hi All,
I am now using new code. But facing below problems with the new repository.
@Jona- Yes I am using Amazon Elastic Block Storage. Is there any issue
with this?
*@All- I could not be able to understand that where I should give mysql
user-name and password.Also I want that it should use
Hi Morsey/All,
Please find pom.xml file contents as below.
?xml version=1.0 encoding=UTF-8?
project xmlns=http://maven.apache.org/POM/4.0.0;
modelVersion4.0.0/modelVersion
parent
groupIdorg.dbpedia/groupId
artifactIdextraction/artifactId
version3.8/version
Hi Gaurav,
which are the permissions this folder?
/mnt/ebs/perl/framework/dbpedia/target/classes
Maybe not related, but the error message seems to suggest maven cannot
clean that directory.
Also, it might be useful to rerun the maven command with -eX switches.
Regards
Andrea
2013/3/20 gaurav
Hi Gaurav,
I give it a compile here using your , and it was successful.
Furthermore, regarding your further question, I would recommend you to
read that README file [1].
On 03/20/2013 11:18 AM, gaurav pant wrote:
Hi,
Please find below.
?xml version=1.0 encoding=UTF-8?
project
Hi Morsy/All,
While running *Import.sh* is am getting below error.
missing
/home/gaurav/other_lang_extraction/extraction-framework-master/dump/wiki_dump/dewiki/tables-no-indexes.sql
missing
Hi Gaurav,
if you use the old script you'll have to use the old instructions :)
clean.sh is not necessary in the scala-based import that Jona created
On Wed, Mar 20, 2013 at 4:01 PM, gaurav pant golup...@gmail.com wrote:
Hi Morsy/All,
While running *Import.sh* is am getting below error.
On Wed, Mar 20, 2013 at 3:01 PM, gaurav pant golup...@gmail.com wrote:
Hi Morsy/All,
While running Import.sh is am getting below error.
Don't use import.sh.
missing
/home/gaurav/other_lang_extraction/extraction-framework-master/dump/wiki_dump/dewiki/tables-no-indexes.sql
missing
Yes, this is almost certainly a problem with file/folder permissions.
From looking at your path /mnt/ebs/, I assume you're using Amazon
Elastic Block Storage. Is that correct?
On Wed, Mar 20, 2013 at 11:48 AM, Andrea Di Menna ninn...@gmail.com wrote:
Hi Gaurav,
which are the permissions this
Hi Jona and all,
On 03/20/2013 03:25 PM, Jona Christopher Sahnwaldt wrote:
On Wed, Mar 20, 2013 at 3:01 PM, gaurav pantgolup...@gmail.com wrote:
Hi Morsy/All,
While running Import.sh is am getting below error.
Don't use import.sh.
missing
I also think that this is a maven / folder permissions like JC and Andrea
suggested.
The script breaks at $mvn clean
so you should check your system
Dimitris
On Wed, Mar 20, 2013 at 4:32 PM, Jona Christopher Sahnwaldt j...@sahnwaldt.de
wrote:
Yes, this is almost certainly a problem with
23 matches
Mail list logo