Author: ogrisel
Date: Thu Jun 30 20:06:36 2011
New Revision: 1141693

URL: http://svn.apache.org/viewvc?rev=1141693&view=rev
Log:
STANBOL-92: download a prebuilt solr index of Wikipedia to be included in the 
defaultdata artifact

Modified:
    incubator/stanbol/trunk/defaultdata/README.md
    incubator/stanbol/trunk/defaultdata/download_models.sh
    
incubator/stanbol/trunk/defaultdata/src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index/
   (props changed)

Modified: incubator/stanbol/trunk/defaultdata/README.md
URL: 
http://svn.apache.org/viewvc/incubator/stanbol/trunk/defaultdata/README.md?rev=1141693&r1=1141692&r2=1141693&view=diff
==============================================================================
--- incubator/stanbol/trunk/defaultdata/README.md (original)
+++ incubator/stanbol/trunk/defaultdata/README.md Thu Jun 30 20:06:36 2011
@@ -8,12 +8,14 @@ To avoid loading subversion repository w
 to be build and deployed manually to retrieve precomputed models from other
 sites.
 
-## Downloading the OpenNLP statistical model files
 
-Use the `download_models.sh` script.
+## Downloading the OpenNLP statistical model files and pre-built Solr Index
 
-## Building Entity Hub indices
+Under Unix, use the `download_models.sh` script and then run `mvn install`.
+
+Under windows, read the script content and do the same operations manually :)
 
-TODO
 
+## Building Entity Hub indices
 
+See the online documentation: (TODO: put the URL here when no longer staging)

Modified: incubator/stanbol/trunk/defaultdata/download_models.sh
URL: 
http://svn.apache.org/viewvc/incubator/stanbol/trunk/defaultdata/download_models.sh?rev=1141693&r1=1141692&r2=1141693&view=diff
==============================================================================
--- incubator/stanbol/trunk/defaultdata/download_models.sh (original)
+++ incubator/stanbol/trunk/defaultdata/download_models.sh Thu Jun 30 20:06:36 
2011
@@ -2,12 +2,19 @@
 
 OPENNLP_DATA=src/main/resources/org/apache/stanbol/defaultdata/opennlp
 MODELS_URL="http://opennlp.sourceforge.net/models-1.5";
-
+DBPEDIDA_SOLR_DATA=src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index
+DBPEDIA_SOLR_URL="http://dl.dropbox.com/u/5743203/IKS/dbpedia/3.6/dbpedia_43k.solrindex.zip";
 
 rm -rf $OPENNLP_DATA/*.bin
 
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-sent.bin)
+(cd $OPENNLP_DATA && wget $MODELS_URL/en-pos-perceptron.bin)
+(cd $OPENNLP_DATA && wget $MODELS_URL/en-chunker.bin)
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-ner-person.bin)
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-ner-location.bin)
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-ner-organization.bin)
 
+
+rm -rf $DBPEDIDA_SOLR_DATA/*.zip
+
+(cd $DBPEDIDA_SOLR_DATA && wget $DBPEDIA_SOLR_URL)

Propchange: 
incubator/stanbol/trunk/defaultdata/src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index/
------------------------------------------------------------------------------
--- svn:ignore (added)
+++ svn:ignore Thu Jun 30 20:06:36 2011
@@ -0,0 +1 @@
+*.zip


Reply via email to