Hi there,

sorry for the length - it is mostly (really) log output. The basic issue is 
reflected in the subject: DIH runs fine, but even with an extra optimize on top 
(which should not be necessary given my DIH config) the index remains empty.

(I have changed from 3.6.1 to 4.0-ALPHA because of Hoss' answer to my question 
"NumberFormatException while indexing TextField with LengthFilter" (on this 
same list). I had an index setup with 4.0-ALPHA today, I could verify that 
Hoss' suggestion works. But now, I seem not to be able to get that index filled 
yet another time.
SOLR runs inside Jetty which is started via "mvn jetty:run-war". SOLR_HOME is 
set to a subdirectory of maven's target dir. I have been using this setup 
successfully with SOLR 3.* for some time, now. While configuring the index, I 
often do a "mvn clean; mvn jetty:run-war" so SOLR_HOME including the index is 
completely removed and recreated from scratch.)


After running a full import of DIH on core "issues" using:
http://localhost:9090/solr/issues/dataimport?command=full-import&importfile=/absolute/path/to/issues.xml

I get the response:

<response>
<lst name="responseHeader">
<int name="status">0</int>
<int name="QTime">1</int>
</lst>
<lst name="initArgs">
<lst name="defaults">
<str name="update.chain">emptyFieldChain</str>
<str name="config">data-config.xml</str>
<str name="clean">true</str>
<str name="commit">true</str>
<str name="optimize">true</str>
</lst>
</lst>
<str name="status">idle</str>
<str name="importResponse"/>
<lst name="statusMessages">
<str name="Total Requests made to DataSource">0</str>
<str name="Total Rows Fetched">294</str>
<str name="Total Documents Skipped">0</str>
<str name="Full Dump Started">2012-07-24 15:46:27</str>
<str name="">
Indexing completed. Added/Updated: 294 documents. Deleted 0 documents.
</str>
<str name="Committed">2012-07-24 15:46:28</str>
<str name="Optimized">2012-07-24 15:46:28</str>
<str name="Total Documents Processed">294</str>
<str name="Time taken">0:0:0.605</str>
</lst>
<str name="WARNING">
This response format is experimental. It is likely to change in the future.
</str>
</response>

Meaning that everything went fine including commit and optimize and the index 
should now contain 294 documents. Well, it doesn't.
Trying to get it working again, I have now replaced large parts of my 
solrconfig.xml with the new parts taken from the current 4.0-ALPHA 
(https://builds.apache.org/job/Solr-trunk/ws/checkout/) but this doesn't change 
a thing. The schema version is set to 1.5.



When starting the server it outputs:

24.07.2012 16:00:16 org.apache.solr.core.SolrCore <init>
INFO: [issues] Opening new SolrCore at target/classes/core_issues/, 
dataDir=target/classes/core_issues/data/
…
24.07.2012 16:00:16 org.apache.solr.core.SolrCore getNewIndexDir
WARNUNG: New index directory detected: old=null 
new=target/classes/core_issues/data/index/
24.07.2012 16:00:16 org.apache.solr.core.SolrCore initIndex
WARNUNG: [issues] Solr index directory 'target/classes/core_issues/data/index' 
doesn't exist. Creating new index...
24.07.2012 16:00:16 org.apache.solr.core.SolrDeletionPolicy onCommit
INFO: SolrDeletionPolicy.onCommit: commits:num=1
        
commit{dir=/path/to/maven-project/target/classes/core_issues/data/index,segFN=segments_1,generation=1,filenames=[segments_1]
24.07.2012 16:00:16 org.apache.solr.core.SolrDeletionPolicy updateCommits
INFO: newest commit = 1
…
24.07.2012 16:00:16 org.apache.solr.search.SolrIndexSearcher <init>
INFO: Opening Searcher@920ab60 main
24.07.2012 16:00:16 org.apache.solr.core.SolrCore registerSearcher
INFO: [issues] Registered new searcher Searcher@920ab60 
main{StandardDirectoryReader(segments_1:1)}
24.07.2012 16:00:16 org.apache.solr.update.CommitTracker <init>
INFO: Hard AutoCommit: if uncommited for 15000ms; 
24.07.2012 16:00:16 org.apache.solr.update.CommitTracker <init>
INFO: Soft AutoCommit: disabled
24.07.2012 16:00:16 org.apache.solr.handler.dataimport.DataImportHandler 
processConfiguration
INFO: Processing configuration from solrconfig.xml: 
{update.chain=emptyFieldChain,config=data-config.xml,clean=true,commit=true,optimize=true}
24.07.2012 16:00:16 org.apache.solr.handler.dataimport.DataImporter 
loadDataConfig
INFO: Data Configuration loaded successfully
24.07.2012 16:00:16 org.apache.solr.core.QuerySenderListener newSearcher
INFO: QuerySenderListener sending requests to Searcher@920ab60 
main{StandardDirectoryReader(segments_1:1)}
24.07.2012 16:00:16 org.apache.solr.core.CoreContainer register
INFO: registering core: issues



When running the DIH full import, the log output is:

24.07.2012 16:00:31 org.apache.solr.handler.dataimport.DataImporter doFullImport
INFO: Starting Full Import
24.07.2012 16:00:31 org.apache.solr.core.SolrCore execute
INFO: [issues] webapp=/solr path=/dataimport 
params={command=full-import&importfile=/path/to/maven-project/src/test/resources/issues.xml}
 status=0 QTime=4 
24.07.2012 16:00:31 org.apache.solr.handler.dataimport.SimplePropertiesWriter 
readIndexerProperties
WARNUNG: Unable to read: dataimport.properties
24.07.2012 16:00:32 org.apache.solr.handler.dataimport.DocBuilder finish
INFO: Import completed successfully
24.07.2012 16:00:32 org.apache.solr.handler.dataimport.SimplePropertiesWriter 
readIndexerProperties
WARNUNG: Unable to read: dataimport.properties
24.07.2012 16:00:32 org.apache.solr.handler.dataimport.SimplePropertiesWriter 
persist
INFO: Wrote last indexed time to dataimport.properties
24.07.2012 16:00:32 org.apache.solr.handler.dataimport.DocBuilder execute
INFO: Time taken = 0:0:0.566



I'm a bit confused that it does not output the commit logs but it didn't do 
that before, neither, if I remember correctly. When I issue an "Optimize" via 
the new WebGUI admin the I get the following log output:


24.07.2012 16:01:28 org.apache.solr.client.solrj.impl.HttpClientUtil 
createClient
INFO: Creating new http client, 
config:maxConnections=200&maxConnectionsPerHost=8
24.07.2012 16:01:28 org.apache.solr.core.SolrDeletionPolicy onInit
INFO: SolrDeletionPolicy.onInit: commits:num=1
        
commit{dir=/path/to/maven-project/target/classes/core_issues/data/index,segFN=segments_1,generation=1,filenames=[segments_1]
24.07.2012 16:01:28 org.apache.solr.core.SolrDeletionPolicy updateCommits
INFO: newest commit = 1
24.07.2012 16:01:28 org.apache.solr.update.DirectUpdateHandler2 commit
INFO: start 
commit{flags=0,version=0,optimize=true,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false}
24.07.2012 16:01:28 org.apache.solr.search.SolrIndexSearcher <init>
INFO: Opening Searcher@471ed915 main
24.07.2012 16:01:28 org.apache.solr.core.QuerySenderListener newSearcher
INFO: QuerySenderListener sending requests to Searcher@471ed915 
main{StandardDirectoryReader(segments_1:1)}
24.07.2012 16:01:28 org.apache.solr.update.DirectUpdateHandler2 commit
INFO: end_commit_flush
24.07.2012 16:01:28 org.apache.solr.core.SolrCore execute
INFO: [issues] webapp=null path=null 
params={event=newSearcher&q=Issue&distrib=false&qt=dismax} hits=0 status=0 
QTime=1 
24.07.2012 16:01:28 org.apache.solr.core.QuerySenderListener newSearcher
INFO: QuerySenderListener done.
24.07.2012 16:01:28 org.apache.solr.core.SolrCore registerSearcher
INFO: [issues] Registered new searcher Searcher@471ed915 
main{StandardDirectoryReader(segments_1:1)}
24.07.2012 16:01:28 org.apache.solr.update.processor.LogUpdateProcessor finish
INFO: [issues] webapp=/solr path=/update 
params={optimize=true&waitFlush=true&wt=json} {optimize=} 0 28


And the admin as well as any request still return no documents at all. The 
index directory looks like this:

$ ls -l /path/to/maven-project/target/classes/core_issues/data/index/
total 16
-rw-r--r--  1 chantal  staff  20 24 Jul 16:00 segments.gen
-rw-r--r--  1 chantal  staff  45 24 Jul 16:00 segments_1


Any suggestions very much appreciated!

Chantal

Reply via email to