Gora Mohanty-3 wrote:
> 
> On Wed, Sep 15, 2010 at 2:31 PM, yklxmas <yklx...@gmail.com> wrote:
>>
>> [...]
>> Basically we have a file system that have many xml files to be indexed by
>> solr. However, users might make changes to the files by using another
>> editorial system that will export xml to the file system. After xml is
>> exported, a call to solr to trigger full import.
> 
>>>What do you mean by "full import" in this context? Are you simply POSTing
>>>the XML file to Solr, or do you have a custom data import handler for
which
>>>you trigger a full-import?
> 
> I'm using standard data import handler with file data source and xpath
> processor. so my script will be calling
> http://host:8983/solr/dataimport?command=full-import
> 
>>
>> I've got a few question:
>>
>> if full import is triggered while solr is still busy importing from the
>> previous call, what will happen?
> 
>>>If a full-import is still running, the standard DataImportHandler
interface:
>>>solr/dataimport?command=full-import will inform you that an import is
still
>>>running, and a fresh import will not be started.
> 
> Does the fresh import go in a queue or simply won't start at all? If it
> won't start, that means I need to find out the status and start again if
> necessary.
> 
>> shall i use delta-import instead of full-import as most of xml files may
>> remain unchanged?
> [...]
> 
>>>How about having a script that runs periodically (say, at off-peak hours
where
>>>files are not being edited), checks the last-modified time of each file,
and if
>>>the file has been modified since the last import, POSTs the file to Solr?
Seems
>>>like this would be a simple solution.
> 
> we need to push the changes out immediately. i need to rethink the process
> and suggest my team a new approach more along what you suggested.
> currently we haven't got that much data but constant full importing will
> cause problems for sure.
> 
> Regards,
> Gora
> 
> 
On Wed, Sep 15, 2010 at 2:31 PM, yklxmas <yklx...@gmail.com> wrote:
>
> [...]
> Basically we have a file system that have many xml files to be indexed by
> solr. However, users might make changes to the files by using another
> editorial system that will export xml to the file system. After xml is
> exported, a call to solr to trigger full import.

What do you mean by "full import" in this context? Are you simply POSTing
the XML file to Solr, or do you have a custom data import handler for which
you trigger a full-import?

>
> I've got a few question:
>
> if full import is triggered while solr is still busy importing from the
> previous call, what will happen?

If a full-import is still running, the standard DataImportHandler interface:
solr/dataimport?command=full-import will inform you that an import is still
running, and a fresh import will not be started.

> shall i use delta-import instead of full-import as most of xml files may
> remain unchanged?
[...]

How about having a script that runs periodically (say, at off-peak hours
where
files are not being edited), checks the last-modified time of each file, and
if
the file has been modified since the last import, POSTs the file to Solr?
Seems
like this would be a simple solution.

Regards,
Gora


-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/How-will-solr-behave-if-data-importing-is-called-while-another-importing-operation-is-still-ongoing-tp1478544p1479011.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to