oh ok, it may be the silent timeout then. It has reindexed over a million records that mostly have stored fields. It stalled after that, the source collection has 20m records so it may be a large collection and silent timeout. The endpoint created a checkpoint collection that has one document in it, thinking a manual update to it may allow it to reset to where it can restart from? It does not allow writing to an existing collection otherwise large collection reindex with q set query by date could have been an alternative.
On Thu, May 25, 2023 at 2:55 PM ufuk yılmaz <[email protected]> wrote: > Afaik reindexcollection command uses streaming expressions and export > handler under the hood, so the fields need to have docValues. I need to > check the source code to see if it also works with just stored fields. > > If you have copyField s in your schema, it may also interfere because they > would be sent two times to the target field > > I don’t know, reindexcollection only works for me if I exclude copyFields > and only use fields with docValues. Also if your index is very large (100s > gb) it also times out silently. > > -ufuk yilmaz > — > > > On 25 May 2023, at 21:27, Rajani Maski <[email protected]> wrote: > > > > no, whatever the default is, assuming it is fl=* Would it help if it > lists > > field names? It will be a large list, about 50-60 fields so leaving it as > > default. All the fields are stored. > > > > On Thu, May 25, 2023 at 11:09 AM ufuk yılmaz <[email protected] > > > > wrote: > > > >> Are you specifying a list of field names when calling the api? > >> > >> -ufuk yilmaz > >> > >> — > >> > >>> On 25 May 2023, at 17:47, Ed K <[email protected]> wrote: > >>> > >>> it has happened to me in the past and I've had to restart it from > >> beginning. > >>> > >>> On Thursday, May 25, 2023 at 06:46:04 AM PDT, Rajani Maski < > >> [email protected]> wrote: > >>> > >>> Hi Solr Users, > >>> > >>> The reindex collection api stops importing after it imports around 20% > >>> of total docs from source collection, how to restart it? I could find > >>> "start", "abort" and "status" commands on the documentation page, but > >> there > >>> is no command to restart from where it left. The status is running > >> however > >>> it is stalled. Following is the api I used to start it. Any > suggestions > >> on > >>> how to restart from where it left, maybe update the checkpoint > >> collection? > >>> > >>> > >>> > >>> > >> > http://host:8983/solr/admin/collections?action=REINDEXCOLLECTION&name=v9&numShards=12&collection.configName=v9-schema&replicationFactor=1 > >> > >> > >
