Hi all,

I'm using JAVA API on ES 1.0.1 to bulk index medium sized docs.
Documents come from 150Mb XML.
Average JSON document is about 500Bytes in 10 propeties, currently testing 
on 275.000 documents. Only some key properties are indexed, otherwise 
stored in _source.
Bulk index is done in 5000 document blocks.

While contiguously indexing, the speed of indexing slows linearly down, at 
approx 100.000th item it took 5 times longer then at first chunk.
But when I close TrasnportClient after each successful bulk index, the 
performance remains the same and indexing is breathlessly excellent.

What causes such a problem? Is it correct to close TransportClient 
connection each time bulk indexing is done?
Seems okay, index is ready and functional.

1GB given to Java, bootstrap.mlockall: true, ES_HEAP_SIZE = ES_MIN_MEM 
= ES_MAX_MEM = 1GB
Windows 8, i7, 8GB RAM, SSD disk.

Thanks in advance

Ondra

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/42e50b9e-3078-462f-b5c5-51b867a34ae9%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to