so far i understood, you need both columns in your database table. Otherwise the river wont be able to do the checks. The river compares the updated_at date with own date. If that date equal or in the future than the one from the river, the river try to update/insert your record to elasticsearch.
take a look to the code snippet: { *"strategy": "column"*, "type": "jdbc", "jdbc": { "url": "db server connect string", "user": "username", "schedule": "0 20/30 * * * ?", "password": "password", "index": "transactions_test", "type": "transaction_test", "sql": "SELECT * from my_transaction_table" } } Am Dienstag, 25. November 2014 13:34:08 UTC+1 schrieb Erlendur Hákonarson: > > Thanks Ramy > > but how does that strategy work > is there any doc on strategies I can view? > the only one I found was on the jprante github wiki and that only > describes the simple strategy > > and if I am using tables from a system that I have no control over and > those columns created_at and updated_at are not in those tables? > am I maybe misunderstanding this column strategy? > > best regards > Erlendur > > On Tuesday, November 25, 2014 11:00:18 AM UTC, Ramy wrote: >> >> Sorry... >> - created_at >> - updated_at >> >> Am Dienstag, 25. November 2014 11:55:18 UTC+1 schrieb Ramy: >>> >>> Hi Erlendur, >>> In your case, you should use the column strategy instead of simple one. >>> The column strategy requires two columns in the SQL DB. >>> - cerated_at >>> - update_at >>> Cheers, Ramy >>> >>> Am Dienstag, 25. November 2014 11:04:17 UTC+1 schrieb Erlendur >>> Hákonarson: >>>> >>>> Hi I am new to ES but my company is starting to use it >>>> >>>> When I set up an river I have scheduled it to check for data changes at >>>> an 30 min interval, my largest index on dev includes 230k documents but in >>>> production is expected to grow to 300million docs >>>> this 230k index is a heavy load on the server when it checks for data >>>> changes, puts the core in 100% for approx. 5 minutes. >>>> It looks like it is reindexing the index every time, I am using simple >>>> strategy, can someone show me where I can find documentation on the >>>> different strategies? >>>> here is a sample of my river statement: >>>> >>>> { >>>> "_index" : "_river", >>>> "_type" : "transactions_test", >>>> "_id" : "_meta", >>>> "_score" : 1, >>>> "_source" : { >>>> "type" : "jdbc", >>>> "jdbc" : { >>>> "strategy" : "simple", >>>> "url" : "db server connect string", >>>> "user" : "username", >>>> "schedule" : "0 20/30 * * * ?", >>>> "password" : "password", >>>> "index" : "transactions_test", >>>> "type" : "transaction_test", >>>> "sql" : "SELECT * from my_transaction_table" >>>> } >>>> } >>>> }, >>>> >>>> best regards >>>> Erlendur >>>> >>>> >>>> -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/8e681d42-b3bd-4e76-9254-712ada901a91%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.