Hi Jörg A bit out of topic: I wonder if you are indexing blobs as base64 encoded fields in JDBC river? (I did not look at the doc)
-- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs > Le 22 févr. 2015 à 18:11, "joergpra...@gmail.com" <joergpra...@gmail.com> a > écrit : > > Can you give some information about the mapper attachment setup you used > successfully? > > There is no good reason why this should not be possible with JDBC river. > > Jörg > >> On Sun, Feb 22, 2015 at 5:20 PM, Jiri Pik <jiri....@googlemail.com> wrote: >> I need to index a HTML column (nvarchar(MAX)) in a MS SQL Server database. I >> have set up a JDBC river https://github.com/jprante/elasticsearch-river-jdbc >> and the database is indexed. >> >> Using >> >> "settings":{ >> >> "analysis":{ >> >> "analyzer":{ >> >> "default":{ >> >> "type":"custom", >> >> "tokenizer":"standard", >> >> "filter":[ "standard", "lowercase" ], >> >> "char_filter" : ["html_strip"] >> >> } >> >> } >> >> } >> >> } >> >> is good for searching but not for the highlighter as that returns sometimes >> trimmed unpaired html tags. >> >> I have played with the Mapper Attachments with HTML attachments and then the >> highlighter works well - all original html tags are gone - but I am unable >> to get the river push the column directly to the Mapper Attachments. >> >> Questions: >> >> 1. what is the best practice for indexing HTML columns? I am aware of the >> possibility of a manual removal of HTML tags using Agility Pack but do not >> like that as it's too much extra maintenance. >> >> 2. is there any better highlighter for html data which doesn't cut off any >> original html tags? >> >> 3. How to plug in the JDBC river to Mapper Attachments? >> >> 4. Any better ideas how to achieve my goals? >> >> >> >> Thanks! >> >> -- >> You received this message because you are subscribed to the Google Groups >> "elasticsearch" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to elasticsearch+unsubscr...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com. >> For more options, visit https://groups.google.com/d/optout. > > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to elasticsearch+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/CAKdsXoH6Ei%2B23bRKrL0Z7WkQALengfhaZeJRBq5gK1F22yxJfg%40mail.gmail.com. > For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/09317C08-E397-4044-91F2-072A5FA4A3DF%40pilato.fr. For more options, visit https://groups.google.com/d/optout.