from:"German Carrillo"

Stop words and Keyword tokenizer

2014-08-28 Thread German Carrillo

Hi all, 


I'm looking for a way to remove stop words from tokens returned by a 
keyword tokenizer, i.e., I'd like to obtain the original text without stop 
words after the analysis process. 

Sample data looks like: El corregimiento de 
Mulaló, jurisdicción del municipio de Yumbo (Valle del Cauca)
After the lowercase token filter:   el corregimiento de mulaló, 
jurisdicción del municipio de yumbo (valle del cauca)
After the ascii folding token filter:el corregimiento de mulalo, 
jurisdiccion del municipio de yumbo (valle del cauca)
After removing stop words:   corregimiento mulalo, 
municipio yumbo (valle cauca)

The stop words (currently) are:  [la, el, de, del, los, 
las, jurisdiccion]

Is the pattern replace token filter the only (or best) way to go for such a 
task? 

I'd really like to avoid writing custom regular expressions rather than 
specifying a stop words list, which I know would work perfectly fine for 
other tokenizers.


Regards, 

Germán

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/038ff037-ccf3-4aca-b0c0-bb421531c495%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Issue (Bug?) with field_value_factor and Search Template

2014-07-31 Thread German Carrillo

Hi, 

I'm getting the following error message while attempting to use the Search 
Template (I use ES 1.3.1):

*nested: ElasticsearchException[Unable to find a field mapper for field 
[weight]*


I' m storing the template in the .script index as stated here [1]. The 
field 'weight' is used inside a field_value_factor (function_score). 

The query runs appropriately when not using a Search Template. 


You can find a mapping, sample data, working query, sample template, and a 
not working template query at [2].
Is that a bug? 


I'd appreciate any help,

Germán


[1] 
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-template.html#_filling_in_a_query_string_with_a_single_value
[2] https://titanpad.com/es-fvf-issue

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/65d2ceb8-37cb-4f61-92e6-e85c0c4f0b6e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Stop words and Keyword tokenizer

Issue (Bug?) with field_value_factor and Search Template

2 matches

Site Navigation

Mail list logo

Footer information