I think this is probably better to discuss on solr-user, or maybe solr-dev, since it is dismax parser you are talking about, which really lives in Solr. However, my 2c - this seems somewhat dubious. Maybe people want to include those in their terms? Also, it leads to a kind of slippery slope: would you also want to convert all the various white space characters (no-break space, thin space, em space, etc) as vanilla ascii 32? How about all the other "operator" characters like brackets?
On Mon, Jan 21, 2019 at 9:50 AM John Ryan <[email protected]> wrote: > I'm looking to create an issue to add support for Unicode Double Quotes to > the dismax parser. > > I want to replace all types of double quotes with standard ones before > they get stripped > > i.e. > “ ” „ “ „ « » ‟ ❝ ❞ ⹂ " > > With > " > I presume this has been discussed before? > > I have a POC here: > https://github.com/apache/lucene-solr/compare/branch_7x...jnyryan:branch_7x > > Thanks, > > John > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
