Maybe storing the protocol info in a separate field would be cleaner. Otis Solr & ElasticSearch Support http://sematext.com/ On Oct 3, 2013 3:20 AM, "Furkan KAMACI" <furkankam...@gmail.com> wrote:
> I have two questions: > * > * > *First one:* > > I have a url field at my index. I have some supported protocols. i.e. http > and https. How can I list the urls at my index that has is not a supported > url? (which query parser do you suggest for such kind of purposes)? > > http://www.google.com/sfdsd sfsdf sfdsf/asfsf - sdfsfdsf/ -> *supported* > https://www.google.com/sfdsd sfsdf sfdsf/asfsf - sdfsfdsf/-> *supported* > *ftp*://www.google.com/sfdsd sfsdf sfdsf/asfsf - sdfsfdsf/ -> *not > supported > * > > *Second one:* > > My next aim will be detecting urls that does not have protocol definition: > > http://www.google.com/sfdsd sfsdf sfdsf/asfsf - sdfsfdsf/ -> *supported* > www.google.com/sfdsd sfsdf sfdsf/asfsf - sdfsfdsf/-> *not supported* >