Re: [Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Tom Morris
For any type of search application, you not only want to do case and accent folding, but also Unicode normalization (you could have both precomposed and combining accent versions of the è in Isère). Typically a search engine could be directed to normalize both th

Re: [Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Andrea Di Menna
Hi, there is no magic in that. It only happens that wikipedia has got a page Isere ( http://en.wikipedia.org/wiki/Isere) which is actually a mere redirect to Isère (http://en.wikipedia.org/wiki/Is%C3%A8re). Hence the framework links the two DBpedia entities together in a triple - dbpedia:Isere

Re: [Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Romain Beaumont
Did you consider using the keyword search (http://wiki.dbpedia.org/Lookup) ? for example http://lookup.dbpedia.org/api/search.asmx/KeywordSearch?QueryString=Is%C3%A8re 2014-06-27 13:46 GMT+02:00 Mohammad Ghufran : > Hello, > > Thank you for your reply. Yes, I tried doing that. If i try to remov

Re: [Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Mohammad Ghufran
Hello, Thank you for your reply. Yes, I tried doing that. If i try to remove the accents, i normally get a redirection page in the search results. I can then get the resource uri for this result and get the actual resource page. However, this only happens sometimes. For example, a region in France

Re: [Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Andrea Di Menna
Hi, I also think you should do some preprocessing using ASCII Folding techniques. You could fold labels and add them as additional surface forms for the entity. The same process would apply for labels coming from different languages. I have successfully used this approach in a project where Solr wa

Re: [Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Romain Beaumont
Hello, I think you are going to do some preprocessing. For example to handle accents, you can just remove them (in your program/script/...) before transforming it to sparql. Some labels are present in different languages in DBpedia, maybe you could use that ? 2014-06-27 10:57 GMT+02:00 Mohammad

[Dbpedia-discussion] Querying for keywords while discarding accents

2014-06-27 Thread Mohammad Ghufran
Hello, I am using dbpedia to work with locations in order to compare them and determine if two locations are same / similar and to what extent. Since my data source can be user input, the data normally does not match the exact resource / label defined in dbpedia. I am using the sparql endpoint fo