On 5. März 2014 00:25:40 MEZ, Chris Peterson <[email protected]> wrote:
>On 3/4/14, 2:59 AM, André Fiedler wrote:
>> And if someone feeds real world data, but does:
>>
>> latitude + random()
>> longitude + random()
>>
>> ?
>>
>> Wouldn't it mess up the geolocation service? Maybe 1 out of 100 won't
>> mess it up, but 1 out of 2 will.
>
>This is a tough problem. The server can't really trust the data 
>submitted from stumbler clients because malicious people could modify 
>their stumbler code or write bots to upload bogus data. We could
>require 
>stumblers to register a user account, but storing user data in 
>connection with location data could be a slippery slope for privacy.
>
>Fortunately, even if the database is flooded with bogus data, someone 
>using (say) a Firefox OS phone to geolocate their position with query 
>the server with the nearby Wi-Fi BSSIDs. The bogus BSSIDs are unlikely 
>to match real-world BSSIDs, so most would never be queried. But even if
>
>someone uploaded enough bogus BSSIDs that are matches, the server is 
>still going to compare multiple Wi-Fi AP positions and if one is 
>suspiciously far away from the others, we can ignore it.
>
>Real-world data is messy, so I'm definitely interested in hearing other
>
>ideas people have for cleaning up the data on the client, server, or 
>database side. :)
>
>
>chris
>_______________________________________________
>dev-geolocation mailing list
>[email protected]
>https://lists.mozilla.org/listinfo/dev-geolocation

We could make a query for the server database to see what the "damage" is.
This query would return all locations that have no other locations around them 
in a radius of 50-100m (we could use different values here).
Then we could take looks at the uploaded data of such locations -> is there a 
scheme which the bot makes use of which we are able to detect and prevent (at 
least until the bot developer changes the behaviour)
Are there other entries in the database which follow a similar scheme but have 
a location next to a regular/valid upload? 
Do these entries have completely different cell towers compared to the 
locations next to them? (do the providers of the faked cell towers even exist 
in this country?)

Maybe there is no way to do it automatically in a safe way but we could filter 
the database manually at least to filter those lonely plots in Poland out, 
because, no matter how important this data is to us, nobody could detect his 
position successfully with just a few plots in the landscape while driving -> 
later on  somebody will decide to drive there anyways and stumble the area 
correctly if he is really interested in seeing this area added to the Mozilla 
location service.
_______________________________________________
dev-geolocation mailing list
[email protected]
https://lists.mozilla.org/listinfo/dev-geolocation

Reply via email to