On 06/22/2010 10:58 AM, Bert Huijben wrote:
>> -----Original Message-----
>> From: Geoff Worboys [mailto:[email protected]]
>> Sent: dinsdag 22 juni 2010 16:37
>> To: [email protected]
>> Subject: Generating a dump file using a powershell script
>>
> 
> <snip>
> 
>> Q2:  When writing the code to try and identify text versus
>> binary files I decided to look at what subversion did ... but
>> now I am confused.  In libsvn_subr\io.c function
>> svn_io_detect_mimetype2 a comment says:
>>      going to examine the first block of data, and make sure that 85%
>>      of the bytes are such that their value is in the ranges 0x07-0x0D
>>      or 0x20-0x7F, and that 100% of those bytes is not 0x00.
>> but my reading of this code
>>       if (((binary_count * 1000) / amt_read) > 850)
>>         {
>>           *mimetype = generic_binary;
>>           return SVN_NO_ERROR;
>>         }
>> suggests that it is actually setting the type to binary only
>> if it finds more than 85% are binary bytes (in earlier code a
>> file binary if forced if any null byte is found).
>>
>> Can anyone explain this?  A bug or am I missing something?
> 
> Looking at the code, this seems looks like a bug to me. But it's not a bug
> that I like to fix without further review, because the current code might
> work better then the intended behavior for users of different character
> sets.
> 
> So it might be safer to just fix the documentation.

I've fixed the documentation on trunk.

-- 
C. Michael Pilato <[email protected]>
CollabNet   <>   www.collab.net   <>   Distributed Development On Demand

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to