[ 
https://issues.apache.org/jira/browse/CODEC-263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17043716#comment-17043716
 ] 

Tim Allison commented on CODEC-263:
-----------------------------------

This is hitting us on Apache Tika on a file in our test suite created by 
MSWord: 
https://github.com/apache/tika/blob/master/tika-parsers/src/test/resources/test-documents/testWORD2003.xml

Java's Base64.decode also complains about this.  This is the shortest entry in 
that file.  A "lenient" option would be great.  Thank you!
{noformat}
/9j/4AAQSkZJRgABAQEASwBLAAD/4wMOTVNPIFBhbGV0dGUg27Vw37164cGF4sWO48B648SF5MmU
5ciL5cyX5saJ5sqM58V/58+e6MuU6NGi6cmH6c2W6c6V6dCW6syL6tWo69Ce7M+S7NKg7NSf7dGV
7dWn7diu79Wc79ik8Nut8d+206ll2LFs2LV12bl+27yD3Lhz3Lx53rZr3r2C3sCG3sGN37l13714
379737+C38OO4MB84MGD4blq4bxy4bx54cOH4cSN4caN4caU4r594r974sF84sGB4sGH4sKE4sOB
4smT4smX48OG48SI48SM48WD48WF48WN48aH48eP48iO48mY5Lxx5MOA5MmS5Myd5b9z5cF95cKE
5ciG5ciK5ciN5ciT5cmN5cmT5cqS5cqX5cuU5cuY5cyU5cyZ5c6c5c6i5sF15sN75sSB5sSH5sWH
5saA5saE5saK5seG5seN5siO5smL5suS5syV5s6Y5tCl58iH58mJ58mO58mT58qN58uN58uU58yT
58yY582T582a586Y586c58+h59Ce59Km6MR56MSA6MeH6MiL6M+c6M+g6NCX6NGe6NSr6cZ86caC
6cqM6cqT6cuJ6cuO6cyO6cyV6c2T6c2Y6c6R6c6U6c6Y6c6c6c+V6c+Y6dCe6dKf6dKm6dOg6dSl
6dWt6siA6smB6smG6smL6sqM6syK6syR6s6O6s+R6s+e6tCW6tGb6tGf6tGj6tSn6tWm68yO682S
686W68+b69CW69CX69Ge69Kd69Kg69Kl69Oe69am69as69ev7MyF7MyK7M6Q7M+R7NGT7NGY7NGc
7NKU7NKZ7NKc7NOc7NSi7NSk7NWp7Nan7Nas7Nmu7c6L7c+M7c+U7dCW7dKd7dKh7dSm7dWd7dWi
7det7du17tKS7tKX7tKc7tSf7tag7tan7tei7tii7tin7tiv7tuv79GO79Oa79SV79Sf79am79iq
79mx79qp79258NWY8Naf8Nim8Nmn8Nmr8Nup8Nu08N2v8def8dmg8duv8d638eC78tuo8t2t8t+y
8+G49ObB/9sAQwALCAgKCAcLCgkKDQwLDREcEhEPDxEiGRoUHCkkKyooJCcnLTJANy0wPTAnJzhM
OT1DRUhJSCs2T1VORlRAR0hF/9sAQwEMDQ0RDxEhEhIhRS4nLkVFRUVFRUVFRUVFRUVFRUVFRUVF
RUVFRUVFRUVFRUVFRUVFRUVFRUVFRUVFRUVFRUVF/8AAEQgAgACAAwEiAAIRAQMRAf/EABkAAQEB
AQEBAAAAAAAAAAAAAAIDAQAEBv/EAC8QAQACAQMDAwMDAwUBAAAAAAECEQASITEDQVEiYXETgZEy
obEjQlIzYsHR8EP/xAAZAQEBAQADAAAAAAAAAAAAAAABAAIDBAX/xAAbEQEBAAMBAQEAAAAAAAAA
AAAAARExQSECUf/aAAwDAQACEQMRAD8A+8Pqt78rndQ6ifpL4wkIsf1FrnTj6b1G93xnlcdzrano
BjHnOIzIyEi1ztmfTjpiqfes6MYsWk4rkvJKpJ3Svtk97C9s0jt6Zd/OdSf/AETK7UY/r88dvfAb
zHfnh7PjKsZEg1UKcrtkyMjqJdll+2ZqMJBVdvOZPUEtuP8AvN0FUSe2DqRNFEr3XN3SjPVHSv8A
l/xhNX1dIU00udMPpFSovz7ZOMa64a9qUrM8L0xjOFlHJbWcHUZth2rOjCOqRKRW18eMBA1yqZtR
yY3gMj1Gq0mEj1BaziMD+8ur5HDKENT6/HjCzww9IhcLLvgwyjcS+jz7GWjIqq9ucEmEgt2++V0G
RjXSHRtXgzo6NP8ApD59OU6bDQl+fPnFGUQ7fhx/EkEar6QGIrVsF17eMTTdxKXw+M4rVtGnf+Mu
gLNQtc+clf05vqEe18Z6NNTj6e/t4ycDTI9IW77mZpjZJZscnDmdRjo4FsMqrZsfn2wdRl4Oe7mr
oRGaPRr6d2949sIRjMqIsr9WnjLK/TdrHkXBUzrdO6pu7eMMtGvqlUO5TpwQok3063s9JlYjrbrZ
O/tnQV6m5H8vlx3QHFf0+22xtuYWRrfTYHYrL2VvQVzv5wqa62XbzlVAgSq6CnjCkqNhlV8mUjF3
t4fbDpY8Sr3zN0W9KLDpBQN8XmixL2+M6OxWrvnUPE78GIFoXd+2cUzLZbbv4xsK31Fe2cRL54vD
HpyFj1Ipr57vs4IJwDtZxlkGcd6uTX4yXpZbPA4IyW+wsfnMm1EuN7734ykT13f7ZjEh/dStqnbN
YuAmS1G0QjdbmTGZODpGhNv4ysj+n6ZI3zXthAi9H1J2KAw4SjqZJE9Kg9qzIapdV52x2nUdMpKV
w4YsibfUaqucU1J6omnat9/fAxnqHTb4ytblyarbM0GtR/bKwZGMGUWv8vGc9NR9HbfOJTYT3CR7
4ZrpNy/nDzBOHTSNvPvmdOOuda275zIySD6o8/OHoErbkVfOnLzMSug/y3rjNK1gLXNX2rBplUkk
fePxmkpRmUdnfHoa7MWk575OJUqWzGSlRt2e2DpkhbgHnb/3tmdk2abkRfGdJ0xGNbt75xNY6qN/
wYJzaFj6j4v5MeJspX0rIcF0/wAZLp1fSa+x5ysJKSGMXcfTk1kPSmRtEvt/7nLiW1XOTdcbeMl0
2UpSWxqqymuet2uVG/asEZzeqnIHOV4lCKjS8fJh9UktbP5xMmjcfv7uYS6nCg/OKdGXLRz4zmTt
6oi+TM1+naSe2Z9RsCTw5Bt1Dat3jfNhzbdr2HCzdLvJ3zhb3Xa/4wLYxkDS2n+LmyZ1uScy/Lgm
7nqOTEHo/wBqe+ZENW8X84EUfUd/OMaa2djAqQ2v07X5zEgAvT/fNHc9MPPHfMkxA9MeeKzV0HG8
P0Fd8kxr6dRjRI7/ABjJum6jV+N1ycajGAABI7fGZ4T3+s2FbFYYp9eW3TCue+ayTqSuSWH9nDmR
alJXtXG/Bh1LOn1atFHf3twKa7uL4xy6gCernivdyTMreTy/zmqI4V6ZsG3bCS4NPGVOksDftXOH
6X6d3jKypmpY+d+M4u7os/6x6NMaHv59s4ikZU+McLIyWr227YJE7olf2y0orsXxgYuo3vfzhVE5
E4wVX8ZopPvb5Mcj0JW4PLnLH1WN+LwvhPfbn7Zkt4xu/vlN6Kr7uCXFNV85usibxDSu/nJG8YwI
y24yq1E3OebyUb1BYbuY40re6kTZrnDGzqS9P9r3xSXVLfb5zIjqWziqvHobNlZUN1ydSVWPOU6s
VjC3bVTbmaHVzt234y+lHRI/Ttu+G8LtA33+crq9F/PfCqGNQko1+rvX7Yo6WxVzHi0efPtmxiU7
8dryiJCRd705lx1G211xjdo/q4yYusCbs+cbQ6VJw0l5OQnV23POUudfqarmvfD1LZMRkRfB75j6
piqcHjjb5wIcbu3b4yjRP9XHO2TaItyb+PbOSiCjopvd98nVTicANGVpYg7F7lZyaeoew1mOFiH1
Hm9i/fNjEepMvan/AJxFs1ONn9sI6ZytQ749TesH0Dnzs5kipulS+32ykmum9/8AnbCqeU2NvjL6
UdFvpgjhZekU2rvmnDbuWZjwV2OMLpOuNbnL377YoUXQVhCKc73i6ahuu2KNYkW+nbXjJjFS4G3t
lENNVfbA+mRQp7GNEZcFDRxT++YpLiKF4jQVVjt/OSa+oSiS3lxWYrS7It/p398EtBx0nnesdlcK
Vvtxthk3ezz+M3dMwSUfT6Et7/GIl/Vj6dt998DcGFDvKijCy1SjEJ1TzHCXwneqUjShtd9zFBPq
UjX57uGDz6ZUVeznRX6m8Xb/AG5IpSJQXftf4wSnHVRdAUV7Y5SKjZTZtXzgbWq7HnD60o//2Y==
{noformat}

> Base64.decodeBase64 throw exception
> -----------------------------------
>
>                 Key: CODEC-263
>                 URL: https://issues.apache.org/jira/browse/CODEC-263
>             Project: Commons Codec
>          Issue Type: Bug
>    Affects Versions: 1.13
>         Environment: JDK 7/JDK 8 
> commons-codec 1.13
>            Reporter: xie tao
>            Priority: Critical
>         Attachments: image-jpg-01-big.base64.txt
>
>
> Codec upgrade to 1.13, code  throw exception as follows:
> {code:java}
>   @Test
>   public  void test(){
>     Base64.decodeBase64("publishMessage");
>   }
> {code}
> exception like:
> {code:java}
> java.lang.IllegalArgumentException: Last encoded character (before the 
> paddings if any) is a valid base 64 alphabet but not a possible value
>       at 
> org.apache.commons.codec.binary.Base64.validateCharacter(Base64.java:798)
>       at org.apache.commons.codec.binary.Base64.decode(Base64.java:472)
>       at 
> org.apache.commons.codec.binary.BaseNCodec.decode(BaseNCodec.java:412)
>       at 
> org.apache.commons.codec.binary.BaseNCodec.decode(BaseNCodec.java:395)
>       at org.apache.commons.codec.binary.Base64.decodeBase64(Base64.java:694)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to