Michael:
You're right. Copy / paste for the loose; I read right over it. :(
corrected code:
package s;
import java.io.FileWriter;
import java.io.InputStream;
import java.io.IOException;
import java.io.FileOutputStream;
import java.net.URL;
import java.net.URLConnection;
import java.net.Malforme
From the JDK docs:
FileWriter is meant for writing streams of characters. For writing streams of
raw bytes, consider using a FileOutputStream.
You get characters replaced depending on your platforms character encoding. You
must ensure you're writing bytes and not characters!
Michael
On 5. Nov
First difference (on second line, first line is for reference point):
bad:
<>stream
xÚ?U LSW >?Û O)]Wä!Ô>?"CATl?4PkADy ? ?RjgÊ??< õ A
Start of second line in hex: 78 DA 3F 55 0B 4C 53 57
good:
<>stream
xÚ”U LSW >—Û O)]Wä!Ô>˜"CATl”4PkADy ‹ –Rjgʈˆ< õ A
Start of second line in hex: 78 DA 94 5
Thanks Grant.
But I have thousands of PDF URLs like this. I have tried around 12 so far.
Can all of them be corrupt?
What can I do about this?
- Yogesh
On 5 November 2010 18:53, Grant Overby wrote:
> I ran the code [2]. The pdf is corrupted by the code as MD5s are different.
> File sizes a
Also, My.pdf has EOF token "%%EOF"
--
Grant Overby
Senior Developer
FloorSoft, Inc.
Often people, especially computer engineers, focus on the machines. They
think, "By doing this, the machine will run faster. By doing this, the
machine will run more effectively. By doing this, the machine will so
I ran the code [2]. The pdf is corrupted by the code as MD5s are different.
File sizes are identical [1];
1:
11/05/2010 06:47 PM 2,371,050 msb201055.pdf
11/05/2010 06:46 PM 2,371,050 My.pdf
2:
package s;
import java.io.FileWriter;
import java.io.InputStream;
import java.io.IO
Following is the code and the URL.
URL url = new URL ("
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2947364/pdf/msb201055.pdf?tool=pmcentrez
");
con = url.openConnection();
InputStream in = con.getInputStream();
FileWriter out = new FileWriter("C:/My.pdf
Yogesh,
Compare the file size and hash (SHA1, MD5, etc.) of the file you download
from your browser with the file that Java downloads. The end of the file
may be missing when you download it via Java. I know you said the file
size is correct, but is it the *exact* same number of bytes? If so
Hrm, That's odd.
Can you post the code you tried? An perhaps the url?
--
Grant Overby
Senior Developer
FloorSoft, Inc.
Often people, especially computer engineers, focus on the machines. They
think, "By doing this, the machine will run faster. By doing this, the
machine will run more effectively
Yes. I can download the file through the browser. It works perfectly fine.
- Yogesh
On 5 November 2010 18:25, Grant Overby wrote:
> If you download the file through a browser? Does it work then?
>
>
> --
> Grant Overby
> Senior Developer
> FloorSoft, Inc.
>
> Often people, especially computer
If you download the file through a browser? Does it work then?
--
Grant Overby
Senior Developer
FloorSoft, Inc.
Often people, especially computer engineers, focus on the machines. They
think, "By doing this, the machine will run faster. By doing this, the
machine will run more effectively. By doi
I tried with that, it writes a blank PDF. Though, the file size and the
number of pages is correct (for the new written file)
- Yogesh
On 5 November 2010 18:09, Grant Overby wrote:
> You don't need pdfBox to do this. Below is some rough code that allows you
> to download a file and save it.
You don't need pdfBox to do this. Below is some rough code that allows you
to download a file and save it.
URLConnection urlConnection = new URL("http://...";);
InputStream in = urlConnection.getInputStream();
FileWriter out = new FileWriter("my.pdf");
int next = 0;
while ( ( next = in.read
Hi,
I have PDFs which I can access through URLs. I want to download and save it
to files. How can I go about it?
Thanks
-Yogesh
14 matches
Mail list logo