Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
Michael: You're right. Copy / paste for the loose; I read right over it. :( corrected code: package s; import java.io.FileWriter; import java.io.InputStream; import java.io.IOException; import java.io.FileOutputStream; import java.net.URL; import java.net.URLConnection; import java.net.Malforme

Re: Save URLs to PDFs?

2010-11-05 Thread Michael Baehr
From the JDK docs: FileWriter is meant for writing streams of characters. For writing streams of raw bytes, consider using a FileOutputStream. You get characters replaced depending on your platforms character encoding. You must ensure you're writing bytes and not characters! Michael On 5. Nov

Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
First difference (on second line, first line is for reference point): bad: <>stream xÚ?U LSW >?Û O)]Wä!Ô>?"CATl?4PkADy ? ?RjgÊ??< õ A Start of second line in hex: 78 DA 3F 55 0B 4C 53 57 good: <>stream xÚ”U LSW >—Û O)]Wä!Ô>˜"CATl”4PkADy ‹ –Rjgʈˆ< õ A Start of second line in hex: 78 DA 94 5

Re: Save URLs to PDFs?

2010-11-05 Thread Yogesh
Thanks Grant. But I have thousands of PDF URLs like this. I have tried around 12 so far. Can all of them be corrupt? What can I do about this? - Yogesh On 5 November 2010 18:53, Grant Overby wrote: > I ran the code [2]. The pdf is corrupted by the code as MD5s are different. > File sizes a

Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
Also, My.pdf has EOF token "%%EOF" -- Grant Overby Senior Developer FloorSoft, Inc. Often people, especially computer engineers, focus on the machines. They think, "By doing this, the machine will run faster. By doing this, the machine will run more effectively. By doing this, the machine will so

Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
I ran the code [2]. The pdf is corrupted by the code as MD5s are different. File sizes are identical [1]; 1: 11/05/2010 06:47 PM 2,371,050 msb201055.pdf 11/05/2010 06:46 PM 2,371,050 My.pdf 2: package s; import java.io.FileWriter; import java.io.InputStream; import java.io.IO

Re: Save URLs to PDFs?

2010-11-05 Thread Yogesh
Following is the code and the URL. URL url = new URL (" http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2947364/pdf/msb201055.pdf?tool=pmcentrez "); con = url.openConnection(); InputStream in = con.getInputStream(); FileWriter out = new FileWriter("C:/My.pdf

Re: Save URLs to PDFs?

2010-11-05 Thread Adam
Yogesh, Compare the file size and hash (SHA1, MD5, etc.) of the file you download from your browser with the file that Java downloads. The end of the file may be missing when you download it via Java. I know you said the file size is correct, but is it the *exact* same number of bytes? If so

Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
Hrm, That's odd. Can you post the code you tried? An perhaps the url? -- Grant Overby Senior Developer FloorSoft, Inc. Often people, especially computer engineers, focus on the machines. They think, "By doing this, the machine will run faster. By doing this, the machine will run more effectively

Re: Save URLs to PDFs?

2010-11-05 Thread Yogesh
Yes. I can download the file through the browser. It works perfectly fine. - Yogesh On 5 November 2010 18:25, Grant Overby wrote: > If you download the file through a browser? Does it work then? > > > -- > Grant Overby > Senior Developer > FloorSoft, Inc. > > Often people, especially computer

Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
If you download the file through a browser? Does it work then? -- Grant Overby Senior Developer FloorSoft, Inc. Often people, especially computer engineers, focus on the machines. They think, "By doing this, the machine will run faster. By doing this, the machine will run more effectively. By doi

Re: Save URLs to PDFs?

2010-11-05 Thread Yogesh
I tried with that, it writes a blank PDF. Though, the file size and the number of pages is correct (for the new written file) - Yogesh On 5 November 2010 18:09, Grant Overby wrote: > You don't need pdfBox to do this. Below is some rough code that allows you > to download a file and save it.

Re: Save URLs to PDFs?

2010-11-05 Thread Grant Overby
You don't need pdfBox to do this. Below is some rough code that allows you to download a file and save it. URLConnection urlConnection = new URL("http://...";); InputStream in = urlConnection.getInputStream(); FileWriter out = new FileWriter("my.pdf"); int next = 0; while ( ( next = in.read

Save URLs to PDFs?

2010-11-05 Thread Yogesh
Hi, I have PDFs which I can access through URLs. I want to download and save it to files. How can I go about it? Thanks -Yogesh