NNTPClient.retrieveArticleBody returns MalformedServerReplyException

Thufir Sat, 24 Mar 2012 03:45:29 -0700

What's the correct way to get an article body?

I'm using java.util.logging.Logger to catchorg.apache.commons.net.MalformedServerReplyException to a log file:


    15  <record>
    16    <date>2012-03-24T03:09:35</date>
    17    <millis>1332583775299</millis>
    18    <sequence>1</sequence>
    19    <logger>gwene.LogUtils</logger>
    20    <level>INFO</level>
    21    <class>gwene.LogUtils</class>
    22    <method>logArticles</method>
    23    <thread>1</thread>
    24    <message>Could not parse response code.

25 Server Reply: <p>Alex &#8220;Hurricane&#8221;Higgins, transformer of snooker, died on July 24th, aged

...text snipped...

mercilessly, one by one. ...</p><divclass="feedflare"></message>

    26  </record>

The server reply is *exactly* what I'm missing, the content of thearticle. code and full output:


https://gist.github.com/2180843

I'm guessing that the HTML is throwing things off? What doesNNTPClient.retrieveArticleBody expect? After all, anything can be in anNNTP post.

Now, what I'm really after, I suppose, is the server reply because thathas the body of the NNTP article. However, surely, that's not the wayto use org.apache.commons.net.nntp.NNTPClient, only I can't find thecorrect way. Hence this kludge to grab the MalformedServerReply insteadof parsing it.

I suppose it's possible to log everything, and then parse the log file,but that seems like a very complex way of doing a simple thing.

The API documentation for NNTPClient assumes a knowledge of NNTP which,unfortunately, I don't have. I've looked through the example code anddon't see any samples where article bodies are parsed. The closest Isee is NNTPClient.retrieveArticleBody:


https://commons.apache.org/net/api-3.1/org/apache/commons/net/nntp/NNTPClient.html#retrieveArticleBody%28java.lang.String%29

however, that's just malformed content. Presumably, since Pan canconnect with gmane fine, that's not the problem. Also, by looking inthe Pan newsreader, NNTPClient.retrieveArticleBody results match withwhat I'm after -- namely, the body of the article.

What is the correct way to grab the article body? I've looked throughthe API quite thoroughly.

Surely there must be an example for parsing the article body, not justthe header. Or, at least, using BufferedReader to get the article bodyand assign it to a String. If so, I don't see a better method availablethrough the API.




thanks,

Thufir

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@commons.apache.org
For additional commands, e-mail: user-h...@commons.apache.org

NNTPClient.retrieveArticleBody returns MalformedServerReplyException

Reply via email to