On 19Sep2015 21:46, chandan kumar <chandankumar.093...@gmail.com> wrote:
I am looking for a python module which i can use to parse mailing thread
and extract some information from it.
Any pointer regarding that would be helpful.
You should describe where the email messages are stored. I'll presume you have
obtained the messages.
Construct a Message object from each message text. See the email.message
module:
https://docs.python.org/3/library/email.message.html#module-email.message
Every message has a Message-ID: header which uniquely identifies it. Replies to
that message have that id in the In_Reply-To: header. (If you're parsing usenet
newsgroup messages, you want the References: header - personally I consult
both.)
The complete specification of an email message is here:
http://tools.ietf.org/html/rfc2822
and the email.message module (and the other email.* modules) makes most of it
easily available. If you need to parse email addresses import the
"getaddresses" function from the "email.utils" module.
Constuct a graph connecting messages with the replies. You're done!
Cheers,
Cameron Simpson <c...@zip.com.au>
_______________________________________________
Tutor maillist - Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor