You can use that groovy code, and the header named zzBody will containt plain 
TEXT.
but watchout if the mail is multipart you have to do more.


Hope it's help!



       <setHeader headerName="zzBody">
       <groovy>
           def MimeUtility = new javax.mail.internet.MimeUtility();
           def Jsoup       = new org.jsoup.Jsoup();
           if ( 
request.getOriginalMessage().getContentType().toLowerCase().contains('html')) {
              zresult = 
MimeUtility.decodeText(request.getOriginalMessage().getContent());
              result  = Jsoup.parse(zresult).wholeText(); // preserve CRLF
           } else {
                  result = 
MimeUtility.decodeText(request.getOriginalMessage().getContent());
                  }
       </groovy>
       </setHeader>


Daniel Langevin


>>> 
De :    <christian.ja...@innogy.com>
À :     <users@camel.apache.org>
Date :  2019-10-07 07:49
Objet :         AW: parse HTML Email

I don't know if it helps you with HTML mails, but for parsing HTML sites I used 
a library called JSoup.

HTH
Christian

-----Ursprüngliche Nachricht-----
Von: thomas.thi...@telekom.de [mailto:thomas.thi...@telekom.de]
Gesendet: Montag, 7. Oktober 2019 13:01
An: users@camel.apache.org 
Betreff: parse HTML Email

Hi,

is there a build in way to parse HTML and RTF Emails. I simply need a plain 
text.
Only what I found was to unmarshall into different message-part.

Thomas

----------------------------------------------------------------
innogy SE
stv. Vorsitzender des Aufsichtsrates: Frank Bsirske
Vorstand: Uwe Tigges (Vorsitzender), Dr. Hans Buenting,
Dr. Bernhard Guenther, Arno Hahn, Martin Herrmann, Hildegard Mueller
Sitz der Gesellschaft: Essen, Eingetragen beim Amtsgericht Essen, 
Handelsregister-Nr. HRB 27091, USt-IdNr. DE304171711


"Le présent courriel peut contenir des renseignements confidentiels et ne 
s'adresse qu'au destinataire dont le nom apparaît ci-dessus. Si ce courriel 
vous est parvenu par mégarde, veuillez le supprimer et nous en aviser aussitôt."

Reply via email to