Hi experts, I need to strip HTML out of the code BUT with some filter, i.e. keep certain tags in. I got as far as this below where it really strips HTML and keeps the tags I want (<a> and <i>) but it does not keep ending tags (</a> and </i>). Please see clear example below:
String in = "<i>Italic here</i> and <a href='http://www.x.com'>link here</a> and the <span>rest of removed HTML </span>"; regex = "<[^((\\s*I)|(\\s*A))].*?>"; System.out.println(regex); System.out.println( java.util.regex.Pattern.compile(regex, java.util.regex.Pattern.CASE_INSENSITIVE).matcher(in).replaceAll ("") ); Result: <i>Italic here and <a href='http://www.x.com'>link here and the rest of removed HTML Desired result: <i>Italic here</i> and <a href='http://www.x.com'>link here</a> and the rest of removed HTML Please recommend the solution if possible. Thank you. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Java EE (J2EE) Programming with Passion!" group. To post to this group, send email to java-ee-j2ee-programming-with-passion@googlegroups.com To unsubscribe from this group, send email to java-ee-j2ee-programming-with-passion-unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/java-ee-j2ee-programming-with-passion?hl=en?hl=en -~----------~----~----~----~------~----~------~--~---