Thank you very much. It is almost perfect except that all tags starting with I and A letters are being kept. The rest works just fine. Is there any improvement on that? Thanks.
On Jul 23, 7:07 pm, Aaron Carnes <kymn...@gmail.com> wrote: > String regex = "<\\s*?(/)??[b-hj-z].*?>"; > > On Thu, Jul 23, 2009 at 11:04 AM, vunet <vunet...@gmail.com> wrote: > > > Hi experts, > > > I need to strip HTML out of the code BUT with some filter, i.e. keep > > certain tags in. I got as far as this below where it really strips > > HTML and keeps the tags I want (<a> and <i>) but it does not keep > > ending tags (</a> and </i>). Please see clear example below: > > > String in = "<i>Italic here</i> and <a href='http://www.x.com'>link > > here</a> and the <span>rest of removed HTML </span>"; > > regex = "<[^((\\s*I)|(\\s*A))].*?>"; > > System.out.println(regex); > > System.out.println( java.util.regex.Pattern.compile(regex, > > java.util.regex.Pattern.CASE_INSENSITIVE).matcher(in).replaceAll > > ("") ); > > > Result: <i>Italic here and <a href='http://www.x.com'>link here and > > the rest of removed HTML > > Desired result: <i>Italic here</i> and <a href='http://www.x.com'>link > > here</a> and the rest of removed HTML > > > Please recommend the solution if possible. > > Thank you. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Java EE (J2EE) Programming with Passion!" group. To post to this group, send email to java-ee-j2ee-programming-with-passion@googlegroups.com To unsubscribe from this group, send email to java-ee-j2ee-programming-with-passion-unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/java-ee-j2ee-programming-with-passion?hl=en?hl=en -~----------~----~----~----~------~----~------~--~---