Author: tpalsulich
Date: Wed Dec 24 07:28:05 2014
New Revision: 1647741

URL: http://svn.apache.org/r1647741
Log:
TIKA-1500. Strip tags from content in FeedParser.

Modified:
    tika/trunk/CHANGES.txt
    
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java

Modified: tika/trunk/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/tika/trunk/CHANGES.txt?rev=1647741&r1=1647740&r2=1647741&view=diff
==============================================================================
--- tika/trunk/CHANGES.txt (original)
+++ tika/trunk/CHANGES.txt Wed Dec 24 07:28:05 2014
@@ -1,5 +1,8 @@
 Release 1.7 - Current Development
 
+  * HTML tags are properly stripped from content by FeedParser
+    (TIKA-1500).
+
   * Tika Server support for selecting a single metadata key;
     wrapped MetadataEP into MetadataResource (TIKA-1499).
 

Modified: 
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
URL: 
http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java?rev=1647741&r1=1647740&r2=1647741&view=diff
==============================================================================
--- 
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
 (original)
+++ 
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
 Wed Dec 24 07:28:05 2014
@@ -96,7 +96,7 @@ public class FeedParser extends Abstract
                     SyndContent content = entry.getDescription();
                     if (content != null) {
                         xhtml.newline();
-                        xhtml.characters(content.getValue());
+                        xhtml.characters(stripTags(content));
                     }
                     xhtml.endElement("li");
                 }


Reply via email to