Author: tpalsulich
Date: Wed Dec 24 07:28:05 2014
New Revision: 1647741
URL: http://svn.apache.org/r1647741
Log:
TIKA-1500. Strip tags from content in FeedParser.
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
Modified: tika/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewvc/tika/trunk/CHANGES.txt?rev=1647741&r1=1647740&r2=1647741&view=diff
==============================================================================
--- tika/trunk/CHANGES.txt (original)
+++ tika/trunk/CHANGES.txt Wed Dec 24 07:28:05 2014
@@ -1,5 +1,8 @@
Release 1.7 - Current Development
+ * HTML tags are properly stripped from content by FeedParser
+ (TIKA-1500).
+
* Tika Server support for selecting a single metadata key;
wrapped MetadataEP into MetadataResource (TIKA-1499).
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java?rev=1647741&r1=1647740&r2=1647741&view=diff
==============================================================================
---
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
(original)
+++
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/feed/FeedParser.java
Wed Dec 24 07:28:05 2014
@@ -96,7 +96,7 @@ public class FeedParser extends Abstract
SyndContent content = entry.getDescription();
if (content != null) {
xhtml.newline();
- xhtml.characters(content.getValue());
+ xhtml.characters(stripTags(content));
}
xhtml.endElement("li");
}