Add post-extraction inclusions and exclusions into the web connector --------------------------------------------------------------------
Key: CONNECTORS-214 URL: https://issues.apache.org/jira/browse/CONNECTORS-214 Project: ManifoldCF Issue Type: Improvement Components: Web connector Affects Versions: ManifoldCF 0.2, ManifoldCF 0.1 Reporter: Erlend GarĂ¥sen Assignee: Erlend GarĂ¥sen Fix For: ManifoldCF next If html files are excluded for a job, links in these files will not be followed. If we add inclusion and exclusion filters based on post-extraction, it will be possible to fetch only certain types of documents, such as PDFs. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira