Extract rel attr with LinkContentHandler
----------------------------------------

                 Key: TIKA-825
                 URL: https://issues.apache.org/jira/browse/TIKA-825
             Project: Tika
          Issue Type: Improvement
          Components: parser
            Reporter: Markus Jelsma
            Priority: Minor


For Nutch we need to extract URL's but need the rel attribute to check for the 
nofollow value. I've patched the code to return this information in the Link 
object. It's been tested and i can read the rel in Nutch now.

Thoughts?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to