[ 
https://issues.apache.org/jira/browse/SOLR-10934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16226061#comment-16226061
 ] 

Hoss Man commented on SOLR-10934:
---------------------------------

I think we'll have to abandon any plans of running a robust link checker on the 
PDF -- or any "single page" output format using the master 
{{SolrRefGuide-all.adoc}} file.  The metadata we need to be confident that 
links/anchors are going where we expect just doesn't exist at that point.

What we _might_ want to consider, is refactoring our build.xml, so that the 
same {{<asciidoctor:convert/>}} task options use to generate the PDF, could 
also be used to generate a bare bones version of the html-site -- ie: not using 
jekyll, just using raw asciidoctor with the "html5" output option.  Then we 
could (in theory) run the same HTML link checking code we currently have 
against that output dir -- just for the purpose of checking the links, not with 
any plan to ever publish it.

That way, we could have something like {{ant precommit}} fail if there are any 
broken links in the ref-guide -- using entirely ivy managed dependencies, w/o 
requiring a local install of ruby/jekyll.

> create a link+anchor checker for the ref-guide PDF using PDFBox
> ---------------------------------------------------------------
>
>                 Key: SOLR-10934
>                 URL: https://issues.apache.org/jira/browse/SOLR-10934
>             Project: Solr
>          Issue Type: Sub-task
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: documentation
>            Reporter: Hoss Man
>         Attachments: SOLR-10934.patch
>
>
> We currently have CheckLinksAndAnchors.java which is automatically run 
> against the ref-guide HTML as part of the build to use JSoup to find bad 
> links/anchors that asciidoctor doesn't complain about -- but not everyone 
> does/can build the HTML version of the ref-guide sincif we can e it requires 
> manually installing jekyll.
> The PDF build only requires things installed by ivy (via JRuby) and we 
> already have some PDFBox based code in ReducePDFSize.java that operates on 
> this PDF every time it's run -- so if we can find a way to do similar checks 
> using the PDFBox API we could catch these broken links faster.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to