For me, it is valuable to be able to visit a PDF online and annotate and
tag it as if it is an HTML page.


On Fri, Mar 27, 2020 at 1:07 PM Benjamin Young <[email protected]>
wrote:

> Hi all,
>
> We've got the (very promising!) beginnings of a core framework for
> building HTML annotation tools that work well with the W3C Web Annotation
> format. What I'm beginning to wonder is if we might also need/want to
> consider bringing in (or building or integrating with) other code that may
> get more content into HTML for annotating.
>
> There are of course projects like PDF.js (Mozilla) and EPUB.js
> (FuturePress.org) which would be great to see integrations and/or demos on
> top of (at some point).
>
> There's another project I'm connected with (dedocx) that does Microsoft
> Word .docx file (OOXML) conversion into "ugly" HTML and then has a plugins
> system for post-processing that into more meaningful content (i.e. adding
> Linked Data, etc.): https://github.com/science-periodicals/dedocx
>
> Would this sort of project be something we might be able to find a home
> for here under the Apache Annotator banner? Or, barring that, maybe
> consider sending through the incubator process on its own--if others are
> interested?
>
> Just musings at this point, but thought I'd reach out to see if y'all had
> thoughts. :)
>
> Cheers!
> Benjamin
>
>
> --
>
> http://bigbluehat.com/
>
> http://linkedin.com/in/benjaminyoung
>

Reply via email to