[
https://issues.apache.org/jira/browse/TIKA-3464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376633#comment-17376633
]
Tim Allison commented on TIKA-3464:
-----------------------------------
Can you use the xhtml output? That marks page breaks with <div/> elements.
> Is it possible to extract individual pdf pages using Tika Server?
> -----------------------------------------------------------------
>
> Key: TIKA-3464
> URL: https://issues.apache.org/jira/browse/TIKA-3464
> Project: Tika
> Issue Type: Wish
> Components: server
> Reporter: Sal
> Priority: Trivial
>
> I was wondering if there exists the ability to call the Tika Server and get
> back the text as individual pages, instead of all grouped together in a
> single text file. I just need to know where each pdf page begins and ends in
> the output and it's not obvious from the text output.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)