> On Jun 22, 2024, at 6:59 AM, Marcus <[email protected]> wrote:
>
> Am 22.06.24 um 14:53 schrieb Bidouille:
>>> I remember from old time that the QA team at Sun/Oracle had really a
>>> lot of documents for general and special testing.
>>>
>>> These were not part of the code repository and were loaded from their
>>> own test software. Maybe this is the link to the storage outside of
>>> the project.
>> If you have an URL, you can try to get with the WayBack machine
>> https://wayback-api.archive.org/
>
> they were stored on an internal server.
The Apache Tika and Apache POI projects make use of Common Crawl to create a
large corpus for regression tests.
https://commoncrawl.org
Perhaps we can start to do the same? We can ask for help from Tika at
[email protected] or POI at [email protected]
Best,
Dave
>
> Marcus
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]