This is an automated email from the ASF dual-hosted git repository.
rzo1 pushed a change to branch genai-text-extractor
in repository https://gitbox.apache.org/repos/asf/stormcrawler.git
at 47da87d3 Adds a first draft for a llm based text extractor. Still
needs some more love in terms of testing
This branch includes the following new commits:
new 4734256c Introduces TextExtractor as interface. Renames previous
TextExtractor to JSoupTextExtractor. Allow configuration of text extractor used
via config properties.
new 47da87d3 Adds a first draft for a llm based text extractor. Still
needs some more love in terms of testing
The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.