[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-05-18 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847521#comment-17847521 ] Joe Gilvary commented on NUTCH-3057: Happy Saturday, [~lewi...@apache.org], I worked on the plugin

[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-05-17 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847453#comment-17847453 ] Joe Gilvary commented on NUTCH-3057: The arbitrary indexer plug-in can add multiple new fields

[jira] [Created] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-05-17 Thread Joe Gilvary (Jira)
Joe Gilvary created NUTCH-3057: -- Summary: Arbitrary indexer "leaks" previous value into a field processed after an exception Key: NUTCH-3057 URL: https://issues.apache.org/jira/browse/NUTCH-3057

[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed

2024-04-30 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17842526#comment-17842526 ] Joe Gilvary commented on NUTCH-585: --- [~dbeckstrom] I'm not sure which patch you were asking about. I

Re: [VOTE] Apache Nutch 1.20 Release

2024-04-20 Thread Joe Gilvary
the build being successful and unit tests running, additionally ran a small crawl in local hadoop and verified the same. _Regards_ Shashanka Balakuntala Srinivasa On Fri, 19 Apr 2024 at 4:47 AM, Joe Gilvary wrote: Just catching up now after the eclipse road trip, kicking the tires

Re: [VOTE] Apache Nutch 1.20 Release

2024-04-18 Thread Joe Gilvary
Just catching up now after the eclipse road trip, kicking the tires on the bin rc it's looking good to me. Maybe it's my imagination, or maybe it's just old PDFs that I pointed Nutch at, but it seems that Tika complains more often. I'll try to get a more thorough run through this weekend, but

[jira] [Resolved] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-31 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary resolved NUTCH-3032. Resolution: Fixed I believe this meets all the goals in the discussions now. > Indexing plu

[jira] [Work started] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-30 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3032 started by Joe Gilvary. -- > Indexing plugin as an adapter for end user's own POJO instan

[jira] [Updated] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-14 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary updated NUTCH-3032: --- Patch Info: Patch Available > Indexing plugin as an adapter for end user's own POJO instan

[jira] [Comment Edited] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-14 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17825873#comment-17825873 ] Joe Gilvary edited comment on NUTCH-3032 at 3/14/24 11:05 PM: -- -Done

[jira] [Updated] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-14 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary updated NUTCH-3032: --- Attachment: NUTCH-3032.patch > Indexing plugin as an adapter for end user's own POJO instan

[jira] [Updated] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-14 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary updated NUTCH-3032: --- Attachment: (was: NUTCH-3032.patch) > Indexing plugin as an adapter for end user's own P

[jira] [Commented] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-12 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17825873#comment-17825873 ] Joe Gilvary commented on NUTCH-3032: Done! > Indexing plugin as an adapter for end user's own P

[jira] [Updated] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-12 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary updated NUTCH-3032: --- Attachment: NUTCH-3032.patch > Indexing plugin as an adapter for end user's own POJO instan

[jira] [Comment Edited] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-12 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17825855#comment-17825855 ] Joe Gilvary edited comment on NUTCH-3032 at 3/12/24 11:06 PM: -- I have

[jira] [Commented] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-12 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17825855#comment-17825855 ] Joe Gilvary commented on NUTCH-3032: I have the code cleaned up and a few Junit tests. When I follow

[jira] [Created] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-10 Thread Joe Gilvary (Jira)
Joe Gilvary created NUTCH-3032: -- Summary: Indexing plugin as an adapter for end user's own POJO instances Key: NUTCH-3032 URL: https://issues.apache.org/jira/browse/NUTCH-3032 Project: Nutch

Indexing arbitrary fields

2024-03-07 Thread Joe Gilvary
Good day, all, I wanted to index some values that I had to derive from fields in the NutchDocument. I started on an indexing plugin. Then I realized I would need more than one, or I could generalize the plugin. I went with the generalizing and wrote a plugin that will use custom POJOs to

[jira] [Commented] (NUTCH-2900) Integrate Nutch with Kerberized Solr Cloud

2022-03-31 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17515351#comment-17515351 ] Joe Gilvary commented on NUTCH-2900: I see a similar error at injection when Solr uses the MultiAuth

[jira] [Updated] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer

2020-08-13 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary updated NUTCH-2823: --- Description: The string validation for the IndexWriters.describe() fails when the value in index

[jira] [Created] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer

2020-08-13 Thread Joe Gilvary (Jira)
Joe Gilvary created NUTCH-2823: -- Summary: IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer Key: NUTCH-2823 URL: https://issues.apache.org/jira/browse/NUTCH-2823