Re: Removing words like "FONT-SIZE: 9pt; FONT-FAMILY: arial" from content

2018-12-31 Thread Hasan Diwan
Perhaps https://royvanrijn.com/blog/2016/03/java-mail-message-as-download/ may be helpful? Though I see the date on it and am now unsure. -- H On Mon, 31 Dec 2018 at 17:51, Zheng Lin Edwin Yeo wrote: > Hi Alex, > > I have tried with a file that is HTML formatted, with those tags like > , , ,

Re: Removing words like "FONT-SIZE: 9pt; FONT-FAMILY: arial" from content

2018-12-31 Thread Zheng Lin Edwin Yeo
Hi Alex, I have tried with a file that is HTML formatted, with those tags like , , , etc, and those gets removed during indexing. For tags like "*FONT-SIZE: 9pt; FONT-FAMILY: arial*", I found that in the EML file, there are two different content type, text/html and text/plain. Could it be due to

Re: How to access the Solr Admin GUI

2018-12-31 Thread Jörn Franke
Reverse proxy? > Am 31.12.2018 um 22:48 schrieb s...@cid.is: > > Hi all, > > is there a way, better a solution, to access the Solr Admin GUI from outside > the server (via public web) while the Solr port 8983 is closed by a firewall > and only available inside the server via localhost? > >

How to access the Solr Admin GUI

2018-12-31 Thread solr
Hi all, is there a way, better a solution, to access the Solr Admin GUI from outside the server (via public web) while the Solr port 8983 is closed by a firewall and only available inside the server via localhost? Thanks in advance Walter Claassen Alexandraweg 32 D 64287 Darmstadt Fon

Re: SOLR Cloud - Full index replication

2018-12-31 Thread Erick Erickson
No particular downside to increasing numRecordsToKeep except there is some additional disk space required and a bit of bookkeeping. Frankly, though, that's a bandaid at best. There should be more information in the logs about _why_ they go into recovery. If you're indexing while nodes are down

Re: Facing issue while transforming and indexing custom JSON

2018-12-31 Thread Alexandre Rafalovitch
Do you have _src_ field declared in schema? It is just a non-indexed string: https://github.com/apache/lucene-solr/blob/releases/lucene-solr/7.5.0/solr/server/solr/configsets/sample_techproducts_configs/conf/managed-schema#L169 Regards, Alex. On Mon, 31 Dec 2018 at 04:35, Shubhangi Shinde

Re: Removing words like "FONT-SIZE: 9pt; FONT-FAMILY: arial" from content

2018-12-31 Thread Alexandre Rafalovitch
EML is for emails, so there are probably some HTML-formatted emails that you are getting. Probably with the alternative text-part. Outlook would render HTML and/or use text part. I think you can just open EML in an editor to check it out. As to URP, are you absolutely sure it is being used? It is

Resolved Authorization Issue

2018-12-31 Thread Terry Steichen
Thanks, Dominique.  This appears to explain a LOT of past confusion. Terry On 12/31/18 5:26 AM, Dominique Bejean wrote: > So in Solr standalone mode, only authentication is fully functional, not > authorization !

Re: How to archive Solr cloud and delete the data?

2018-12-31 Thread Steve Rowe
Hi Rekha, Do you know about Solr's Time Routed Aliases feature[1]? Steve [1] https://lucene.apache.org/solr/guide/7_6/time-routed-aliases.html > On Dec 30, 2018, at 11:48 AM, Rekha > wrote: > > Hi Solr Team, I want to archive my Solr data. Is there any api available to > archive data? I

Re: RuleBasedAuthorizationPlugin configuration

2018-12-31 Thread Dominique Bejean
Hi, In debugging mode, I discovered that only in SolrCloud mode the collection name is extract from the request path in the init() method of HttpSolrCall.java if (cores.isZooKeeperAware()) { // init collectionList (usually one name but not when there are aliases) ... } So in

Re: Facing issue while transforming and indexing custom JSON

2018-12-31 Thread Shubhangi Shinde
Hi Team, I am waiting for your feedback. Any update on this issue? On Fri, Dec 28, 2018 at 12:11 PM Shubhangi Shinde < shubhangi.shi...@iauro.com> wrote: > Hi Team, > > I am using Apache Solr. I went through the below link 'https://lucene. > apache.org/solr >