Since the data already existing and need is to remove unwanted fields using a custom update processor looks less useful here. Erick's recommendation on re-indexing into a new collection if at all possible looks simple and safe.
On Sat, Nov 8, 2014 at 12:44 AM, Erick Erickson <erickerick...@gmail.com> wrote: > bq: My question is if I can delete the field definition from the > schema.xml and do an optimize and the fields “magically” disappears > > no. schema.xml is really just about regularizing how Lucene indexes > things. Lucene (where this would have to take place) doesn't have any > understanding of schema.xml, so changing it then optimizing (and > optimizing is also a Lucene function) won't have any effect. > > If you > 1> change the schema > and > 2> update documents > the data will be purged as background merges happen. > > But really, I'd recommend re-indexing into a new collection if at all > possible. > > > Best, > Erick > > On Fri, Nov 7, 2014 at 4:26 AM, Yago Riveiro <yago.rive...@gmail.com> > wrote: > > Jack, > > > > > > > > > > I have some data indexed that I don’t need any more. My question is if I > can delete the field definition from the schema.xml and do an optimize and > the fields “magically” disappears (and free space from disk). > > > > > > > > > > Re-index data to delete fields is to expensive in collections with > hundreds of millions of documents. > > > > > > > > > > Optimize operation seems to be a good place to shrink to documents ... > > > > > > > > — > > /Yago Riveiro > > > > On Fri, Nov 7, 2014 at 12:19 PM, Jack Krupansky <j...@basetechnology.com > > > > wrote: > > > >> Could you clarify exactly what you are trying to do, like with an > example? I > >> mean, how exactly are you determining what fields are "unwanted"? Are > you > >> simply asking whether fields can be deleted from the index (and schema)? > >> -- Jack Krupansky > >> -----Original Message----- > >> From: yriveiro > >> Sent: Thursday, November 6, 2014 9:19 AM > >> To: solr-user@lucene.apache.org > >> Subject: Delete data from stored documents > >> Hi, > >> It's possible remove store data of an index deleting the unwanted fields > >> from schema.xml and after do an optimize over the index? > >> Thanks, > >> /yago > >> ----- > >> Best regards > >> -- > >> View this message in context: > >> > http://lucene.472066.n3.nabble.com/Delete-data-from-stored-documents-tp4167990.html > >> Sent from the Solr - User mailing list archive at Nabble.com. >