[jira] [Resolved] (NIFI-7611) NiFi fails to index provenance events

David Handermann (Jira) Fri, 10 Feb 2023 11:29:05 -0800


     [ 
https://issues.apache.org/jira/browse/NIFI-7611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


David Handermann resolved NIFI-7611.
------------------------------------
    Resolution: Cannot Reproduce

Thanks the comment [~Kappelmann], it makes sense that an anti-virus scanner 
could introduce this behavior if it locks access to the Provenance repository 
files. Closing based on those comments, but this could be re-opened if there 
other environmental factors could be described and ruled out as the source of 
the problem.

> NiFi fails to index provenance events
> -------------------------------------
>
>                 Key: NIFI-7611
>                 URL: https://issues.apache.org/jira/browse/NIFI-7611
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.11.4
>         Environment: Microsoft Windows Server 2016 Standard - Intel Xeon Gold 
> 6140 CPU @ 2,30 GHz 8 processors, 32 GB RAM, total disk space 877 GB
>            Reporter: Michal W
>            Priority: Critical
>              Labels: index, java.nio.file.FileSystemException, provenance, 
> repository
>
> Getting error "failed to index provenance events". Nifi.app log displays 
> following information:
> 2020-07-08 09:00:00,406 ERROR [Index Provenance Events-4] 
> o.a.n.p.index.lucene.EventIndexTask Failed to index Provenance Events
> org.apache.lucene.store.AlreadyClosedException: this IndexWriter is closed
>                 at 
> org.apache.lucene.index.IndexWriter.ensureOpen(IndexWriter.java:681)
>                 at 
> org.apache.lucene.index.IndexWriter.ensureOpen(IndexWriter.java:695)
>                 at 
> org.apache.lucene.index.IndexWriter.updateDocuments(IndexWriter.java:1281)
>                 at 
> org.apache.lucene.index.IndexWriter.addDocuments(IndexWriter.java:1257)
>                 at 
> org.apache.nifi.provenance.lucene.LuceneEventIndexWriter.index(LuceneEventIndexWriter.java:70)
>                 at 
> org.apache.nifi.provenance.index.lucene.EventIndexTask.index(EventIndexTask.java:202)
>                 at 
> org.apache.nifi.provenance.index.lucene.EventIndexTask.run(EventIndexTask.java:113)
>                 at 
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
>                 at 
> java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
>                 at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>                 at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>                 at java.base/java.lang.Thread.run(Thread.java:834)
> Caused by: java.nio.file.FileSystemException: 
> E:\nifi-storage\provenance_repository\lucene-8-index-1593163985970\_11r.cfe: 
> The process cannot access the file because it is being used by another 
> process.
>  
>                 at 
> java.base/sun.nio.fs.WindowsException.translateToIOException(WindowsException.java:92)
>                 at 
> java.base/sun.nio.fs.WindowsException.rethrowAsIOException(WindowsException.java:103)
>                 at 
> java.base/sun.nio.fs.WindowsException.rethrowAsIOException(WindowsException.java:108)
>                 at 
> java.base/sun.nio.fs.WindowsFileSystemProvider.newFileChannel(WindowsFileSystemProvider.java:120)
>                 at 
> java.base/java.nio.channels.FileChannel.open(FileChannel.java:292)
>                 at 
> java.base/java.nio.channels.FileChannel.open(FileChannel.java:345)
>                 at 
> org.apache.lucene.store.MMapDirectory.openInput(MMapDirectory.java:238)
>                 at 
> org.apache.lucene.store.Directory.openChecksumInput(Directory.java:157)
>                 at 
> org.apache.lucene.codecs.lucene50.Lucene50CompoundReader.readEntries(Lucene50CompoundReader.java:105)
>                 at 
> org.apache.lucene.codecs.lucene50.Lucene50CompoundReader.<init>(Lucene50CompoundReader.java:69)
>                 at 
> org.apache.lucene.codecs.lucene50.Lucene50CompoundFormat.getCompoundReader(Lucene50CompoundFormat.java:70)
>                 at 
> org.apache.lucene.index.SegmentCoreReaders.<init>(SegmentCoreReaders.java:100)
>                 at 
> org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:83)
>                 at 
> org.apache.lucene.index.ReadersAndUpdates.getReader(ReadersAndUpdates.java:172)
>                 at 
> org.apache.lucene.index.ReadersAndUpdates.getReaderForMerge(ReadersAndUpdates.java:709)
>                 at 
> org.apache.lucene.index.IndexWriter.mergeMiddle(IndexWriter.java:4396)
>                 at 
> org.apache.lucene.index.IndexWriter.merge(IndexWriter.java:4054)
>                 at 
> org.apache.lucene.index.ConcurrentMergeScheduler.doMerge(ConcurrentMergeScheduler.java:625)
>                 at 
> org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:662)
>  
> Logs eventually grow over time and fill up the partition.
>  
> Configuration related to provenance repository:
>  
> # Provenance Repository Properties
> nifi.provenance.repository.implementation=org.apache.nifi.provenance.WriteAheadProvenanceRepository
> nifi.provenance.repository.debug.frequency=1_000_000
> nifi.provenance.repository.encryption.key.provider.implementation=
> nifi.provenance.repository.encryption.key.provider.location=
> nifi.provenance.repository.encryption.key.id=
> nifi.provenance.repository.encryption.key=
>  
> # Persistent Provenance Repository Properties
> nifi.provenance.repository.directory.default=E:\\nifi-storage\\provenance_repository
> nifi.provenance.repository.directory.content1=F:\\nifi-storage\\provenance_repository
> nifi.provenance.repository.max.storage.time=24 hours
> # nifi.provenance.repository.max.storage.size=1 GB
> nifi.provenance.repository.max.storage.size=8 GB
> nifi.provenance.repository.rollover.time=30 secs
> # nifi.provenance.repository.rollover.size=100 MB
> nifi.provenance.repository.rollover.size=1 GB
> nifi.provenance.repository.query.threads=2
> nifi.provenance.repository.index.threads=4
> #default: nifi.provenance.repository.compress.on.rollover=true
> nifi.provenance.repository.compress.on.rollover=false
> nifi.provenance.repository.always.sync=false
> # Comma-separated list of fields. Fields that are not indexed will not be 
> searchable. Valid fields are:
> # EventType, FlowFileUUID, Filename, TransitURI, ProcessorID, 
> AlternateIdentifierURI, Relationship, Details
> nifi.provenance.repository.indexed.fields=EventType, FlowFileUUID, Filename, 
> ProcessorID, Relationship
> # FlowFile Attributes that should be indexed and made searchable.  Some 
> examples to consider are filename, uuid, mime.type
> nifi.provenance.repository.indexed.attributes=
> # Large values for the shard size will result in more Java heap usage when 
> searching the Provenance Repository
> # but should provide better performance
> # nifi.provenance.repository.index.shard.size=500 MB
> nifi.provenance.repository.index.shard.size=4 GB
>  
> # Indicates the maximum length that a FlowFile attribute can be when 
> retrieving a Provenance Event from
> # the repository. If the length of any attribute exceeds this value, it will 
> be truncated when the event is retrieved.
> nifi.provenance.repository.max.attribute.length=65536
> nifi.provenance.repository.concurrent.merge.threads=2
>  
> # Volatile Provenance Respository Properties
> nifi.provenance.repository.buffer.size=100000



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Resolved] (NIFI-7611) NiFi fails to index provenance events

Reply via email to