subject:"Index File"

[jira] [Commented] (LUCENE-5050) CompressingStoredFieldsReader should close the index file as soon as it has been read

2013-06-11 Thread Commit Tag Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13680560#comment-13680560
 ] 

Commit Tag Bot commented on LUCENE-5050:


[branch_4x commit] jpountz
http://svn.apache.org/viewvc?view=revisionrevision=1491909

LUCENE-5050: Close the stored fields and term vectors index files eagerly 
(merged from r1491889).

 CompressingStoredFieldsReader should close the index file as soon as it has 
 been read
 -

 Key: LUCENE-5050
 URL: https://issues.apache.org/jira/browse/LUCENE-5050
 Project: Lucene - Core
  Issue Type: Improvement
Affects Versions: 4.3.1
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-5050.patch


 Although CompressingStoredFieldsReader loads the stored fields index into 
 memory, it only closes the index file in {{close()}}. Closing at the end of 
 the constructor should help save file descriptors.
 The same idea applies to CompressingTermVectorsReader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-5050) CompressingStoredFieldsReader should close the index file as soon as it has been read

2013-06-11 Thread Adrien Grand (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-5050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand resolved LUCENE-5050.
--

Resolution: Fixed

Thanks Robert.

 CompressingStoredFieldsReader should close the index file as soon as it has 
 been read
 -

 Key: LUCENE-5050
 URL: https://issues.apache.org/jira/browse/LUCENE-5050
 Project: Lucene - Core
  Issue Type: Improvement
Affects Versions: 4.3.1
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-5050.patch


 Although CompressingStoredFieldsReader loads the stored fields index into 
 memory, it only closes the index file in {{close()}}. Closing at the end of 
 the constructor should help save file descriptors.
 The same idea applies to CompressingTermVectorsReader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (LUCENE-5050) CompressingStoredFieldsReader should close the index file as soon as it has been read

2013-06-10 Thread Adrien Grand (JIRA)

Adrien Grand created LUCENE-5050:


 Summary: CompressingStoredFieldsReader should close the index file 
as soon as it has been read
 Key: LUCENE-5050
 URL: https://issues.apache.org/jira/browse/LUCENE-5050
 Project: Lucene - Core
  Issue Type: Improvement
Affects Versions: 4.3.1
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.4


Although CompressingStoredFieldsReader loads the stored fields index into 
memory, it only closes the index file in {{close()}}. Closing at the end of the 
constructor should help save file descriptors.

The same idea applies to CompressingTermVectorsReader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-5050) CompressingStoredFieldsReader should close the index file as soon as it has been read

2013-06-10 Thread Adrien Grand (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-5050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand updated LUCENE-5050:
-

Attachment: LUCENE-5050.patch

Here is a patch which closes the index file in the constructor for both 
CompressingStoredFieldsReader and CompressingTermVectorsReader.

 CompressingStoredFieldsReader should close the index file as soon as it has 
 been read
 -

 Key: LUCENE-5050
 URL: https://issues.apache.org/jira/browse/LUCENE-5050
 Project: Lucene - Core
  Issue Type: Improvement
Affects Versions: 4.3.1
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-5050.patch


 Although CompressingStoredFieldsReader loads the stored fields index into 
 memory, it only closes the index file in {{close()}}. Closing at the end of 
 the constructor should help save file descriptors.
 The same idea applies to CompressingTermVectorsReader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5050) CompressingStoredFieldsReader should close the index file as soon as it has been read

2013-06-10 Thread Robert Muir (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13679785#comment-13679785
 ] 

Robert Muir commented on LUCENE-5050:
-

+1

 CompressingStoredFieldsReader should close the index file as soon as it has 
 been read
 -

 Key: LUCENE-5050
 URL: https://issues.apache.org/jira/browse/LUCENE-5050
 Project: Lucene - Core
  Issue Type: Improvement
Affects Versions: 4.3.1
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-5050.patch


 Although CompressingStoredFieldsReader loads the stored fields index into 
 memory, it only closes the index file in {{close()}}. Closing at the end of 
 the constructor should help save file descriptors.
 The same idea applies to CompressingTermVectorsReader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [Heads-up] Index file format change on trunk

2012-11-19 Thread Adrien Grand

Hi,

I just modified the Lucene 4.1 stored fields format again[1]. You should
reindex.

 [1] https://issues.apache.org/jira/browse/LUCENE-4558

-- 
Adrien

[Heads-up] Index file format change on trunk

2012-11-13 Thread Adrien Grand

Hi,

I just committed LUCENE-4509 [1] which changes the default
StoredFieldsFormat on trunk (backport to come on branch 4.x in a few
minutes). You should reindex.

 [1] https://issues.apache.org/jira/browse/LUCENE-4509

-- 
Adrien

Re: [Heads-up] Index file format change on trunk

2012-11-13 Thread Jack Krupansky

Hmmm... does this mean that 4.1 would REQUIRE reindexing?!?! If so, that would 
strongly argue for there to be a 4.0.1 release that has both bug fixes and the 
incremental feature additions of the 4x branch but DOESN’T require a reindex.

Who made this decision that it was A-OK for 4.1 to require a reindex? I don’t 
recall seeing any discussion here about it.

-- Jack Krupansky

From: Adrien Grand 
Sent: Tuesday, November 13, 2012 7:00 AM
To: dev@lucene.apache.org 
Subject: [Heads-up] Index file format change on trunk

Hi,

I just committed LUCENE-4509 [1] which changes the default StoredFieldsFormat 
on trunk (backport to come on branch 4.x in a few minutes). You should reindex.


[1] https://issues.apache.org/jira/browse/LUCENE-4509

-- 
Adrien

Re: [Heads-up] Index file format change on trunk

2012-11-13 Thread Adrien Grand

Hi Jack,

On Tue, Nov 13, 2012 at 4:06 PM, Jack Krupansky j...@basetechnology.comwrote:

   Hmmm... does this mean that 4.1 would REQUIRE reindexing?


No it doesn't. This is a notice for people who would already use the
(unreleased) Lucene41Codec for some test indexes to tell them that they
should reindex if they want their code to keep working with the latest
trunk revision.

-- 
Adrien

Re: [Heads-up] Index file format change on trunk

2012-11-13 Thread Jack Krupansky

Thank you for that clarification!

-- Jack Krupansky

From: Adrien Grand 
Sent: Tuesday, November 13, 2012 7:10 AM
To: dev@lucene.apache.org 
Subject: Re: [Heads-up] Index file format change on trunk

Hi Jack, 

On Tue, Nov 13, 2012 at 4:06 PM, Jack Krupansky j...@basetechnology.com wrote:

  Hmmm... does this mean that 4.1 would REQUIRE reindexing?

No it doesn't. This is a notice for people who would already use the 
(unreleased) Lucene41Codec for some test indexes to tell them that they should 
reindex if they want their code to keep working with the latest trunk revision.

-- 
Adrien

[jira] [Commented] (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2012-07-31 Thread Faizan Aziz (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13425593#comment-13425593
 ] 

Faizan Aziz commented on SOLR-1822:
---

I am getting the same error for solr 3.5.0. On close analysis of 
SnapPuller.java I notice that the fix mentioned in the attachment is not done

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Assignee: Mark Miller
Priority: Critical
 Fix For: 3.1, 4.0-ALPHA

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2012-07-31 Thread Faizan Aziz (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13425600#comment-13425600
 ] 

Faizan Aziz commented on SOLR-1822:
---

ok The fix by kafka0102 is there so I guess the patch is not needed 
However I am still getting the error

SEVERE: Unable to copy index file from: 
/opt/solr/multicore/1011/data/index.20120731065949/_9x.tis to: 
/mnt/multicore/1011/data/index.20120731050740/_9x.tis

Note: /opt/solr/multicore is a symlink to /mnt/multicore

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Assignee: Mark Miller
Priority: Critical
 Fix For: 3.1, 4.0-ALPHA

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2012-07-31 Thread Faizan Aziz (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13425605#comment-13425605
 ] 

Faizan Aziz commented on SOLR-1822:
---

Complete logs:
Jul 31, 2012 6:59:49 AM org.apache.solr.handler.SnapPuller copyAFile
SEVERE: Unable to move index file from: 
/opt/solr/multicore/1011/data/index.20120731065949/_9x.tis to: 
/mnt/multicore/1011/data/index.20120731050740/_9x.tisTrying to do a copy
Jul 31, 2012 6:59:49 AM org.apache.solr.handler.SnapPuller copyAFile
SEVERE: Unable to copy index file from: 
/opt/solr/multicore/1011/data/index.20120731065949/_9x.tis to: 
/mnt/multicore/1011/data/index.20120731050740/_9x.tis
java.io.FileNotFoundException: 
/mnt/multicore/1011/data/index.20120731050740/_9x.tis (No such file or 
directory)
at java.io.FileOutputStream.open(Native Method)
at java.io.FileOutputStream.init(FileOutputStream.java:209)
at java.io.FileOutputStream.init(FileOutputStream.java:160)
at org.apache.solr.common.util.FileUtils.copyFile(FileUtils.java:48)
at org.apache.solr.handler.SnapPuller.copyAFile(SnapPuller.java:585)
at 
org.apache.solr.handler.SnapPuller.copyIndexFiles(SnapPuller.java:621)
at 
org.apache.solr.handler.SnapPuller.fetchLatestIndex(SnapPuller.java:317)
at 
org.apache.solr.handler.ReplicationHandler.doFetch(ReplicationHandler.java:268)
at 
org.apache.solr.handler.ReplicationHandler$1.run(ReplicationHandler.java:149)

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Assignee: Mark Miller
Priority: Critical
 Fix For: 3.1, 4.0-ALPHA

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[HEADS-UP]: Index File Format Change on Trunk

2012-05-23 Thread Simon Willnauer

Hey folks,

I just committed LUCENE-4051 [1] (Revision 1341768) which changes the
file format of DocValues, Norms (DocValues), StoredFields 
TermVectors incompatible to previous revisions. If you are using trunk
indices you must re-index before updating  to the latest trunk
sources.
If you are using Lucene 3.x or below you can safely ignore this message.

happy indexing,

simon

[1] https://issues.apache.org/jira/browse/LUCENE-4051

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Heads Up - Index File Format Change on Trunk

2012-01-05 Thread Simon Willnauer

Folks,

I just committed LUCENE-3628 [1] which cuts over Norms to DocVaues.
This is an index file format change and if you are using trunk you
need to reindex before updating.

happy indexing :)

simon

[1] https://issues.apache.org/jira/browse/LUCENE-3628

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

how to customize lucene index file

2011-07-21 Thread Ari . Ko


Hi, good morning everyone.

I would like to ask one question about lucene index file.

I want to know whether I could customize this index file.

I know this index file is created by lucene in the backgroud from documents
directory.
And all of words and their frequence is output in the index file.

In fact I want to calculate the weight of each word and just output some
words which weight value is high in the index file.

According to the index creation method as below

***
...
IndexWriter writer = new IndexWriter(index, new JapaneseAnalyzer(), create);
...
Document doc = new Document();
doc.add(Field.Text(contents,reader));

...
writer.addDocument(doc);
***

It seems impossible. But I realy want to is there some method to realize it
in the lucene ?

In fact I want to use lucene not for retrival but for text clustering.

Best regards.

Yali Hu




-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Heads Up - Index File Format Change on Trunk

2011-06-10 Thread Michael McCandless

Simon can you also email java-user@ and solr-user@?  Seems good to
over-communicate when trunk index format changes...

Mike McCandless

http://blog.mikemccandless.com

On Fri, Jun 10, 2011 at 10:01 AM, Simon Willnauer
simon.willna...@googlemail.com wrote:
 Hey folks,

 I just committed LUCENE-3108 (Landing DocValues on Trunk) which adds a
 byte to FieldInfo.
 If you are running on trunk you must / should re-index any trunk
 indexes once you update to the latest trunk.

 its likely if you open up old trunk (4.0) indexes, you will get an
 exception related to Read Past EOF.

 Simon

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Heads Up - Index File Format Change on Trunk

2011-06-10 Thread Simon Willnauer

On Fri, Jun 10, 2011 at 6:22 PM, Michael McCandless
luc...@mikemccandless.com wrote:
 Simon can you also email java-user@ and solr-user@?  Seems good to
 over-communicate when trunk index format changes...

good point, done!



 Mike McCandless

 http://blog.mikemccandless.com

 On Fri, Jun 10, 2011 at 10:01 AM, Simon Willnauer
 simon.willna...@googlemail.com wrote:
 Hey folks,

 I just committed LUCENE-3108 (Landing DocValues on Trunk) which adds a
 byte to FieldInfo.
 If you are running on trunk you must / should re-index any trunk
 indexes once you update to the latest trunk.

 its likely if you open up old trunk (4.0) indexes, you will get an
 exception related to Read Past EOF.

 Simon

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Index File

2011-03-21 Thread soheila dehghanzadeh

 Hi All,
I have created Index folder, and i tried to open .CFS,.CFX,.GEN, segments
file with notpad . but they are unreadable. i want to see their structure
for my sample directory which i have passed to Indexfiles . ihave red this
http://lucene.apache.org/java/3_0_3/fileformats.html and i know what Index
should has. is there any way to see created index?
thanks in advance .
Peace.
-Soheila D.

Re: Index File

2011-03-21 Thread Sanne Grinovero

Hi,

2011/3/21 soheila dehghanzadeh sally...@gmail.com:
 Hi All,
 I have created Index folder, and i tried to open .CFS,.CFX,.GEN, segments
 file with notpad . but they are unreadable. i want to see their structure
 for my sample directory which i have passed to Indexfiles . ihave red this
 http://lucene.apache.org/java/3_0_3/fileformats.html and i know what Index
 should has. is there any way to see created index?

yes, use the force :)
http://code.google.com/p/luke/

Regards,
Sanne

 thanks in advance .
 Peace.
 -Soheila D.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Resolved: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2011-03-16 Thread Mark Miller (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Miller resolved SOLR-1822.
---

   Resolution: Duplicate
Fix Version/s: (was: Next)
   4.0
   3.1
 Assignee: Mark Miller

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Assignee: Mark Miller
Priority: Critical
 Fix For: 3.1, 4.0

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Index File Format Change - heads up (late)

2011-02-26 Thread Simon Willnauer

hey there,

a couple of days ago the trunk index format changed with LUCENE-2881
you should re-index any trunk indexes.

its likely if you open up old trunk indexes, you will get an
exception related to Read Past EOF.

Sorry for the delay!

simon

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-11-22 Thread kafka0102 (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934715#action_12934715
 ] 

kafka0102 commented on SOLR-1822:
-

I found the problem's reason.It's tmpDir's name same to indexDir's name. 
tmpIdxDirName's format is SnapShooter.DATE_FMT = MMddhhmmss. If I have a 
indexDirName index.20101122031000 because of a full index repl at 2010-11-22 
03:10:00. And slave's repl duration is 5 minitues. So at 2010-11-22 15:10:00, 
the slave made indexDirName and indexDirName both index.20101122031000, and 
finnaly will drop the index.After that,replication would generate the above 
error again and again.

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: Next

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Issue Comment Edited: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-11-22 Thread kafka0102 (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934715#action_12934715
 ] 

kafka0102 edited comment on SOLR-1822 at 11/22/10 9:45 PM:
---

I found the problem's reason.It's tmpDir's name same to indexDir's name. 
tmpIdxDirName's format is SnapShooter.DATE_FMT = MMddhhmmss(hh is between 
1-12). If I have a indexDirName index.20101122031000 because of a full index 
repl at 2010-11-22 03:10:00. And slave's repl duration is 5 minitues. So at 
2010-11-22 15:10:00, the slave made indexDirName and indexDirName both 
index.20101122031000, and finnaly will drop the index.After that,replication 
would generate the above error again and again.

So,just let  SnapShooter.DATE_FMT = MMddHHmmss would be fine.

  was (Author: kafka0102):
I found the problem's reason.It's tmpDir's name same to indexDir's name. 
tmpIdxDirName's format is SnapShooter.DATE_FMT = MMddhhmmss. If I have a 
indexDirName index.20101122031000 because of a full index repl at 2010-11-22 
03:10:00. And slave's repl duration is 5 minitues. So at 2010-11-22 15:10:00, 
the slave made indexDirName and indexDirName both index.20101122031000, and 
finnaly will drop the index.After that,replication would generate the above 
error again and again.
  
 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: Next

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Index File Format Change - LUCENE-2741

2010-11-15 Thread Simon Willnauer

Hey folks | devs,

I just wanted to give you a heads up that the just committed
LUCENE-2741 contains a index file format change.
Codec Files will now have a codec ID in their filenames to
support codecs that use the same file extensions within a single segment.

Indexes build with Lucene TRUNK will not work anymore and need to be
reindexed. If you
are using trunk you should reindex after you update to latest trunk.
Indexes build with previous version of Lucene will work just fine and
no action is required.

Simon

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Index File Format change on trunk

2010-11-11 Thread Yonik Seeley

Thanks for the heads-up... I tried not rebuilding the index on Solr
just to see what type of exception I'd get:

Caused by: java.lang.IllegalArgumentException: required codec
'tandardos   Windows 7
os.version6.1sourceflushlucene.version34.0-SNAPSHOT 10' not found

-Yonik
http://www.lucidimagination.com


On Wed, Nov 10, 2010 at 11:40 AM, Simon Willnauer
simon.willna...@googlemail.com wrote:
 Hey folks | devs,

 I just wanted to give you a heads up that the just committed
 LUCENE-2742 contains a index file format change. Indexes build with
 Lucene TRUNK will not work anymore and need to be reindexed. If you
 are using trunk you should reindex after you update to latest trunk.
 Indexes build with previous version of Lucene will work just fine and
 no action is required.

 simon

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Index File Format change on trunk

2010-11-11 Thread Simon Willnauer

On Thu, Nov 11, 2010 at 2:57 PM, Yonik Seeley
yo...@lucidimagination.com wrote:
 Thanks for the heads-up... I tried not rebuilding the index on Solr
 just to see what type of exception I'd get:

 Caused by: java.lang.IllegalArgumentException: required codec
 'tandardos       Windows 7
 os.version6.1sourceflushlucene.version34.0-SNAPSHOT 10' not found
heh, it tried to read the first string in the stringstring map for the
segment and get the codec for it :D
yeah we didn't add a check for it since it is only a change in trunk though.

simon

 -Yonik
 http://www.lucidimagination.com


 On Wed, Nov 10, 2010 at 11:40 AM, Simon Willnauer
 simon.willna...@googlemail.com wrote:
 Hey folks | devs,

 I just wanted to give you a heads up that the just committed
 LUCENE-2742 contains a index file format change. Indexes build with
 Lucene TRUNK will not work anymore and need to be reindexed. If you
 are using trunk you should reindex after you update to latest trunk.
 Indexes build with previous version of Lucene will work just fine and
 no action is required.

 simon

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Index File Format change on trunk

2010-11-10 Thread Simon Willnauer

Hey folks | devs,

I just wanted to give you a heads up that the just committed
LUCENE-2742 contains a index file format change. Indexes build with
Lucene TRUNK will not work anymore and need to be reindexed. If you
are using trunk you should reindex after you update to latest trunk.
Indexes build with previous version of Lucene will work just fine and
no action is required.

simon

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Updated: (LUCENE-2528) CFSFileDirectory: Allow a Compound Index file to be deployed as a complete index without segment files

2010-09-23 Thread Robert Muir (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-2528:


Component/s: Store

 CFSFileDirectory: Allow a Compound Index file to be deployed as a complete 
 index without segment files
 --

 Key: LUCENE-2528
 URL: https://issues.apache.org/jira/browse/LUCENE-2528
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Store
Reporter: Lance Norskog
Priority: Minor
 Attachments: LUCENE-2528.patch


 This patch presents a compound index file as a Lucene Directory class. This 
 allows you to deploy one file to a query server instead of deploying a 
 directory with the compound file and two segment files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Index file format documentation

2010-07-18 Thread Shai Erera

Hi

I've read the index file format documentation (
http://lucene.apache.org/java/3_0_2/fileformats.html) trying to understand
something, and I spent quite a lot of time on the skip data documentation.
The definition that puzzled me is this:

SkipData -- SkipLevelLength, SkipLevel NumSkipLevels-1, SkipLevel
SkipDatum

First, can someone please tell me how to read this? :)

Second, I think it's incorrect (unless I misunderstood how to read it).
After reading the code, it feels to me like the definition should have been:

SkipData -- SkipLevelLength, SkipDatum NumSkipLevels-1 SkipDatum

That is, all levels  0, record their skip level data length followed by the
SkipDatum entries while level = 0 records the same besides the skip level
length ('cause it's never cached, therefore there's no need to record the
length).

Am I correct in my understanding?

Shai

Re: Index file format documentation

2010-07-18 Thread Michael McCandless

Actually I think the docs are [close to] correct?

SkipLevel is a sequence of SkipDatum.

Though I'm not sure why that last SkipDatum is attached; that seems odd.

Mike

On Sun, Jul 18, 2010 at 6:07 AM, Shai Erera ser...@gmail.com wrote:
 Hi

 I've read the index file format documentation
 (http://lucene.apache.org/java/3_0_2/fileformats.html) trying to understand
 something, and I spent quite a lot of time on the skip data documentation.
 The definition that puzzled me is this:

 SkipData -- SkipLevelLength, SkipLevel NumSkipLevels-1, SkipLevel
 SkipDatum

 First, can someone please tell me how to read this? :)

 Second, I think it's incorrect (unless I misunderstood how to read it).
 After reading the code, it feels to me like the definition should have been:

 SkipData -- SkipLevelLength, SkipDatum NumSkipLevels-1 SkipDatum

 That is, all levels  0, record their skip level data length followed by the
 SkipDatum entries while level = 0 records the same besides the skip level
 length ('cause it's never cached, therefore there's no need to record the
 length).

 Am I correct in my understanding?

 Shai


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Index file format documentation

2010-07-18 Thread Shai Erera

Ahh I've missed SkipLevel below it! Now it makes more sense.

I agree then the SkipDatum in the end is redundant … the last
SkipLevel in the definition refers to level 0.

Thanks !

Shai

On Sunday, July 18, 2010, Michael McCandless luc...@mikemccandless.com wrote:
 Actually I think the docs are [close to] correct?

 SkipLevel is a sequence of SkipDatum.

 Though I'm not sure why that last SkipDatum is attached; that seems odd.

 Mike

 On Sun, Jul 18, 2010 at 6:07 AM, Shai Erera ser...@gmail.com wrote:
 Hi

 I've read the index file format documentation
 (http://lucene.apache.org/java/3_0_2/fileformats.html) trying to understand
 something, and I spent quite a lot of time on the skip data documentation.
 The definition that puzzled me is this:

 SkipData -- SkipLevelLength, SkipLevel NumSkipLevels-1, SkipLevel
 SkipDatum

 First, can someone please tell me how to read this? :)

 Second, I think it's incorrect (unless I misunderstood how to read it).
 After reading the code, it feels to me like the definition should have been:

 SkipData -- SkipLevelLength, SkipDatum NumSkipLevels-1 SkipDatum

 That is, all levels  0, record their skip level data length followed by the
 SkipDatum entries while level = 0 records the same besides the skip level
 length ('cause it's never cached, therefore there's no need to record the
 length).

 Am I correct in my understanding?

 Shai


 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Updated: (LUCENE-2528) CFSFileDirectory: Allow a Compound Index file to be deployed as a complete index without segment files

2010-07-05 Thread Lance Norskog (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lance Norskog updated LUCENE-2528:
--

Attachment: LUCENE-2528.patch

 CFSFileDirectory: Allow a Compound Index file to be deployed as a complete 
 index without segment files
 --

 Key: LUCENE-2528
 URL: https://issues.apache.org/jira/browse/LUCENE-2528
 Project: Lucene - Java
  Issue Type: New Feature
Reporter: Lance Norskog
Priority: Minor
 Attachments: LUCENE-2528.patch


 This patch presents a compound index file as a Lucene Directory class. This 
 allows you to deploy one file to a query server instead of deploying a 
 directory with the compound file and two segment files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Created: (LUCENE-2528) CFSFileDirectory: Allow a Compound Index file to be deployed as a complete index without segment files

2010-07-05 Thread Lance Norskog (JIRA)

CFSFileDirectory: Allow a Compound Index file to be deployed as a complete 
index without segment files
--

 Key: LUCENE-2528
 URL: https://issues.apache.org/jira/browse/LUCENE-2528
 Project: Lucene - Java
  Issue Type: New Feature
Reporter: Lance Norskog
Priority: Minor


This patch presents a compound index file as a Lucene Directory class. This 
allows you to deploy one file to a query server instead of deploying a 
directory with the compound file and two segment files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (LUCENE-2528) CFSFileDirectory: Allow a Compound Index file to be deployed as a complete index without segment files

2010-07-05 Thread Lance Norskog (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885427#action_12885427
 ] 

Lance Norskog commented on LUCENE-2528:
---

This patch has two problems:
1) CFSFileDirectory is in o.a.l.index instead of o.a.l.store because it uses 
SegmenInfo.commit(), which is not public. 
2) It uses a hacky way to get the number of documents in the index.
3) It does not know how to get the codec, and so defaults to StandardCodec.

The right way to fix #2 and #3 is to add the information to the compound file 
header.  



 CFSFileDirectory: Allow a Compound Index file to be deployed as a complete 
 index without segment files
 --

 Key: LUCENE-2528
 URL: https://issues.apache.org/jira/browse/LUCENE-2528
 Project: Lucene - Java
  Issue Type: New Feature
Reporter: Lance Norskog
Priority: Minor
 Attachments: LUCENE-2528.patch


 This patch presents a compound index file as a Lucene Directory class. This 
 allows you to deploy one file to a query server instead of deploying a 
 directory with the compound file and two segment files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Updated: (LUCENE-2528) CFSFileDirectory: Allow a Compound Index file to be deployed as a complete index without segment files

2010-07-05 Thread Lance Norskog (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lance Norskog updated LUCENE-2528:
--

Attachment: LUCENE-2528.patch

 CFSFileDirectory: Allow a Compound Index file to be deployed as a complete 
 index without segment files
 --

 Key: LUCENE-2528
 URL: https://issues.apache.org/jira/browse/LUCENE-2528
 Project: Lucene - Java
  Issue Type: New Feature
Reporter: Lance Norskog
Priority: Minor
 Attachments: LUCENE-2528.patch


 This patch presents a compound index file as a Lucene Directory class. This 
 allows you to deploy one file to a query server instead of deploying a 
 directory with the compound file and two segment files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-03-16 Thread Otis Gospodnetic (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12846099#action_12846099
 ] 

Otis Gospodnetic commented on SOLR-1822:


When Solr starts, doesn't it create the index directory?  If so, this patch is 
not needed, unless we want to make sure replication succeeds even if 
someone/something removes the whole index directory on a slave after the slave 
had already started.

Is this rally needed?

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: 1.5

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-03-14 Thread wyhw whon (JIRA)

SEVERE: Unable to move index file from: tempfile to: indexfile
--

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: 1.5


SOLR index directory remvoed,but do not know what the reasons for this.

I add some codes on SnapPuller.java 577 line can reslove this bug.
line 576  File indexFileInIndex = new File(indexDir, fname);
+  if (!indexDir.exists()) indexDir.mkdir();
boolean success = 
indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-03-14 Thread wyhw whon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wyhw whon updated SOLR-1822:


Description: 
SOLR index directory remvoed,but do not know what the reasons for this.

I add some codes on SnapPuller.java 577 line can reslove this bug.
line 576   File indexFileInIndex = new File(indexDir, 
fname);
+ if (!indexDir.exists()) 
indexDir.mkdir();
 boolean success = 
indexFileInTmpDir.renameTo(indexFileInIndex);

  was:
SOLR index directory remvoed,but do not know what the reasons for this.

I add some codes on SnapPuller.java 577 line can reslove this bug.
line 576  File indexFileInIndex = new File(indexDir, fname);
+  if (!indexDir.exists()) indexDir.mkdir();
boolean success = 
indexFileInTmpDir.renameTo(indexFileInIndex);


 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: 1.5


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   File indexFileInIndex = new File(indexDir, 
 fname);
 + if (!indexDir.exists()) 
 indexDir.mkdir();
  boolean success = 
 indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-03-14 Thread wyhw whon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wyhw whon updated SOLR-1822:


Description: 
SOLR index directory remvoed,but do not know what the reasons for this.

I add some codes on SnapPuller.java 577 line can reslove this bug.
line 576   
File indexFileInIndex = new File(indexDir, fname);
+
if (!indexDir.exists()) indexDir.mkdir();
boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

  was:
SOLR index directory remvoed,but do not know what the reasons for this.

I add some codes on SnapPuller.java 577 line can reslove this bug.
line 576   File indexFileInIndex = new File(indexDir, 
fname);
+ if (!indexDir.exists()) 
indexDir.mkdir();
 boolean success = 
indexFileInTmpDir.renameTo(indexFileInIndex);


 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: 1.5


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-03-14 Thread wyhw whon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wyhw whon updated SOLR-1822:


Attachment: SnapPuller.patch

Add SnapPuller.patch

 SEVERE: Unable to move index file from: tempfile to: indexfile
 --

 Key: SOLR-1822
 URL: https://issues.apache.org/jira/browse/SOLR-1822
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 1.4
 Environment: Linux, JDK6,SOLR 1.4
Reporter: wyhw whon
Priority: Critical
 Fix For: 1.5

 Attachments: SnapPuller.patch


 SOLR index directory remvoed,but do not know what the reasons for this.
 I add some codes on SnapPuller.java 577 line can reslove this bug.
 line 576   
 File indexFileInIndex = new File(indexDir, fname);
 +
 if (!indexDir.exists()) indexDir.mkdir();
 boolean success = indexFileInTmpDir.renameTo(indexFileInIndex);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (LUCY-66) Lucy::Util::IndexFileNames -- utilities for processing index file paths

2009-11-18 Thread Marvin Humphrey (JIRA)

Lucy::Util::IndexFileNames -- utilities for processing index file paths
---

 Key: LUCY-66
 URL: https://issues.apache.org/jira/browse/LUCY-66
 Project: Lucy
  Issue Type: Improvement
  Components: Core
Reporter: Marvin Humphrey
Assignee: Marvin Humphrey



This implementation of IndexFileNames only supplies a handful of minor utility
functions.  It has has less in it than the corresponding Lucene class, because
Lucy will delegate much more to the individual index components than Lucene.
For instance, the top level library will remain ignorant of what DocWriter
chooses to name its files.  We choose never to rely on filename conventions so
that replacement components can be swapped in without requiring that they
write certain files or adhere to specific naming schemes.

The only important function provided right now is IxFileNames_extract_gen:

{code:none}
/** Skip past the first instance of an underscore in the CharBuf, then
 * attempt to decode a base 36 number.  For example, snapshot_5.json
 * yields 5, and seg_a1 yields 27.
 *
 * @return a generation number, or 0 if no number can be extracted.
 */
inert i32_t
extract_gen(const CharBuf *name);
{code}

Later, after Folder is added, we'll add IxFileNames_latest_snapshot, which
will scan the root level of a Folder for the snapshot file with the highest
generation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (LUCY-66) Lucy::Util::IndexFileNames -- utilities for processing index file paths

2009-11-18 Thread Marvin Humphrey (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCY-66?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marvin Humphrey updated LUCY-66:


Attachment: IndexFileNames.pm
IndexFileNames.c
IndexFileNames.bp

 Lucy::Util::IndexFileNames -- utilities for processing index file paths
 ---

 Key: LUCY-66
 URL: https://issues.apache.org/jira/browse/LUCY-66
 Project: Lucy
  Issue Type: Improvement
  Components: Core
Reporter: Marvin Humphrey
Assignee: Marvin Humphrey
 Attachments: IndexFileNames.bp, IndexFileNames.c, IndexFileNames.pm


 This implementation of IndexFileNames only supplies a handful of minor utility
 functions.  It has has less in it than the corresponding Lucene class, because
 Lucy will delegate much more to the individual index components than Lucene.
 For instance, the top level library will remain ignorant of what DocWriter
 chooses to name its files.  We choose never to rely on filename conventions so
 that replacement components can be swapped in without requiring that they
 write certain files or adhere to specific naming schemes.
 The only important function provided right now is IxFileNames_extract_gen:
 {code:none}
 /** Skip past the first instance of an underscore in the CharBuf, then
  * attempt to decode a base 36 number.  For example, snapshot_5.json
  * yields 5, and seg_a1 yields 27.
  *
  * @return a generation number, or 0 if no number can be extracted.
  */
 inert i32_t
 extract_gen(const CharBuf *name);
 {code}
 Later, after Folder is added, we'll add IxFileNames_latest_snapshot, which
 will scan the root level of a Folder for the snapshot file with the highest
 generation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (LUCY-66) Lucy::Util::IndexFileNames -- utilities for processing index file paths

2009-11-18 Thread Marvin Humphrey (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCY-66?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marvin Humphrey updated LUCY-66:


Attachment: TestIndexFileNames.c
TestIndexFileNames.bp
033-index_file_names.t

 Lucy::Util::IndexFileNames -- utilities for processing index file paths
 ---

 Key: LUCY-66
 URL: https://issues.apache.org/jira/browse/LUCY-66
 Project: Lucy
  Issue Type: Improvement
  Components: Core
Reporter: Marvin Humphrey
Assignee: Marvin Humphrey
 Attachments: 033-index_file_names.t, index_file_names.diff, 
 IndexFileNames.bp, IndexFileNames.c, IndexFileNames.pm, 
 TestIndexFileNames.bp, TestIndexFileNames.c


 This implementation of IndexFileNames only supplies a handful of minor utility
 functions.  It has has less in it than the corresponding Lucene class, because
 Lucy will delegate much more to the individual index components than Lucene.
 For instance, the top level library will remain ignorant of what DocWriter
 chooses to name its files.  We choose never to rely on filename conventions so
 that replacement components can be swapped in without requiring that they
 write certain files or adhere to specific naming schemes.
 The only important function provided right now is IxFileNames_extract_gen:
 {code:none}
 /** Skip past the first instance of an underscore in the CharBuf, then
  * attempt to decode a base 36 number.  For example, snapshot_5.json
  * yields 5, and seg_a1 yields 27.
  *
  * @return a generation number, or 0 if no number can be extracted.
  */
 inert i32_t
 extract_gen(const CharBuf *name);
 {code}
 Later, after Folder is added, we'll add IxFileNames_latest_snapshot, which
 will scan the root level of a Folder for the snapshot file with the highest
 generation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (LUCENE-877) 2.1 Locking documentation in Apache Lucene - Index File Formats section 6.2 Lock File out dated

2007-05-09 Thread Andreas Guther (JIRA)

2.1 Locking documentation in Apache Lucene - Index File Formats section 6.2 
Lock File out dated
---

 Key: LUCENE-877
 URL: https://issues.apache.org/jira/browse/LUCENE-877
 Project: Lucene - Java
  Issue Type: Bug
  Components: Other
Affects Versions: 2.1
 Environment: n/a
Reporter: Andreas Guther
Priority: Minor


I am in the process to migrate from Lucene 2.0 to Lucene 2.1.

From reading the Changes document I understand that the write locks are now 
written into the index folder instead of the java.io.tmpdir. 

In the Apache Lucene - Index File Formats document in section 6.2 Lock File 
I read that there is a write lock used to indicate that another process is 
writing into the index and that this file is stored in the java.io.tempdir.

This is confusing to me.  I had the impression all lock files go into the index 
folder now.  And using the the java.io.tempdir is only local and does not 
support access to shared index folders.

Do I miss something here or is the documentation not updated?


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Assigned: (LUCENE-877) 2.1 Locking documentation in Apache Lucene - Index File Formats section 6.2 Lock File out dated

2007-05-09 Thread Michael McCandless (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless reassigned LUCENE-877:
-

Assignee: Michael McCandless

 2.1 Locking documentation in Apache Lucene - Index File Formats section 
 6.2 Lock File out dated
 ---

 Key: LUCENE-877
 URL: https://issues.apache.org/jira/browse/LUCENE-877
 Project: Lucene - Java
  Issue Type: Bug
  Components: Other
Affects Versions: 2.1
 Environment: n/a
Reporter: Andreas Guther
 Assigned To: Michael McCandless
Priority: Minor

 I am in the process to migrate from Lucene 2.0 to Lucene 2.1.
 From reading the Changes document I understand that the write locks are now 
 written into the index folder instead of the java.io.tmpdir. 
 In the Apache Lucene - Index File Formats document in section 6.2 Lock 
 File I read that there is a write lock used to indicate that another process 
 is writing into the index and that this file is stored in the java.io.tempdir.
 This is confusing to me.  I had the impression all lock files go into the 
 index folder now.  And using the the java.io.tempdir is only local and does 
 not support access to shared index folders.
 Do I miss something here or is the documentation not updated?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Resolved: (LUCENE-877) 2.1 Locking documentation in Apache Lucene - Index File Formats section 6.2 Lock File out dated

2007-05-09 Thread Michael McCandless (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless resolved LUCENE-877.
---

   Resolution: Fixed
Fix Version/s: 2.2

 2.1 Locking documentation in Apache Lucene - Index File Formats section 
 6.2 Lock File out dated
 ---

 Key: LUCENE-877
 URL: https://issues.apache.org/jira/browse/LUCENE-877
 Project: Lucene - Java
  Issue Type: Bug
  Components: Other
Affects Versions: 2.1
 Environment: n/a
Reporter: Andreas Guther
 Assigned To: Michael McCandless
Priority: Minor
 Fix For: 2.2


 I am in the process to migrate from Lucene 2.0 to Lucene 2.1.
 From reading the Changes document I understand that the write locks are now 
 written into the index folder instead of the java.io.tmpdir. 
 In the Apache Lucene - Index File Formats document in section 6.2 Lock 
 File I read that there is a write lock used to indicate that another process 
 is writing into the index and that this file is stored in the java.io.tempdir.
 This is confusing to me.  I had the impression all lock files go into the 
 index folder now.  And using the the java.io.tempdir is only local and does 
 not support access to shared index folders.
 Do I miss something here or is the documentation not updated?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Updated: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-07 Thread Doug Cutting (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doug Cutting updated HADOOP-984:


Resolution: Fixed
Status: Resolved  (was: Patch Available)

I just committed this.  Thanks, Arun!

 Exception while retrieving a map output index file is not reported to 
 JobTracker
 

 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
 Assigned To: Arun C Murthy
Priority: Critical
 Fix For: 0.11.1

 Attachments: HADOOP-984_20070207_1.patch


 The index file of a map output reported a checksum error and was moved to 
 bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
 forever.  It seem the checksum exception did *NOT* result in the map's 
 TaskTracker reporting to the JobTracker that the map output was lost.  This 
 was likely introduced by HADOOP-331.
 TaskTracker log:
 2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
 Moving bad file 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
 /foo/bad_files/file.out.index.-48633003
 2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
 checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
 at 
 org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
 at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
 at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
 at 
 org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
 at 
 org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
 at 
 org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
 at 
 org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
 at 
 org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
 at 
 org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
 at org.mortbay.http.HttpServer.service(HttpServer.java:954)
 at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
 at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
 at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
 at 
 org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
 at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
 at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-06 Thread Nigel Daley (JIRA)

Exception while retrieving a map output index file is not reported to JobTracker


 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
Priority: Critical


The index file of a map output reported a checksum error and was moved to 
bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
forever.  It seem the checksum exception did *NOT* result in the map's 
TaskTracker reporting to the JobTracker that the map output was lost.  This was 
likely introduced by HADOOP-331.

TaskTracker log:

2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
Moving bad file 
/foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
/foo/bad_files/file.out.index.-48633003
2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
/foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
at 
org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
at 
org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
at 
org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
at 
org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
at 
org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
at 
org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
at 
org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
at 
org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
at 
org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
at 
org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
at org.mortbay.http.HttpServer.service(HttpServer.java:954)
at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
at 
org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-06 Thread Owen O'Malley (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Owen O'Malley updated HADOOP-984:
-

Fix Version/s: 0.11.1
 Assignee: Arun C Murthy

 Exception while retrieving a map output index file is not reported to 
 JobTracker
 

 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
 Assigned To: Arun C Murthy
Priority: Critical
 Fix For: 0.11.1


 The index file of a map output reported a checksum error and was moved to 
 bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
 forever.  It seem the checksum exception did *NOT* result in the map's 
 TaskTracker reporting to the JobTracker that the map output was lost.  This 
 was likely introduced by HADOOP-331.
 TaskTracker log:
 2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
 Moving bad file 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
 /foo/bad_files/file.out.index.-48633003
 2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
 checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
 at 
 org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
 at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
 at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
 at 
 org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
 at 
 org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
 at 
 org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
 at 
 org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
 at 
 org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
 at 
 org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
 at org.mortbay.http.HttpServer.service(HttpServer.java:954)
 at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
 at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
 at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
 at 
 org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
 at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
 at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-06 Thread Arun C Murthy (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated HADOOP-984:
-

Attachment: HADOOP-984_20070207_1.patch

Simple fix - moved the offending code into the appropriate try-block.

 Exception while retrieving a map output index file is not reported to 
 JobTracker
 

 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
 Assigned To: Arun C Murthy
Priority: Critical
 Fix For: 0.11.1

 Attachments: HADOOP-984_20070207_1.patch


 The index file of a map output reported a checksum error and was moved to 
 bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
 forever.  It seem the checksum exception did *NOT* result in the map's 
 TaskTracker reporting to the JobTracker that the map output was lost.  This 
 was likely introduced by HADOOP-331.
 TaskTracker log:
 2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
 Moving bad file 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
 /foo/bad_files/file.out.index.-48633003
 2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
 checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
 at 
 org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
 at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
 at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
 at 
 org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
 at 
 org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
 at 
 org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
 at 
 org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
 at 
 org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
 at 
 org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
 at org.mortbay.http.HttpServer.service(HttpServer.java:954)
 at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
 at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
 at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
 at 
 org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
 at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
 at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-06 Thread Arun C Murthy (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated HADOOP-984:
-

Status: Patch Available  (was: Open)

 Exception while retrieving a map output index file is not reported to 
 JobTracker
 

 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
 Assigned To: Arun C Murthy
Priority: Critical
 Fix For: 0.11.1

 Attachments: HADOOP-984_20070207_1.patch


 The index file of a map output reported a checksum error and was moved to 
 bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
 forever.  It seem the checksum exception did *NOT* result in the map's 
 TaskTracker reporting to the JobTracker that the map output was lost.  This 
 was likely introduced by HADOOP-331.
 TaskTracker log:
 2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
 Moving bad file 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
 /foo/bad_files/file.out.index.-48633003
 2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
 checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
 at 
 org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
 at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
 at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
 at 
 org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
 at 
 org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
 at 
 org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
 at 
 org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
 at 
 org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
 at 
 org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
 at org.mortbay.http.HttpServer.service(HttpServer.java:954)
 at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
 at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
 at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
 at 
 org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
 at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
 at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-06 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12470816
 ] 

Hadoop QA commented on HADOOP-984:
--

+1, because 
http://issues.apache.org/jira/secure/attachment/12350518/HADOOP-984_20070207_1.patch
 applied and successfully tested against trunk revision r504381.

 Exception while retrieving a map output index file is not reported to 
 JobTracker
 

 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
 Assigned To: Arun C Murthy
Priority: Critical
 Fix For: 0.11.1

 Attachments: HADOOP-984_20070207_1.patch


 The index file of a map output reported a checksum error and was moved to 
 bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
 forever.  It seem the checksum exception did *NOT* result in the map's 
 TaskTracker reporting to the JobTracker that the map output was lost.  This 
 was likely introduced by HADOOP-331.
 TaskTracker log:
 2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
 Moving bad file 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
 /foo/bad_files/file.out.index.-48633003
 2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
 checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
 at 
 org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
 at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
 at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
 at 
 org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
 at 
 org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
 at 
 org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
 at 
 org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
 at 
 org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
 at 
 org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
 at org.mortbay.http.HttpServer.service(HttpServer.java:954)
 at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
 at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
 at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
 at 
 org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
 at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
 at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-984) Exception while retrieving a map output index file is not reported to JobTracker

2007-02-06 Thread Owen O'Malley (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12470825
 ] 

Owen O'Malley commented on HADOOP-984:
--

+1

 Exception while retrieving a map output index file is not reported to 
 JobTracker
 

 Key: HADOOP-984
 URL: https://issues.apache.org/jira/browse/HADOOP-984
 Project: Hadoop
  Issue Type: Bug
  Components: mapred
Affects Versions: 0.11.0
Reporter: Nigel Daley
 Assigned To: Arun C Murthy
Priority: Critical
 Fix For: 0.11.1

 Attachments: HADOOP-984_20070207_1.patch


 The index file of a map output reported a checksum error and was moved to 
 bad_file dir.  Reduces, however, continued to attempt retrieval of the file 
 forever.  It seem the checksum exception did *NOT* result in the map's 
 TaskTracker reporting to the JobTracker that the map output was lost.  This 
 was likely introduced by HADOOP-331.
 TaskTracker log:
 2007-02-06 09:06:19,200 WARN org.apache.hadoop.dfs.DistributedFileSystem: 
 Moving bad file 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index to 
 /foo/bad_files/file.out.index.-48633003
 2007-02-06 09:06:19,203 INFO org.apache.hadoop.fs.DataInputStream: Found 
 checksum error: org.apache.hadoop.fs.ChecksumException: Checksum error: 
 /foo/testbase/tmp/mapred/local2/task_0002_m_011745_0/file.out.index at 363792
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.verifySum(FSDataInputStream.java:167)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Checker.read(FSDataInputStream.java:125)
 at 
 org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:218)
 at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
 at java.io.BufferedInputStream.read(BufferedInputStream.java:235)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.read(FSDataInputStream.java:281)
 at 
 org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:268)
 at 
 org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:331)
 at 
 org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1533)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
 at 
 org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
 at 
 org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
 at 
 org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
 at 
 org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
 at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
 at org.mortbay.http.HttpServer.service(HttpServer.java:954)
 at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
 at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
 at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
 at 
 org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
 at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
 at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-05 Thread Grant Ingersoll (JIRA)

 [ http://issues.apache.org/jira/browse/LUCENE-706?page=all ]

Grant Ingersoll resolved LUCENE-706.


Resolution: Fixed

Applied.  Thanks Johan and Doron.

 Index File Format - Example for frequency file .frq is wrong
 

 Key: LUCENE-706
 URL: http://issues.apache.org/jira/browse/LUCENE-706
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Website
 Environment: not applicable
Reporter: Doron Cohen
 Assigned To: Grant Ingersoll
Priority: Trivial
 Attachments: file-format-frq-example.patch


 Reported by Johan Stuyts - 
 http://www.nabble.com/Possible-documentation-error--p7012445.html - 
 Frequency file example says: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 22, 3 
 It should be: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 8, 3 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Assigned: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-03 Thread Grant Ingersoll (JIRA)

 [ http://issues.apache.org/jira/browse/LUCENE-706?page=all ]

Grant Ingersoll reassigned LUCENE-706:
--

Assignee: Grant Ingersoll  (was: Doron Cohen)

 Index File Format - Example for frequency file .frq is wrong
 

 Key: LUCENE-706
 URL: http://issues.apache.org/jira/browse/LUCENE-706
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Website
 Environment: not applicable
Reporter: Doron Cohen
 Assigned To: Grant Ingersoll
Priority: Trivial
 Attachments: file-format-frq-example.patch


 Reported by Johan Stuyts - 
 http://www.nabble.com/Possible-documentation-error--p7012445.html - 
 Frequency file example says: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 22, 3 
 It should be: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 8, 3 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Commented: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-03 Thread Steven Rowe (JIRA)

[ 
http://issues.apache.org/jira/browse/LUCENE-706?page=comments#action_12447044 ] 

Steven Rowe commented on LUCENE-706:


Hex: 0D is NOT the same as decimal 15.  0Dh = 13d.  15d = 0Fh.

 Index File Format - Example for frequency file .frq is wrong
 

 Key: LUCENE-706
 URL: http://issues.apache.org/jira/browse/LUCENE-706
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Website
 Environment: not applicable
Reporter: Doron Cohen
 Assigned To: Grant Ingersoll
Priority: Trivial
 Attachments: file-format-frq-example.patch


 Reported by Johan Stuyts - 
 http://www.nabble.com/Possible-documentation-error--p7012445.html - 
 Frequency file example says: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 22, 3 
 It should be: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 8, 3 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Commented: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-03 Thread Doron Cohen (JIRA)

[ 
http://issues.apache.org/jira/browse/LUCENE-706?page=comments#action_12447049 ] 

Doron Cohen commented on LUCENE-706:


Right, sorry, copied that hex data from an .frq of an index with a different 
example, where the frequencies were 1 in doc 6 and 3 in doc 10, so there you 
would get 2 * 6 + 1 = 13.

For the correct example of freq 1 in doc 7 and 3 in doc 11 the .frq content is  
0F 08 03  as it should be. 

(Meaning that the documentatin should still be fixed...;-)


 Index File Format - Example for frequency file .frq is wrong
 

 Key: LUCENE-706
 URL: http://issues.apache.org/jira/browse/LUCENE-706
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Website
 Environment: not applicable
Reporter: Doron Cohen
 Assigned To: Grant Ingersoll
Priority: Trivial
 Attachments: file-format-frq-example.patch


 Reported by Johan Stuyts - 
 http://www.nabble.com/Possible-documentation-error--p7012445.html - 
 Frequency file example says: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 22, 3 
 It should be: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 8, 3 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Created: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-02 Thread Doron Cohen (JIRA)

Index File Format - Example for frequency file .frq is wrong


 Key: LUCENE-706
 URL: http://issues.apache.org/jira/browse/LUCENE-706
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Website
 Environment: not applicable
Reporter: Doron Cohen
 Assigned To: Doron Cohen
Priority: Trivial


Reported by Johan Stuyts - 
http://www.nabble.com/Possible-documentation-error--p7012445.html - 

Frequency file example says: 

 For example, the TermFreqs for a term which occurs once in document seven 
and three times in document eleven would be the following sequence of VInts: 
 15, 22, 3 

It should be: 

 For example, the TermFreqs for a term which occurs once in document seven 
and three times in document eleven would be the following sequence of VInts: 
 15, 8, 3 




-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Updated: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-02 Thread Doron Cohen (JIRA)

 [ http://issues.apache.org/jira/browse/LUCENE-706?page=all ]

Doron Cohen updated LUCENE-706:
---

Lucene Fields: [New, Patch Available]  (was: [New])

 Index File Format - Example for frequency file .frq is wrong
 

 Key: LUCENE-706
 URL: http://issues.apache.org/jira/browse/LUCENE-706
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Website
 Environment: not applicable
Reporter: Doron Cohen
 Assigned To: Doron Cohen
Priority: Trivial
 Attachments: file-format-frq-example.patch


 Reported by Johan Stuyts - 
 http://www.nabble.com/Possible-documentation-error--p7012445.html - 
 Frequency file example says: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 22, 3 
 It should be: 
  For example, the TermFreqs for a term which occurs once in document 
 seven and three times in document eleven would be the following sequence of 
 VInts: 
  15, 8, 3 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

60 matches

Mail list logo