[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2008-08-24 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016.txt

Documentation

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Fix For: 2.4
>
> Attachments: LUCENE-1016.txt, LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2008-08-24 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Fix Version/s: 2.4

I'll commit this soon.

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Fix For: 2.4
>
> Attachments: LUCENE-1016.txt, LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: (was: LUCENE-1016-Tanimoto.txt)

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016.txt, LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016.txt

In this patch:

* Java 1.4 for real

And then I removed everything that had nothing to do with this patch.

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: (was: LUCENE-1016.txt)

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: (was: LUCENE-1016.txt)

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: (was: out.png)

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016.txt, LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: (was: LUCENE-1016-clusterer.txt)

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016.txt, LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-08 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016.txt

This patch:

 * All Java 1.4
 * Bugfix, could throw a nullexception in some cases before

This patch is TermVectorAccessor code only, nothing else.

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016-clusterer.txt, LUCENE-1016-Tanimoto.txt, 
> LUCENE-1016.txt, LUCENE-1016.txt, out.png
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-03 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016-clusterer.txt
out.png

Sorry for flooding. This JIRA issue is sort of turning more off topic for each 
post.. I hope you don't mind.

LUCENE-1016-clusterer.txt now contains a refactor of the Tanimoto similarity, 
it does the same thing, but with less messy code. 

And as the filename hints, I thought it would be fun to demonstrate the 
similarity by adding a very simple two dimensional decision tree clusterer.

For the test I feed it with 17 news articles representing 3 news stories I got 
from Google news. Attached is also a graphviz diagram that shows the tree with 
the news stories clustered together. I did not look at how to draw the line 
between the clusters yet, but I could probably come up with something simple 
enough. Legend: floating numbers represents the distance between two children. 
The leafs are the actual articles, prefixed with new story identity and 
suffixed with news article identity.

(The clusterer sure needs optimization, use carrot instead. This is just me 
fooling aroung.)

Have fun!

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016-clusterer.txt, LUCENE-1016-Tanimoto.txt, 
> LUCENE-1016.txt, out.png
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-02 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016-Tanimoto.txt

TanimotoDocumentSimilarity, depends on TermVectorAccessor,  used to calculate 
the distance between the vector space of two documents.

My math skills are pretty lame, but I think I got it right.

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016-Tanimoto.txt, LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-02 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016.txt

Oups, prior patch contained some other stuff too by misstake.

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-02 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: (was: LUCENE-1016.txt)

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-02 Thread Karl Wettin (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-1016:


Attachment: LUCENE-1016.txt

> TermVectorAccessor, transparent vector space access 
> 
>
> Key: LUCENE-1016
> URL: https://issues.apache.org/jira/browse/LUCENE-1016
> Project: Lucene - Java
>  Issue Type: New Feature
>  Components: Term Vectors
>Affects Versions: 2.2
>Reporter: Karl Wettin
>Priority: Minor
> Attachments: LUCENE-1016.txt
>
>
> This class visits TermVectorMapper and populates it with information 
> transparent by either passing it down to the default terms cache (documents 
> indexed with Field.TermVector) or by resolving the inverted index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]