Re: [VOTE] CEP-30 ANN Vector Search

2023-06-16 Thread Andrew Cobley (Staff)
Thanks Jonathan,

That’s good to know.

Andy


From: Jonathan Ellis 
Date: Friday, 16 June 2023 at 18:04
To: dev@cassandra.apache.org 
Subject: Re: [VOTE] CEP-30 ANN Vector Search

CAUTION: This email originated from outside the University of Dundee. Do not 
click links or open attachments unless you recognise the sender's email address 
and know the content is safe.
Correct.  They will be ordered closest-first.

Unfortunately it's not possible for the near or medium future to do 
farthest-first.  HNSW index gets to log(n) time by only keeping a subset of the 
closest neighbors for each vector.  So you'd need a separate index with a 
inverse-cosine similarity metric, and it's not possible today to use a custom 
metric function.

(This has been GA for over a year in Elastic and Solr and so far nobody has 
needed farthest-first badly enough to add this as an option to the underlying 
Lucene library.)

You can get the distances back today, like this:

SELECT my_text, similarity_cosine(my_embedding, ?)
FROM my_table
ORDER BY my_embedding ANN OF ? LIMIT 2

Then just pass the query vector into both bind variables.

On Fri, Jun 16, 2023 at 7:09 AM Andrew Cobley (Staff) 
mailto:a.e.cob...@dundee.ac.uk>> wrote:
Hi,

I’ve got a question and a request about this CEP

In the example:


SELECT * FROM test.foo WHERE j ANN OF [3.4, 7.8, 9.1] limit 1;

I presume that limit n will return the nth nearest neighbours?

If that’s the case what order will they be in? Is it posssible to reverse the 
order ?

Secondly would it be possible to return the calculated distances?  This might 
be particular important if there are n returned neighbours?

Andy

From: Patrick McFadin mailto:pmcfa...@gmail.com>>
Sent: 15 June 2023 01:03
To: dev@cassandra.apache.org<mailto:dev@cassandra.apache.org> 
mailto:dev@cassandra.apache.org>>
Subject: Re: [VOTE] CEP-30 ANN Vector Search




CAUTION: This email originated from outside the University of Dundee. Do not 
click links or open attachments unless you recognise the sender's email address 
and know the content is safe.
Andy,

Good to see you on the ML again! CEP-30 is slated for release with 5.0 later in 
the year. Until then, you'll need to do a local build or try it out in a 
preview in Astra. A few of us have been talking about creating a preview docker 
image since there is some interest in having it run in k8ssandra. In any case, 
this is very alpha code and should be treated as such. Reporting errors or 
unusual results would be greatly appreciated!

Patrick



On Wed, Jun 14, 2023 at 7:10 AM Andrew Cobley (Staff) 
mailto:a.e.cob...@dundee.ac.uk>> wrote:

Hi All,



Great news this has gone through, I wondering if we have a timescale for this 
making it to Beta or release ?  I’m asking because we have a project that would 
benefit from this approach.



Andy





From: Jonathan Ellis mailto:jbel...@gmail.com>>
Date: Tuesday, 30 May 2023 at 14:44
To: dev mailto:dev@cassandra.apache.org>>
Subject: Re: [VOTE] CEP-30 ANN Vector Search



CAUTION: This email originated from outside the University of Dundee. Do not 
click links or open attachments unless you recognise the sender's email address 
and know the content is safe.

Thanks, all.  Closing the vote as accepted with 8 binding +1 (including me) and 
11 non-binding votes.



On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis 
mailto:jbel...@gmail.com>> wrote:

Let's make this official.

CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes



POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch

--

Jonathan Ellis
co-founder, http://www.datastax.com
@spyced


--

Jonathan Ellis
co-founder, http://www.datastax.com
@spyced

The University of Dundee is a registered Scottish Charity, No: SC015096

The University of Dundee is a registered Scottish Charity, No: SC015096


--
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced

The University of Dundee is a registered Scottish Charity, No: SC015096


Re: [VOTE] CEP-30 ANN Vector Search

2023-06-16 Thread Jonathan Ellis
Correct.  They will be ordered closest-first.

Unfortunately it's not possible for the near or medium future to do
farthest-first.  HNSW index gets to log(n) time by only keeping a subset of
the closest neighbors for each vector.  So you'd need a separate index with
a inverse-cosine similarity metric, and it's not possible today to use a
custom metric function.

(This has been GA for over a year in Elastic and Solr and so far nobody has
needed farthest-first badly enough to add this as an option to the
underlying Lucene library.)

You can get the distances back today, like this:

SELECT my_text, similarity_cosine(my_embedding, ?)
FROM my_table
ORDER BY my_embedding ANN OF ? LIMIT 2

Then just pass the query vector into both bind variables.

On Fri, Jun 16, 2023 at 7:09 AM Andrew Cobley (Staff) <
a.e.cob...@dundee.ac.uk> wrote:

> Hi,
>
> I’ve got a question and a request about this CEP
>
> In the example:
>
> SELECT * FROM test.foo WHERE j ANN OF [3.4, 7.8, 9.1] limit 1;
>
>
> I presume that limit n will return the nth nearest neighbours?
>
> If that’s the case what order will they be in? Is it posssible to reverse
> the order ?
>
> Secondly would it be possible to return the calculated distances?  This
> might be particular important if there are n returned neighbours?
>
> Andy
> --
> *From:* Patrick McFadin 
> *Sent:* 15 June 2023 01:03
> *To:* dev@cassandra.apache.org 
> *Subject:* Re: [VOTE] CEP-30 ANN Vector Search
>
>
>
>
> CAUTION: This email originated from outside the University of Dundee. Do
> not click links or open attachments unless you recognise the sender's email
> address and know the content is safe.
> Andy,
>
> Good to see you on the ML again! CEP-30 is slated for release with 5.0
> later in the year. Until then, you'll need to do a local build or try it
> out in a preview in Astra. A few of us have been talking about creating a
> preview docker image since there is some interest in having it run in
> k8ssandra. In any case, this is very alpha code and should be treated as
> such. Reporting errors or unusual results would be greatly appreciated!
>
> Patrick
>
>
>
> On Wed, Jun 14, 2023 at 7:10 AM Andrew Cobley (Staff) <
> a.e.cob...@dundee.ac.uk> wrote:
>
> Hi All,
>
>
>
> Great news this has gone through, I wondering if we have a timescale for
> this making it to Beta or release ?  I’m asking because we have a project
> that would benefit from this approach.
>
>
>
> Andy
>
>
>
>
>
> *From: *Jonathan Ellis 
> *Date: *Tuesday, 30 May 2023 at 14:44
> *To: *dev 
> *Subject: *Re: [VOTE] CEP-30 ANN Vector Search
>
>
>
> CAUTION: This email originated from outside the University of Dundee. Do
> not click links or open attachments unless you recognise the sender's email
> address and know the content is safe.
>
> Thanks, all.  Closing the vote as accepted with 8 binding +1 (including
> me) and 11 non-binding votes.
>
>
>
> On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis  wrote:
>
> Let's make this official.
>
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
>
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
>
> --
>
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
>
>
> --
>
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
> The University of Dundee is a registered Scottish Charity, No: SC015096
>
>
> The University of Dundee is a registered Scottish Charity, No: SC015096
>


-- 
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced


Re: [VOTE] CEP-30 ANN Vector Search

2023-06-16 Thread Andrew Cobley (Staff)
Hi,

I’ve got a question and a request about this CEP

In the example:


SELECT * FROM test.foo WHERE j ANN OF [3.4, 7.8, 9.1] limit 1;

I presume that limit n will return the nth nearest neighbours?

If that’s the case what order will they be in? Is it posssible to reverse the 
order ?

Secondly would it be possible to return the calculated distances?  This might 
be particular important if there are n returned neighbours?

Andy

From: Patrick McFadin 
Sent: 15 June 2023 01:03
To: dev@cassandra.apache.org 
Subject: Re: [VOTE] CEP-30 ANN Vector Search




CAUTION: This email originated from outside the University of Dundee. Do not 
click links or open attachments unless you recognise the sender's email address 
and know the content is safe.

Andy,

Good to see you on the ML again! CEP-30 is slated for release with 5.0 later in 
the year. Until then, you'll need to do a local build or try it out in a 
preview in Astra. A few of us have been talking about creating a preview docker 
image since there is some interest in having it run in k8ssandra. In any case, 
this is very alpha code and should be treated as such. Reporting errors or 
unusual results would be greatly appreciated!

Patrick



On Wed, Jun 14, 2023 at 7:10 AM Andrew Cobley (Staff) 
mailto:a.e.cob...@dundee.ac.uk>> wrote:

Hi All,



Great news this has gone through, I wondering if we have a timescale for this 
making it to Beta or release ?  I’m asking because we have a project that would 
benefit from this approach.



Andy





From: Jonathan Ellis mailto:jbel...@gmail.com>>
Date: Tuesday, 30 May 2023 at 14:44
To: dev mailto:dev@cassandra.apache.org>>
Subject: Re: [VOTE] CEP-30 ANN Vector Search



CAUTION: This email originated from outside the University of Dundee. Do not 
click links or open attachments unless you recognise the sender's email address 
and know the content is safe.

Thanks, all.  Closing the vote as accepted with 8 binding +1 (including me) and 
11 non-binding votes.



On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis 
mailto:jbel...@gmail.com>> wrote:

Let's make this official.

CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes



POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch

--

Jonathan Ellis
co-founder, http://www.datastax.com
@spyced


--

Jonathan Ellis
co-founder, http://www.datastax.com
@spyced

The University of Dundee is a registered Scottish Charity, No: SC015096

The University of Dundee is a registered Scottish Charity, No: SC015096


Re: [VOTE] CEP-30 ANN Vector Search

2023-06-14 Thread Patrick McFadin
Andy,

Good to see you on the ML again! CEP-30 is slated for release with 5.0
later in the year. Until then, you'll need to do a local build or try it
out in a preview in Astra. A few of us have been talking about creating a
preview docker image since there is some interest in having it run in
k8ssandra. In any case, this is very alpha code and should be treated as
such. Reporting errors or unusual results would be greatly appreciated!

Patrick



On Wed, Jun 14, 2023 at 7:10 AM Andrew Cobley (Staff) <
a.e.cob...@dundee.ac.uk> wrote:

> Hi All,
>
>
>
> Great news this has gone through, I wondering if we have a timescale for
> this making it to Beta or release ?  I’m asking because we have a project
> that would benefit from this approach.
>
>
>
> Andy
>
>
>
>
>
> *From: *Jonathan Ellis 
> *Date: *Tuesday, 30 May 2023 at 14:44
> *To: *dev 
> *Subject: *Re: [VOTE] CEP-30 ANN Vector Search
>
>
>
> CAUTION: This email originated from outside the University of Dundee. Do
> not click links or open attachments unless you recognise the sender's email
> address and know the content is safe.
>
> Thanks, all.  Closing the vote as accepted with 8 binding +1 (including
> me) and 11 non-binding votes.
>
>
>
> On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis  wrote:
>
> Let's make this official.
>
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
>
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
>
> --
>
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
>
>
> --
>
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
> The University of Dundee is a registered Scottish Charity, No: SC015096
>


Re: [VOTE] CEP-30 ANN Vector Search

2023-06-14 Thread Andrew Cobley (Staff)
Hi All,

Great news this has gone through, I wondering if we have a timescale for this 
making it to Beta or release ?  I’m asking because we have a project that would 
benefit from this approach.

Andy


From: Jonathan Ellis 
Date: Tuesday, 30 May 2023 at 14:44
To: dev 
Subject: Re: [VOTE] CEP-30 ANN Vector Search

CAUTION: This email originated from outside the University of Dundee. Do not 
click links or open attachments unless you recognise the sender's email address 
and know the content is safe.
Thanks, all.  Closing the vote as accepted with 8 binding +1 (including me) and 
11 non-binding votes.

On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis 
mailto:jbel...@gmail.com>> wrote:
Let's make this official.

CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes

POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch

--
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced


--
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced

The University of Dundee is a registered Scottish Charity, No: SC015096


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-30 Thread Jonathan Ellis
Thanks to Benjamin for pointing out to me that committer votes count as
binding for CEPs.

That makes the updated tally 15 binding and 4 non-binding.

On Tue, May 30, 2023 at 8:44 AM Jonathan Ellis  wrote:

> Thanks, all.  Closing the vote as accepted with 8 binding +1 (including
> me) and 11 non-binding votes.
>
> On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis  wrote:
>
>> Let's make this official.
>>
>> CEP:
>> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>>
>> POC that demonstrates all the big rocks, including distributed queries:
>> https://github.com/datastax/cassandra/tree/cep-vsearch
>>
>> --
>> Jonathan Ellis
>> co-founder, http://www.datastax.com
>> @spyced
>>
>
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>


-- 
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-30 Thread Jonathan Ellis
Thanks, all.  Closing the vote as accepted with 8 binding +1 (including me)
and 11 non-binding votes.

On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis  wrote:

> Let's make this official.
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>


-- 
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-26 Thread Michael Shuler

+1, this is cool.

On 5/25/23 10:45, Jonathan Ellis wrote:

Let's make this official.

CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes 


POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch 



--
Jonathan Ellis
co-founder, http://www.datastax.com 
@spyced


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-26 Thread Andrés de la Peña
+1

On Fri, 26 May 2023 at 12:59, Mike Adamson  wrote:

> +1 (nb)
>
> On Fri, 26 May 2023 at 12:50, Stefania Alborghetti 
> wrote:
>
>> +1
>>
>> On Fri, May 26, 2023 at 7:31 AM Aleksey Yeshchenko 
>> wrote:
>>
>>> +1
>>>
>>> On 26 May 2023, at 07:19, Berenguer Blasi 
>>> wrote:
>>>
>>> +1
>>> On 26/5/23 6:07, guo Maxwell wrote:
>>>
>>> +1
>>>
>>> Dinesh Joshi 于2023年5月26日 周五上午11:08写道:
>>>
 +1


 On May 25, 2023, at 8:45 AM, Jonathan Ellis  wrote:

 

 Let's make this official.

 CEP:
 https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes

 POC that demonstrates all the big rocks, including distributed queries:
 https://github.com/datastax/cassandra/tree/cep-vsearch

 --
 Jonathan Ellis
 co-founder, http://www.datastax.com
 @spyced

 --
>>> you are the apple of my eye !
>>>
>>>
>>>
>
> --
> [image: DataStax Logo Square]  *Mike Adamson*
> Engineering
>
> +1 650 389 6000 <16503896000> | datastax.com 
> Find DataStax Online: [image: LinkedIn Logo]
> 
>[image: Facebook Logo]
> 
>[image: Twitter Logo]    [image: RSS
> Feed]    [image: Github Logo]
> 
>
>


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-26 Thread Mike Adamson
+1 (nb)

On Fri, 26 May 2023 at 12:50, Stefania Alborghetti 
wrote:

> +1
>
> On Fri, May 26, 2023 at 7:31 AM Aleksey Yeshchenko 
> wrote:
>
>> +1
>>
>> On 26 May 2023, at 07:19, Berenguer Blasi 
>> wrote:
>>
>> +1
>> On 26/5/23 6:07, guo Maxwell wrote:
>>
>> +1
>>
>> Dinesh Joshi 于2023年5月26日 周五上午11:08写道:
>>
>>> +1
>>>
>>>
>>> On May 25, 2023, at 8:45 AM, Jonathan Ellis  wrote:
>>>
>>> 
>>>
>>> Let's make this official.
>>>
>>> CEP:
>>> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>>>
>>> POC that demonstrates all the big rocks, including distributed queries:
>>> https://github.com/datastax/cassandra/tree/cep-vsearch
>>>
>>> --
>>> Jonathan Ellis
>>> co-founder, http://www.datastax.com
>>> @spyced
>>>
>>> --
>> you are the apple of my eye !
>>
>>
>>

-- 
[image: DataStax Logo Square]  *Mike Adamson*
Engineering

+1 650 389 6000 <16503896000> | datastax.com 
Find DataStax Online: [image: LinkedIn Logo]

   [image: Facebook Logo]

   [image: Twitter Logo]    [image: RSS Feed]
   [image: Github Logo]



Re: [VOTE] CEP-30 ANN Vector Search

2023-05-26 Thread Stefania Alborghetti
+1

On Fri, May 26, 2023 at 7:31 AM Aleksey Yeshchenko 
wrote:

> +1
>
> On 26 May 2023, at 07:19, Berenguer Blasi 
> wrote:
>
> +1
> On 26/5/23 6:07, guo Maxwell wrote:
>
> +1
>
> Dinesh Joshi 于2023年5月26日 周五上午11:08写道:
>
>> +1
>>
>>
>> On May 25, 2023, at 8:45 AM, Jonathan Ellis  wrote:
>>
>> 
>>
>> Let's make this official.
>>
>> CEP:
>> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>>
>> POC that demonstrates all the big rocks, including distributed queries:
>> https://github.com/datastax/cassandra/tree/cep-vsearch
>>
>> --
>> Jonathan Ellis
>> co-founder, http://www.datastax.com
>> @spyced
>>
>> --
> you are the apple of my eye !
>
>
>


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-26 Thread Aleksey Yeshchenko
+1

> On 26 May 2023, at 07:19, Berenguer Blasi  wrote:
> 
> +1
> 
> On 26/5/23 6:07, guo Maxwell wrote:
>> +1
>> 
>> Dinesh Joshi mailto:djo...@apache.org>>于2023年5月26日 
>> 周五上午11:08写道:
>>> +1
>>> 
 
 On May 25, 2023, at 8:45 AM, Jonathan Ellis >>> > wrote:
 
 
>>> 
 Let's make this official.
 
 CEP: 
 https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
 
 POC that demonstrates all the big rocks, including distributed queries: 
 https://github.com/datastax/cassandra/tree/cep-vsearch
 
 -- 
 Jonathan Ellis
 co-founder, http://www.datastax.com 
 @spyced
>> -- 
>> you are the apple of my eye !



Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Berenguer Blasi

+1

On 26/5/23 6:07, guo Maxwell wrote:

+1

Dinesh Joshi 于2023年5月26日 周五上午11:08写道:

+1



On May 25, 2023, at 8:45 AM, Jonathan Ellis 
wrote:


Let's make this official.

CEP:

https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes

POC that demonstrates all the big rocks, including distributed
queries: https://github.com/datastax/cassandra/tree/cep-vsearch

-- 
Jonathan Ellis

co-founder, http://www.datastax.com
@spyced


--
you are the apple of my eye !

Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread guo Maxwell
+1

Dinesh Joshi 于2023年5月26日 周五上午11:08写道:

> +1
>
>
> On May 25, 2023, at 8:45 AM, Jonathan Ellis  wrote:
>
> 
>
> Let's make this official.
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
> --
you are the apple of my eye !


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Dinesh Joshi
+1On May 25, 2023, at 8:45 AM, Jonathan Ellis  wrote:Let's make this official.CEP: https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+IndexesPOC that demonstrates all the big rocks, including distributed queries: https://github.com/datastax/cassandra/tree/cep-vsearch-- Jonathan Ellisco-founder, http://www.datastax.com@spyced


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread J. D. Jordan
+1 nbOn May 25, 2023, at 7:47 PM, Jasonstack Zhao Yang  wrote:+1On Fri, 26 May 2023 at 8:44 AM, Yifan Cai <yc25c...@gmail.com> wrote:






+1






From: Josh McKenzie <jmcken...@apache.org>
Sent: Thursday, May 25, 2023 5:37:02 PM
To: dev <dev@cassandra.apache.org>
Subject: Re: [VOTE] CEP-30 ANN Vector Search
 



+1


On Thu, May 25, 2023, at 8:33 PM, Jake Luciani wrote:


+1





On Thu, May 25, 2023 at 11:45 AM Jonathan Ellis <jbel...@gmail.com> wrote:



Let's make this official.




CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes




POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch





--




Jonathan Ellis

co-founder, http://www.datastax.com

@spyced








--

http://twitter.com/tjake








Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Jasonstack Zhao Yang
+1

On Fri, 26 May 2023 at 8:44 AM, Yifan Cai  wrote:

> +1
> --
> *From:* Josh McKenzie 
> *Sent:* Thursday, May 25, 2023 5:37:02 PM
> *To:* dev 
> *Subject:* Re: [VOTE] CEP-30 ANN Vector Search
>
> +1
>
> On Thu, May 25, 2023, at 8:33 PM, Jake Luciani wrote:
>
> +1
>
> On Thu, May 25, 2023 at 11:45 AM Jonathan Ellis  wrote:
>
> Let's make this official.
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
> --
> http://twitter.com/tjake
>
>


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Yifan Cai
+1

From: Josh McKenzie 
Sent: Thursday, May 25, 2023 5:37:02 PM
To: dev 
Subject: Re: [VOTE] CEP-30 ANN Vector Search

+1

On Thu, May 25, 2023, at 8:33 PM, Jake Luciani wrote:
+1

On Thu, May 25, 2023 at 11:45 AM Jonathan Ellis 
mailto:jbel...@gmail.com>> wrote:
Let's make this official.

CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes

POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch

--
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced
--
http://twitter.com/tjake


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Josh McKenzie
+1

On Thu, May 25, 2023, at 8:33 PM, Jake Luciani wrote:
> +1
> 
> On Thu, May 25, 2023 at 11:45 AM Jonathan Ellis  wrote:
>> Let's make this official.
>> 
>> CEP: 
>> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>> 
>> POC that demonstrates all the big rocks, including distributed queries: 
>> https://github.com/datastax/cassandra/tree/cep-vsearch
>> 
>> --
>> Jonathan Ellis
>> co-founder, http://www.datastax.com
>> @spyced
> --
> http://twitter.com/tjake

Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Jake Luciani
+1

On Thu, May 25, 2023 at 11:45 AM Jonathan Ellis  wrote:

> Let's make this official.
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>
-- 
http://twitter.com/tjake


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread David Capwell
+1

> On May 25, 2023, at 1:53 PM, Ekaterina Dimitrova  
> wrote:
> 
> +1
> 
> On Thu, 25 May 2023 at 16:46, Brandon Williams  > wrote:
>> +1
>> 
>> Kind Regards,
>> Brandon
>> 
>> On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis > > wrote:
>> >
>> > Let's make this official.
>> >
>> > CEP: 
>> > https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>> >
>> > POC that demonstrates all the big rocks, including distributed queries: 
>> > https://github.com/datastax/cassandra/tree/cep-vsearch
>> >
>> > --
>> > Jonathan Ellis
>> > co-founder, http://www.datastax.com 
>> > @spyced



Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Ekaterina Dimitrova
+1

On Thu, 25 May 2023 at 16:46, Brandon Williams  wrote:

> +1
>
> Kind Regards,
> Brandon
>
> On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis  wrote:
> >
> > Let's make this official.
> >
> > CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
> >
> > POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
> >
> > --
> > Jonathan Ellis
> > co-founder, http://www.datastax.com
> > @spyced
>


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Brandon Williams
+1

Kind Regards,
Brandon

On Thu, May 25, 2023 at 10:45 AM Jonathan Ellis  wrote:
>
> Let's make this official.
>
> CEP: 
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
> POC that demonstrates all the big rocks, including distributed queries: 
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced


Re: Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread German Eichberger via dev
+ 1

I am seeing ANN Vector Search pop up in every database...

From: Patrick McFadin 
Sent: Thursday, May 25, 2023 11:29 AM
To: dev@cassandra.apache.org 
Subject: [EXTERNAL] Re: [VOTE] CEP-30 ANN Vector Search

+1
Love the buzz this creating with new users. Thanks for the work on this 
Jonathan.

On Thu, May 25, 2023 at 8:45 AM Jonathan Ellis 
mailto:jbel...@gmail.com>> wrote:
Let's make this official.

CEP: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes

POC that demonstrates all the big rocks, including distributed queries: 
https://github.com/datastax/cassandra/tree/cep-vsearch

--
Jonathan Ellis
co-founder, http://www.datastax.com<http://www.datastax.com/>
@spyced


Re: [VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Patrick McFadin
+1
Love the buzz this creating with new users. Thanks for the work on this
Jonathan.

On Thu, May 25, 2023 at 8:45 AM Jonathan Ellis  wrote:

> Let's make this official.
>
> CEP:
> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes
>
> POC that demonstrates all the big rocks, including distributed queries:
> https://github.com/datastax/cassandra/tree/cep-vsearch
>
> --
> Jonathan Ellis
> co-founder, http://www.datastax.com
> @spyced
>


[VOTE] CEP-30 ANN Vector Search

2023-05-25 Thread Jonathan Ellis
Let's make this official.

CEP:
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-30%3A+Approximate+Nearest+Neighbor%28ANN%29+Vector+Search+via+Storage-Attached+Indexes

POC that demonstrates all the big rocks, including distributed queries:
https://github.com/datastax/cassandra/tree/cep-vsearch

-- 
Jonathan Ellis
co-founder, http://www.datastax.com
@spyced