Re: GitHub issues vs PRs vs Lucene's CHANGES.txt

2023-11-30 Thread Dongyu Xu
Hopefully this is relevant. There are useful tools like git-cliff​ for automating changelog generation. https://github.com/orhun/git-cliff Tony X From: Michael McCandless Sent: Thursday, November 30, 2023 4:30 AM To: dev@lucene.apache.org Subject: Re: GitHub

Re: GDPR compliance

2023-11-28 Thread Dongyu Xu
What is the expected grace time for the data-deletion request to take place? I'm not expert about the policy but I think something like "I need my data to be gone in next 2 second" is unreasonable. Tony X From: Robert Muir Sent: Tuesday, November 28, 2023

Re: Test framework can't find SPI implementations from module sandbox

2023-11-23 Thread Dongyu Xu
t, but you didn't say which test it was. Is the test also in the sandbox module? On Mon, Nov 20, 2023 at 6:56 PM Dongyu Xu mailto:dongyu...@hotmail.com>> wrote: Hi devs, I tried to plug in my experimental PostingsFormat implementation to all the existing unit tests. I've registered it under M

Test framework can't find SPI implementations from module sandbox

2023-11-20 Thread Dongyu Xu
Hi devs, I tried to plug in my experimental PostingsFormat implementation to all the existing unit tests. I've registered it under META-INF.services as well as in the moudle-info.java. However, the test still fails like the following. java.lang.IllegalArgumentException: An SPI class of type

Re: PackedInts functionalities

2023-10-17 Thread Dongyu Xu
M Mikhail Khludnev mailto:m...@apache.org>> wrote: Hello Tony Is it possible to write a block of docfreqs and then a block of postingoffsets? Or why not write them as 10-bit integers and then split to quad and sextet in the posting format code? On Mon, Oct 16, 2023 at 11:50 PM Dongyu Xu mailto

PackedInts functionalities

2023-10-16 Thread Dongyu Xu
Hi devs, As I was working on https://github.com/apache/lucene/issues/12513 I needed to compress positive integers which are used to locate postings etc. To put it concretely, I will need to pack a few values per term contiguously and those values can have different bit-width. For example,

Time to write an open-source book?

2021-11-16 Thread Dongyu Xu
Hi Devs, I'm finally motivated enough to start this thread as I believe this is a great thing to do for the Lucene community to continuously thrive as the library has become so feature-rich but as the same time much more complex. "What do you recommend to read for learning more Lucene?" -- A