[Numpy-discussion] Re: Curious performance different with np.unique on arrays of characters

2023-09-14 Thread Charles R Harris
On Thu, Sep 14, 2023 at 11:34 AM Devulapalli, Raghuveer < raghuveer.devulapa...@intel.com> wrote: > What processor you are running this on? np.sort uses AVX-512 accelerated > sorting for np.int32, so just wondering if you that is the reason for this > difference. > > Raghuveer We also have radix

[Numpy-discussion] Re: Curious performance different with np.unique on arrays of characters

2023-09-14 Thread Devulapalli, Raghuveer
What processor you are running this on? np.sort uses AVX-512 accelerated sorting for np.int32, so just wondering if you that is the reason for this difference. Raghuveer > -Original Message- > From: sal...@caltech.edu > Sent: Wednesday, September 13, 2023 6:14 PM > To: numpy-discussio

[Numpy-discussion] Re: Curious performance different with np.unique on arrays of characters

2023-09-14 Thread Nathan
Looking at a py-spy profile of a slightly modified version of the code you shared, it seems the difference comes down to NumPy's sorting implementation simply being faster for ints than unicode strings. In particular, it looks like string_quicksort_ is two or three times slower than quicksort_ when

[Numpy-discussion] Curious performance different with np.unique on arrays of characters

2023-09-14 Thread saladi
Hello - In the course of some genomics simulations, I seem to have come across a curious (to me at least) performance difference in np.unique that I wanted to share. (If this is not the right forum for this, please let me know!) With a np.array of characters (U1), np.unique seems to be much fas