commit python-dask for openSUSE:Factory

Source-Sync Wed, 07 Jun 2023 14:07:26 -0700

Script 'mail_helper' called by obssrc
Hello community,

here is the log from the commit of package python-dask for openSUSE:Factory 
checked in at 2023-06-07 23:06:37
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Comparing /work/SRC/openSUSE:Factory/python-dask (Old)
 and      /work/SRC/openSUSE:Factory/.python-dask.new.15902 (New)
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Package is "python-dask"

Wed Jun  7 23:06:37 2023 rev:67 rq:1090990 version:2023.5.1

Changes:
--------
--- /work/SRC/openSUSE:Factory/python-dask/python-dask.changes  2023-04-05 
21:35:26.714496609 +0200
+++ /work/SRC/openSUSE:Factory/.python-dask.new.15902/python-dask.changes       
2023-06-07 23:07:06.995236946 +0200
@@ -1,0 +2,143 @@
+Mon Jun  5 23:42:44 UTC 2023 - Steve Kowalik <steven.kowa...@suse.com>
+
+- Tighten bokeh requirement to match distributed. 
+
+-------------------------------------------------------------------
+Fri May 26 19:59:37 UTC 2023 - Ben Greiner <c...@bnavigator.de>
+
+- Update to 2023.5.1
+  * This release drops support for Python 3.8. As of this release
+    Dask supports Python 3.9, 3.10, and 3.11.
+  ## Enhancements
+  * Drop Python 3.8 support (GH#10295) Thomas Grainger
+  * Change Dask Bag partitioning scheme to improve cluster
+    saturation (GH#10294) Jacob Tomlinson
+  * Generalize dd.to_datetime for GPU-backed collections, introduce
+    get_meta_library utility (GH#9881) Charles Blackmon-Luca
+  * Add na_action to DataFrame.map (GH#10305) Patrick Hoefler
+  * Raise TypeError in DataFrame.nsmallest and DataFrame.nlargest
+    when columns is not given (GH#10301) Patrick Hoefler
+  * Improve sizeof for pd.MultiIndex (GH#10230) Patrick Hoefler
+  * Support duplicated columns in a bunch of DataFrame methods
+    (GH#10261) Patrick Hoefler
+  * Add numeric_only support to DataFrame.idxmin and
+    DataFrame.idxmax (GH#10253) Patrick Hoefler
+  * Implement numeric_only support for DataFrame.quantile
+    (GH#10259) Patrick Hoefler
+  * Add support for numeric_only=False in DataFrame.std (GH#10251)
+    Patrick Hoefler
+  * Implement numeric_only=False for GroupBy.cumprod and
+    GroupBy.cumsum (GH#10262) Patrick Hoefler
+  * Implement numeric_only for skew and kurtosis (GH#10258) Patrick
+    Hoefler
+  * mask and where should accept a callable (GH#10289) Irina Truong
+  * Fix conversion from Categorical to pa.dictionary in
+    read_parquet (GH#10285) Patrick Hoefler
+  ## Bug Fixes
+  * Spurious config on nested annotations (GH#10318) crusaderky
+  * Fix rechunking behavior for dimensions with known and unknown
+    chunk sizes (GH#10157) Hendrik Makait
+  * Enable drop to support mismatched partitions (GH#10300) James
+    Bourbeau
+  * Fix divisions construction for to_timestamp (GH#10304) Patrick
+    Hoefler
+  * pandas ExtensionDtype raising in Series reduction operations
+    (GH#10149) Patrick Hoefler
+  * Fix regression in da.random interface (GH#10247) Eray Aslan
+  * da.coarsen doesnât trim an empty chunk in meta (GH#10281) Irina
+    Truong
+  * Fix dtype inference for engine="pyarrow" in read_csv (GH#10280)
+    Patrick Hoefler
+- Release 2023.5.0
+  ## Enhancements
+  * Implement numeric_only=False for GroupBy.corr and GroupBy.cov
+    (GH#10264) Patrick Hoefler
+  * Add support for numeric_only=False in DataFrame.var (GH#10250)
+    Patrick Hoefler
+  * Add numeric_only support to DataFrame.mode (GH#10257) Patrick
+    Hoefler
+  * Add DataFrame.map to dask.DataFrame API (GH#10246) Patrick
+    Hoefler
+  * Adjust for DataFrame.applymap deprecation and all NA concat
+    behaviour change (GH#10245) Patrick Hoefler
+  * Enable numeric_only=False for DataFrame.count (GH#10234)
+    Patrick Hoefler
+  * Disallow array input in mask/where (GH#10163) Irina Truong
+  * Support numeric_only=True in GroupBy.corr and GroupBy.cov
+    (GH#10227) Patrick Hoefler
+  * Add numeric_only support to GroupBy.median (GH#10236) Patrick
+    Hoefler
+  * Support mimesis=9 in dask.datasets (GH#10241) James Bourbeau
+  * Add numeric_only support to min, max and prod (GH#10219)
+    Patrick Hoefler
+  * Add numeric_only=True support for GroupBy.cumsum and
+    GroupBy.cumprod (GH#10224) Patrick Hoefler
+  * Add helper to unpack numeric_only keyword (GH#10228) Patrick
+    Hoefler
+  ## Bug Fixes
+  * Fix clone + from_array failure (GH#10211) crusaderky
+  * Fix dataframe reductions for ea dtypes (GH#10150) Patrick
+    Hoefler
+  * Avoid scalar conversion deprecation warning in numpy=1.25
+    (GH#10248) James Bourbeau
+  * Make sure transform output has the same index as input
+    (GH#10184) Irina Truong
+  * Fix corr and cov on a single-row partition (GH#9756) Irina
+    Truong
+  * Fix test_groupby_numeric_only_supported and
+    test_groupby_aggregate_categorical_observed upstream errors
+    (GH#10243) Irina Truong
+- Release 2023.4.1
+  ## Enhancements
+  * Implement numeric_only support for DataFrame.sum (GH#10194)
+    Patrick Hoefler
+  * Add support for numeric_only=True in GroupBy operations
+    (GH#10222) Patrick Hoefler
+  * Avoid deep copy in DataFrame.__setitem__ for pandas 1.4 and up
+    (GH#10221) Patrick Hoefler
+  * Avoid calling Series.apply with _meta_nonempty (GH#10212)
+    Patrick Hoefler
+  * Unpin sqlalchemy and fix compatibility issues (GH#10140)
+    Patrick Hoefler
+  ## Bug Fixes
+  * Partially revert default client discovery (GH#10225) Florian
+    Jetter
+  * Support arrow dtypes in Index meta creation (GH#10170) Patrick
+    Hoefler
+  * Repartitioning raises with extension dtype when truncating
+    floats (GH#10169) Patrick Hoefler
+  * Adjust empty Index from fastparquet to object dtype (GH#10179)
+    Patrick Hoefler
+- Release 2023.4.0
+  ## Enhancements
+  * Override old default values in update_defaults (GH#10159) Gabe
+    Joseph
+  * Add a CLI command to list and get a value from dask config
+    (GH#9936) Irina Truong
+  * Handle string-based engine argument to read_json (GH#9947)
+    Richard (Rick) Zamora
+  * Avoid deprecated GroupBy.dtypes (GH#10111) Irina Truong
+  ## Bug Fixes
+  * Revert grouper-related changes (GH#10182) Irina Truong
+  * GroupBy.cov raising for non-numeric grouping column (GH#10171)
+    Patrick Hoefler
+  * Updates for Index supporting numpy numeric dtypes (GH#10154)
+    Irina Truong
+  * Preserve dtype for partitioning columns when read with pyarrow
+    (GH#10115) Patrick Hoefler
+  * Fix annotations for to_hdf (GH#10123) Hendrik Makait
+  * Handle None column name when checking if columns are all
+    numeric (GH#10128) Lawrence Mitchell
+  * Fix valid_divisions when passed a tuple (GH#10126) Brian
+    Phillips
+  * Maintain annotations in DataFrame.categorize (GH#10120) Hendrik
+    Makait
+  * Fix handling of missing min/max parquet statistics during
+    filtering (GH#10042) Richard (Rick) Zamora
+  ## Deprecations
+  * Deprecate use_nullable_dtypes= and add dtype_backend=
+    (GH#10076) Irina Truong
+  * Deprecate convert_dtype in Series.apply (GH#10133) Irina Truong
+- Drop dask-pr10042-parquetstats.patch
+
+-------------------------------------------------------------------

Old:
----
  dask-2023.3.2.tar.gz
  dask-pr10042-parquetstats.patch

New:
----
  dask-2023.5.1.tar.gz

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Other differences:
------------------
++++++ python-dask.spec ++++++
--- /var/tmp/diff_new_pack.BzPyfA/_old  2023-06-07 23:07:07.711241103 +0200
+++ /var/tmp/diff_new_pack.BzPyfA/_new  2023-06-07 23:07:07.719241149 +0200
@@ -45,61 +45,72 @@
 
 Name:           python-dask%{psuffix}
 # ===> Note: python-dask MUST be updated in sync with python-distributed! <===
-Version:        2023.3.2
+Version:        2023.5.1
 Release:        0
 Summary:        Minimal task scheduling abstraction
 License:        BSD-3-Clause
 URL:            https://dask.org
 # SourceRepository: https://github.com/dask/dask
 Source0:        
https://files.pythonhosted.org/packages/source/d/dask/dask-%{version}.tar.gz
-# PATCH-FIX-UPSTREAM dask-pr10042-parquetstats.patch gh#dask/dask#10042
-Patch0:         dask-pr10042-parquetstats.patch
-BuildRequires:  %{python_module base >= 3.8}
+BuildRequires:  %{python_module base >= 3.9}
 BuildRequires:  %{python_module packaging >= 20.0}
 BuildRequires:  %{python_module pip}
 BuildRequires:  %{python_module setuptools}
+BuildRequires:  %{python_module versioneer-toml >= 0.28}
 BuildRequires:  %{python_module wheel}
 BuildRequires:  fdupes
 BuildRequires:  python-rpm-macros
 Requires:       python-PyYAML >= 5.3.1
-Requires:       python-click >= 7
-Requires:       python-cloudpickle >= 1.1.1
-Requires:       python-fsspec >= 0.6.0
+Requires:       python-click >= 8
+Requires:       python-cloudpickle >= 1.5
+Requires:       python-fsspec >= 2021.9
 Requires:       python-importlib-metadata >= 4.13.0
 Requires:       python-packaging >= 20.0
 Requires:       python-partd >= 1.2.0
-Requires:       python-toolz >= 0.8.2
+Requires:       python-toolz >= 0.10.0
 Requires(post): update-alternatives
 Requires(postun):update-alternatives
 Recommends:     %{name}-array = %{version}
-Recommends:     %{name}-bag = %{version}
 Recommends:     %{name}-dataframe = %{version}
-Recommends:     %{name}-delayed = %{version}
 Recommends:     %{name}-distributed = %{version}
-Recommends:     %{name}-dot = %{version}
-Recommends:     python-SQLAlchemy >= 1.4.0
-Recommends:     python-cityhash
-Recommends:     python-fastparquet
-Recommends:     python-gcsfs >= 0.4.0
-Recommends:     python-murmurhash
-Recommends:     python-psutil
-Recommends:     python-pyarrow >= 0.14.0
-Recommends:     python-s3fs >= 0.4.0
-Recommends:     python-xxhash
 Suggests:       %{name}-complete = %{version}
 Suggests:       %{name}-diagnostics = %{version}
+# SECTION https://docs.dask.org/en/stable/install.html#optional-dependencies
+Suggests:       python-SQLAlchemy >= 1.4.16
+Suggests:       python-cityhash >= 0.2.4
+Suggests:       python-fastparquet >= 0.8.2
+Suggests:       python-gcsfs >= 2021.9.0
+Suggests:       python-crick >= 0.0.3
+Suggests:       python-cytoolz >= 0.10.1
+Suggests:       python-dask-ml >= 1.4.0
+Suggests:       python-fastavro >= 0.22.6
+Suggests:       python-graphviz >= 0.8.4
+Suggests:       python-h5py >= 2.10.0
+Suggests:       python-psutil >= 0.5.7
+Suggests:       python-pyarrow >= 0.14.0
+Suggests:       python-matplotlib
+Suggests:       python-mimesis >= 5.3.0
+Suggests:       python-mmh3 >= 2.5.1
+Suggests:       python-sparse >= 0.12.0
+Suggests:       python-s3fs >= 0.4.0
+Suggests:       python-xxhash >= 1.4.1
+Suggests:       python-zarr >= 2.12.0
+# /SECTION
+Provides:       %{name}-bag = %{version}-%{release}
+Obsoletes:      %{name}-bag < %{version}-%{release}
+Provides:       %{name}-delayed = %{version}-%{release}
+Obsoletes:      %{name}-delayed < %{version}-%{release}
+Provides:       %{name}-dot = %{version}-%{release}
+Obsoletes:      %{name}-dot < %{version}-%{release}
 Provides:       %{name}-multiprocessing = %{version}-%{release}
 Obsoletes:      %{name}-multiprocessing < %{version}-%{release}
 BuildArch:      noarch
 %if %{with test}
 # test that we specified all requirements correctly in the core
-# and subpackages by only requiring dask-complete and optional extras
+# and subpackages by only requiring dask-test (= [complete] + pytest) and 
optional extras
 BuildRequires:  %{python_module dask-test = %{version}}
-BuildRequires:  %{python_module pytest-rerunfailures}
-BuildRequires:  %{python_module pytest-xdist}
-BuildRequires:  %{python_module pytest}
 # SECTION additional optionally tested (importorskip) packages
-BuildRequires:  %{python_module SQLAlchemy >= 1.4.0}
+BuildRequires:  %{python_module SQLAlchemy >= 1.4.16}
 BuildRequires:  %{python_module cachey}
 BuildRequires:  %{python_module fastparquet >= 0.8.0}
 # optional zarr increases fsspec miminum to 0.8.4 if present
@@ -110,15 +121,15 @@
 BuildRequires:  %{python_module matplotlib}
 BuildRequires:  %{python_module mimesis}
 BuildRequires:  %{python_module multipledispatch}
-BuildRequires:  %{python_module numba if %python-base < 3.11}
+BuildRequires:  %{python_module numba}
 # snappy required for using fastparquet
 BuildRequires:  %{python_module python-snappy}
 BuildRequires:  %{python_module requests}
 BuildRequires:  %{python_module scikit-image}
 BuildRequires:  %{python_module scipy}
-BuildRequires:  %{python_module sparse if %python-base < 3.11}
+BuildRequires:  %{python_module sparse}
 BuildRequires:  %{python_module tables}
-BuildRequires:  %{python_module xarray if %python-base >= 3.9}
+BuildRequires:  %{python_module xarray}
 BuildRequires:  %{python_module zarr}
 # /SECTION
 %endif
@@ -141,12 +152,9 @@
 Summary:        All dask components
 Requires:       %{name} = %{version}
 Requires:       %{name}-array = %{version}
-Requires:       %{name}-bag = %{version}
 Requires:       %{name}-dataframe = %{version}
-Requires:       %{name}-delayed = %{version}
 Requires:       %{name}-diagnostics = %{version}
 Requires:       %{name}-distributed = %{version}
-Requires:       %{name}-dot = %{version}
 Requires:       python-lz4 >= 4.3.2
 Requires:       python-pyarrow >= 7
 Provides:       %{name}-all = %{version}-%{release}
@@ -190,29 +198,6 @@
 Dask arrays implement a subset of the NumPy interface on large
 arrays using blocked algorithms and task scheduling.
 
-%package bag
-Summary:        Data structure generic python objects in dask
-Requires:       %{name} = %{version}
-
-%description bag
-A flexible library for parallel computing in Python.
-
-Dask is composed of two parts:
-- Dynamic task scheduling optimized for computation. This is similar to
-  Airflow, Luigi, Celery, or Make, but optimized for interactive
-  computational workloads.
-- âBig Dataâ collections like parallel arrays, dataframes, and lists that
-  extend common interfaces like NumPy, Pandas, or Python iterators to
-  larger-than-memory or distributed environments. These parallel collections
-  run on top of dynamic task schedulers.
-
-This package contains the dask bag class.
-
-Dask.Bag parallelizes computations across a large collection of
-generic Python objects. It is particularly useful when dealing
-with large quantities of semi-structured data like JSON blobs
-or log files.
-
 %package dataframe
 Summary:        Pandas-like DataFrame data structure for dask
 Requires:       %{name} = %{version}
@@ -264,8 +249,7 @@
 Summary:        Diagnostics for dask
 Requires:       %{name} = %{version}
 Requires:       python-Jinja2 >= 2.10.3
-# Not ready yet: https://github.com/dask/dask/pull/9659, we provide a legacy 
bokeh2 in Tumbleweed
-Requires:       (python-bokeh >= 2.4.2 with python-bokeh < 3)
+Requires:       python-bokeh >= 3.1
 
 %description diagnostics
 A flexible library for parallel computing in Python.
@@ -281,49 +265,18 @@
 
 This package contains the dask.diagnostics module
 
-%package delayed
-Summary:        Delayed module for dask
-Requires:       %{name} = %{version}
-
-%description delayed
-A flexible library for parallel computing in Python.
-
-Dask is composed of two parts:
-- Dynamic task scheduling optimized for computation. This is similar to
-  Airflow, Luigi, Celery, or Make, but optimized for interactive
-  computational workloads.
-- âBig Dataâ collections like parallel arrays, dataframes, and lists that
-  extend common interfaces like NumPy, Pandas, or Python iterators to
-  larger-than-memory or distributed environments. These parallel collections
-  run on top of dynamic task schedulers.
-
-This package contains the dask.delayed module
-
-%package dot
-Summary:        Display dask graphs using graphviz
-Requires:       %{name} = %{version}
-Requires:       graphviz
-Requires:       graphviz-gd
-Requires:       graphviz-gnome
-Requires:       python-graphviz
-
-%description dot
-A flexible library for parallel computing in Python.
-
-Dask is composed of two parts:
-- Dynamic task scheduling optimized for computation. This is similar to
-  Airflow, Luigi, Celery, or Make, but optimized for interactive
-  computational workloads.
-- âBig Dataâ collections like parallel arrays, dataframes, and lists that
-  extend common interfaces like NumPy, Pandas, or Python iterators to
-  larger-than-memory or distributed environments. These parallel collections
-  run on top of dynamic task schedulers.
-
-This package contains the graphviz dot rendering interface.
-
 %package test
 Summary:        The test submodules of the python-dask package
 Requires:       %{name}-complete = %{version}
+Requires:       python-pandas
+# SECTION pandas[test]
+Requires:       python-hypothesis
+Requires:       python-pytest-asyncio
+# /SECTION
+Requires:       python-pre-commit
+Requires:       python-pytest
+Requires:       python-pytest-rerunfailures
+Requires:       python-pytest-xdist
 
 %description test
 Dask is a flexible library for parallel computing in Python.
@@ -332,8 +285,7 @@
 
 %prep
 %autosetup -p1 -n dask-%{version}
-sed -i  '/addopts/d' setup.cfg
-chmod a-x dask/dataframe/io/orc/utils.py
+sed -i  '/addopts/d' pyproject.toml
 
 %build
 %pyproject_wheel
@@ -372,6 +324,8 @@
   # https://github.com/dask/dask/issues/8620
   donttest+=" or test_query_with_meta"
   donttest+=" or test_repartition_npartitions"
+  #
+  donttest+=" or test_pandas_multiindex"
 fi
 # (rarely) flaky on obs
 donttest+=" or test_local_scheduler"
@@ -397,12 +351,10 @@
 %{python_sitelib}/dask/
 %{python_sitelib}/dask-%{version}.dist-info
 %exclude %{python_sitelib}/dask/array/
-%exclude %{python_sitelib}/dask/bag/
 %exclude %{python_sitelib}/dask/dataframe/
 %exclude %{python_sitelib}/dask/diagnostics
-%exclude %{python_sitelib}/dask/delayed.py*
-%exclude %{python_sitelib}/dask/dot.py*
 %exclude %{python_sitelib}/dask/tests
+%exclude %{python_sitelib}/dask/bag/tests
 %exclude %{python_sitelib}/dask/bytes/tests
 %exclude %{python_sitelib}/dask/widgets/tests
 %pycache_only %exclude %{python_sitelib}/dask/__pycache__/delayed*.pyc
@@ -416,11 +368,6 @@
 %{python_sitelib}/dask/array/
 %exclude %{python_sitelib}/dask/array/tests
 
-%files %{python_files bag}
-%license LICENSE.txt
-%{python_sitelib}/dask/bag/
-%exclude %{python_sitelib}/dask/bag/tests
-
 %files %{python_files dataframe}
 %license LICENSE.txt
 %{python_sitelib}/dask/dataframe/
@@ -431,21 +378,11 @@
 %files %{python_files distributed}
 %license LICENSE.txt
 
-%files %{python_files dot}
-%license LICENSE.txt
-%{python_sitelib}/dask/dot.py*
-%pycache_only %{python_sitelib}/dask/__pycache__/dot.*
-
 %files %{python_files diagnostics}
 %license LICENSE.txt
 %{python_sitelib}/dask/diagnostics/
 %exclude %{python_sitelib}/dask/diagnostics/tests
 
-%files %{python_files delayed}
-%license LICENSE.txt
-%{python_sitelib}/dask/delayed.py*
-%pycache_only %{python_sitelib}/dask/__pycache__/delayed*.pyc
-
 %files %{python_files test}
 %license LICENSE.txt
 %{python_sitelib}/dask/tests

++++++ dask-2023.3.2.tar.gz -> dask-2023.5.1.tar.gz ++++++
/work/SRC/openSUSE:Factory/python-dask/dask-2023.3.2.tar.gz 
/work/SRC/openSUSE:Factory/.python-dask.new.15902/dask-2023.5.1.tar.gz differ: 
char 5, line 1

commit python-dask for openSUSE:Factory

Reply via email to