Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package python-dask for openSUSE:Factory checked in at 2023-06-07 23:06:37 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/python-dask (Old) and /work/SRC/openSUSE:Factory/.python-dask.new.15902 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "python-dask" Wed Jun 7 23:06:37 2023 rev:67 rq:1090990 version:2023.5.1 Changes: -------- --- /work/SRC/openSUSE:Factory/python-dask/python-dask.changes 2023-04-05 21:35:26.714496609 +0200 +++ /work/SRC/openSUSE:Factory/.python-dask.new.15902/python-dask.changes 2023-06-07 23:07:06.995236946 +0200 @@ -1,0 +2,143 @@ +Mon Jun 5 23:42:44 UTC 2023 - Steve Kowalik <steven.kowa...@suse.com> + +- Tighten bokeh requirement to match distributed. + +------------------------------------------------------------------- +Fri May 26 19:59:37 UTC 2023 - Ben Greiner <c...@bnavigator.de> + +- Update to 2023.5.1 + * This release drops support for Python 3.8. As of this release + Dask supports Python 3.9, 3.10, and 3.11. + ## Enhancements + * Drop Python 3.8 support (GH#10295) Thomas Grainger + * Change Dask Bag partitioning scheme to improve cluster + saturation (GH#10294) Jacob Tomlinson + * Generalize dd.to_datetime for GPU-backed collections, introduce + get_meta_library utility (GH#9881) Charles Blackmon-Luca + * Add na_action to DataFrame.map (GH#10305) Patrick Hoefler + * Raise TypeError in DataFrame.nsmallest and DataFrame.nlargest + when columns is not given (GH#10301) Patrick Hoefler + * Improve sizeof for pd.MultiIndex (GH#10230) Patrick Hoefler + * Support duplicated columns in a bunch of DataFrame methods + (GH#10261) Patrick Hoefler + * Add numeric_only support to DataFrame.idxmin and + DataFrame.idxmax (GH#10253) Patrick Hoefler + * Implement numeric_only support for DataFrame.quantile + (GH#10259) Patrick Hoefler + * Add support for numeric_only=False in DataFrame.std (GH#10251) + Patrick Hoefler + * Implement numeric_only=False for GroupBy.cumprod and + GroupBy.cumsum (GH#10262) Patrick Hoefler + * Implement numeric_only for skew and kurtosis (GH#10258) Patrick + Hoefler + * mask and where should accept a callable (GH#10289) Irina Truong + * Fix conversion from Categorical to pa.dictionary in + read_parquet (GH#10285) Patrick Hoefler + ## Bug Fixes + * Spurious config on nested annotations (GH#10318) crusaderky + * Fix rechunking behavior for dimensions with known and unknown + chunk sizes (GH#10157) Hendrik Makait + * Enable drop to support mismatched partitions (GH#10300) James + Bourbeau + * Fix divisions construction for to_timestamp (GH#10304) Patrick + Hoefler + * pandas ExtensionDtype raising in Series reduction operations + (GH#10149) Patrick Hoefler + * Fix regression in da.random interface (GH#10247) Eray Aslan + * da.coarsen doesnât trim an empty chunk in meta (GH#10281) Irina + Truong + * Fix dtype inference for engine="pyarrow" in read_csv (GH#10280) + Patrick Hoefler +- Release 2023.5.0 + ## Enhancements + * Implement numeric_only=False for GroupBy.corr and GroupBy.cov + (GH#10264) Patrick Hoefler + * Add support for numeric_only=False in DataFrame.var (GH#10250) + Patrick Hoefler + * Add numeric_only support to DataFrame.mode (GH#10257) Patrick + Hoefler + * Add DataFrame.map to dask.DataFrame API (GH#10246) Patrick + Hoefler + * Adjust for DataFrame.applymap deprecation and all NA concat + behaviour change (GH#10245) Patrick Hoefler + * Enable numeric_only=False for DataFrame.count (GH#10234) + Patrick Hoefler + * Disallow array input in mask/where (GH#10163) Irina Truong + * Support numeric_only=True in GroupBy.corr and GroupBy.cov + (GH#10227) Patrick Hoefler + * Add numeric_only support to GroupBy.median (GH#10236) Patrick + Hoefler + * Support mimesis=9 in dask.datasets (GH#10241) James Bourbeau + * Add numeric_only support to min, max and prod (GH#10219) + Patrick Hoefler + * Add numeric_only=True support for GroupBy.cumsum and + GroupBy.cumprod (GH#10224) Patrick Hoefler + * Add helper to unpack numeric_only keyword (GH#10228) Patrick + Hoefler + ## Bug Fixes + * Fix clone + from_array failure (GH#10211) crusaderky + * Fix dataframe reductions for ea dtypes (GH#10150) Patrick + Hoefler + * Avoid scalar conversion deprecation warning in numpy=1.25 + (GH#10248) James Bourbeau + * Make sure transform output has the same index as input + (GH#10184) Irina Truong + * Fix corr and cov on a single-row partition (GH#9756) Irina + Truong + * Fix test_groupby_numeric_only_supported and + test_groupby_aggregate_categorical_observed upstream errors + (GH#10243) Irina Truong +- Release 2023.4.1 + ## Enhancements + * Implement numeric_only support for DataFrame.sum (GH#10194) + Patrick Hoefler + * Add support for numeric_only=True in GroupBy operations + (GH#10222) Patrick Hoefler + * Avoid deep copy in DataFrame.__setitem__ for pandas 1.4 and up + (GH#10221) Patrick Hoefler + * Avoid calling Series.apply with _meta_nonempty (GH#10212) + Patrick Hoefler + * Unpin sqlalchemy and fix compatibility issues (GH#10140) + Patrick Hoefler + ## Bug Fixes + * Partially revert default client discovery (GH#10225) Florian + Jetter + * Support arrow dtypes in Index meta creation (GH#10170) Patrick + Hoefler + * Repartitioning raises with extension dtype when truncating + floats (GH#10169) Patrick Hoefler + * Adjust empty Index from fastparquet to object dtype (GH#10179) + Patrick Hoefler +- Release 2023.4.0 + ## Enhancements + * Override old default values in update_defaults (GH#10159) Gabe + Joseph + * Add a CLI command to list and get a value from dask config + (GH#9936) Irina Truong + * Handle string-based engine argument to read_json (GH#9947) + Richard (Rick) Zamora + * Avoid deprecated GroupBy.dtypes (GH#10111) Irina Truong + ## Bug Fixes + * Revert grouper-related changes (GH#10182) Irina Truong + * GroupBy.cov raising for non-numeric grouping column (GH#10171) + Patrick Hoefler + * Updates for Index supporting numpy numeric dtypes (GH#10154) + Irina Truong + * Preserve dtype for partitioning columns when read with pyarrow + (GH#10115) Patrick Hoefler + * Fix annotations for to_hdf (GH#10123) Hendrik Makait + * Handle None column name when checking if columns are all + numeric (GH#10128) Lawrence Mitchell + * Fix valid_divisions when passed a tuple (GH#10126) Brian + Phillips + * Maintain annotations in DataFrame.categorize (GH#10120) Hendrik + Makait + * Fix handling of missing min/max parquet statistics during + filtering (GH#10042) Richard (Rick) Zamora + ## Deprecations + * Deprecate use_nullable_dtypes= and add dtype_backend= + (GH#10076) Irina Truong + * Deprecate convert_dtype in Series.apply (GH#10133) Irina Truong +- Drop dask-pr10042-parquetstats.patch + +------------------------------------------------------------------- Old: ---- dask-2023.3.2.tar.gz dask-pr10042-parquetstats.patch New: ---- dask-2023.5.1.tar.gz ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ python-dask.spec ++++++ --- /var/tmp/diff_new_pack.BzPyfA/_old 2023-06-07 23:07:07.711241103 +0200 +++ /var/tmp/diff_new_pack.BzPyfA/_new 2023-06-07 23:07:07.719241149 +0200 @@ -45,61 +45,72 @@ Name: python-dask%{psuffix} # ===> Note: python-dask MUST be updated in sync with python-distributed! <=== -Version: 2023.3.2 +Version: 2023.5.1 Release: 0 Summary: Minimal task scheduling abstraction License: BSD-3-Clause URL: https://dask.org # SourceRepository: https://github.com/dask/dask Source0: https://files.pythonhosted.org/packages/source/d/dask/dask-%{version}.tar.gz -# PATCH-FIX-UPSTREAM dask-pr10042-parquetstats.patch gh#dask/dask#10042 -Patch0: dask-pr10042-parquetstats.patch -BuildRequires: %{python_module base >= 3.8} +BuildRequires: %{python_module base >= 3.9} BuildRequires: %{python_module packaging >= 20.0} BuildRequires: %{python_module pip} BuildRequires: %{python_module setuptools} +BuildRequires: %{python_module versioneer-toml >= 0.28} BuildRequires: %{python_module wheel} BuildRequires: fdupes BuildRequires: python-rpm-macros Requires: python-PyYAML >= 5.3.1 -Requires: python-click >= 7 -Requires: python-cloudpickle >= 1.1.1 -Requires: python-fsspec >= 0.6.0 +Requires: python-click >= 8 +Requires: python-cloudpickle >= 1.5 +Requires: python-fsspec >= 2021.9 Requires: python-importlib-metadata >= 4.13.0 Requires: python-packaging >= 20.0 Requires: python-partd >= 1.2.0 -Requires: python-toolz >= 0.8.2 +Requires: python-toolz >= 0.10.0 Requires(post): update-alternatives Requires(postun):update-alternatives Recommends: %{name}-array = %{version} -Recommends: %{name}-bag = %{version} Recommends: %{name}-dataframe = %{version} -Recommends: %{name}-delayed = %{version} Recommends: %{name}-distributed = %{version} -Recommends: %{name}-dot = %{version} -Recommends: python-SQLAlchemy >= 1.4.0 -Recommends: python-cityhash -Recommends: python-fastparquet -Recommends: python-gcsfs >= 0.4.0 -Recommends: python-murmurhash -Recommends: python-psutil -Recommends: python-pyarrow >= 0.14.0 -Recommends: python-s3fs >= 0.4.0 -Recommends: python-xxhash Suggests: %{name}-complete = %{version} Suggests: %{name}-diagnostics = %{version} +# SECTION https://docs.dask.org/en/stable/install.html#optional-dependencies +Suggests: python-SQLAlchemy >= 1.4.16 +Suggests: python-cityhash >= 0.2.4 +Suggests: python-fastparquet >= 0.8.2 +Suggests: python-gcsfs >= 2021.9.0 +Suggests: python-crick >= 0.0.3 +Suggests: python-cytoolz >= 0.10.1 +Suggests: python-dask-ml >= 1.4.0 +Suggests: python-fastavro >= 0.22.6 +Suggests: python-graphviz >= 0.8.4 +Suggests: python-h5py >= 2.10.0 +Suggests: python-psutil >= 0.5.7 +Suggests: python-pyarrow >= 0.14.0 +Suggests: python-matplotlib +Suggests: python-mimesis >= 5.3.0 +Suggests: python-mmh3 >= 2.5.1 +Suggests: python-sparse >= 0.12.0 +Suggests: python-s3fs >= 0.4.0 +Suggests: python-xxhash >= 1.4.1 +Suggests: python-zarr >= 2.12.0 +# /SECTION +Provides: %{name}-bag = %{version}-%{release} +Obsoletes: %{name}-bag < %{version}-%{release} +Provides: %{name}-delayed = %{version}-%{release} +Obsoletes: %{name}-delayed < %{version}-%{release} +Provides: %{name}-dot = %{version}-%{release} +Obsoletes: %{name}-dot < %{version}-%{release} Provides: %{name}-multiprocessing = %{version}-%{release} Obsoletes: %{name}-multiprocessing < %{version}-%{release} BuildArch: noarch %if %{with test} # test that we specified all requirements correctly in the core -# and subpackages by only requiring dask-complete and optional extras +# and subpackages by only requiring dask-test (= [complete] + pytest) and optional extras BuildRequires: %{python_module dask-test = %{version}} -BuildRequires: %{python_module pytest-rerunfailures} -BuildRequires: %{python_module pytest-xdist} -BuildRequires: %{python_module pytest} # SECTION additional optionally tested (importorskip) packages -BuildRequires: %{python_module SQLAlchemy >= 1.4.0} +BuildRequires: %{python_module SQLAlchemy >= 1.4.16} BuildRequires: %{python_module cachey} BuildRequires: %{python_module fastparquet >= 0.8.0} # optional zarr increases fsspec miminum to 0.8.4 if present @@ -110,15 +121,15 @@ BuildRequires: %{python_module matplotlib} BuildRequires: %{python_module mimesis} BuildRequires: %{python_module multipledispatch} -BuildRequires: %{python_module numba if %python-base < 3.11} +BuildRequires: %{python_module numba} # snappy required for using fastparquet BuildRequires: %{python_module python-snappy} BuildRequires: %{python_module requests} BuildRequires: %{python_module scikit-image} BuildRequires: %{python_module scipy} -BuildRequires: %{python_module sparse if %python-base < 3.11} +BuildRequires: %{python_module sparse} BuildRequires: %{python_module tables} -BuildRequires: %{python_module xarray if %python-base >= 3.9} +BuildRequires: %{python_module xarray} BuildRequires: %{python_module zarr} # /SECTION %endif @@ -141,12 +152,9 @@ Summary: All dask components Requires: %{name} = %{version} Requires: %{name}-array = %{version} -Requires: %{name}-bag = %{version} Requires: %{name}-dataframe = %{version} -Requires: %{name}-delayed = %{version} Requires: %{name}-diagnostics = %{version} Requires: %{name}-distributed = %{version} -Requires: %{name}-dot = %{version} Requires: python-lz4 >= 4.3.2 Requires: python-pyarrow >= 7 Provides: %{name}-all = %{version}-%{release} @@ -190,29 +198,6 @@ Dask arrays implement a subset of the NumPy interface on large arrays using blocked algorithms and task scheduling. -%package bag -Summary: Data structure generic python objects in dask -Requires: %{name} = %{version} - -%description bag -A flexible library for parallel computing in Python. - -Dask is composed of two parts: -- Dynamic task scheduling optimized for computation. This is similar to - Airflow, Luigi, Celery, or Make, but optimized for interactive - computational workloads. -- âBig Dataâ collections like parallel arrays, dataframes, and lists that - extend common interfaces like NumPy, Pandas, or Python iterators to - larger-than-memory or distributed environments. These parallel collections - run on top of dynamic task schedulers. - -This package contains the dask bag class. - -Dask.Bag parallelizes computations across a large collection of -generic Python objects. It is particularly useful when dealing -with large quantities of semi-structured data like JSON blobs -or log files. - %package dataframe Summary: Pandas-like DataFrame data structure for dask Requires: %{name} = %{version} @@ -264,8 +249,7 @@ Summary: Diagnostics for dask Requires: %{name} = %{version} Requires: python-Jinja2 >= 2.10.3 -# Not ready yet: https://github.com/dask/dask/pull/9659, we provide a legacy bokeh2 in Tumbleweed -Requires: (python-bokeh >= 2.4.2 with python-bokeh < 3) +Requires: python-bokeh >= 3.1 %description diagnostics A flexible library for parallel computing in Python. @@ -281,49 +265,18 @@ This package contains the dask.diagnostics module -%package delayed -Summary: Delayed module for dask -Requires: %{name} = %{version} - -%description delayed -A flexible library for parallel computing in Python. - -Dask is composed of two parts: -- Dynamic task scheduling optimized for computation. This is similar to - Airflow, Luigi, Celery, or Make, but optimized for interactive - computational workloads. -- âBig Dataâ collections like parallel arrays, dataframes, and lists that - extend common interfaces like NumPy, Pandas, or Python iterators to - larger-than-memory or distributed environments. These parallel collections - run on top of dynamic task schedulers. - -This package contains the dask.delayed module - -%package dot -Summary: Display dask graphs using graphviz -Requires: %{name} = %{version} -Requires: graphviz -Requires: graphviz-gd -Requires: graphviz-gnome -Requires: python-graphviz - -%description dot -A flexible library for parallel computing in Python. - -Dask is composed of two parts: -- Dynamic task scheduling optimized for computation. This is similar to - Airflow, Luigi, Celery, or Make, but optimized for interactive - computational workloads. -- âBig Dataâ collections like parallel arrays, dataframes, and lists that - extend common interfaces like NumPy, Pandas, or Python iterators to - larger-than-memory or distributed environments. These parallel collections - run on top of dynamic task schedulers. - -This package contains the graphviz dot rendering interface. - %package test Summary: The test submodules of the python-dask package Requires: %{name}-complete = %{version} +Requires: python-pandas +# SECTION pandas[test] +Requires: python-hypothesis +Requires: python-pytest-asyncio +# /SECTION +Requires: python-pre-commit +Requires: python-pytest +Requires: python-pytest-rerunfailures +Requires: python-pytest-xdist %description test Dask is a flexible library for parallel computing in Python. @@ -332,8 +285,7 @@ %prep %autosetup -p1 -n dask-%{version} -sed -i '/addopts/d' setup.cfg -chmod a-x dask/dataframe/io/orc/utils.py +sed -i '/addopts/d' pyproject.toml %build %pyproject_wheel @@ -372,6 +324,8 @@ # https://github.com/dask/dask/issues/8620 donttest+=" or test_query_with_meta" donttest+=" or test_repartition_npartitions" + # + donttest+=" or test_pandas_multiindex" fi # (rarely) flaky on obs donttest+=" or test_local_scheduler" @@ -397,12 +351,10 @@ %{python_sitelib}/dask/ %{python_sitelib}/dask-%{version}.dist-info %exclude %{python_sitelib}/dask/array/ -%exclude %{python_sitelib}/dask/bag/ %exclude %{python_sitelib}/dask/dataframe/ %exclude %{python_sitelib}/dask/diagnostics -%exclude %{python_sitelib}/dask/delayed.py* -%exclude %{python_sitelib}/dask/dot.py* %exclude %{python_sitelib}/dask/tests +%exclude %{python_sitelib}/dask/bag/tests %exclude %{python_sitelib}/dask/bytes/tests %exclude %{python_sitelib}/dask/widgets/tests %pycache_only %exclude %{python_sitelib}/dask/__pycache__/delayed*.pyc @@ -416,11 +368,6 @@ %{python_sitelib}/dask/array/ %exclude %{python_sitelib}/dask/array/tests -%files %{python_files bag} -%license LICENSE.txt -%{python_sitelib}/dask/bag/ -%exclude %{python_sitelib}/dask/bag/tests - %files %{python_files dataframe} %license LICENSE.txt %{python_sitelib}/dask/dataframe/ @@ -431,21 +378,11 @@ %files %{python_files distributed} %license LICENSE.txt -%files %{python_files dot} -%license LICENSE.txt -%{python_sitelib}/dask/dot.py* -%pycache_only %{python_sitelib}/dask/__pycache__/dot.* - %files %{python_files diagnostics} %license LICENSE.txt %{python_sitelib}/dask/diagnostics/ %exclude %{python_sitelib}/dask/diagnostics/tests -%files %{python_files delayed} -%license LICENSE.txt -%{python_sitelib}/dask/delayed.py* -%pycache_only %{python_sitelib}/dask/__pycache__/delayed*.pyc - %files %{python_files test} %license LICENSE.txt %{python_sitelib}/dask/tests ++++++ dask-2023.3.2.tar.gz -> dask-2023.5.1.tar.gz ++++++ /work/SRC/openSUSE:Factory/python-dask/dask-2023.3.2.tar.gz /work/SRC/openSUSE:Factory/.python-dask.new.15902/dask-2023.5.1.tar.gz differ: char 5, line 1