Hello community, here is the log from the commit of package python3-pandas for openSUSE:Factory checked in at 2015-03-25 10:01:34 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/python3-pandas (Old) and /work/SRC/openSUSE:Factory/.python3-pandas.new (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "python3-pandas" Changes: -------- --- /work/SRC/openSUSE:Factory/python3-pandas/python3-pandas.changes 2015-02-16 22:13:29.000000000 +0100 +++ /work/SRC/openSUSE:Factory/.python3-pandas.new/python3-pandas.changes 2015-03-25 10:01:49.000000000 +0100 @@ -1,0 +2,244 @@ +Tue Mar 24 12:44:20 UTC 2015 - toddrme2...@gmail.com + +- update to version 0.16.0: + * Highlights: + - DataFrame.assign method + - Series.to_coo/from_coo methods to interact with scipy.sparse + - Backwards incompatible change to Timedelta to conform the .seconds + attribute with datetime.timedelta + - Changes to the .loc slicing API to conform with the behavior of .ix + - Changes to the default for ordering in the Categorical constructor + - Enhancement to the .str accessor to make string operations easier + - The pandas.tools.rplot, pandas.sandbox.qtpandas and pandas.rpy + modules are deprecated. We refer users to external packages like + seaborn, pandas-qt and rpy2 for similar or equivalent functionality + * New features + - Inspired by dplyr's mutate verb, DataFrame has a new assign method. + - Added SparseSeries.to_coo and SparseSeries.from_coo methods for + converting to and from scipy.sparse.coo_matrix instances. + - Following new methods are accesible via .str accessor to apply the + function to each values. This is intended to make it more consistent with + standard methods on strings: isalnum(), isalpha(), isdigit(), isdigit(), + isspace(), islower(), isupper(), istitle(), isnumeric(), isdecimal(), + find(), rfind(), ljust(), rjust(), zfill() + - Reindex now supports method='nearest' for frames or series with a + monotonic increasing or decreasing index. + - The read_excel() function's sheetname argument now accepts a list and + None, to get multiple or all sheets respectively. If more than one sheet + is specified, a dictionary is returned. + - Allow Stata files to be read incrementally with an iterator; support for + long strings in Stata files. + - Paths beginning with ~ will now be expanded to begin with the user's home + directory. + - Added time interval selection in get_data_yahoo. + - Added Timestamp.to_datetime64() to complement Timedelta.to_timedelta64(). + - tseries.frequencies.to_offset() now accepts Timedelta as input. + - Lag parameter was added to the autocorrelation method of Series, defaults + to lag-1 autocorrelation. + - Timedelta will now accept nanoseconds keyword in constructor. + - SQL code now safely escapes table and column names. + - Added auto-complete for Series.str.<tab>, Series.dt.<tab> and + Series.cat.<tab>. + - Index.get_indexer now supports method='pad' and method='backfill' even + for any target array, not just monotonic targets. + - Index.asof now works on all index types. + - A verbose argument has been augmented in io.read_excel(), defaults to + False. Set to True to print sheet names as they are parsed. + - Added days_in_month (compatibility alias daysinmonth) property to + Timestamp, DatetimeIndex, Period, PeriodIndex, and Series.dt. + - Added decimal option in to_csv to provide formatting for non-'.' decimal + separators + - Added normalize option for Timestamp to normalized to midnight + - Added example for DataFrame import to R using HDF5 file and rhdf5 + library. + * Backwards incompatible API changes + - In v0.16.0, we are restoring the API to match that of datetime.timedelta. + Further, the component values are still available through the .components + accessor. This affects the .seconds and .microseconds accessors, and + removes the .hours, .minutes, .milliseconds accessors. These changes + affect TimedeltaIndex and the Series .dt accessor as well. + - The behavior of a small sub-set of edge cases for using .loc have + changed. Furthermore we have improved the content of the error messages + that are raised: + + Slicing with .loc where the start and/or stop bound is not found in + the index is now allowed; this previously would raise a KeyError. This + makes the behavior the same as .ix in this case. This change is only + for slicing, not when indexing with a single label. + + Allow slicing with float-like values on an integer index for .ix. + Previously this was only enabled for .loc: + + Provide a useful exception for indexing with an invalid type for that + index when using .loc. For example trying to use .loc on an index of + type DatetimeIndex or PeriodIndex or TimedeltaIndex, with an integer + (or a float). + - In prior versions, Categoricals that had an unspecified ordering + (meaning no ordered keyword was passed) were defaulted as ordered + Categoricals. Going forward, the ordered keyword in the Categorical + constructor will default to False. Ordering must now be explicit. + Furthermore, previously you *could* change the ordered attribute of a + Categorical by just setting the attribute, e.g. cat.ordered=True; This is + now deprecated and you should use cat.as_ordered() or cat.as_unordered(). + These will by default return a **new** object and not modify the + existing object. + - Index.duplicated now returns np.array(dtype=bool) rather than + Index(dtype=object) containing bool values. + - DataFrame.to_json now returns accurate type serialisation for each column + for frames of mixed dtype + - DatetimeIndex, PeriodIndex and TimedeltaIndex.summary now output the same + format. + - TimedeltaIndex.freqstr now output the same string format as + DatetimeIndex. + - Bar and horizontal bar plots no longer add a dashed line along the info + axis. The prior style can be achieved with matplotlib's axhline or + axvline methods. + - Series accessors .dt, .cat and .str now raise AttributeError instead of + TypeError if the series does not contain the appropriate type of data. + This follows Python's built-in exception hierarchy more closely and + ensures that tests like hasattr(s, 'cat') are consistent on both Python + 2 and 3. + - Series now supports bitwise operation for integral types. Previously even + if the input dtypes were integral, the output dtype was coerced to bool. + - During division involving a Series or DataFrame, 0/0 and 0//0 now give + np.nan instead of np.inf. + - Series.values_counts and Series.describe for categorical data will now + put NaN entries at the end. + - Series.describe for categorical data will now give counts and frequencies + of 0, not NaN, for unused categories + - Due to a bug fix, looking up a partial string label with + DatetimeIndex.asof now includes values that match the string, even if + they are after the start of the partial string label. Old behavior: + * Deprecations + - The rplot trellis plotting interface is deprecated and will be removed + in a future version. We refer to external packages like + seaborn for similar but more refined functionality. + - The pandas.sandbox.qtpandas interface is deprecated and will be removed + in a future version. + We refer users to the external package pandas-qt. + - The pandas.rpy interface is deprecated and will be removed in a future + version. + Similar functionaility can be accessed thru the rpy2 project + - Adding DatetimeIndex/PeriodIndex to another DatetimeIndex/PeriodIndex is + being deprecated as a set-operation. This will be changed to a TypeError + in a future version. .union() should be used for the union set operation. + - Subtracting DatetimeIndex/PeriodIndex from another + DatetimeIndex/PeriodIndex is being deprecated as a set-operation. This + will be changed to an actual numeric subtraction yielding a + TimeDeltaIndex in a future version. .difference() should be used for + the differencing set operation. + * Removal of prior version deprecations/changes + - DataFrame.pivot_table and crosstab's rows and cols keyword arguments were + removed in favor + of index and columns + - DataFrame.to_excel and DataFrame.to_csv cols keyword argument was removed + in favor of columns + - Removed convert_dummies in favor of get_dummies + - Removed value_range in favor of describe + * Performance Improvements + - Fixed a performance regression for .loc indexing with an array or + list-like. + - DataFrame.to_json 30x performance improvement for mixed dtype frames. + - Performance improvements in MultiIndex.duplicated by working with labels + instead of values + - Improved the speed of nunique by calling unique instead of value_counts + - Performance improvement of up to 10x in DataFrame.count and + DataFrame.dropna by taking advantage of homogeneous/heterogeneous dtypes + appropriately + - Performance improvement of up to 20x in DataFrame.count when using a + MultiIndex and the level keyword argument + - Performance and memory usage improvements in merge when key space exceeds + int64 bounds + - Performance improvements in multi-key groupby + - Performance improvements in MultiIndex.sortlevel + - Performance and memory usage improvements in DataFrame.duplicated + - Cythonized Period + - Decreased memory usage on to_hdf + * Bug Fixes + - Changed .to_html to remove leading/trailing spaces in table body + - Fixed issue using read_csv on s3 with Python 3 + - Fixed compatibility issue in DatetimeIndex affecting architectures where + numpy.int_ defaults to numpy.int32 + - Bug in Panel indexing with an object-like + - Bug in the returned Series.dt.components index was reset to the default + index + - Bug in Categorical.__getitem__/__setitem__ with listlike input getting + incorrect results from indexer coercion + - Bug in partial setting with a DatetimeIndex + - Bug in groupby for integer and datetime64 columns when applying an + aggregator that caused the value to be + changed when the number was sufficiently large + - Fixed bug in to_sql when mapping a Timestamp object column (datetime + column with timezone info) to the appropriate sqlalchemy type. + - Fixed bug in to_sql dtype argument not accepting an instantiated + SQLAlchemy type. + - Bug in .loc partial setting with a np.datetime64 + - Incorrect dtypes inferred on datetimelike looking Series & on .xs slices + - Items in Categorical.unique() (and s.unique() if s is of dtype category) + now appear in the order in which they are originally found, not in sorted + order. This is now consistent with the behavior for other dtypes in pandas. + - Fixed bug on big endian platforms which produced incorrect results in + StataReader. + - Bug in MultiIndex.has_duplicates when having many levels causes an + indexer overflow + - Bug in pivot and unstack where nan values would break index alignment + - Bug in left join on multi-index with sort=True or null values. + - Bug in MultiIndex where inserting new keys would fail. + - Bug in groupby when key space exceeds int64 bounds. + - Bug in unstack with TimedeltaIndex or DatetimeIndex and nulls. + - Bug in rank where comparing floats with tolerance will cause inconsistent + behaviour. + - Fixed character encoding bug in read_stata and StataReader when loading + data from a URL. + - Bug in adding offsets.Nano to other offets raises TypeError + - Bug in DatetimeIndex iteration, related to, fixed in + - Bugs in resample around DST transitions. This required fixing offset + classes so they behave correctly on DST transitions. + - Bug in binary operator method (eg .mul()) alignment with integer levels. + - Bug in boxplot, scatter and hexbin plot may show an unnecessary warning + - Bug in subplot with layout kw may show unnecessary warning + - Bug in using grouper functions that need passed thru arguments (e.g. ++++ 47 more lines (skipped) ++++ between /work/SRC/openSUSE:Factory/python3-pandas/python3-pandas.changes ++++ and /work/SRC/openSUSE:Factory/.python3-pandas.new/python3-pandas.changes Old: ---- pandas-0.15.2.tar.gz New: ---- pandas-0.16.0.tar.gz ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ python3-pandas.spec ++++++ --- /var/tmp/diff_new_pack.c8Sx1Q/_old 2015-03-25 10:01:50.000000000 +0100 +++ /var/tmp/diff_new_pack.c8Sx1Q/_new 2015-03-25 10:01:50.000000000 +0100 @@ -17,7 +17,7 @@ Name: python3-pandas -Version: 0.15.2 +Version: 0.16.0 Release: 0 Summary: Make working with "relational" or "labeled" data both easy and intuitive License: BSD-3-Clause @@ -97,6 +97,10 @@ %install python3 setup.py install --root %{buildroot} --prefix=%{_prefix} +pushd doc +python3 make.py html +popd + rm -r %{buildroot}%{python3_sitearch}/pandas/tests %fdupes %{buildroot}%{python3_sitearch} @@ -116,6 +120,6 @@ %files doc %defattr(-,root,root) -%doc examples +%doc doc/build/html/ %changelog ++++++ pandas-0.15.2.tar.gz -> pandas-0.16.0.tar.gz ++++++ ++++ 281920 lines of diff (skipped) -- To unsubscribe, e-mail: opensuse-commit+unsubscr...@opensuse.org For additional commands, e-mail: opensuse-commit+h...@opensuse.org