Skip to content

Commit 5c848db

Browse files
authored
Merge branch 'main' into newfeature
2 parents 9ecaf66 + 2a1ca9d commit 5c848db

38 files changed

+345
-167
lines changed

.github/workflows/wheels.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ jobs:
156156
run: echo "sdist_name=$(cd ./dist && ls -d */)" >> "$GITHUB_ENV"
157157

158158
- name: Build wheels
159-
uses: pypa/[email protected].0
159+
uses: pypa/[email protected].3
160160
with:
161161
package-dir: ./dist/${{ startsWith(matrix.buildplat[1], 'macosx') && env.sdist_name || needs.build_sdist.outputs.sdist_file }}
162162
env:

ci/code_checks.sh

Lines changed: 0 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -85,8 +85,6 @@ if [[ -z "$CHECK" || "$CHECK" == "docstrings" ]]; then
8585
-i "pandas.Timestamp.resolution PR02" \
8686
-i "pandas.Timestamp.tzinfo GL08" \
8787
-i "pandas.Timestamp.year GL08" \
88-
-i "pandas.api.types.is_integer PR01,SA01" \
89-
-i "pandas.api.types.is_iterator PR07,SA01" \
9088
-i "pandas.api.types.is_re_compilable PR07,SA01" \
9189
-i "pandas.api.types.pandas_dtype PR07,RT03,SA01" \
9290
-i "pandas.arrays.ArrowExtensionArray PR07,SA01" \
@@ -123,11 +121,9 @@ if [[ -z "$CHECK" || "$CHECK" == "docstrings" ]]; then
123121
-i "pandas.core.resample.Resampler.quantile PR01,PR07" \
124122
-i "pandas.core.resample.Resampler.sem SA01" \
125123
-i "pandas.core.resample.Resampler.std SA01" \
126-
-i "pandas.core.resample.Resampler.sum SA01" \
127124
-i "pandas.core.resample.Resampler.transform PR01,RT03,SA01" \
128125
-i "pandas.core.resample.Resampler.var SA01" \
129126
-i "pandas.errors.AttributeConflictWarning SA01" \
130-
-i "pandas.errors.CSSWarning SA01" \
131127
-i "pandas.errors.ChainedAssignmentError SA01" \
132128
-i "pandas.errors.DataError SA01" \
133129
-i "pandas.errors.DuplicateLabelError SA01" \
@@ -136,21 +132,17 @@ if [[ -z "$CHECK" || "$CHECK" == "docstrings" ]]; then
136132
-i "pandas.errors.NullFrequencyError SA01" \
137133
-i "pandas.errors.NumExprClobberingError SA01" \
138134
-i "pandas.errors.NumbaUtilError SA01" \
139-
-i "pandas.errors.OptionError SA01" \
140135
-i "pandas.errors.OutOfBoundsTimedelta SA01" \
141136
-i "pandas.errors.PerformanceWarning SA01" \
142137
-i "pandas.errors.PossibleDataLossError SA01" \
143138
-i "pandas.errors.UndefinedVariableError PR01,SA01" \
144139
-i "pandas.errors.UnsortedIndexError SA01" \
145-
-i "pandas.errors.UnsupportedFunctionCall SA01" \
146140
-i "pandas.errors.ValueLabelTypeMismatch SA01" \
147141
-i "pandas.infer_freq SA01" \
148142
-i "pandas.io.json.build_table_schema PR07,RT03,SA01" \
149143
-i "pandas.io.stata.StataWriter.write_file SA01" \
150-
-i "pandas.json_normalize RT03,SA01" \
151144
-i "pandas.plotting.andrews_curves RT03,SA01" \
152145
-i "pandas.plotting.scatter_matrix PR07,SA01" \
153-
-i "pandas.set_eng_float_format RT03,SA01" \
154146
-i "pandas.tseries.offsets.BDay PR02,SA01" \
155147
-i "pandas.tseries.offsets.BQuarterBegin.is_on_offset GL08" \
156148
-i "pandas.tseries.offsets.BQuarterBegin.n GL08" \
@@ -296,7 +288,6 @@ if [[ -z "$CHECK" || "$CHECK" == "docstrings" ]]; then
296288
-i "pandas.tseries.offsets.Second.is_on_offset GL08" \
297289
-i "pandas.tseries.offsets.Second.n GL08" \
298290
-i "pandas.tseries.offsets.Second.normalize GL08" \
299-
-i "pandas.tseries.offsets.SemiMonthBegin SA01" \
300291
-i "pandas.tseries.offsets.SemiMonthBegin.day_of_month GL08" \
301292
-i "pandas.tseries.offsets.SemiMonthBegin.is_on_offset GL08" \
302293
-i "pandas.tseries.offsets.SemiMonthBegin.n GL08" \

doc/source/development/contributing_codebase.rst

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -298,6 +298,12 @@ So, before actually writing any code, you should write your tests. Often the te
298298
taken from the original GitHub issue. However, it is always worth considering additional
299299
use cases and writing corresponding tests.
300300

301+
We use `code coverage <https://en.wikipedia.org/wiki/Code_coverage>`_ to help understand
302+
the amount of code which is covered by a test. We recommend striving to ensure code
303+
you add or change within Pandas is covered by a test. Please see our
304+
`code coverage dashboard through Codecov <https://app.codecov.io/github/pandas-dev/pandas>`_
305+
for more information.
306+
301307
Adding tests is one of the most common requests after code is pushed to pandas. Therefore,
302308
it is worth getting in the habit of writing tests ahead of time so this is never an issue.
303309

doc/source/user_guide/cookbook.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ These are some neat pandas ``idioms``
3535
)
3636
df
3737
38-
if-then...
38+
If-then...
3939
**********
4040

4141
An if-then on one column
@@ -176,7 +176,7 @@ One could hard code:
176176
Selection
177177
---------
178178

179-
Dataframes
179+
DataFrames
180180
**********
181181

182182
The :ref:`indexing <indexing>` docs.
@@ -1489,7 +1489,7 @@ of the data values:
14891489
)
14901490
df
14911491
1492-
Constant series
1492+
Constant Series
14931493
---------------
14941494

14951495
To assess if a series has a constant value, we can check if ``series.nunique() <= 1``.

doc/source/user_guide/gotchas.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -121,7 +121,7 @@ Below is how to check if any of the values are ``True``:
121121
if pd.Series([False, True, False]).any():
122122
print("I am any")
123123
124-
Bitwise boolean
124+
Bitwise Boolean
125125
~~~~~~~~~~~~~~~
126126

127127
Bitwise boolean operators like ``==`` and ``!=`` return a boolean :class:`Series`

doc/source/user_guide/groupby.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -618,7 +618,7 @@ this will make an extra copy.
618618
619619
.. _groupby.aggregate.udf:
620620

621-
Aggregation with User-Defined Functions
621+
Aggregation with user-defined functions
622622
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
623623

624624
Users can also provide their own User-Defined Functions (UDFs) for custom aggregations.
@@ -1261,7 +1261,7 @@ with
12611261
df.groupby("A", group_keys=False).apply(lambda x: x, include_groups=False)
12621262
12631263
1264-
Numba Accelerated Routines
1264+
Numba accelerated routines
12651265
--------------------------
12661266

12671267
.. versionadded:: 1.1
@@ -1696,7 +1696,7 @@ introduction <categorical>` and the
16961696
16971697
dfg.groupby(["A", [0, 0, 0, 1, 1]]).ngroup()
16981698
1699-
Groupby by indexer to 'resample' data
1699+
GroupBy by indexer to 'resample' data
17001700
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
17011701

17021702
Resampling produces new hypothetical samples (resamples) from already existing observed data or from a model that generates data. These new samples are similar to the pre-existing samples.

doc/source/user_guide/integer_na.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,7 @@ Reduction and groupby operations such as :meth:`~DataFrame.sum` work as well.
147147
df.sum()
148148
df.groupby("B").A.sum()
149149
150-
Scalar NA Value
150+
Scalar NA value
151151
---------------
152152

153153
:class:`arrays.IntegerArray` uses :attr:`pandas.NA` as its scalar

doc/source/user_guide/io.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5996,7 +5996,7 @@ Full documentation can be found `here <https://pandas-gbq.readthedocs.io/en/late
59965996

59975997
.. _io.stata:
59985998

5999-
Stata format
5999+
STATA format
60006000
------------
60016001

60026002
.. _io.stata_writer:

doc/source/whatsnew/v1.0.2.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ Fixed regressions
4747

4848
.. ---------------------------------------------------------------------------
4949
50-
Indexing with nullable boolean arrays
50+
Indexing with nullable Boolean arrays
5151
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
5252

5353
Previously indexing with a nullable Boolean array containing ``NA`` would raise a ``ValueError``, however this is now permitted with ``NA`` being treated as ``False``. (:issue:`31503`)

doc/source/whatsnew/v3.0.0.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -592,6 +592,7 @@ Performance improvements
592592
- Performance improvement in :meth:`RangeIndex.take` returning a :class:`RangeIndex` instead of a :class:`Index` when possible. (:issue:`57445`, :issue:`57752`)
593593
- Performance improvement in :func:`merge` if hash-join can be used (:issue:`57970`)
594594
- Performance improvement in :meth:`CategoricalDtype.update_dtype` when ``dtype`` is a :class:`CategoricalDtype` with non ``None`` categories and ordered (:issue:`59647`)
595+
- Performance improvement in :meth:`DataFrame.astype` when converting to extension floating dtypes, e.g. "Float64" (:issue:`60066`)
595596
- Performance improvement in :meth:`to_hdf` avoid unnecessary reopenings of the HDF5 file to speedup data addition to files with a very large number of groups . (:issue:`58248`)
596597
- Performance improvement in ``DataFrameGroupBy.__len__`` and ``SeriesGroupBy.__len__`` (:issue:`57595`)
597598
- Performance improvement in indexing operations for string dtypes (:issue:`56997`)

0 commit comments

Comments
 (0)