BUG: GH10355 groupby std() doesnt sqrt grouping cols #11507

henrystokeley · 2015-11-03T08:17:46Z

New attempt at #10355 Hopefully should address the issues raised in #11300

Previously, grouping columns were square rooted when as_index=False
We now test whether the grouping keys are in the columns, and
if so don't square root those columns.

Note that we squash TypeError which occurs when self.keys is not
Hashable, and so we can't check for existence in columns.

Previously, grouping columns were square rooted when as_index=False We now test whether the grouping keys are in the columns, and if so don't square root those columns. Note that we squash TypeError which occurs when self.keys is not Hashable and so we can't check for existence in columns.

jreback · 2015-11-04T13:47:02Z

pandas/core/groupby.py

+            return np.sqrt(self.var(ddof=ddof))
+        else:
+            df = self.var(ddof=ddof)
+


you don't need all of this logic, just use self._selected_obj

jreback · 2015-11-04T13:47:36Z

pls add a whatsnew note in 0.17.1 bug-fixes

henrystokeley · 2015-11-07T12:22:35Z

@jreback

My if tests logic finds columns used to make the group. This is to ensure their values are not square rooted.
How can I use self._selected_obj to discover which columns are being used in the grouping?

jreback · 2015-11-07T14:44:44Z

In [27]: df = pandas.DataFrame({
               'a' : [1,1,1,2,2,2,3,3,3],
               'b' : [1,2,3,4,5,6,7,8,9],
})

In [21]: g = df.groupby('a',as_index=False)

In [22]: g._set_selection_from_grouper()

In [23]: g._selected_obj
Out[23]: 
   a  b
0  1  1
1  1  2
2  1  3
3  2  4
4  2  5
5  2  6
6  3  7
7  3  8
8  3  9

In [24]: g = df.groupby('a',as_index=True)

In [25]: g._set_selection_from_grouper()

In [26]: g._selected_obj
Out[26]: 
   b
0  1
1  2
2  3
3  4
4  5
5  6
6  7
7  8
8  9

these _set_selection_from_grouper() functions are called when functions are actually run (e.g. you actually call .std()). So you can then use the ._selected_obj for what the actual data (excluding the groupings is).

henrystokeley · 2015-11-07T15:00:28Z

@jreback

Thanks for your quick reply.

As you can see in your output on line [23] _selected_obj doesn't exclude the grouping when as_index=False
It only excludes the grouping when as_index=True, and that isn't the case we're dealing with.

Sorry if I'm missing something.

jreback · 2015-11-07T15:05:11Z

use this, though maybe something more sophisticated in there. you have to test using levels as well. Everything is there in the grouper objects, you just have to look for it. Don't reinvent the wheel.

In [36]: g.grouper.names 
Out[36]: ['a']

jreback · 2015-11-25T15:41:48Z

pls rebase / update according to comments

jreback · 2015-12-06T19:13:26Z

can you rebase / update according to comments

jreback · 2015-12-09T15:42:00Z

can you update?

jreback · 2015-12-16T14:27:18Z

can you update

jreback · 2016-01-06T17:20:51Z

closing. but pls reopen if you'd like to update

xieyuheng · 2019-02-14T05:24:07Z

#25315

jreback added Bug Groupby labels Nov 4, 2015

jreback reviewed Nov 4, 2015
View reviewed changes

jreback mentioned this pull request Nov 5, 2015

BUG: GH10355 groupby std() no longer sqrts grouping cols #11300

Closed

add whatsnew comment for GH11507

34c0daa

jreback closed this Jan 6, 2016

jorisvandenbossche added the Closed PR label Nov 1, 2016

ivaniadg mentioned this pull request Jun 29, 2017

BUG: apply std to groupby with as_index=False #16799

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: GH10355 groupby std() doesnt sqrt grouping cols #11507

BUG: GH10355 groupby std() doesnt sqrt grouping cols #11507

Uh oh!

henrystokeley commented Nov 3, 2015

Uh oh!

jreback Nov 4, 2015

Uh oh!

jreback commented Nov 4, 2015

Uh oh!

henrystokeley commented Nov 7, 2015

Uh oh!

jreback commented Nov 7, 2015

Uh oh!

henrystokeley commented Nov 7, 2015

Uh oh!

jreback commented Nov 7, 2015

Uh oh!

jreback commented Nov 25, 2015

Uh oh!

jreback commented Dec 6, 2015

Uh oh!

jreback commented Dec 9, 2015

Uh oh!

jreback commented Dec 16, 2015

Uh oh!

jreback commented Jan 6, 2016

Uh oh!

xieyuheng commented Feb 14, 2019

Uh oh!

Uh oh!

Uh oh!

BUG: GH10355 groupby std() doesnt sqrt grouping cols #11507

BUG: GH10355 groupby std() doesnt sqrt grouping cols #11507

Uh oh!

Conversation

henrystokeley commented Nov 3, 2015

Uh oh!

jreback Nov 4, 2015

Choose a reason for hiding this comment

Uh oh!

jreback commented Nov 4, 2015

Uh oh!

henrystokeley commented Nov 7, 2015

Uh oh!

jreback commented Nov 7, 2015

Uh oh!

henrystokeley commented Nov 7, 2015

Uh oh!

jreback commented Nov 7, 2015

Uh oh!

jreback commented Nov 25, 2015

Uh oh!

jreback commented Dec 6, 2015

Uh oh!

jreback commented Dec 9, 2015

Uh oh!

jreback commented Dec 16, 2015

Uh oh!

jreback commented Jan 6, 2016

Uh oh!

xieyuheng commented Feb 14, 2019

Uh oh!

Uh oh!