read_csv arguments: can we have skipcols and userows? #15799

pseth · 2017-03-24T15:04:59Z

Is there a reason why read_csv has a usecols and skiprows as arguments, but not skipcols and userows? Is this to avoid parameter checks or something more fundamental than that?

It would be nice to have all four options to avoid clunky inversions of the type usecols = columns.remove(unwanted_col).

The text was updated successfully, but these errors were encountered:

jreback · 2017-03-24T17:15:01Z

this is essentially a duplicate of the now closed: #10882

usecols accepts a callable, to allow arbitrary evaluation of which columns to use.

in a similar vein, skiprows accepts a callable as well.

The defaults make the most sense here, e.g. generally you want to keep columns (out of a larger set) and skip (a small subset of rows).

I am not anti the counter parts, but this is just one more keyword and added complexity.

gfyoung · 2017-03-27T01:14:59Z

Yep, I'm in agreement with @jreback here. Especially since we can accept callables for both inputs you can emulate skipcols and userows as follows:

skipcols = [...]
userows = [...]
read_csv(..., usecols=lambda x: x not in skipcols,
              skiprows=lambda x: x not in userows])

I think this should resolve your concern about "clunkiness" as you put it, so if there are no other concerns, I think this is safe to close.

pseth · 2017-03-27T11:02:43Z

@gfyoung Ah, I did not realise that was possible, the online documentation for read_csv doesn't seem to be up to date. That is indeed a more elegant solution over possibly conflicting arguments.

jreback · 2017-03-27T11:58:49Z

This feature is in 0.20.0 which is not released yet, docs are in the dev-docs: http://pandas-docs.github.io/pandas-docs-travis/generated/pandas.read_csv.html?highlight=read_csv#pandas.read_csv

jreback added API Design Duplicate Report Duplicate issue or pull request IO CSV read_csv, to_csv labels Mar 24, 2017

pseth closed this as completed Mar 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

read_csv arguments: can we have skipcols and userows? #15799

read_csv arguments: can we have skipcols and userows? #15799

pseth commented Mar 24, 2017

jreback commented Mar 24, 2017

Uh oh!

gfyoung commented Mar 27, 2017 •

edited

Loading

Uh oh!

pseth commented Mar 27, 2017

Uh oh!

jreback commented Mar 27, 2017 •

edited

Loading

Uh oh!

Uh oh!

read_csv arguments: can we have skipcols and userows? #15799

read_csv arguments: can we have skipcols and userows? #15799

Comments

pseth commented Mar 24, 2017

jreback commented Mar 24, 2017

Uh oh!

gfyoung commented Mar 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pseth commented Mar 27, 2017

Uh oh!

jreback commented Mar 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gfyoung commented Mar 27, 2017 •

edited

Loading

jreback commented Mar 27, 2017 •

edited

Loading