Re-implemented parametrization of test_frame_from_json_to_json #28510

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

jbrockmendel merged 1 commit into pandas-dev:master from WillAyd:json-parametrize2

Sep 20, 2019

Member

WillAyd commented Sep 18, 2019

This piece was broken off of #27838 as it made the diff much larger, so hopefully easier to digest on its own.

As is, parametrization here has brought up a lot of rough edges which are responsible for some of the complexity. These are noted with TODOs and summarized as follows (save Py35 issues, which aren't worth addressing at this point):

Frame order is not maintained when numpy=False (default) and orient="index"
On windows or 32 bit platforms it appears that np.int64 roundtrips as np.int32 (maybe not an issue?)
orient="split" does not preserve strings in the index if those strings are numeric, though it should be able to
convert_axes may have surprising behavior when dealing with empty DataFrames
DTI seem to roundtrip as strings when written with epoch format for all but `orient="split"

Not all of these are the same priority, but figure worth leaving as follow ups


          Reverted parametrization of test_frame_from_json_to_json

9a1deaf

WillAyd added IO JSON Testing labels

WillAyd changed the title ~~Reverted parametrization of test_frame_from_json_to_json~~ Re-implemented parametrization of test_frame_from_json_to_json

Member

jbrockmendel commented Sep 19, 2019

I'll take a look. Thanks for separating this out.

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py

+                      )
+                      expected = df.copy()
+                      expected = expected.assign(**expected.select_dtypes("number").astype(np.int64))

Member

jbrockmendel Sep 20, 2019

I dont use assign or select_dtypes very often. Is this just casting to int64 for columns A and B?

Member Author

WillAyd Sep 20, 2019

Yes that's correct. Happy to change to another way of casting if you prefer

Member

jbrockmendel Sep 20, 2019

I think df[["A", "B"]] = df[["A", "B"]].astype(np.int64) would be clearer. Might just be me.

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py

+                      if orient == "records" or orient == "values":
+                          expected = expected.reset_index(drop=True)
+                      if orient == "values":
+                          expected.columns = range(len(expected.columns))

Member

jbrockmendel Sep 20, 2019

re-iterating request to share code for ~344-360. As is I have to look at each version for "is this subtly different and if so why"

Member Author

WillAyd Sep 20, 2019

Hmm I don't know I fully understand what you are asking for. This particular test lines up pretty well to the left (lines 463 - 472) but is more explicit about the expected shape of the result. This idiom of resetting particular axes for values and records is used throughout the module (probably worth a dedicated function, but leaving to a follow up)

Member

jbrockmendel Sep 20, 2019

This idiom of resetting particular axes for values and records is used throughout the module

Yah, thats what I'm asking for. OK for follow-up.

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py Show resolved Hide resolved

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py Show resolved Hide resolved

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py

-                          self.empty_frame, check_index_type=False, check_column_type=False
+                  @pytest.mark.parametrize("convert_axes", [True, False])
+                  @pytest.mark.parametrize("numpy", [True, False])
+                  def test_roundtrip_timestamp(self, orient, convert_axes, numpy):

Member

jbrockmendel Sep 20, 2019

LGTM, corresponds to 452-453

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py

+                  @pytest.mark.parametrize("convert_axes", [True, False])
+                  @pytest.mark.parametrize("numpy", [True, False])
+                  def test_roundtrip_empty(self, orient, convert_axes, numpy):

Member

jbrockmendel Sep 20, 2019

LGTM, corresponds to 447-449

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py Show resolved Hide resolved

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py Show resolved Hide resolved

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py Show resolved Hide resolved

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py

+                  @pytest.mark.parametrize("dtype", [False, np.int64])
+                  @pytest.mark.parametrize("convert_axes", [True, False])
+                  @pytest.mark.parametrize("numpy", [True, False])
+                  def test_roundtrip_intframe(self, orient, convert_axes, numpy, dtype):

Member

jbrockmendel Sep 20, 2019

func LGTM. corresponds to 417-418

jbrockmendel reviewed

View reviewed changes

pandas/tests/io/json/test_pandas.py

+                  @pytest.mark.parametrize("dtype", [False, float])
+                  @pytest.mark.parametrize("convert_axes", [True, False])
+                  @pytest.mark.parametrize("numpy", [True, False])
+                  def test_roundtrip_simple(self, orient, convert_axes, numpy, dtype):

Member

jbrockmendel Sep 20, 2019

func LGTM. corresponds to 413-415

Member

jbrockmendel commented Sep 20, 2019

Thanks @WillAyd

jbrockmendel merged commit 3bd222d into pandas-dev:master

WillAyd deleted the json-parametrize2 branch

September 20, 2019 18:06

This was referenced Sep 21, 2019

pd.read_json May Not Maintain Numeric String Index #28556

Closed

pd.read_json With convert_axes Produces Different Index Type than Empty Frame #28558

Closed

read_json with orient="values" and numpy=True provides strange column #28559

Closed

read_json and numpy=True and dtype=False does not preserve int64 dtype roundtrip on Windows #28560

Closed

proost pushed a commit to proost/pandas that referenced this pull request


          Reverted parametrization of test_frame_from_json_to_json (pandas-dev#…

196678e

…28510)

proost pushed a commit to proost/pandas that referenced this pull request


          Reverted parametrization of test_frame_from_json_to_json (pandas-dev#…

f533603

…28510)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

IO JSON Testing