-
Notifications
You must be signed in to change notification settings - Fork 368
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Show group values when printing grouped dataframe #1632
Conversation
Now the printing looks like this (note the
|
I'd print this on the same line as the number of rows, maybe like this: You'll also need to update tests. |
Let me know if you like this behavior (though the code can change obviously) and I will add tests |
src/groupeddataframe/show.jl
Outdated
@@ -6,31 +6,55 @@ function Base.show(io::IO, gd::GroupedDataFrame; | |||
rowlabel::Symbol = :Row, | |||
summary::Bool = true) | |||
N = length(gd) | |||
keys = join(':' .* string.(names(gd.parent)[gd.cols]), ", ") | |||
grouped_names = names(gd.parent)[gd.cols] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
grouped_names = names(gd.parent)[gd.cols] | |
keynames = names(gd.parent)[gd.cols] |
src/groupeddataframe/show.jl
Outdated
print(io, "\nGroup $i: $nrows $rows") | ||
|
||
identified_groups = [':' * string(parent_names[col], " = ", | ||
first(gd[i][col])) for col in gd.cols] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wrong indentation. Please also remove trailing spaces (here and elsewhere).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I couldn't find any examples of the right indentation for constructors. So I added more line breaks? Let me know the course of action.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure there's a clear rule for comprehensions, but here first
is inside string
, so it should definitely not be aligned with the bracket. I'd do this:
first(gd[i][col])) for col in gd.cols] | |
identified_groups = [':' * string(parent_names[col], " = ", first(gd[i][col])) | |
for col in gd.cols] |
|
||
print(io, "\nGroup $i ($nrows $rows): ") | ||
join(io, identified_groups, ", ", " and ") | ||
|
||
show(io, gd[i], summary=false, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we shouldn't print the grouping columns, since they're listed above?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it reads well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As you like. But just to make sure we're talking about the same thing: I was suggesting we could drop the grouping columns from the data frame we print below, since all values are equal within a given group.
src/groupeddataframe/show.jl
Outdated
print(io, "\nGroup $i: $nrows $rows") | ||
|
||
identified_groups = [':' * string(parent_names[col], " = ", | ||
first(gd[i][col])) for col in gd.cols] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure there's a clear rule for comprehensions, but here first
is inside string
, so it should definitely not be aligned with the bracket. I'd do this:
first(gd[i][col])) for col in gd.cols] | |
identified_groups = [':' * string(parent_names[col], " = ", first(gd[i][col])) | |
for col in gd.cols] |
|
||
print(io, "\nGroup $i ($nrows $rows): ") | ||
join(io, identified_groups, ", ", " and ") | ||
|
||
show(io, gd[i], summary=false, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As you like. But just to make sure we're talking about the same thing: I was suggesting we could drop the grouping columns from the data frame we print below, since all values are equal within a given group.
docs/src/lib/functions.md
Outdated
@@ -1,54 +0,0 @@ | |||
```@meta |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This shouldn't be removed.
Sorry about deleting that file, i did it on accident and I thought my We should show the grouping columns, so that |
Okay I think this is ready to be merged. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
keystr = length(gd.cols) > 1 ? "keys" : "key" | ||
groupstr = N > 1 ? "groups" : "group" | ||
summary && print(io, "$(typeof(gd)) with $N $groupstr based on $keystr: $keys") | ||
if allgroups | ||
for i = 1:N | ||
nrows = size(gd[i], 1) | ||
rows = nrows > 1 ? "rows" : "row" | ||
print(io, "\nGroup $i: $nrows $rows") | ||
|
||
identified_groups = [':' * string(parent_names[col], " = ", first(gd[i][col])) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've just realized that when the grouping column is a string or symbol, it's printed as e.g. :x = a
rather than :x = "a"
or :x = :a
. I guess we should change this (using repr
or ourshowcompact
)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It should be repr
. I will make a quick PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In reference to #1539 this PR now prints the values of the grouping column whenever it prints a grouped dataframe.