Not including first rows of dataset, row shifting, incorrectly annotating row as left-hand labels, whereas labels on the right are correct #81

rmaarle · 2023-09-13T13:16:15Z

import forestplot as fp
import pandas as pd

df = pd.read_csv("review_example.csv",sep=";")  # companion example data


fp.forestplot(df,  # the dataframe with results data
              estimate='PCSA_Men_mean',  # col containing estimated effect size 
              ll= 'PCSA_Men_Lower', hl='PCSA_Men_Upper',  # columns containing conf. int. lower and higher limits
              varlabel='Abbreviation',  # column containing variable label
              capitalize="capitalize",  # Capitalize labels
              annote=["Source", "Image modality", 'Sample_size',"Method", 'Position'],   # columns to report on left of plot
              annoteheaders=["Ref", "Modality", 'N',"PCSA", 'Pose'],  # ^corresponding headers
              rightannote=['Age', 'Height', 'Weight', 'Fiber_length', 'Pennation', "Info"],  # columns to report on right of plot 
              right_annoteheaders=['Age[y]', 'Height[cm]', 'Weight[kg]', 'Fiber_length[cm]', 'Pennation[Deg]', "Note"],  #corresponding headers
              
              groupvar= "Agegroup",  # column containing group labels
              group_order=["Reference","Young Adults","Adults"], 
              xlabel="PCSA Ratio",  # x-label title
              xticks=[0,30,60],  # x-ticks to be printed
              table=True,  # Format as a table
              color_alt_rows=True,  # Gray alternate rows
              # Additional kwargs for customizations
              **{"marker": "D",  # set maker symbol as diamond
                 "markersize": 35,  # adjust marker size
                 "xtick_size": 12,  # adjust x-ticker fontsize
                })
#plt.savefig("plot.jpg", bbox_inches="tight")

rmaarle · 2023-09-13T13:18:00Z

The example code with the sleep dataset worked perfectly, however when I implemented my own dataset various mistakes arose. I hope someone has a solution for this?

LSYS · 2023-12-19T03:48:37Z

hi @rmaarle, thanks for raising this. I wasn't aware that duplicated variable labels (varlabel) would create problems, which is likely the source of the problem. If you use some other unduplicated label, things should work as expected.

Minimal example:

import forestplot as fp
import pandas as pd

df = pd.read_csv("review_example.csv",sep=";")  # companion example data
df = df.reset_index().astype({"index": str})

fp.forestplot(df,  # the dataframe with results data
              estimate='PCSA_Men_mean',  # col containing estimated effect size 
              ll= 'PCSA_Men_Lower', hl='PCSA_Men_Upper',  # columns containing conf. int. lower and higher limits
              varlabel="index",
)

Your case (main change is varlabel=index):

import forestplot as fp
import pandas as pd

df = pd.read_csv("review_example.csv",sep=";")  # companion example data
df = df.reset_index().astype({"index": str})

fp.forestplot(df,  # the dataframe with results data
              estimate='PCSA_Men_mean',  # col containing estimated effect size 
              ll= 'PCSA_Men_Lower', hl='PCSA_Men_Upper',  # columns containing conf. int. lower and higher limits
              varlabel='index',  # column containing variable label
              capitalize="capitalize",  # Capitalize labels
              annote=["Source", "Image modality", 'Sample_size',"Method", 'Position'],   # columns to report on left of plot
              annoteheaders=["Ref", "Modality", 'N',"PCSA", 'Pose'],  # ^corresponding headers
              rightannote=['Age', 'Height', 'Weight', 'Fiber_length', 'Pennation',],  # columns to report on right of plot 
              right_annoteheaders=['Age[y]', 'Height[cm]', 'Weight[kg]', 'Fiber_length[cm]', 'Pennation[Deg]'],  #corresponding headers
              
              groupvar= "Agegroup",  # column containing group labels
              group_order=["Reference","Young Adults","Adults"], 
              xlabel="PCSA Ratio",  # x-label title
              xticks=[0,30,60],  # x-ticks to be printed
              table=True,  # Format as a table
              color_alt_rows=True,  # Gray alternate rows
              # Additional kwargs for customizations
              **{"marker": "D",  # set maker symbol as diamond
                 "markersize": 35,  # adjust marker size
                 "xtick_size": 12,  # adjust x-ticker fontsize
                }
)

LSYS · 2023-12-19T03:50:21Z

Your use case may find the future release (WIP) with grouped labels useful. The duplicated variable labels you were using were really groups. See #59 for an example.

LSYS · 2023-12-19T03:51:11Z

The next release will also warn about duplicated labels in the readme.

@juancq

* From #73 by @juancq. * Warn about duplicated `varlabel` (closes #76, closes #81). * Add test that above warning works. * Add known issues about duplicated `varlabel` (closes #76, closes #81) and PyCharm (closes #80).

LSYS added the Type: Investigate label Dec 16, 2023

LSYS added the Next Release To work on for new version release label Dec 19, 2023

LSYS added the Type: Documentation label Dec 19, 2023

LSYS mentioned this issue Dec 24, 2023

Duplicate values in separate groupings bug #76

Closed

LSYS added a commit that referenced this issue Dec 24, 2023

Update known issues and bump ver (closes #76, closes #80, closes #81)

3519d74

LSYS closed this as completed in a2ec12d Dec 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not including first rows of dataset, row shifting, incorrectly annotating row as left-hand labels, whereas labels on the right are correct #81

Not including first rows of dataset, row shifting, incorrectly annotating row as left-hand labels, whereas labels on the right are correct #81

rmaarle commented Sep 13, 2023 •

edited by LSYS

Loading

rmaarle commented Sep 13, 2023

LSYS commented Dec 19, 2023

LSYS commented Dec 19, 2023

LSYS commented Dec 19, 2023 •

edited

Loading

Not including first rows of dataset, row shifting, incorrectly annotating row as left-hand labels, whereas labels on the right are correct #81

Not including first rows of dataset, row shifting, incorrectly annotating row as left-hand labels, whereas labels on the right are correct #81

Comments

rmaarle commented Sep 13, 2023 • edited by LSYS Loading

rmaarle commented Sep 13, 2023

LSYS commented Dec 19, 2023

LSYS commented Dec 19, 2023

LSYS commented Dec 19, 2023 • edited Loading

rmaarle commented Sep 13, 2023 •

edited by LSYS

Loading

LSYS commented Dec 19, 2023 •

edited

Loading