Keep IPCW results in the list column format predicted by the predict() methods #937

topepo · 2023-03-22T21:41:58Z

The current code unnests the data. This is a problem if we also are using static predictions (like the predicted event time).

This PR keeps the list format and adds the required columns to each tibble within the list.

…tered vector

R/ipcw.R

(could be `NULL` but not `FALSE`)

hfrick

I like that it just expects the output of predict(), that seems sensible. I fell down a rabbit hole a little bit while looking at this code so this review also contains elements that are likely out of scope for this PR. I wanted to jot them down though so we don't loose it.

Specifically on the unnesting/this PR:

The .filter_eval_time() functions doesn't get used anymore now which means infinite time points can get through to the prodlim function which returns the probabilities of being censored (which makes it error). I supposed add_graf_weights_vec() would be a good place to doctor with the eval_time values.

Generally:

We should look a consistent handling of "non-standard" values for eval_time, i.e. -Inf, generally < 0, Inf (and NA). Most predictions from censored are less restrictive than what's outlined in .filter_eval_time() which I think is nice. survival and prodLim work (at least in parts) with negative values for time so I don't think it's such a big philosophical stretch to get to infinite values. Regardless, if we decide to restrict possible values for eval_time rather than pad results appropriately, we should at least warn when we do that.

The other thing I noticed is that we are slightly modifying the censoring probabilities in two places:

predict.censoring_model_reverse_km() adds an epsilon to the probabilities, the amount is data-derived, not set via an argument

and then, after that:

trunc_probs() prevents very small probs, ie adds enough to make them at least trunc

So depending how the data is inside of predict.censoring_model_reverse_km() we might be changing it to more than trunc, so we might want to make the amount an argument to predict.censoring_model_reverse_km() to give us control of the final/total amount.

R/ipcw.R

Co-authored-by: Hannah Frick <hfrick@users.noreply.github.com>

github-actions · 2023-04-15T00:57:28Z

This pull request has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

topepo added 9 commits March 22, 2023 13:41

replace a few functions

3b2c39c

remove older code

bc561d5

update docs and pass tolerance args to computations

767f2a5

version bump

8268ca5

move to vctrs replacements for tidyr functions

2d44b59

comments

486a713

make sure that the original first eval_time is still first in the fil…

19b1362

…tered vector

re-doc

77345b4

vec chopin'

3106338

topepo commented Mar 22, 2023

View reviewed changes

R/ipcw.R Show resolved Hide resolved

topepo commented Mar 22, 2023

View reviewed changes

R/ipcw.R Show resolved Hide resolved

topepo commented Mar 22, 2023

View reviewed changes

R/ipcw.R Show resolved Hide resolved

topepo marked this pull request as ready for review March 22, 2023 21:59

topepo requested a review from hfrick March 22, 2023 21:59

topepo added 3 commits March 22, 2023 20:16

extra bump

5e83c9b

Merge branch 'main' into nested-ipcw

ce69e47

update snapshot for new pillar

d247725

topepo added a commit to tidymodels/extratests that referenced this pull request Mar 23, 2023

updated for tidymodels/parsnip#937

d21bd75

no need to fiddle with call

dfcca77

(could be `NULL` but not `FALSE`)

hfrick reviewed Mar 24, 2023

View reviewed changes

R/ipcw.R Show resolved Hide resolved

R/ipcw.R Show resolved Hide resolved

R/ipcw.R Show resolved Hide resolved

R/ipcw.R Outdated Show resolved Hide resolved

R/ipcw.R Outdated Show resolved Hide resolved

topepo and others added 6 commits March 24, 2023 20:00

Apply suggestions from code review

c98aa83

Co-authored-by: Hannah Frick <hfrick@users.noreply.github.com>

add back .filter_eval_time but at prediction time

6fa482e

more on truncation

1a5c442

warning for bad time points

81600f6

typo fix

ac41dd9

added more snapshots for warnings

abfeae7

topepo merged commit 2fa853e into main Apr 1, 2023

topepo deleted the nested-ipcw branch April 1, 2023 00:21

topepo added a commit to tidymodels/tune that referenced this pull request Apr 3, 2023

updates based on tidymodels/parsnip#937

972ccc0

topepo added a commit to tidymodels/tune that referenced this pull request Apr 3, 2023

updates based on tidymodels/parsnip#937

82cc19a

github-actions bot locked and limited conversation to collaborators Apr 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep IPCW results in the list column format predicted by the predict() methods #937

Keep IPCW results in the list column format predicted by the predict() methods #937

topepo commented Mar 22, 2023

hfrick left a comment

github-actions bot commented Apr 15, 2023

Keep IPCW results in the list column format predicted by the predict() methods #937

Keep IPCW results in the list column format predicted by the predict() methods #937

Conversation

topepo commented Mar 22, 2023

hfrick left a comment

Choose a reason for hiding this comment

github-actions bot commented Apr 15, 2023