Skip to content

Result validity despite low predictive performances #215

Answered by SvenKlaassen
DarioSimonato asked this question in Q&A
Discussion options

You must be logged in to vote

I am sorry for the late response.
I agree with most of the points.

  • the model is flexible enougth to approximate $P(D=1|X)$
  • you should ofc try to avoid overfitting, but the RMSE from the package should allow to obtain a good overview of the performance (or using evaluate_learners()) as this returns a cross-fitted value of the metric
  • For the last point:
    I think the important distinction is that usually the relevant confounding variables have to be included (in most cases variables affecting both $D$ and $Y$, but you can make the argument more complex, as blocking all backdoor paths, see e.g. Book of Why)
    This does not mean that the variables have to be important predictors of $D$ or $Y$, …

Replies: 1 comment 7 replies

Comment options

You must be logged in to vote
7 replies
@DarioSimonato
Comment options

@SvenKlaassen
Comment options

@DarioSimonato
Comment options

@SvenKlaassen
Comment options

Answer selected by DarioSimonato
@DarioSimonato
Comment options

@SvenKlaassen
Comment options

@DarioSimonato
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants