Distribution-free conditional median inference

Download paper here


We consider the problem of constructing confidence intervals for the median of a response Y conditional on features X in a situation where we are not willing to make any assumption whatsoever on the underlying distribution of the data (X,Y). We propose a method based upon ideas from conformal prediction and establish a theoretical guarantee of coverage while also going over particular distributions where its performance is sharp. Additionally, we prove an equivalence between confidence intervals for the conditional median and confidence intervals for the response variable, resulting in a lower bound on the length of any possible conditional median confidence interval. This lower bound is independent of sample size and holds for all distributions with no point masses.

Code can be found here