Quantile Regression
Do you need help with Quantile Regression?
Quantile Regression
Sometimes, though, we want something else. Sometimes the dependent variable isn’t continuous and we turn to logistic regression or some form of count regression. Sometimes the dependent variable is censored, as a time to event, and we turn to survival analysis.
But sometimes even though the dependent variable is continuous, we are not interested in the mean, but in some other statistic about the population. One such situation is when we want to model some quantile (also known as percentile) of the population. That is, we might be interested not in what affects the mean, but in what affects (say) the 3rd quartile, or the 95th percentile, or some other percentile.
When might we want this?
Suppose our dependent variable is bimodal or multimodal – that is, it has multiple “humps”. If we knew what caused the bimodality, we could separate on that variable and do stratified analysis, but if we don’t know that, quantile regression might be good.
If our DV is highly skewed – as, for example, income is in many countries – we might be interested in what predicts the median (which is the 50th percentile) or some other quantile.
One more example is where our substantive interest is in people at the highest or lowest quantiles. For example, if studying the spread of sexually transmitted diseases, we might record number of sexual partners that a person had in a given time period. And we might be most interested in what predicts people with a great many partners, since they will be key parts of spreading the disease.
Featured Posts

The title of this post is a quote from baseball great Yogi Berra. Yogi was famous for saying things that...

In a recent article in Sociological Methodology entitled "How to impute interactions, squares, and other...

This is a talk that I will give at NESUG in the fall.

This is a talk developed by David Cassell and me, and given at NESUG and SGF and WUSS

PROC LOGISTIC can be used to run logistic regression on a dichotomous dependent variable. Often, these are...

Most generally, a dependent variable (DV) is something which we think depends on one or more independent...

In a previous post, I dealt with some SAS code for scatterplots. Various problems can arise when using...

Two terms that are frequently confused are moderation and mediation: Definitions...

Lately, across the statistical blogosphere, the repeating discussion of R vs. SAS has started up again. In...

I attended the 2015 meeting of the Southeast SAS Users' Group in Savannah, Georgia from September 27  29....

If you picture the data as a 2 x 2 crosstab, then quasicomplete separation occurs when one of the cells is...

After one measures central tendency or location of a variable, the next thing to measure is often spread or...