Some thoughts on observational studies

By , January 21, 2012 7:46 am

Note: This is a brief introduction to observational designs. For more on this type of study, see books by Paul Rosenbaum: Observational Studies and, for a less mathematical approach Design of Observational Studies

In statistics and research design, there are two types of study: Experiments and observational studies. Some people also use the term “quasi-experiment” but I do not like it.  In an experiment, the key thing is <em>randomization</em>. We assign subjects (e.g. people) to different conditions (e.g. drug and placebo) randomly.  Often, though, such assignment is not possible or not ethical. In social sciences, it is rarely possible. We cannot, for example, randomly assign people to different levels of education. We can only observe relationships between (say) education and political party.
Continue reading 'Some thoughts on observational studies'»

Quantile regression with PROC QUANTREG

By , June 7, 2011 3:02 pm

This is a talk that I will  give at NESUG in the fall.

SA04

Using the SG Procedures to create and enhance scatter plots

By , May 27, 2011 7:22 am

These are the slides for a talk at New York Area SAS Users’ Group on June 2, 2011.

NYASUG2011

Here is a poster that I will present at NESUG in September. It’s in two parts

Part 1

Part 2

Graphics for univariate data – Pie is delicious but not nutritious

By , February 8, 2011 11:00 am

When you have univariate data, that is, a single measure on a variety of units, the most common statistical graphic is a pie chart.  But pie charts should not be used.  Ever.  When there are a lot of units, pie charts are unreadable.  When there are only a few units, pie charts waste space.  And research shows that, even with a moderate number of units, pie charts can distort the data (for example, using different colors leads to different estimates of the size of the wedges).  Fortunately, there are better methods.

Continue reading 'Graphics for univariate data – Pie is delicious but not nutritious'»

Scatterplots – dealing with overplotting

By , January 24, 2011 5:20 pm

In a previous post, I dealt with some SAS code for scatterplots. Various problems can arise when using scatterplots. One of them is overplotting, where two or more data are the same point.

There are a variety of ways of dealing with this.
Continue reading 'Scatterplots – dealing with overplotting'»