Survey sampling

related topics
{rate, high, increase}
{theory, work, human}
{system, computer, user}
{company, market, business}
{math, number, function}
{work, book, publish}
{school, student, university}
{ship, engine, design}
{mi², represent, 1st}

In statistics, survey sampling describes the process of selecting a sample of elements from a target population in order to conduct a survey.

A survey may refer to many different types or techniques of observation, but in the context of survey sampling it most often refers to a questionnaire used to measure the characteristics and/or attitudes of people. The purpose of sampling is to reduce the cost and/or the amount of work that it would take to survey the entire target population. A survey that measures the entire target population is called a census.


Probability vs. Non-Probability Sampling

Survey samples can be broadly divided into two types: probability samples and non-probability samples. Only surveys based on a probability samples can be used to create mathematically sound statistical inferences about a larger target population. Inferences from probability-based surveys may still suffer from many types of bias.

Surveys that are not based on probability sampling have no way of measuring their bias or sampling error. Surveys based on non-probability samples are not externally valid. They can only be said to be representative of the people that have actually completed the survey.[1]

Put another way, if a probability-based survey of the United States household population finds that 59% of its respondents support a piece of legislation there is mathematical reason to believe that the proportion of the all persons living in households in the United States who support this piece of legislation is close to 59% (within the margin of error). If a non-probability survey conducted in the United States finds that 59% percent of its respondents support a piece of legislation that is the only conclusion that can be drawn, no statement about the target population can be made.

The main reason that non-probability samples are used is that probability samples cost much more to produce. Non-probability samples surveys are commonly used in market research, where cost can be more important than the projectability of findings to a larger population.

In academic and government survey research probability sampling is often regarded a standard procedure that must be employed regardless of the cost. The Office of Management and Budget's List of Standards for Statistical Surveys states that federally funded surveys must be performed,

selecting samples using generally accepted statistical methods (e.g., probabilistic methods that can provide estimates of sampling error). Any use of nonprobability sampling methods (e.g., cut-off or model-based samples) must be justified statistically and be able to measure estimation error.[2]

Many statisticians disagree with these views. For example, Valliant, Dorfman and Royall explain,

To claim that, in general, probabilistic inferences are not valid when the randomization distribution is not available is simply wrong. This is not to deny that randomization is valuable, but only to deny that it represents the basis for all valid, rigorous, probabilistic inference.[3]

Full article ▸

related documents
Interquartile range
Economic Recovery Tax Act of 1981
Pareto index
Statistical inference
Summary statistics
Bureau of Labor Statistics
Standard of living in the United States
ILR scale
Physical quality-of-life index
Robert Lucas, Jr.
Gross world product
Observational error
Statistical dispersion
Racial quota
Lucy tuning
Range (statistics)
List of countries by population
List of metropolitan areas by population
Interquartile mean
Multistage sampling
Demand factor
Precision agriculture
Bernoulli trial
Léon Walras
Parkinson's law