In SAS, how can I randomly select a certain number of observations from a data set?
You can use the
SURVEYSELECT procedure for random
sampling. The procedure supports various methods for selecting
probability-based random samples from the existing data set. The
SURVEYSELECT procedure can conduct simple
SRS), unrestricted (
SYS), and sequential (
SEQ) random sampling
methods. It also supports the probability-proportional-to-size
Suppose you want to randomly draw 100 observations from the data set
pop with 7,000 observations. Consider the following SAS code:
METHOD=SRS option specifies the simple random
sampling method. The
SEED option specifies the seed to be
used in the random number generation, allowing replication of the same
set of random numbers. The 100 observations drawn are stored in the
If you have questions about using statistical and mathematical software at Indiana University, email UITS Research Analytics (formerly known as the Stat/Math Center). Research Analytics is located on the IU Bloomington campus at Woodburn Hall 200, and is open for consultation by appointment Monday-Friday 9am-5pm. For more, visit Research Analytics on the web, or call 812-855-4724 (IUB) or 317-278-4740 (IUPUI).
Last modified on July 30, 2013.