ARCHIVED: In Stata, how do I get aggregate statistics and save them into a data set?

Stata's collapse command computes aggregate statistics such as mean, sum, and standard deviation and saves them into a data set. When you execute the command, an existing data set is replaced with the new one containing aggregate data.

Suppose you want to get the sum of a variable x1 and the mean of a variable x2 for males and females separately. Consider the following example:

  . collapse (sum) x1 (mean) x2, by(male)

Note that male is set to 1 for male and 0 for female. The new data set has two observations with three variables: male, x1, and x2.

If you want to save into different variable names, specify the names, for example:

  . collapse (sum) new_x1=x1 (mean) new_x2=x2, by(male)

If you have questions about using statistical and mathematical software at Indiana University, contact Research Analytics. Research Analytics is located on the IU Bloomington campus at Woodburn Hall 200; staff are available for consultation Monday-Friday 9am-noon and by appointment.

Note:
For details about availability of UITS services during winter break, see UITS winter break support hours.

This is document aqdz in the Knowledge Base.
Last modified on 2015-06-30 00:00:00.

  • Fill out this form to submit your issue to the UITS Support Center.
  • Please note that you must be affiliated with Indiana University to receive support.
  • All fields are required.

Please provide your IU email address. If you currently have a problem receiving email at your IU account, enter an alternate email address.

  • Fill out this form to submit your comment to the IU Knowledge Base.
  • If you are affiliated with Indiana University and need help with a computing problem, please use the I need help with a computing problem section above, or contact your campus Support Center.

Please provide your IU email address. If you currently have a problem receiving email at your IU account, enter an alternate email address.