In Stata, how do I detect duplicate observations in a data set?

You can check for duplicate observations in Stata in the following ways:

  • The isid command can detect duplicate observations:
      . isid x1 x2 x3
  • The duplicates command can list and flag duplicate observations. The list subcommand lists the duplicate observations:
      . duplicates list x1 x2 x3
  • The tag subcommand and the generate() option flag duplicate observations by assigning 1 to duplicacy in the variable duple:
      . duplicates tag x1 x2 x3, generate(duple)

If you have questions about using statistical and mathematical software at Indiana University, contact Research Analytics. Research Analytics is located on the IU Bloomington campus at Woodburn Hall 200; staff are available for consultation Monday-Friday 9am-noon and by appointment.

This is document aqea in the Knowledge Base.
Last modified on 2015-06-23 00:00:00.

  • Fill out this form to submit your issue to the UITS Support Center.
  • Please note that you must be affiliated with Indiana University to receive support.
  • All fields are required.

Please provide your IU email address. If you currently have a problem receiving email at your IU account, enter an alternate email address.

  • Fill out this form to submit your comment to the IU Knowledge Base.
  • If you are affiliated with Indiana University and need help with a computing problem, please use the I need help with a computing problem section above, or contact your campus Support Center.

Please provide your IU email address. If you currently have a problem receiving email at your IU account, enter an alternate email address.