sample 10.5 You can save this smaller sample (e.g., for “Small Stata”).

Finding Variable Descriptions: after reading in a Stata dataset type: describe

If the person put variable description in the dataset, this command should produce them.

Select Cases by Scores on a Variable: logit nafta avmich if divrk>25 (only

uses cases where “divrk” is greater than 25; use >=25 for 25 or greater) If selecting by a particular year you need to use two consecutive equal signs. Thus to list the scores on variable “race” for 1990 type: list race if year==1990 (two consecutive equal signs). To select on two variables at the same time use “&”: logit nafta avmich if party==1 & south==0 Missing data are a great problem with this procedure. You can use two commas if you have multiple sets of instructions after the last independent variable. For example, to specify lower and upper censoring points in tobit and use only observations where variable “state” =1:

tobit ratio ada85 par85, ll ul, if state==1

You can also select cases by scores on a variable using the “keep” command prior to statistical analysis. For example, to select cases scoring “2” on variable “brown” (with possible scores of 1, 2 and 3) you could use the following command: keep if (brown==2). To use cases with scores of 1 and 2 on brown type: keep if (brown== 1 & 2). To use cases with a score of less than 10,000 on a continuous variable “income” type:

keep if (income <10000) or for 10,000 and less type:

keep if (income <=10000). Make sure you don’t save the data because you’ll lose all the dropped observations. If you are using a “do” file to prevent permanent loss of data than have the last command reinstall the original dataset.

Select Cases by Observation: In a data set with 100 observations, to use

observations 1, 25-29 and 34-100 type: drop in 2/24 (press “enter”)

drop in 30/33 (press “enter”) and then run regression

Deleting Observations from a Dataset: Read the dataset into Stata. In a

dataset in which “year” was a variable and I wanted to take data from the years 1985, 1987, 1988, 1991, 1992, 1995, 1996, 2000 and 2005 from a dataset that was annual from 1880 to 2008 I did the following:

drop if year<1985

drop if year==1986

drop if year==1989

drop if year==1990

You get the picture. I don’t know how to drop consecutive years (e.g., 1989 and 1990) in one command. When you’ve deleted all the years you don’t want you will be left with those you do want. Then, using the data editor you can cut and paste the new dataset into Excel. You might “google” both “drop” and “keep” in Stata. There may be easier ways to do this than that described above.

Stacking Data – How to Change it: To change a dataset stacked by state (e.g.,

observations 1-20 are 20 consecutive annual observations on state #1 with observation 21 being the first observation on state #2) to a dataset