Example 1: Suppose that we are interested in the factors that influencewhether a political candidate wins an election. Clustered standard errors belong to these type of standard errors. This involves a covariance estimator along the lines of White's "sandwich estimator". For calculating robust standard errors in R, both with more goodies and in (probably) a more efficient way, look at the sandwich package. I am basically looking for the equivalent in R for the Stata option cluster(crisno). The predictor variables of interest are theamount of money spent on the campaign, the amount of time spent campaigningnegatively and whether the candidate is an incumbent. If you suspect heteroskedasticity or clustered errors, there really is no good reason to go with a test (classic Hausman) that is invalid in the presence of these problems. Example: Probit Model for Marriage Sample: March 2009 CPS Population: U.S. Black women in Midwest (n=433) Percent Married: 37% Probit for married as a function of age, age2, education, ..probit mar age age2 education if bf, r This calculates (robust) asymptotic standard errors In practice, heteroskedasticity-robust and clustered standard errors are usually larger than standard errors from regular OLS — however, this is not always the case. An Introduction to Robust and Clustered Standard Errors Linear Regression with Non-constant Variance Variance of ^ depends on the errors ^ = X0X 1 X0y = X0X 1 X0(X + u) = + X0X 1 X0u clustervar1 a character value naming the ﬁrst cluster on which to adjust the standard errors. clustervar1 a character value naming the ﬁrst cluster on which to adjust the standard errors. Computing cluster -robust standard errors is a fix for the latter issue. Hence, obtaining the correct SE, is critical Cluster-robust standard errors are now widely used, popularized in part by Rogers (1993) who incorporated the method in Stata, and by Bertrand, Du o and Mullainathan (2004) who pointed out that many di erences-in-di erences studies failed to control for clustered errors, and those that did often clustered at the wrong level. Here's the Stata code that I will benchmark against. Fixed effects probit regression is limited in this case because it may ignore necessary random effects and/or non independence in the data. Let's see an example with the Union Dataset: logit union age south year estimates store logit logit union age south year, cluster(id) estimates store cluster xtlogit union age south year , i(id) re 