|
Concepts illustrated in the paper
(The list is expandable. Click on the concept to see
its description.) |
Structure:
- Introduction
(see example)
-
The Introduction should convey four things. First, what
is the question that the paper asks. Second, why is the question
important. Third, how is the paper going to answer the question.
Finally, how is the paper related to existing work. The introduction is
the most important part of any paper. No one will continue to read any
further if the introduction is confusing or poorly written.
- Data
(see example)
-
The Data section should accomplish three things: First,
state the sources of data. Second, discuss the variables used and how
they relate to the concepts that they are supposed to measure. Finally,
present the data’s descriptive statistics.
- Empirical Results
(see example)
-
The Empirical Results section should present and
discuss the empirical results. The presentation of results is
usually done with a table. The discussion of results typically
includes a statement of whether the results support or refute
the hypothesis, a statement of whether the results are
statistically significant, interpretation of the magnitude of
the coefficients and a comment on functional form.
- Conclusion
(see example)
-
The conclusion should accomplish three things:
summarize the results, explore the implications of the results, and point
to future research.
|
Writing Style:
- Citation Style
(see example)
- Use of acronyms
(see example)
-
The first time an acronym is used it should be written
out, followed by the acronym in parenthesis.
- Use of first person
(see example)
-
It is acceptable to use first person (I) in an economics
paper.
- Coherence
(see example)
- Make each sentence linked to the previous one.
- Tense
(see example)
-
It is appropriate to use past tense when describing the
construction of your variables. However, use present tense when
referring to tables or your results.
|
Conventions in
an Empirical Paper:
- Descriptive Statistics Table
(see example)
-
A
descriptive statistics table should include the list of
variables and the mean, median, standard deviation, minimum and
maximum. In cases where the number of observations varies from
variable to variable, a column specifying the number of
observations is necessary. The orientation of the table should
be such that the variables are in rows and the statistics in
columns. This way, even if a large number of variables are used,
the table will fit on one page.
- Discussing Descriptive Statistics
(see example)
-
Discussing the minimum and maximum and the
corresponding data points makes the data “come alive.” It also
reassures the reader that the data was put together correctly.
- Rounding numbers in the text
(see example)
-
When discussing quantities in the text, use
round numbers.
- Presentation of regression results
(see example)
-
Regression results are typically presented in
this compact form. The columns show results from 6 different
regressions. The rows show the intercept, independent variables
and the R-squared. The estimated coefficients and their
associated standard errors in parentheses appear inside the table.
Some authors prefer to show each coefficient’s t-statistics in
parentheses; therefore it is always necessary to specify this in the
table’s footnote. If the independent variable is not included in
a specification, the cell corresponding to that independent
variable and specification is left blank. If the number of
observations varies across specifications, it can be included as
the last row. The asterisks are for easy identification of the
significance level - the more asterisks, the higher the
significance.
- Converting variables to convenient
units (see example)
-
In order to be able to present regression results
in a compact and readable form, it is necessary to convert the variables to
appropriate units. For
example, the appropriate units for payroll are millions of dollars. This
is because if payroll were in dollars, the coefficient in specification
(3) would appear as 0.0000001 which is more difficult to fit in a table
and more difficult to read.
- Interpreting estimated coefficients
(see example)
-
It is very important to include the units of both
the independent and the dependent variables.
- Assessing economic significance
(see example)
-
Assessing economic significance
requires judgment. Unlike statistical significance, there is no
"official" benchmark for assessing economic significance.
|
Other:
- Title
(see example)
-
The title should concisely express what the paper is
about. It can also be used to capture the reader's attention.
- Searching for existing literature (see
example)
-
EconLit is the most commonly used database for searching
published papers in Economics. Working papers can be found via
IDEAS,
SSRN,
NBER or even
google.
- Effect vs. affect
(see example)
-
"Effect" is usually a noun (that is, it could be
preceeded by "the"). "Affect" is usually a verb.
- Appeal to authority
(see example)
-
It is appropriate to cite other studies when
justifying the use of a variable or technique. This also makes
the comparison to other work easier.
- Acknowledge shortcomings of data
(see example)
-
It is appropriate to acknowledge the shortcomings
of your data. The shortcomings could come from unreliability of
the source, lack of observations or, as in this case, lack of
time to properly adjust the data for inflation.
|
| |
Does pay inequality within
a team affect performance?
Tomas Dvorak* |
 |
 |
1. Introduction
The business of sports draws considerable attention from the media and
the general public. Fans and sports writers frequently speculate about
the effects of money on athletic performance. There is general agreement
that more financial resources usually lead to better athletic
performance. In team sports, higher pay can be used to lure better
players from other teams and therefore improve performance.
However, performance can also be affected by pay inequality among
players within a team. On the one hand, pay inequality could have a negative effect because it may hinder
cooperation among team members. In many sports, team cooperation is
critical for good performance. If pay
inequality creates tensions or animosity among team members, performance
is likely to suffer. On the other hand, inequality could have a positive
effect on performance by providing incentives. The prospect of a very
large salary could be a powerful drive behind an athlete’s performance.
Pay inequality might also enhance performance if low paid players
learn from high paid players. This would happen when pay inequality is
associated with skill inequality. For example, if a highly paid
superstar can teach other players, the overall performance of a team may
improve. Given that arguments can be made both ways, it is not
surprising that there is little agreement on the effects of pay
inequality on team performance. The purpose of this paper is to determine whether,
on balance, the effect of
pay inequality on performance is positive or negative.
|


|
| |
Understanding the effect of pay inequality on a team’s performance is
important for at least two reasons. First, team managers can use this
information to make decisions about which players to hire. For example,
should they hire one expensive superstar and two inexpensive players, or
three medium-priced players? If we find that pay inequality leads to
poor team performance, then the team may perform better with three
medium-priced players than one superstar and two low-priced players.
Second, because salaries are a large part of contract negotiations
between player associations and team owners, understanding the effects
of pay inequality on performance can help determine optimal policies.
For example, if pay inequality has a negative effect on performance, an
argument for a higher minimum salary could be made.
|
 |
 |
There
are a number of studies that look at the effects of pay inequality on
performance. DeBrock, Hendricks and Koenker (2004) study the effects of
pay inequality on performance in Major League Baseball (MLB) . They find
that pay inequality is associated with poor performance. Frick, Prinze
and Winklemann (2003) look at the effects of pay inequality in all four
major leagues in North America. They find that inequality improves
team performance in basketball and worsens team performance in baseball.
They find no statistically significant effect of inequality on
performance in football and hockey.
|
  |

|
This paper looks at the effects of inequality on performance in MLB.
It differs from that of DeBrock,
Hendricks and Koenker (2004) in that it uses the most recent data. While
the previous authors use data from 1985 through 1998, I use data from the latest two
seasons: 2003 and 2004.
Another difference is that I use a different measure of pay inequality.
Rather than the Herfindahl index, I
use the percentage of payroll earned by the best paid 20% of players.
I chose the share earned by the top
20% players for two reasons: it is somewhat easier to calculate, and its
magnitude is easier to interpret.
|
  |


|
2. Data
The data on pay inequality was constructed in the following way. From
the USA Today salary database, I collected annual salaries for each
player in all MLB teams during the 2003 and 2004 seasons. I summed the
salaries of all players for each team and each season to obtain the
total payroll. The active roster in baseball is 25, but the database
includes salaries of disabled players as well. Therefore, the number of
players for each team ranges from 25 to 31. As the measure of pay
inequality, I calculated the percentage of payroll earned by the highest
paid 20% of players. For example, for a 30 player team I summed the
salaries of the highest paid 6 players and divide that amount by total
payroll. If every player earned the same amount, the best paid 20% would
earn exactly 20% of the payroll. When pay is unequal, this measure is
higher than 20%. The higher the share of payroll earned by the top 20%
of players, the higher the pay inequality.
To measure performance I use the percentage of games won in the
regular season. This data comes from BaseballReference.com. It does not
include performance during league championships or the World Series.
However, with 162 games per regular season, the winning
percentage can be regarded as a reasonable measure of performance.
This is also the
measure used by DeBrock, Hendricks and Koenker (2004).
|
   
|
|


|
In addition to pay inequality and performance, I use data on the total
payroll of each team. This is a measure of financial resources which
could be an important determinant of performance. I measure payroll in
current dollars and do not adjust for inflation.
While 2003 dollars are
not exactly comparable to 2004 dollars, 2003 inflation was low enough
not to influence the results significantly.
Table 1 shows the descriptive statistics of each variable. In the first
row we see that on average the highest paid 20% of players earn about
61% of the total payroll. This implies that on a 30 player team, the six
best paid players earn more than the remaining 24 combined. According to
this measure, the team with the most equitable pay is the New York
Yankees during the 2003 season when the top 20% of players earned only 42%
of total payroll. The team with the highest inequality was the Colorado
Rockies during the 2004 season. On that team, five players earned more
than 78% of the team’s total payroll.
The second row in Table 1 shows that the average winning percentage is
50% which has to be the case since for every game won there is a game
lost. The Detroit Tigers have the lowest winning percentage in the data
with only 26% of games won during the 2003 season. The maximum winning
percentage in the data is for the St. Louis Cardinals, who won nearly 65%
of their games during the 2004 season. Finally, the last row in Table
1 shows that the average payroll is about 70 million dollars. The range
of payroll is quite striking. It goes from less than 20 million dollars
for the Tampa Bay Rays to over 184 million for the New York Yankees.
Table 1: Descriptive Statistics
| |
mean |
median |
st.dev. |
min |
max |
|
Top20share (in %) |
61.0 |
61.4 |
8.0 |
42.2 |
78.3 |
| Games Won (in %) |
50.0 |
51.6 |
8.2 |
26.5 |
64.8 |
| Payroll (in mil. USD)
|
70.0 |
65.3 |
30.3 |
19.6 |
184.2 |
|
 

 |
 

|
3. Empirical
Results
I estimate three different specifications. The dependent variable in
each specification is performance, as measured by the percentage of
games won. Pay inequality and total payroll are the independent
variables. Table 2 shows the results. In the first specification, I
regress performance on the share earned by the top 20% of players. The
coefficient on the share of top 20% is negative and statistically
significant. This indicates that teams with higher pay inequality tend
to win fewer games. A one percentage point increase in the share of
payroll earned by the top 20% of players is associated with about half
of a percentage point decline in the percentage of games won.
Table 2: Regression Results
|
Dependent variable: winning percentage (in %) |
|
|
(1) |
(2) |
(3) |
|
Intercept |
77.3
(7.43)** |
59.9
(9.51)**
|
37.35
(15.37)* |
|
Top20share (in %) |
-0.45
(0.12)** |
-0.27
(0.13)*
|
-0.28
(0.13)* |
|
Payroll (in mil. USD) |
|
0.10
(0.04)**
|
|
|
Log of Payroll |
|
|
7.09
(2.43)** |
|
R-squared |
0.19 |
0.29 |
0.29 |
|
Adjusted R-squared |
0.18 |
0.26 |
0.27 |
|
Number of observations is
60.
Standard errors are in parentheses.
** significant at 1%, * significant at 5% |
In the second specification I include total payroll
as an independent variable. Payroll is a measure of the financial
resources which can affect performance - the higher the payroll, the
higher the quality of players and, generally, the better the
performance. Therefore, including payroll may increase the precision of
the estimated coefficient on pay inequality. More importantly, it is
possible that pay inequality is correlated with total payroll. If low
payroll teams tend to have more pay inequality, then the coefficient on
pay inequality in specification (1) is biased. Indeed, the correlation
coefficient between the share earned by the top 20% of players and total
payroll is -0.5. Teams with high pay inequality may perform worse not
because of pay inequality, but because they are also the teams with a
lower payroll. Therefore, in order to measure the effect of pay
inequality on performance, I need to control for total payroll.
Once I control for total payroll, the coefficient on the share of the
top 20% remains statistically significant but the magnitude drops
substantially. Holding payroll
constant, a one percentage point increase in the share earned by the
highest paid 20% is associated with a 0.27 percentage point decline in
the percentage of games won. The
impact of inequality on performance does not seem enormous. For example,
a five percentage point increase in inequality for the team with median
inequality would shift the team up 13 spots in the inequality ranking,
but its performance ranking would drop by only 2 spots. The coefficient
on total payroll is positive and statistically significant. A one
million dollar increase in total payroll is associated with about 0.1
percentage point increase in the percentage of games won. This indicates
that greater financial resources tend to improve performance. Adding
payroll as an independent variable led to an increase in R-squared from
about 0.19 to 0.29.
Finally, in specification (3) I include the logarithm of payroll instead
of payroll. I want to verify that the result in specification (2) is
robust to different functional forms. In addition, the effect of an
additional one million dollars may be smaller for a team with a 100
million payroll than for one with a 20 million payroll. Thus, including
payroll in logarithm seems appropriate. The coefficient on the share of
the top 20% remains statistically significant with roughly the same
magnitude. The log of payroll is statistically significant. A one
percent increase in payroll is associated with about 0.8 percentage
points increase in the percentage of games won.
|


|



|
4. Conclusion
The analysis in this paper shows that pay inequality within MLB teams has a negative effect on
performance. The effect remains statistically significant even after
controlling for total payroll. The result is the same as that of DeBrock
et al. (2004) who use data from 1985 through 1998. My paper confirms
their finding using the most recent data and using a different measure
of pay inequality.
The fact that pay inequality leads to worse performance implies that
managers should strive for pay equality in their teams. For example,
instead of hiring two low-priced players and one superstar, performance
may be better if three medium-priced players are hired. Given these
results, it is surprising that there is not a more equal distribution of
pay in baseball. One possible explanation is that managers may care
about attendance as well as winning. They may be willing to sign up an
expensive superstar who will attract fans even though it will increase
pay inequality and may hinder performance.
The conclusions above are subject to a number of limitations. First, it
is unclear to what extent the results can be generalized to other
sports. Each sport requires a different degree of cooperation among team
members. Therefore, the relationship between pay inequality and
performance is likely to differ across sports. Second, the error terms
for each team could be correlated over time. For example, if a team wins
a lot of games one year given its payroll and pay inequality, that team
is likely to win a lot of games the next year as well. Therefore, the
estimation procedure may need to correct for this autocorrelation.
Finally, there may be other variables that affect performance, e.g. coach
salary or quality of training facilities. Including these in the
regression would increase the precision of my estimates as well as eliminate potential omitted variable bias.
The channels through which pay inequality affects performance are not
clear. I can think of two possibilities. One is that pay inequality
leads to tensions within the team and impairs performance. The other
possibility is that baseball requires players of similar quality. Pay
inequality is probably associated with skill inequality, and it may be
the skill inequality that drives down performance. An excellent pitcher
cannot win the game when the outfielders cannot catch or throw. It may
be possible to distinguish these two channels empirically. Using
statistics on individual player skill level, one could construct a
measure of skill inequality for a team and include it as an additional
control. The coefficient on pay inequality in that case would capture
the effect of pay inequality on
performance while holding skill inequality constant. A negative impact
of pay inequality would then support the idea that pay
inequality leads to tensions which affect performance. This
investigation, however, is left for future research.
|



|
 |
References:
DeBrock, Lawrence, Wallace Hendricks, and Roger
Koenker. 2004. Pay and performance: The impact of salary distribution on
firm-level outcomes in baseball. Journal of Sports Economics 5
(August): 243–261.
Frick, Bernd, Joachim Prinz, and Karina Winkelmann.
2003. Pay inequalities and team performance: Empirical evidence from the
North American major leagues. International Journal of Manpower
24: 472-491.
|
|
 |
Appendix:
Data with
documentation and results:
MLB.xls
|
|
| |
(back to the top) |
|
| * I would like to thank Mary
Mar, Youghwan Song, Stephen Schmidt and two anonymous referees for their
helpful comments. I am also grateful to many Union College students for
their useful feedback. |