Template:Short description Template:More citations needed
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing inequality of the wealth distribution.
The curve is a graph showing the proportion of overall income or wealth assumed by the bottom Template:Nowrap of the people, although this is not rigorously true for a finite population (see below). It is often used to represent income distribution, where it shows for the bottom Template:Nowrap of households, what percentage Template:Nowrap of the total income they have. The percentage of households is plotted on the Template:Mvar-axis, the percentage of income on the Template:Mvar-axis. It can also be used to show distribution of assets. In such use, many economists consider it to be a measure of social inequality.
The concept is useful in describing inequality among the size of individuals in ecology<ref name=EcolgyArticle>Template:Cite journal</ref> and in studies of biodiversity, where the cumulative proportion of species is plotted against the cumulative proportion of individuals.<ref name=natureArticle>Template:Cite journal</ref> It is also useful in business modeling: e.g., in consumer finance, to measure the actual percentage Template:Nowrap of delinquencies attributable to the Template:Nowrap of people with worst risk scores. Lorenz curves were also applied to epidemiology and public health, e.g., to measure pandemic inequality as the distribution of national cumulative incidence (y%) generated by the population residing in areas (x%) ranked with respect to their local epidemic attack rate.<ref>Template:Cite journal</ref>
ExplanationEdit
Template:Lorenz curve global income 2011.svg Data from 2005. Points on the Lorenz curve represent statements such as, "the bottom 20% of all households have 10% of the total income."
A perfectly equal income distribution would be one in which every person has the same income. In this case, the bottom Template:Nowrap of society would always have Template:Nowrap of the income. This can be depicted by the straight line Template:Math; called the "line of perfect equality."
By contrast, a perfectly unequal distribution would be one in which one person has all the income and everyone else has none. In that case, the curve would be at Template:Math for all Template:Math, and Template:Math when Template:Math. This curve is called the "line of perfect inequality."
The Gini coefficient is the ratio of the area between the line of perfect equality and the observed Lorenz curve to the area between the line of perfect equality and the line of perfect inequality. The higher the coefficient, the more unequal the distribution is. In the diagram on the right, this is given by the ratio Template:Math, where Template:Mvar and Template:Mvar are the areas of regions as marked in the diagram.
Definition and calculationEdit
The Lorenz curve is a probability plot (a P–P plot) comparing the distribution of a variable against a hypothetical uniform distribution of that variable. It can usually be represented by a function Template:Math, where Template:Mvar, the cumulative portion of the population, is represented by the horizontal axis, and Template:Mvar, the cumulative portion of the total wealth or income, is represented by the vertical axis.
The curve Template:Mvar need not be a smoothly increasing function of Template:Mvar, For wealth distributions there may be oligarchies or people with negative wealth for instance.<ref>Template:Cite journal</ref>
For a discrete distribution of Y given by values Template:Math, ..., Template:Math in non-decreasing order Template:Math and their probabilities <math>f(y_j) := \Pr(Y=y_j)</math> the Lorenz curve is the continuous piecewise linear function connecting the points Template:Math, Template:Math, where Template:Math, Template:Math, and for Template:Math: <math display="block">\begin{align} F_i &:= \sum_{j=1}^i f(y_j) \\ S_i &:= \sum_{j=1}^i f(y_j) \, y_j \\ L_i &:= \frac{S_i}{S_n} \end{align}</math>
When all Template:Math are equally probable with probabilities Template:Math this simplifies to <math display="block">\begin{align} F_i &= \frac i n \\ S_i &= \frac 1 n \sum_{j=1}^i \; y_j \\ L_i &= \frac{S_i}{S_n} \end{align} </math>
For a continuous distribution with the probability density function Template:Mvar and the cumulative distribution function Template:Mvar, the Lorenz curve Template:Mvar is given by: <math display="block"> L(F(x)) = \frac{\int_{-\infty}^x t\,f(t)\,dt}{\int_{-\infty}^\infty t\,f(t)\,dt} = \frac{\int_{-\infty}^x t\,f(t)\,dt}{\mu} </math> where <math>\mu</math> denotes the average. The Lorenz curve Template:Math may then be plotted as a function parametric in Template:Mvar: Template:Math vs. Template:Math. In other contexts, the quantity computed here is known as the length biased (or size biased) distribution; it also has an important role in renewal theory.
Alternatively, for a cumulative distribution function Template:Math with inverse Template:Math, the Lorenz curve Template:Math is directly given by: <math display="block"> L(F) = \frac{\int_0^F x(F_1)\,dF_1}{\int_0^1 x(F_1)\,dF_1} </math>
The inverse Template:Math may not exist because the cumulative distribution function has intervals of constant values. However, the previous formula can still apply by generalizing the definition of Template:Math: <math display="block"> x(F_1) = \inf \{y : F(y) \geq F_1\}</math> where Template:Math is the infimum.
For an example of a Lorenz curve, see Pareto distribution.
PropertiesEdit
A Lorenz curve always starts at (0,0) and ends at (1,1).
The Lorenz curve is not defined if the mean of the probability distribution is zero or infinite.
The Lorenz curve for a probability distribution is a continuous function. However, Lorenz curves representing discontinuous functions can be constructed as the limit of Lorenz curves of probability distributions, the line of perfect inequality being an example.
The information in a Lorenz curve may be summarized by the Gini coefficient and the Lorenz asymmetry coefficient.<ref name=EcolgyArticle />
The Lorenz curve cannot rise above the line of perfect equality.
A Lorenz curve that never falls beneath a second Lorenz curve and at least once runs above it, has Lorenz dominance over the second one.<ref>Template:Cite journal</ref>
If the variable being measured cannot take negative values, the Lorenz curve:
- cannot sink below the line of perfect inequality,
- is increasing.
Note however that a Lorenz curve for net worth would start out by going negative due to the fact that some people have a negative net worth because of debt.
The Lorenz curve is invariant under positive scaling. If Template:Math is a random variable, for any positive number Template:Mvar the random variable Template:Mvar has the same Lorenz curve as Template:Math.
The Lorenz curve is flipped twice, once about Template:Math and once about Template:Math, by negation. If Template:Math is a random variable with Lorenz curve Template:Math, then Template:Math has the Lorenz curve:
The Lorenz curve is changed by translations so that the equality gap Template:Math changes in proportion to the ratio of the original and translated means. If Template:Math is a random variable with a Lorenz curve Template:Math and mean Template:Math, then for any constant Template:Math, Template:Math has a Lorenz curve defined by: <math display="block">F - L_{X+c}(F) = \frac{\mu_X}{\mu_X + c} ( F - L_X(F))</math>
For a cumulative distribution function Template:Math with mean Template:Mvar and (generalized) inverse Template:Math, then for any Template:Mvar with 0 < Template:Math :
- If the Lorenz curve is differentiable:<math display="block">\frac{d L(F)}{d F} = \frac{x(F)}{\mu}</math>
- If the Lorenz curve is twice differentiable, then the probability density function Template:Math exists at that point and: <math display="block">\frac{d^2 L(F)}{d F^2} = \frac{1}{\mu\,f(x(F))}\,</math>
- If Template:Math is continuously differentiable, then the tangent of Template:Math is parallel to the line of perfect equality at the point Template:Math. This is also the point at which the equality gap Template:Math, the vertical distance between the Lorenz curve and the line of perfect equality, is greatest. The size of the gap is equal to half of the relative mean absolute deviation: <math display="block">F(\mu) - L(F(\mu)) = \frac{\text{mean absolute deviation}}{2\,\mu}</math>
ExamplesEdit
Both Template:Math and Template:Math, for Template:Math, are well-known functional forms for the Lorenz curve.<ref>Template:Cite journal</ref>
See alsoEdit
- Distribution (economics)
- Distribution of wealth
- Welfare economics
- Income inequality metrics
- Gini coefficient
- Hoover index (a.k.a. Robin Hood index)
- ROC analysis
- Social welfare (political science)
- Economic inequality
- Zipf's law
- Pareto distribution
- Mean deviation
- The Elephant Curve
ReferencesEdit
Further readingEdit
External linksEdit
- WIID Template:Webarchive: World Income Inequality Database, a source of information on inequality, collected by WIDER (World Institute for Development Economics Research, part of United Nations University)
- glcurve: Stata module to plot Lorenz curve (type "findit glcurve" or "ssc install glcurve" in Stata prompt to install)
- Free add-on to STATA to compute inequality and poverty measures
- Free Online Software (Calculator) computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset
- Free Calculator: Online and downloadable scripts (Python and Lua) for Atkinson, Gini, and Hoover inequalities
- Users of the R data analysis software can install the "ineq" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil.
- A MATLAB Inequality Package Template:Webarchive, including code for computing Gini, Atkinson, Theil indexes and for plotting the Lorenz Curve. Many examples are available.
- A complete handout about the Lorenz curve including various applications, including an Excel spreadsheet graphing Lorenz curves and calculating Gini coefficients as well as coefficients of variation.
- LORENZ 3.0 is a Mathematica notebook which draw sample Lorenz curves and calculates Gini coefficients and Lorenz asymmetry coefficients from data in an Excel sheet.