Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Long tail
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Statistical meaning== [[File:Longtail.svg|frame|right|The tail becomes bigger and longer in new markets (depicted in red). For example, if time progresses left to right, traditional retailers have focused on the area to the left of the chart, while online bookstores derive more sales from the area to the right.]] ''The long tail'' is the name for a long-known feature of some statistical distributions (such as [[Zipf's law|Zipf]], [[power law]]s, [[Pareto distribution]]s and [[Lévy skew alpha-stable distribution|general Lévy distributions]]). In "long-tailed" distributions a high-frequency or high-amplitude population is followed by a low-frequency or low-amplitude population which gradually "tails off" [[asymptotically]]. The events at the far end of the tail have a very low probability of occurrence. As a [[rule of thumb]], for such population distributions the majority of occurrences (more than half, and where the [[Pareto principle]] applies, 80%) are accounted for by the first 20% of items in the distribution. [[Power law]] distributions or functions characterize an important number of behaviors from nature and human endeavor. This fact has given rise to a keen scientific and social interest in such distributions, and the relationships that create them. The observation of such a distribution often points to specific kinds of mechanisms, and can often indicate a deep connection with other, seemingly unrelated systems. Examples of behaviors that exhibit long-tailed distribution are the occurrence of certain words in a given language, the income distribution of a business or the intensity of earthquakes (see: [[Gutenberg–Richter law]]). [[Chris Anderson (writer)|Chris Anderson]]'s and [[Clay Shirky]]'s articles highlight special cases in which we are able to modify the underlying relationships and evaluate the impact on the frequency of events. In those cases the infrequent, low-amplitude (or low-revenue) events – the long tail, represented here by the portion of the curve to the right of the 20th percentile – can become the largest area under the line. This suggests that a variation of one mechanism (internet access) or relationship (the cost of storage) can significantly shift the frequency of occurrence of certain events in the distribution. The shift has a crucial effect in probability and in the customer demographics of businesses like [[mass media]] and online sellers. However, the long tails characterizing distributions such as the [[Gutenberg–Richter law]] or the words-occurrence [[Zipf's law]], and those highlighted by Anderson and Shirky are of very different, if not opposite, nature: Anderson and Shirky refer to frequency-rank relations, whereas the Gutenberg–Richter law and the Zipf's law are probability distributions. Therefore, in these latter cases "tails" correspond to large-intensity events such as large earthquakes and most popular words, which dominate the distributions. By contrast, the long tails in the frequency-rank plots highlighted by Anderson and Shirky would rather correspond to short tails in the associated probability distributions, and therefore illustrate an opposite phenomenon compared to the Gutenberg–Richter and the Zipf's laws.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)