numpy.random.RandomState.pareto

原文：https://docs.scipy.org/doc/numpy/reference/generated/numpy.random.RandomState.pareto.html

译者：飞龙 UsyiyiCN

校对：（虚位以待）

RandomState.pareto(a, size=None)

从具有指定形状的Pareto II或Lomax分布绘制样品。

Lomax或Pareto II分布是偏移的Pareto分布。经典的帕累托分布可以通过加上1并乘以标度参数m从Lomax分布获得（参见注释）。The smallest value of the Lomax distribution is zero while for the classical Pareto distribution it is mu, where the standard Pareto distribution has location mu = 1. Lomax也可以被认为是广义帕累托分布的简化版本（在SciPy中可用），比例设置为1并且位置设置为零。

Pareto分布必须大于零，并且在上面是无界的。它也被称为“80-20规则”。在这种分布中，80％的权重在该范围的最低20％，而另外20％的权重在剩余的80％的范围内。

参数：	shape：float，> 0。形状的分布。 size：int或tuple的整数，可选输出形状。如果给定形状是例如`（m， n， k）`，则 `m * n * k`默认值为None，在这种情况下返回单个值。

也可以看看

scipy.stats.distributions.lomax.pdf: 概率密度函数，分布或累积密度函数等。
scipy.stats.distributions.genpareto.pdf: 概率密度函数，分布或累积密度函数等。

笔记

帕累托分布的概率密度为

其中 $a$ 是形状， $m$ 刻度。

帕累托分布以意大利经济学家Vilfredo Pareto命名，是一个幂律概率分布，用于许多现实世界的问题。在经济学领域之外，它通常被称为布拉德福德分布。帕累托开发了分布来描述一个经济体中财富的分布。它还发现在保险，网页访问统计，油田大小和许多其他问题，包括Sourceforge [R180]中项目的下载频率。它是所谓的“胖尾”分布之一。

参考文献

[R180]

（1，2） Francis Hunt和Paul Johnson，On the Pareto Distribution of Sourceforge projects。

[R181]

Pareto，V。（1896）。政治经济学课程。洛桑。

[R182]

Reiss，R.D.，Thomas，M。（2001），Statistical Analysis of Extreme Values，Birkhauser Verlag，Basel，第23-30页。

[R183]

维基百科，“帕累托分布”，http://en.wikipedia.org/wiki/Pareto_distribution

例子

从分布绘制样本：

>>> a, m = 3., 2.  # shape and mode
>>> s = (np.random.pareto(a, 1000) + 1) * m

显示样本的直方图，以及概率密度函数：

>>> import matplotlib.pyplot as plt
>>> count, bins, _ = plt.hist(s, 100, normed=True)
>>> fit = a*m**a / bins**(a+1)
>>> plt.plot(bins, max(count)*fit/max(fit), linewidth=2, color='r')
>>> plt.show()

（源代码，png，pdf）

../../_images/numpy-random-RandomState-pareto-1.png