Matplotlib 中的平行坐标图

使用传统的绘图类型可以相对直接地查看二维和三维数据。即使是四维数据，我们也常常可以找到一种显示数据的方法。但是，超过四的维度变得越来越难以显示。幸运的是，平行坐标图提供了一种查看更高维度结果的机制。

来自维基百科的示例平行坐标图

几个绘图包提供平行坐标图，例如 Matlab 、 R 、 VTK type 1 和 VTK type 2 ，但我不知道如何使用 Matplotlib 创建一个。

Matplotlib 中是否有内置的平行坐标图？我当然没有在画廊中看到一个。
如果没有内置类型，是否可以使用 Matplotlib 的标准功能构建平行坐标图？

编辑：

根据下面 Zhenya 提供的答案，我开发了以下支持任意数量轴的概括。按照我在上面的原始问题中发布的示例的绘图样式，每个轴都有自己的比例。我通过标准化每个轴点的数据并使轴的范围为 0 到 1 来实现此目的。然后我返回并将标签应用于每个刻度线，在该截距处给出正确的值。

该函数通过接受可迭代的数据集来工作。每个数据集都被视为一组点，其中每个点位于不同的轴上。 __main__ 中的示例在两组 30 行中为每个轴获取随机数。线条在导致线条聚集的范围内是随机的；我想验证的行为。

这个解决方案不如内置解决方案好，因为你有奇怪的鼠标行为，而且我通过标签伪造数据范围，但在 Matplotlib 添加内置解决方案之前，它是可以接受的。

 #!/usr/bin/python
import matplotlib.pyplot as plt
import matplotlib.ticker as ticker

def parallel_coordinates(data_sets, style=None):

    dims = len(data_sets[0])
    x    = range(dims)
    fig, axes = plt.subplots(1, dims-1, sharey=False)

    if style is None:
        style = ['r-']*len(data_sets)

    # Calculate the limits on the data
    min_max_range = list()
    for m in zip(*data_sets):
        mn = min(m)
        mx = max(m)
        if mn == mx:
            mn -= 0.5
            mx = mn + 1.
        r  = float(mx - mn)
        min_max_range.append((mn, mx, r))

    # Normalize the data sets
    norm_data_sets = list()
    for ds in data_sets:
        nds = [(value - min_max_range[dimension][0]) /
                min_max_range[dimension][2]
                for dimension,value in enumerate(ds)]
        norm_data_sets.append(nds)
    data_sets = norm_data_sets

    # Plot the datasets on all the subplots
    for i, ax in enumerate(axes):
        for dsi, d in enumerate(data_sets):
            ax.plot(x, d, style[dsi])
        ax.set_xlim([x[i], x[i+1]])

    # Set the x axis ticks
    for dimension, (axx,xx) in enumerate(zip(axes, x[:-1])):
        axx.xaxis.set_major_locator(ticker.FixedLocator([xx]))
        ticks = len(axx.get_yticklabels())
        labels = list()
        step = min_max_range[dimension][2] / (ticks - 1)
        mn   = min_max_range[dimension][0]
        for i in xrange(ticks):
            v = mn + i*step
            labels.append('%4.2f' % v)
        axx.set_yticklabels(labels)

    # Move the final axis' ticks to the right-hand side
    axx = plt.twinx(axes[-1])
    dimension += 1
    axx.xaxis.set_major_locator(ticker.FixedLocator([x[-2], x[-1]]))
    ticks = len(axx.get_yticklabels())
    step = min_max_range[dimension][2] / (ticks - 1)
    mn   = min_max_range[dimension][0]
    labels = ['%4.2f' % (mn + i*step) for i in xrange(ticks)]
    axx.set_yticklabels(labels)

    # Stack the subplots
    plt.subplots_adjust(wspace=0)

    return plt

if __name__ == '__main__':
    import random
    base  = [0,   0,  5,   5,  0]
    scale = [1.5, 2., 1.0, 2., 2.]
    data = [[base[x] + random.uniform(0., 1.)*scale[x]
            for x in xrange(5)] for y in xrange(30)]
    colors = ['r'] * 30

    base  = [3,   6,  0,   1,  3]
    scale = [1.5, 2., 2.5, 2., 2.]
    data.extend([[base[x] + random.uniform(0., 1.)*scale[x]
                 for x in xrange(5)] for y in xrange(30)])
    colors.extend(['b'] * 30)

    parallel_coordinates(data, style=colors).show()

编辑 2：

这是绘制 Fisher 的 Iris 数据时上述代码的结果示例。它不如来自维基百科的参考图像那么好，但如果你只有 Matplotlib 并且你需要多维图，它是可以通过的。

此答案中平行坐标图的示例结果

原文由 Nathan 发布，翻译遵循 CC BY-SA 4.0 许可协议

阅读 616

#!/usr/bin/python import numpy as np import matplotlib.pyplot as plt import matplotlib.ticker as ticker #vectors to plot: 4D for this example y1=[1,2.3,8.0,2.5] y2=[1.5,1.7,2.2,2.9] x=[1,2,3,8] # spines fig,(ax,ax2,ax3) = plt.subplots(1, 3, sharey=False) # plot the same on all the subplots ax.plot(x,y1,'r-', x,y2,'b-') ax2.plot(x,y1,'r-', x,y2,'b-') ax3.plot(x,y1,'r-', x,y2,'b-') # now zoom in each of the subplots ax.set_xlim([ x[0],x[1]]) ax2.set_xlim([ x[1],x[2]]) ax3.set_xlim([ x[2],x[3]]) # set the x axis ticks for axx,xx in zip([ax,ax2,ax3],x[:-1]): axx.xaxis.set_major_locator(ticker.FixedLocator([xx])) ax3.xaxis.set_major_locator(ticker.FixedLocator([x[-2],x[-1]])) # the last one # EDIT: add the labels to the rightmost spine for tick in ax3.yaxis.get_major_ticks(): tick.label2On=True # stack the subplots together plt.subplots_adjust(wspace=0) plt.show()

import pandas import matplotlib.pyplot as plt from pandas.tools.plotting import parallel_coordinates data = pandas.read_csv(r'C:\Python27\Lib\site-packages\pandas\tests\data\iris.csv', sep=',') parallel_coordinates(data, 'Name') plt.show()

Matplotlib 中的平行坐标图

你尚未登录，登录后可以

请问： Python中是否有方式可以像前端的TSLint一样进行代码的自动风格格式检查？

为什么 pypi 的页面上的新版本在通过 pip 获取不到？

请问一下Python 可以进行强类型开发吗？

python中最好的单元测试是使用的什么呢？

duckdb 的 python sdk 读取 csv 的时候，如何指定列的字段类型？

Python类属性与实例属性自增行为差异？

可以打印全局命名空间：`globals()` 如何打印内置命名空间呢？

Stack Overflow 翻译