如何在Jupyter Notebook中将列表输出为表格?


80

我知道我以前曾经在某个地方看到过一些示例,但是对于我一生来说,在谷歌上搜索时找不到它。

我有一些数据行:

data = [[1,2,3],
        [4,5,6],
        [7,8,9],
        ]

我想将此数据输出到表中,例如

+---+---+---+
| 1 | 2 | 3 |
+---+---+---+
| 4 | 5 | 6 |
+---+---+---+
| 7 | 8 | 9 |
+---+---+---+

显然,我可以使用诸如prettytable之类的库,也可以下载pandas之类的东西,但是我对此并不感兴趣。

我只想在Jupyter笔记本单元中将行输出为表格。我该怎么做呢?


您是否只想使用print功能?数字的宽度固定(1位,3位吗?
tglaria '16

在这里,我写了pythonic抽象。轻松编写代码。:) jupyter_table_class.py
not_python

Answers:


85

我刚刚发现表格具有HTML选项,并且使用起来相当简单。
与韦恩·沃纳的答案非常相似:

from IPython.display import HTML, display
import tabulate
table = [["Sun",696000,1989100000],
         ["Earth",6371,5973.6],
         ["Moon",1737,73.5],
         ["Mars",3390,641.85]]
display(HTML(tabulate.tabulate(table, tablefmt='html')))

仍在寻找简单易用的方法来创建更复杂的表布局,例如使用乳胶语法和格式化以合并单元格并在笔记本中进行变量替换:
允许在Markdown单元中引用Python变量#2958


对齐字符串对我不起作用!它不会将字符串左对齐!
Mojtaba Khodadadi '18

@MojtabaKhodadadi尚未对其进行仔细检查,但您似乎可以在此处为srtings和number设置默认的列参数。
ruffsl

如今,tabulate.tabulate(table, tablefmt='html')看起来似乎还可以工作(尝试了Jupyter 6.0.3,JupyterLab 2.0.1)。真好!
zonksoft '20

82

有一个不错的技巧:用pandas DataFrame包装数据。

import pandas as pd
data = [[1, 2], [3, 4]]
pd.DataFrame(data, columns=["Foo", "Bar"])

它显示如下数据:

  | Foo | Bar |
0 | 1   | 2   |
1 | 3   | 4   |

14
正如有人谁绝对爱蟒蛇的一切,但数据的科学,这让我真的很伤心看到九线,四重进口,三重功能调用的答案得到upvoted时最好看的答案是字面意思是“一个熊猫数据帧。” 我的试探法是:“如果很长-可能是错误的!”
one_observation

1
您甚至可以使用来将DataFrame显示为HTML to_html(),请参见stackoverflow.com/a/29665452/2866660
wvengen

谢谢!是的,绝对应该将接受的答案更改为此。
海伦

59

我终于重新找到了我想要的jupyter / IPython文档

我需要这个:

from IPython.display import HTML, display

data = [[1,2,3],
        [4,5,6],
        [7,8,9],
        ]

display(HTML(
   '<table><tr>{}</tr></table>'.format(
       '</tr><tr>'.join(
           '<td>{}</td>'.format('</td><td>'.join(str(_) for _ in row)) for row in data)
       )
))

(我可能对理解力有些怀疑,但这display(HTML('some html here'))是我们所需要的)


13

tabletext非常适合

import tabletext

data = [[1,2,30],
        [4,23125,6],
        [7,8,999],
        ]

print tabletext.to_text(data)

结果:

┌───┬───────┬─────┐
│ 1230 │
├───┼───────┼─────┤
│ 4231256 │
├───┼───────┼─────┤
│ 78999 │
└───┴───────┴─────┘

4

如果您不介意使用一些html,则应该可以使用这种方法。

from IPython.display import HTML, display

def display_table(data):
    html = "<table>"
    for row in data:
        html += "<tr>"
        for field in row:
            html += "<td><h4>%s</h4><td>"%(field)
        html += "</tr>"
    html += "</table>"
    display(HTML(html))

然后像这样使用

data = [[1,2,3],[4,5,6],[7,8,9]]
display_table(data)

在此处输入图片说明


2

您可以尝试使用以下功能

def tableIt(data):
    for lin in data:
        print("+---"*len(lin)+"+")
        for inlin in lin:
            print("|",str(inlin),"", end="")
        print("|")
    print("+---"*len(lin)+"+")

data = [[1,2,3,2,3],[1,2,3,2,3],[1,2,3,2,3],[1,2,3,2,3]]

tableIt(data)

2

好的,所以这比我难一点:

def print_matrix(list_of_list):
    number_width = len(str(max([max(i) for i in list_of_list])))
    cols = max(map(len, list_of_list))
    output = '+'+('-'*(number_width+2)+'+')*cols + '\n'
    for row in list_of_list:
        for column in row:
            output += '|' + ' {:^{width}d} '.format(column, width = number_width)
        output+='|\n+'+('-'*(number_width+2)+'+')*cols + '\n'
    return output

这应该适用于可变的行数,列数和位数(对于数字)

data = [[1,2,30],
        [4,23125,6],
        [7,8,999],
        ]
print print_matrix(data)
>>>>+-------+-------+-------+
    |   1   |   2   |  30   |
    +-------+-------+-------+
    |   4   | 23125 |   6   |
    +-------+-------+-------+
    |   7   |   8   |  999  |
    +-------+-------+-------+

1

一组通用功能,可将任何python数据结构(嵌套的字典和列表)呈现为HTML。

from IPython.display import HTML, display

def _render_list_html(l):
    o = []
    for e in l:
        o.append('<li>%s</li>' % _render_as_html(e))
    return '<ol>%s</ol>' % ''.join(o)

def _render_dict_html(d):
    o = []
    for k, v in d.items():
        o.append('<tr><td>%s</td><td>%s</td></tr>' % (str(k), _render_as_html(v)))
    return '<table>%s</table>' % ''.join(o)

def _render_as_html(e):
    o = []
    if isinstance(e, list):
        o.append(_render_list_html(e))
    elif isinstance(e, dict):
        o.append(_render_dict_html(e))
    else:
        o.append(str(e))
    return '<html><body>%s</body></html>' % ''.join(o)

def render_as_html(e):
    display(HTML(_render_as_html(e)))

1

我曾经有过同样的问题。我找不到任何对我有帮助的东西,所以我最终在PrintTable下面的课上做了代码。还有一个输出。用法很简单:

ptobj = PrintTable(yourdata, column_captions, column_widths, text_aligns)
ptobj.print()

或一行:

PrintTable(yourdata, column_captions, column_widths, text_aligns).print()

输出:

-------------------------------------------------------------------------------------------------------------
  Name                                     | Column 1   | Column 2   | Column 3   | Column 4   | Column 5    
-------------------------------------------------------------------------------------------------------------
  Very long name 0                         |          0 |          0 |          0 |          0 |          0  
  Very long name 1                         |          1 |          2 |          3 |          4 |          5  
  Very long name 2                         |          2 |          4 |          6 |          8 |         10  
  Very long name 3                         |          3 |          6 |          9 |         12 |         15  
  Very long name 4                         |          4 |          8 |         12 |         16 |         20  
  Very long name 5                         |          5 |         10 |         15 |         20 |         25  
  Very long name 6                         |          6 |         12 |         18 |         24 |         30  
  Very long name 7                         |          7 |         14 |         21 |         28 |         35  
  Very long name 8                         |          8 |         16 |         24 |         32 |         40  
  Very long name 9                         |          9 |         18 |         27 |         36 |         45  
  Very long name 10                        |         10 |         20 |         30 |         40 |         50  
  Very long name 11                        |         11 |         22 |         33 |         44 |         55  
  Very long name 12                        |         12 |         24 |         36 |         48 |         60  
  Very long name 13                        |         13 |         26 |         39 |         52 |         65  
  Very long name 14                        |         14 |         28 |         42 |         56 |         70  
  Very long name 15                        |         15 |         30 |         45 |         60 |         75  
  Very long name 16                        |         16 |         32 |         48 |         64 |         80  
  Very long name 17                        |         17 |         34 |         51 |         68 |         85  
  Very long name 18                        |         18 |         36 |         54 |         72 |         90  
  Very long name 19                        |         19 |         38 |         57 |         76 |         95  
-------------------------------------------------------------------------------------------------------------

该类的代码 PrintTable

# -*- coding: utf-8 -*-

# Class
class PrintTable:
    def __init__(self, values, captions, widths, aligns):
    if not all([len(values[0]) == len(x) for x in [captions, widths, aligns]]):
        raise Exception()
    self._tablewidth = sum(widths) + 3*(len(captions)-1) + 4
    self._values = values
    self._captions = captions
    self._widths = widths
    self._aligns = aligns

    def print(self):
    self._printTable()

    def _printTable(self):
    formattext_head = ""
    formattext_cell = ""
    for i,v in enumerate(self._widths):
        formattext_head += "{" + str(i) + ":<" + str(v) + "} | "
        formattext_cell += "{" + str(i) + ":" + self._aligns[i] + str(v) + "} | "
    formattext_head = formattext_head[:-3]
    formattext_head = "  " + formattext_head.strip() + "  "
    formattext_cell = formattext_cell[:-3]
    formattext_cell = "  " + formattext_cell.strip() + "  "

    print("-"*self._tablewidth)
    print(formattext_head.format(*self._captions))
    print("-"*self._tablewidth)
    for w in self._values:
        print(formattext_cell.format(*w))
    print("-"*self._tablewidth)

示范

# Demonstration

headername = ["Column {}".format(x) for x in range(6)]
headername[0] = "Name"
data = [["Very long name {}".format(x), x, x*2, x*3, x*4, x*5] for x in range(20)] 

PrintTable(data, \
       headername, \
       [70, 10, 10, 10, 10, 10], \
       ["<",">",">",">",">",">"]).print()

1

我最近用于prettytable呈现漂亮的ASCII表。它类似于postgres CLI输出。

import pandas as pd
from prettytable import PrettyTable

data = [[1,2,3],[4,5,6],[7,8,9]]
df = pd.DataFrame(data, columns=['one', 'two', 'three'])

def generate_ascii_table(df):
    x = PrettyTable()
    x.field_names = df.columns.tolist()
    for row in df.values:
        x.add_row(row)
    print(x)
    return x

generate_ascii_table(df)

输出:

+-----+-----+-------+
| one | two | three |
+-----+-----+-------+
|  1  |  2  |   3   |
|  4  |  5  |   6   |
|  7  |  8  |   9   |
+-----+-----+-------+

0

我想输出一个表格,其中每一列都具有最小的宽度,其中各列用空白填充(但是可以更改),行由换行符分隔(但是可以更改),并且每个项目都使用str(但...)。


def ftable(tbl, pad='  ', sep='\n', normalize=str):

    # normalize the content to the most useful data type
    strtbl = [[normalize(it) for it in row] for row in tbl] 

    # next, for each column we compute the maximum width needed
    w = [0 for _ in tbl[0]]
    for row in strtbl:
        for ncol, it in enumerate(row):
            w[ncol] = max(w[ncol], len(it))

    # a string is built iterating on the rows and the items of `strtbl`:
    #   items are  prepended white space to an uniform column width
    #   formatted items are `join`ed using `pad` (by default "  ")
    #   eventually we join the rows using newlines and return
    return sep.join(pad.join(' '*(wid-len(it))+it for wid, it in zip(w, row))
                                                      for row in strtbl)

ftable(tbl, pad=' ', sep='\n', normalize=str)具有其默认参数的功能签名旨在提供最大的灵活性。

您可以自定义

  • 顶,
  • arator,(例如,pad='&', sep='\\\\\n'以具有一个散装乳胶表的)
  • 该函数用于将输入规范化为通用字符串格式---默认情况下,为了获得最大的通用性,这是一个合理的选择,str如果您知道所有数据都是浮点数lambda item: "%.4f"%item,等等。

表面测试:

我需要一些测试数据,可能涉及不同宽度的列,以便算法需要更加复杂(但仅需一点点);

In [1]: from random import randrange

In [2]: table = [[randrange(10**randrange(10)) for i in range(5)] for j in range(3)]

In [3]: table
Out[3]: 
[[974413992, 510, 0, 3114, 1],
 [863242961, 0, 94924, 782, 34],
 [1060993, 62, 26076, 75832, 833174]]

In [4]: print(ftable(table))
974413992  510      0   3114       1
863242961    0  94924    782      34
  1060993   62  26076  75832  833174

In [5]: print(ftable(table, pad='|'))
974413992|510|    0| 3114|     1
863242961|  0|94924|  782|    34
  1060993| 62|26076|75832|833174
By using our site, you acknowledge that you have read and understand our Cookie Policy and Privacy Policy.
Licensed under cc by-sa 3.0 with attribution required.