带R的甘特图


Answers:


109

现在有几种优雅的方法可以在R中生成甘特图。

使用坎德拉

library(candela)

data <- list(
    list(name='Do this', level=1, start=0, end=5),
    list(name='This part 1', level=2, start=0, end=3),
    list(name='This part 2', level=2, start=3, end=5),
    list(name='Then that', level=1, start=5, end=15),
    list(name='That part 1', level=2, start=5, end=10),
    list(name='That part 2', level=2, start=10, end=15))

candela('GanttChart',
    data=data, label='name',
    start='start', end='end', level='level',
    width=700, height=200)

在此处输入图片说明

使用DiagrammeR

library(DiagrammeR)

mermaid("
gantt
dateFormat  YYYY-MM-DD
title A Very Nice Gantt Diagram

section Basic Tasks
This is completed             :done,          first_1,    2014-01-06, 2014-01-08
This is active                :active,        first_2,    2014-01-09, 3d
Do this later                 :               first_3,    after first_2, 5d
Do this after that            :               first_4,    after first_3, 5d

section Important Things
Completed, critical task      :crit, done,    import_1,   2014-01-06,24h
Also done, also critical      :crit, done,    import_2,   after import_1, 2d
Doing this important task now :crit, active,  import_3,   after import_2, 3d
Next critical task            :crit,          import_4,   after import_3, 5d

section The Extras
First extras                  :active,        extras_1,   after import_4,  3d
Second helping                :               extras_2,   after extras_1, 20h
More of the extras            :               extras_3,   after extras_1, 48h
")

在此处输入图片说明

GitHub上找到此示例以及更多内容DiagrammeR


如果您的数据存储在中data.frame,则可以创建要传递给的字符串mermaid()通过将其转换为正确的格式。

考虑以下:

df <- data.frame(task = c("task1", "task2", "task3"),
                 status = c("done", "active", "crit"),
                 pos = c("first_1", "first_2", "first_3"),
                 start = c("2014-01-06", "2014-01-09", "after first_2"),
                 end = c("2014-01-08", "3d", "5d"))

#   task status     pos         start        end
#1 task1   done first_1    2014-01-06 2014-01-08
#2 task2 active first_2    2014-01-09         3d
#3 task3   crit first_3 after first_2         5d

使用dplyrtidyr(或您喜欢的任何数据争用资源):

library(tidyr)
library(dplyr)

mermaid(
  paste0(
    # mermaid "header", each component separated with "\n" (line break)
    "gantt", "\n", 
    "dateFormat  YYYY-MM-DD", "\n", 
    "title A Very Nice Gantt Diagram", "\n",
    # unite the first two columns (task & status) and separate them with ":"
    # then, unite the other columns and separate them with ","
    # this will create the required mermaid "body"
    paste(df %>%
            unite(i, task, status, sep = ":") %>%
            unite(j, i, pos, start, end, sep = ",") %>%
            .$j, 
          collapse = "\n"
    ), "\n"
  )
)

正如@GeorgeDontas在评论中提到的那样,有一点小技巧可以允许将x轴的标签更改为日期,而不是'w.01,w.02'。

假设您将上述美人鱼图保存在中m,请执行以下操作:

m$x$config = list(ganttConfig = list(
  axisFormatter = list(list(
    "%b %d, %Y" 
    ,htmlwidgets::JS(
      'function(d){ return d.getDay() == 1 }' 
    )
  ))
))

这使:

在此处输入图片说明


使用timevis

timevis GitHub

timevis使您可以在R中创建丰富且完全交互式的时间线可视化。时间线可以包含在Shiny应用程序和R降价文档中,也可以从R控制台和RStudio Viewer查看。

library(timevis)

data <- data.frame(
  id      = 1:4,
  content = c("Item one"  , "Item two"  ,"Ranged item", "Item four"),
  start   = c("2016-01-10", "2016-01-11", "2016-01-20", "2016-02-14 15:00:00"),
  end     = c(NA          ,           NA, "2016-02-04", NA)
)

timevis(data)

这使:

在此处输入图片说明


剧情地使用

我偶然发现了这篇文章,提供了另一种使用方法plotly。这是一个例子:

library(plotly)

df <- read.csv("https://cdn.rawgit.com/plotly/datasets/master/GanttChart-updated.csv", 
               stringsAsFactors = F)

df$Start  <- as.Date(df$Start, format = "%m/%d/%Y")
client    <- "Sample Client"
cols      <- RColorBrewer::brewer.pal(length(unique(df$Resource)), name = "Set3")
df$color  <- factor(df$Resource, labels = cols)

p <- plot_ly()
for(i in 1:(nrow(df) - 1)){
  p <- add_trace(p,
                 x = c(df$Start[i], df$Start[i] + df$Duration[i]), 
                 y = c(i, i), 
                 mode = "lines",
                 line = list(color = df$color[i], width = 20),
                 showlegend = F,
                 hoverinfo = "text",
                 text = paste("Task: ", df$Task[i], "<br>",
                              "Duration: ", df$Duration[i], "days<br>",
                              "Resource: ", df$Resource[i]),
                 evaluate = T
  )
}

p

这使:

在此处输入图片说明

然后,您可以添加其他信息和注释,自定义字体和颜色等(有关详细信息,请参见博客文章)。


确实很好。但是,在我看来,使用存储在数据框中的数据自动创建传递给美人鱼的字符串相当困难。
乔治·唐塔斯

是否可以将日期显示为x轴标签,而不是“ w.01”,“ w.02”等?
乔治·唐塔斯

2
覆盖甘特图以允许自定义日期轴,而不是0-52周的比例:github.com/rich-iannone/DiagrammeR/issues/77
George Dontas

该代码在Rstudio中使用DiagramteR和Mermaid可以完美地工作,但是在PowerBI中使用它时出现错误错误消息:未创建任何图像。R代码未创建任何视觉效果。确保您的R脚本导致绘制到R默认设备的图。任何想法有人谢谢Peddie
PeddiePooh '16

1
timevisin中的解决方案R看起来很简单。:-)
苏曼·卡纳尔

29

一个简单的ggplot2甘特图。

首先,我们创建一些数据。

library(reshape2)
library(ggplot2)

tasks <- c("Review literature", "Mung data", "Stats analysis", "Write Report")
dfr <- data.frame(
  name        = factor(tasks, levels = tasks),
  start.date  = as.Date(c("2010-08-24", "2010-10-01", "2010-11-01", "2011-02-14")),
  end.date    = as.Date(c("2010-10-31", "2010-12-14", "2011-02-28", "2011-04-30")),
  is.critical = c(TRUE, FALSE, FALSE, TRUE)
)
mdfr <- melt(dfr, measure.vars = c("start.date", "end.date"))

现在绘制情节。

ggplot(mdfr, aes(value, name, colour = is.critical)) + 
  geom_line(size = 6) +
  xlab(NULL) + 
  ylab(NULL)

我只能创建两次数据:-)
George Dontas 2010年

@ gd047:这需要两只手的facepalm。愚蠢现在已解决。
Richie Cotton

1
这非常好,但是我最想找的是一种显示每个任务多个条形的方法(如您在我给出的示例中所看到的),例如一个用于基线,一个用于实际任务持续时间。有没有办法做这样的事情?
乔治·唐塔斯

12

考虑使用该软件包projmanr(2017年8月23日在CRAN上发布的0.1.0版)。

library(projmanr)

# Use raw example data
(data <- taskdata1)

taskdata1

  id name duration pred
1  1   T1        3     
2  2   T2        4    1
3  3   T3        2    1
4  4   T4        5    2
5  5   T5        1    3
6  6   T6        2    3
7  7   T7        4 4,5 
8  8   T8        3  6,7

现在开始准备甘特:

# Create a gantt chart using the raw data
gantt(data)

在此处输入图片说明

# Create a second gantt chart using the processed data
res <- critical_path(data)
gantt(res)

在此处输入图片说明

# Use raw example data
data <- taskdata1
# Create a network diagram chart using the raw data
network_diagram(data)

在此处输入图片说明

# Create a second network diagram using the processed data
res <- critical_path(data)
network_diagram(res)

在此处输入图片说明


7

试试这个:

install.packages("plotrix")
library(plotrix)
?gantt.chart


5

您可以使用GoogleVis软件包

datTL <- data.frame(Position=c(rep("President", 3), rep("Vice", 3)),
                    Name=c("Washington", "Adams", "Jefferson",
                           "Adams", "Jefferson", "Burr"),
                    start=as.Date(x=rep(c("1789-03-29", "1797-02-03", 
                                          "1801-02-03"),2)),
                    end=as.Date(x=rep(c("1797-02-03", "1801-02-03", 
                                        "1809-02-03"),2)))

Timeline <- gvisTimeline(data=datTL, 
                         rowlabel="Name",
                         barlabel="Position",
                         start="start", 
                         end="end",
                         options=list(timeline="{groupByRowLabel:false}",
                                      backgroundColor='#ffd', 
                                      height=350,
                                      colors="['#cbb69d', '#603913', '#c69c6e']"))
plot(Timeline)

在此处输入图片说明

来源:https//cran.r-project.org/web/packages/googleVis/vignettes/googleVis_examples.html


4

我使用并修改了Richie的上述示例,就像一个魅力一样。修改后的版本,以显示他的模型如何转换为摄取CSV数据,而不是手动提供的文本项。

注意:Richie的答案丢失,表明上面/下面的代码需要2个软件包(reshapeggplot2)。

rawschedule <- read.csv("sample.csv", header = TRUE) #modify the "sample.csv" to be the name of your file target. - Make sure you have headers of: Task, Start, Finish, Critical OR modify the below to reflect column count.
tasks <- c(t(rawschedule["Task"]))
dfr <- data.frame(
name        = factor(tasks, levels = tasks),
start.date  = c(rawschedule["Start"]),
end.date    = c(rawschedule["Finish"]),
is.critical = c(rawschedule["Critical"]))
mdfr <- melt(dfr, measure.vars = c("Start", "Finish"))


#generates the plot
ggplot(mdfr, aes(as.Date(value, "%m/%d/%Y"), name, colour = Critical)) + 
geom_line(size = 6) +
xlab("Duration") + ylab("Tasks") +
theme_bw()


3

对我而言,Gvistimeline是执行此操作的最佳工具,但所需的在线连接对我来说没有用。因此,我创建了一个名为的包vistime,它使用plotly(类似于@StevenBeaupré的答案),因此您可以放大等:

https://github.com/shosaco/vistime

vistime:使用plotly.js创建交互式时间表或甘特图。这些图表可以包含在Shiny应用程序中,并可以通过plotly_build()进行操作。

install.packages("vistime")    
library("vistime")  

dat <- data.frame(Position=c(rep("President", 3), rep("Vice", 3)),
              Name = c("Washington", "Adams", "Jefferson", "Adams", "Jefferson", "Burr"),
              start = rep(c("1789-03-29", "1797-02-03", "1801-02-03"), 2),
              end = rep(c("1797-02-03", "1801-02-03", "1809-02-03"), 2),
              color = c('#cbb69d', '#603913', '#c69c6e'),
              fontcolor = rep("white", 3))

vistime(dat, events="Position", groups="Name", title="Presidents of the USA")

在此处输入图片说明


3

我知道,这是一个非常老的问题,但也许值得在这里留下-对我对这个问题的答案不满意-几个月前,我制作了一个用于制作基于ggplot2的甘特图的基本程序包:ganttrify(该软件包的自述文件中有更多详细信息) 。

输出示例: 在此处输入图片说明


2

PlotPrjNetworks为项目管理提供了有用的网络工具。

library(PlotPrjNetworks)
project1=data.frame(
task=c("Market Research","Concept Development","Viability Test",
"Preliminary Design","Process Design","Prototyping","Market Testing","Final Design",
"Launching"),
start=c("2015-07-05","2015-07-05","2015-08-05","2015-10-05","2015-10-05","2016-02-18",
"2016-03-18","2016-05-18","2016-07-18"),
end=c("2015-08-05","2015-08-05","2015-10-05","2016-01-05","2016-02-18","2016-03-18",
"2016-05-18","2016-07-18","2016-09-18"))
project2=data.frame(
from=c(1,2,3,4,5,6,7,8),
to=c(2,3,4,5,6,7,8,9),
type=c("SS","FS","FS","SS","FS","FS","FS","FS"),
delay=c(7,7,7,8,10,10,10,10))
GanttChart(project1,project2)

在此处输入图片说明


感谢@George对于这一点,是有办法使用任务的名字从和?我应该张贴它作为一个新的问题?根据文档github.com/cran/PlotPrjNetworks/blob/master/R/PlotPrjNetworks.R从并且to应该是整数,对此有没有快速的解决方法?
Mohsen Sichani

1

我想用每个任务的几个栏来改进ggplot-Answer。

首先生成一些数据(dfrP是其他答案的data.frame,dfrR是具有实现日期的其他data.frame,而mdfr是适合以下ggplot()声明的合并):

library(reshape2)
tasks <- c("Review literature", "Mung data", "Stats analysis", "Write Report")
dfrP <- data.frame(
  name        = factor(tasks, levels = tasks),
  start.date  = as.Date(c("2010-08-24", "2010-10-01", "2010-11-01", "2011-02-14")),
  end.date    = as.Date(c("2010-10-31", "2010-12-14", "2011-02-28", "2011-04-30")),
  is.critical = c(TRUE, FALSE, FALSE, TRUE)
)
dfrR <- data.frame(
  name        = factor(tasks, levels = tasks),
  start.date  = as.Date(c("2010-08-22", "2010-10-10", "2010-11-01", NA)),
  end.date    = as.Date(c("2010-11-03", "2010-12-22", "2011-02-24", NA)),
  is.critical = c(TRUE, FALSE, FALSE,TRUE)
)
mdfr <- merge(data.frame(type="Plan", melt(dfrP, measure.vars = c("start.date", "end.date"))),
  data.frame(type="Real", melt(dfrR, measure.vars = c("start.date", "end.date"))), all=T)

现在,使用构面作为任务名称绘制此数据:

library(ggplot2)
ggplot(mdfr, aes(x=value, y=type, color=is.critical))+
  geom_line(size=6)+
  facet_grid(name ~ .) +
  scale_y_discrete(limits=c("Real", "Plan")) +
  xlab(NULL) + ylab(NULL)

如果没有is.critical-information,您还可以将Plan / Real用作颜色(我希望这样做),但是我想使用其他答案的data.frame使其具有更好的可比性。


1

在ggplot中发现geom_segment很棒。根据以前的解决方案,可以使用数据,但无需融合。

library(ggplot2)

tasks <- c("Review literature", "Mung data", "Stats analysis", "Write Report")
dfr <- data.frame(
  name        = factor(tasks, levels = tasks),
  start.date  = as.Date(c("2010-08-24", "2010-10-01", "2010-11-01", "2011-02-14")),
  end.date    = as.Date(c("2010-10-31", "2010-12-14", "2011-02-28", "2011-04-30")),
  is.critical = c(TRUE, FALSE, FALSE, TRUE)
)

ggplot(dfr, aes(x =start.date, xend= end.date, y=name, yend = name, color=is.critical)) +
  geom_segment(size = 6) +
  xlab(NULL) + ylab(NULL)

甘特图


By using our site, you acknowledge that you have read and understand our Cookie Policy and Privacy Policy.
Licensed under cc by-sa 3.0 with attribution required.