zhengchuanpan / gman Goto Github PK

GMAN: A Graph Multi-Attention Network for Traffic Prediction (GMAN, https://fanxlxmu.github.io/publication/aaai2020/) was accepted by AAAI-2020.

License: Apache License 2.0

Python 100.00%

gman traffic-prediction aaai2020

gman's Introduction

GMAN: A Graph Multi-Attention Network for Traffic Prediction (AAAI-2020)

This is the implementation of Graph Multi-Attention Network in the following paper:
Chuanpan Zheng, Xiaoliang Fan*, Cheng Wang, and Jianzhong Qi. "GMAN: A Graph Multi-Attention Network for Traffic Prediction", Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), 2020, 34(01): 1234-1241.

Data

The datasets are available at Google Drive or Baidu Yun, provided by DCRNN, and should be put into the corresponding data/ folder.

Requirements

Python 3.7.10, tensorflow 1.14.0, numpy 1.16.4, pandas 0.24.2

Results

Third-party re-implementations

A Pytorch implementaion by VincLee8188 is available at GMAN-Pytorch.

Citation

If you find this repository useful in your research, please cite the following paper:

@inproceedings{ GMAN-AAAI2020,
  author     = "Chuanpan Zheng and Xiaoliang Fan and Cheng Wang and Jianzhong Qi"
  title      = "GMAN: A Graph Multi-Attention Network for Traffic Prediction",
  booktitle  = "AAAI",
  pages      = "1234--1241",
  year       = "2020"
}

gman's People

Contributors

Stargazers

Watchers

Forkers

yaoxy2010 wfccross mcdragon ustcfd tijsmaas reachcool ammieqi gscr10 theonll jdc08161063 relevation-143 jiaodaxiaozi jialewang97 ccfbupt coder-lhj ltthacker domilay dongyann ydsun wumingyao shuqincao ccdllyy suzhu1988 xwu-ut bczhu mingyangzhang zqzhen091 xiaolinhan ericaguoqiuyu cxlz vivi-der krisandchris josephstalin117 supershujiale xhfei1224 aouedions11 xrosliang woniuhu zhaoyuanm draruncs vananle shuowang-ai captainsparrow11 mc-o zay113 798283635 vicky-51 csq121605366 hatemhunish dawnywu 2020-ai-zx zhaozhongningouc thubiter wmx1129 liweiowl xingzai0617 udeshmg edwardzx nizhengguo818 dhs4654 hawksilent raki-j fys1997 hujilin1229 messham87 iewobx sinofeng yueming-github safarzadeh-reza huihuimhuihui semink songshipeng liujiachang fanxlxmu alexeiga seyun52 w169376 alex-ht lichunyan3 kage1999 duykhuongnguyen cassia151 yahya-alezzi sadia42 py-rex superyfan zdqf maxrubby potassiumwings soroushazi d-stiv max-chen2020 pluto0418 luna-98 songrui643 liesgame pettepiero allem40306 pangmomo8 mieuxmin

gman's Issues

求指点：如何解决AttributeError: 'numpy.bytes_' object has no attribute 'delta'

utils.py中 timeofday = (Time.hour * 3600 + Time.minute * 60 + Time.second) // Time.freq.delta.total_seconds() 这一句报错

Traceback (most recent call last):
File "/Users/crowd/PycharmProjects/GMAN/METR/train.py", line 55, in
mean, std) = utils.loadData(args)
File "/Users/crowd/PycharmProjects/GMAN/METR/utils.py", line 73, in loadData
timeofday = (Time.hour * 3600 + Time.minute * 60 + Time.second) // Time.freq.delta.total_seconds()
AttributeError: 'numpy.bytes_' object has no attribute 'delta'

我没有修改过作者源码请问这个问题大家是怎么解决的

自己电脑上生不成data/GMAN(PeMS）文件，求

Validation error nan

I've been trying to run the MATR example, and from the first iteration I'm receving validation error "nan", as a consequence the model stops learning after 10 iterations. Is there are problem with the code?

ZeroDivisionError: float division by zero

在生成SE时，preprocess_transition_probs()的normalized_probs = [float(u_prob)/norm_const for u_prob in unnormalized_probs]出现问题：ZeroDivisionError: float division by zero

您好，请问这两个文件在哪里啊

您好，请问这两个文件在哪里啊
'data/PeMS.h5' ， 'data/GMAN(PeMS)'

请问下tensorflow版本是1.x吗？

The code snippet to create SE file

Could you please share the code snippet to create the SE file?

你能把剩下的代码传上来吗？想学习学习，万分感谢！！

AttributeError: 'numpy.bytes_' object has no attribute 'delta'

GMAN-master/PeMS/utils.py", line 74, in loadData
// Time.freq.delta.total_seconds()
AttributeError: 'numpy.bytes_' object has no attribute 'delta'

作者您好，请问如何解决呢，我的环境：tf-1.14-py3

求生成Adj.tx文件的代码

作者可以上传生成Adj.tx文件的代码吗

有谁复现出了PeMS上的结果吗？

我在TensorFlow2上兼容模式跑的，还把patience调成了20，测试集平均MAE为1.66，与报告的水平有差距

testing time: 36.1s
MAE RMSE MAPE
train 1.32 2.87 2.78%
val 1.59 3.72 3.61%
test 1.66 3.82 3.74%
performance in each prediction step
step: 01 0.99 1.88 1.96%
step: 02 1.21 2.47 2.50%
step: 03 1.38 2.97 2.93%
step: 04 1.52 3.35 3.30%
step: 05 1.62 3.65 3.61%
step: 06 1.71 3.90 3.86%
step: 07 1.78 4.09 4.08%
step: 08 1.85 4.25 4.26%
step: 09 1.90 4.38 4.42%
step: 10 1.95 4.49 4.55%
step: 11 1.99 4.59 4.67%
step: 12 2.03 4.67 4.78%
average: 1.66 3.72 3.74%
total time: 3.4min

作者可以提供生成Adj.txt的代码吗？

Reproducing the results

Hello,
Thank you very much for sharing your code with the community.

After many attempts with different hyperparameters we have not been able to reproduce any results from the paper (or even get close). Was anyone been able to reproduce the results or do the authors have any pointers in how to achieve this?
Thank you.

Time Features - Ordinality

Doesn't the way time features were encoded introduce ordinality?

For example, if Sunday is encoded as 1 and Thursday is encoded as 5 - doesn't that let the model think Thursday is more important than Sunday.

Is this understanding correct? If yes, could you help to understand why that decision was taken during model design?

1 pytorch 版本

想问下有没有pytorch实现版本

The length of the PEMS data

In the paper, it mentioned that "traffic speed prediction on the PeMS dataset (Li et al. 2018b)), which contains 6 months of data recorded by 325 traffic sensors ranging from January 1st, 2017 to June 30th, 2017 in the Bay Area." But in the referred paper, it said the data was collected from Jan 1st 2017 to May 31th 2017. Can you provide the 6 month data instead?

The data shape is different from DCRNN, GraphWavnet.

The previous works data shape is:
train shape X(36465, 12, 325, 2) Y(36465, 12, 325, 2)
val shape X(5209, 12, 325, 2) Y(5209, 12, 325, 2)
test shape X(10419, 12, 325, 2) Y(10419, 12, 325, 2)
Your is:
trainX: (36458, 12, 325) trainY: (36458, 12, 325)
valX: (5189, 12, 325) valY: (5189, 12, 325)
testX: (10400, 12, 325) testY: (10400, 12, 325)
I'm confused about it.

数据

对这个工作非常感兴趣，请问能否提供下完整的数据，包括SE？

你好，请问这个文件是在哪呀？

Adj_file = '../data/Adj.txt' SE_file = '../data/SE.txt'

numpy版本

numpy 1.18.4 pypi_0 pypi

请问文章中的计算时间是在什么设备上、怎样的超参下得到的？

按照文中的时间，应该与GraphWavenet比较接近，然而我在8700+1080ti的配置下跑需要约1400s/epoch，GraphWaveNet只需要240s/epoch（Batch Size=16）

请问下loadData()里面 Time = df.index报错是为什么啊？

我使用的DCRNN下载下来的METR.h5文件，使用pandas对其进行读取，生成Time Embedding时，代码中TIME = df.index报错，如下：
ssh://[email protected]:22/home/tank/anaconda3/envs/lpb/bin/python3.6 -u /home/tank/lxl/GMAN/GMAN-master/METR/analyzeData.py
Traceback (most recent call last):
File "/home/tank/lxl/GMAN/GMAN-master/METR/analyzeData.py", line 37, in
print(df.index)
File "/home/tank/anaconda3/envs/lpb/lib/python3.6/site-packages/pandas/core/indexes/base.py", line 852, in repr
attrs = self._format_attrs()
File "/home/tank/anaconda3/envs/lpb/lib/python3.6/site-packages/pandas/core/indexes/datetimelike.py", line 381, in _format_attrs
freq = self.freqstr
File "/home/tank/anaconda3/envs/lpb/lib/python3.6/site-packages/pandas/core/indexes/extension.py", line 54, in fget
result = getattr(self.data, name)
File "/home/tank/anaconda3/envs/lpb/lib/python3.6/site-packages/pandas/core/arrays/datetimelike.py", line 1104, in freqstr
return self.freq.freqstr
AttributeError: 'numpy.bytes' object has no attribute 'freqstr'

Process finished with exit code 1

请问是我的数据集不对吗？还是我的Pandas版本(1.1.4)不对啊,为什么无法获取到这个index呢？万分感谢

你好，请问PeMS.h5这个文件在哪啊？

运行时报错：
File data/PeMS.h5 does not exist

请问下这个训练为什么这么慢，每个batch训练时占用的显存特别小

你好，请问下我跑这个代码时为什么训练速度特别慢，感觉是一个batch一个batch跑的，显存只占用了306MB，没有并行跑起来，跑一个epoch可能就得跑好几个小时，请问下这是为什么？您训练时遇到这样的问题了吗？十分感谢

HELP，NotImplementedError: reshaping is not supported for Index objects

Traceback (most recent call last):
File "D:/GitHub源代码/GMAN-master/GMAN-master/METR/train.py", line 55, in
mean, std) = utils.loadData(args)
File "D:\GitHub源代码\GMAN-master\GMAN-master\METR\utils.py", line 72, in loadData
dayofweek = np.reshape(Time.weekday, newshape = (-1, 1))
File "D:\Software\Anaconda3\envs\tensorflow\lib\site-packages\numpy\core\fromnumeric.py", line 232, in reshape
return _wrapfunc(a, 'reshape', newshape, order=order)
File "D:\Software\Anaconda3\envs\tensorflow\lib\site-packages\numpy\core\fromnumeric.py", line 57, in _wrapfunc
return getattr(obj, method)(*args, **kwds)
File "D:\Software\Anaconda3\envs\tensorflow\lib\site-packages\pandas\core\indexes\base.py", line 1149, in reshape
raise NotImplementedError("reshaping is not supported "
NotImplementedError: reshaping is not supported for Index objects

about model performance

great work!
I have a question about the computation of attention coefficient. Did you ever do experience to compare the model performance with STE block and without STE block？

Why this model is CPU intensive ?

Masking in Loss function

I have seen various masking applications in the code yet it wasn't mentioned in paper. Especially in the mae_loss(), masking is applied. What is the purpose of this application?

group spatial attention

论文提到采用了把节点分组的方式，理乱上减少了计算的复杂度，请问在代码中计算空间注意力这一块儿，哪里体现了分组计算呢？

为什么训练结果不收敛

一些关于GMAN的问题

model.py的line142这里x和y的shape不应该一致吗？还有请问楼主tf是啥版本的。感谢

Inconsistencies with the paper

Hello, firstly I would like to thank you for sharing the code. I was looking at the Spatial Attention component (line 56 in model.py) and I've noticed some differences from what is presented in the paper:

I can not find where you're splitting the vertices into G partitions (and doing the intra/inter group attention). As far as I can understand the spatialAttention function does only the intra-group spatial attention without any restrictions.
After you're computing eq 7 (line 86 in model.py) the output is projected again using 2 FC layers, which in the paper are not described. What is the reason for it?
Looking at eq 7 the input of function f3 is the previous hidden representation where in you're code you're also using the static graph embeddings (e_{v,tj})

Looking forward for your reply.