Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Where can I download the four datasets? #7

Open
zhangruiouc opened this issue May 31, 2024 · 5 comments
Open

Where can I download the four datasets? #7

zhangruiouc opened this issue May 31, 2024 · 5 comments

Comments

@zhangruiouc
Copy link

Hello, where can I download the four datasets including NYCBike1, I want to download the datasets from the relevant links of STResNet and STDN, but the relevant links are not working.

@Echo-Ji
Copy link
Owner

Echo-Ji commented Jun 23, 2024

Hi, sorry for the late reply.

You can download the raw datasets here: #9

@zhangruiouc
Copy link
Author

Thanks for your reply! I am trying to apply your idea of spatial-temporal heterogeneity of the ST-SSL model to the prediction of catch fish in the fishing area, and I would appreciate it if you could release the code for the processing raw dataset! I heard from your homepage that you have graduated your PhD this year, so I wish you a happy graduation and a bright future!

@zhangruiouc
Copy link
Author

Hello, I have checked the adj_mx.npz file in the dataset. This matrix is a real symmetric matrix. For example, if the values of matrix points at positions [2,3] and [3,2] are 1, does it mean that there is a traffic correlation between region 3 and region 2? If there is a correlation in traffic flow between Region 3 and Region 2, how was this correlation obtained?
image

@Echo-Ji
Copy link
Owner

Echo-Ji commented Nov 18, 2024

Actually you only see part of the data of the adjacency matrix. Usually, a region is connected to the 8 surrounding regions in a grid dataset, so each row of the adjacency matrix has 8 positions that are 1.0. This also implies our data processing method as follows.

The public datasets used in our paper are all constructed on a grid basis. To transform the grid-based public datasets into graph-based data, we construct a traffic flow graph, where a node in the graph is a region in the grid data and there exists an edge between two regions if they are adjacent. The item of the adjacency matrix $a_{mn} = 1$ if there is an edge between region $r_m$ and $r_n$, otherwise $a_{mn} = 0$.

@Echo-Ji
Copy link
Owner

Echo-Ji commented Nov 18, 2024

Thanks for your reply! I am trying to apply your idea of spatial-temporal heterogeneity of the ST-SSL model to the prediction of catch fish in the fishing area, and I would appreciate it if you could release the code for the processing raw dataset! I heard from your homepage that you have graduated your PhD this year, so I wish you a happy graduation and a bright future!

You can refer to our new branch for the code of data preprocessing!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants