Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

preprocess_step1 takes too long time #12

Open
qiaopr opened this issue Dec 26, 2021 · 6 comments
Open

preprocess_step1 takes too long time #12

qiaopr opened this issue Dec 26, 2021 · 6 comments

Comments

@qiaopr
Copy link

qiaopr commented Dec 26, 2021

Hi,How fast are you preprocess_step1.py files running?Mine has take several hours to process only 150 piece of data. It's unbeliviable!

@RichardHGL
Copy link
Owner

Did you modify any parameters? Such as graph size. I think it won't be so slow.
When I processed the data, I just keep it run in the backend and get the data the next day.
I guess any dataset can be processed in one day.

@qiaopr
Copy link
Author

qiaopr commented Dec 26, 2021

I exactly followed the preprocess >Freebase>README.md.I didn't modify any parameters. The data I'm processed is CWQ.

@LLLiaomeng
Copy link

I also faced this problem.

@JasonCen-sweetdreams
Copy link

Also ran into this problem. It seems that the preprocess_step1 takes a lot of time to output only a few, while my CPU, GPU and memory usage are in a healthy state. sad :(

@RichardHGL
Copy link
Owner

Okay, I'll check this problem next month. You can also try to look into ppr_util.py, I think the majority of time is spent on the calculation of ppr for every graph.

@RichardHGL
Copy link
Owner

The preprocessed datasets can be found in Readme of this repo. If you find any possible ways to improve the efficiency of preprocessing, please kindly let me know.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants