-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support ScaleFactor 100 and 1000 #93
Comments
Hi, thanks for the feedback. I have been working on this extension on scalability already. It only can generate SF10 in v0.1.0. Currently I have extended it to sf30, working on the SF100 now. I am not quite familiar on Spark application optimization, but good news I am moving forward step by step. Hopefully it would support SF300 in the next few weeks. |
Collaboration is welcome if you are an Spark expert. :) |
Thanks for the reply @qishipengqsp , happy to hear that larger sfs are already work in progress. Unfortunately, I'm not an Spark expert, but have some more knowledge in Flink, if this might help. If you want, I can have a look. Is this the branch you are currently working on: https://github.com/ldbc/ldbc_finbench_datagen/tree/sf100 ? |
@ChrizZz110 Thanks for your help and apologize for this late response. Just come back from the LDBC 18th TUC, and start to catch up these thing I left behind. Yes. I am working on that branch, but it is not much different from the main branch. I just created this branch for SF10 parameters controlling the generation process. Currently, I am stuck in this error:
|
Hi,
thanks for working on this data generator. We are using the generated FinBench datasets for our research and would kindly ask to support larger SFs in the generator than the currently supported factor 10. Especially for systems focussing on large-scale graphs, this would be a great extension.
The text was updated successfully, but these errors were encountered: