You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In fact, if we group by the duration of commits of projects where the duration is the years between the oldest and the newest commits of a project and count the number of projects in a group, I got the following,
Looks to me that the most recent dump (2019-06-01) has wrong time stamp. Below is how we may reproduce the error.
In http://ghtorrent-downloads.ewi.tudelft.nl/mysql/mysql-2019-06-01.tar.gz, I found the following,
I loaded the CSV files to a PostgreSQL database, and then do a query. These projects are,
Having compared the dates in the CSV file with the commit log in these projects. The dates are indeed wrong. Similarly, I also found these,
In fact, if we group by the duration of commits of projects where the duration is the years between the oldest and the newest commits of a project and count the number of projects in a group, I got the following,
where
count
is the number of projects whoseduration
is given on the left column.The text was updated successfully, but these errors were encountered: