Hello everybody
I am a researcher in data science and interested in the database history of the IMDB. We were able to obtain a snapshot of the IMDB files (via the old format, meaning the *.list.gz files), as well as a lot of diff-files that make it possible to restore the database to any recorded point in time.
However, some of the diff files are missing which makes it impossible to reconstruct older data files, which is a shame since we would like to analyze historical data, that is as large as possible. (Also see the Post by Paolosg in this topic: https://getsatisfaction.com/imdb/topics/api_bulk_data_access ) ). His interest is basically the same as mine, however it was replied that diff-files are no longer kept.
I was wondering if there was any other way of obtaining historical data of the IMDB (gladly also via the new interface) or if not, whether old snapshots are available, from which we could try to restore the history using the diff-files we have.
Does anybody know whether any of that is obtainable?
Best Regards,
Leon
I am a researcher in data science and interested in the database history of the IMDB. We were able to obtain a snapshot of the IMDB files (via the old format, meaning the *.list.gz files), as well as a lot of diff-files that make it possible to restore the database to any recorded point in time.
However, some of the diff files are missing which makes it impossible to reconstruct older data files, which is a shame since we would like to analyze historical data, that is as large as possible. (Also see the Post by Paolosg in this topic: https://getsatisfaction.com/imdb/topics/api_bulk_data_access ) ). His interest is basically the same as mine, however it was replied that diff-files are no longer kept.
I was wondering if there was any other way of obtaining historical data of the IMDB (gladly also via the new interface) or if not, whether old snapshots are available, from which we could try to restore the history using the diff-files we have.
Does anybody know whether any of that is obtainable?
Best Regards,
Leon


