Getting a proper list of movie titles and year from IMDB offline text data

  • 1
  • Question
  • Updated 6 years ago
  • Answered
Archived and Closed

This conversation is no longer open for comments or replies and is no longer visible to community members. The community moderator provided the following reason for archiving: Old thread

I'm trying to get a definitive list of all movie titles and matching years from downloaded IMDB data extracts.

I've tried achieving this from the movies.list file, but it seems this file is full of TV as well, which doesn't seem to make a whole lot of sense.

How can I go about getting only movies?
Photo of Troy Heron

Troy Heron

  • 3 Posts
  • 0 Reply Likes
  • frustrated

Posted 6 years ago

  • 1
Photo of Emperor

Emperor, Champion

  • 6418 Posts
  • 3004 Reply Likes
I'd assume you'd need to do more filtering but I'm not sure how easy that is from the text files, although I'd have thought some spreadsheet software would do it.

Or just use the database:

www.imdb.com/search/title?sort=year,d...
Photo of Dan Dassow

Dan Dassow, Champion

  • 13457 Posts
  • 13801 Reply Likes
As Emperor indicated, you will need to filter the list. Any title preceded by a quote (") is a television show or television episode.

Fortunately, the list of titles is sorted in lexicographic order. In other words, the list of television shows and television episodes will be together in the list. You will simply need to delete the titles starting with a quote (").

However, the list will still have television movies, videos and video games, which are respectively indicated with (TV), (V) and (VG). You will need to filter out these titles to get the list you desire.

This conversation is no longer open for comments or replies.