Help get this topic noticed by sharing it on Twitter, Facebook, or email.

Navigating GitHub to find simple Dataset Spreadsheets for Middle School Students

I'm a teacher working with Middle schoolers on a data visualization project. I'm looking for the simple spreadsheets (I called these 'datasets' - hoping I'm using the term properly) that used to contain the raw data that Gapminder pulled from. It seems that now all of this is housed at GitHub - and there are frequently a bunch of associated files - .csv files and others - that work together in some way I haven't yet figured out, to produce those awesome Gapminder graphics.
My question: can you help direct me to where I can find the spreadsheets with the data? When I've tried downloading the .csv files and open them in Excel or Google Sheets, they don't render properly, or else contain a bunch of HTML.
1 person has
this question
+1
Reply
  • Thanks for your question, Stephen!
    This is correct, we are moving to have all our data on Github as a bunch of CSV files, because of better versioning, revisions, branches and all the awesome things.

    The downside is that you can't download excel versions anymore (we will think later how we can possibly fix that)

    For now you can try the following that would hopefully help:

    Install CSVhub google chrome extension (you are using google chrome, do you?)
    https://chrome.google.com/webstore/de...

    this will pretty-print the csv files that you view on github, with columns

    * * *

    Excel used to work just fine with CSV files but now their understanding of what "comma-separtated values" are has changed, and they use semicolons to separate values which makes no sense.

    When opening CSV files in Excel you need to use import, not just open:

    File --> Import... --> Text file --> [pick a csv file] --> delimited --> pick comma as a delimiter --> press next and then finish. Import in the same sheet. Now you should be able to see the file correctly

    * * *

    if you are browsing Systema Globalis
    https://github.com/open-numbers/ddf--...

    you can find the information about the column headers (we call them concepts) in ddf--concepts.csv

    and the country codes are in here: ddf--entities--geo--country.csv

    * * *

    Let me know if it helped!
    Angie
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. happy, confident, thankful, excited indifferent, undecided, unconcerned kidding, amused, unsure, silly sad, anxious, confused, frustrated

  • Hi Angie -

    Thanks for the quick and thorough response!

    I still have some questions / follow up items:

    - I installed the CSVHub Chrome extension - particularly hoping that I might be able to find an 'export' option to get a table or spreadsheet out of it... but instead I got:



    I'm also concerned about going the Chrome Extension route, b/c my students don't have the same level of access to the Chrome Store as teachers do - so it might be a solution for me, but not one for them that I could easily roll out.

    - - -

    I also tried importing the CSV file into excel, per your instructions. It got a little murky at the end - I think because I'm on a mac, using 'Microsoft Excel for Mac' (2015, version 15.22).
    The csv file I chose to try to open was:

    https://github.com/open-numbers/ddf--...

    but when it imported, there was a bunch of HTML - looked like a table - and then the data was all in a single column.

    So, I'm still a little confused on how to get at this data for my students.

    - Am I trying to import the wrong csv file into excel?
    - Is there a way around the limitation of the CSVHub Chrome Extension not being able to render larger files?

    Thanks in advance for your ongoing help.

    - Steve
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. happy, confident, thankful, excited indifferent, undecided, unconcerned kidding, amused, unsure, silly sad, anxious, confused, frustrated

  • Hi Steve,

    Thanks for your reply.

    We're aware that the Github interface is less easy to navigate compared to the google spreadsheet. We're discussing on how to make this possibly a little more user friendly for people unfamiliar with github.

    For the csv's, be sure to download the 'raw' version:



    https://raw.githubusercontent.com/ope...

    I suspect you see HTML in excel because you downloaded the actual GitHub webpage which has the data embedded in it, instead of downloading the raw csv data file.

    Best regards,
    Jasper Heeffer / Gapminder Foundation
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. happy, confident, thankful, excited indifferent, undecided, unconcerned kidding, amused, unsure, silly sad, anxious, confused, frustrated