MITx and HarvardX make some of their MOOC data publicly available
MITx and HarvardX deserve huge congratulations for making data associated with a number of their MOOCs publicly available. Four months ago, I wrote that the “community would benefit from access to the data that HarvardX and MITx have, as other individuals/groups could run additional analyses. Granted, I imagine this might require quite a lot of effort, not least in the development of procedures for data sharing.” It seems that the researchers at MITx and HarvardX have tackled the issues involved to make the data available, and have developed thoughtful procedures to ensure de-identification. While some of the steps taken may limit analyses (e.g., the de-identification process document notes that “rows with 60 or more forum posts were deleted,” thus eliminating highly active users), this is a big step in the right direction and it should be celebrated.
Now… can we have some qualitative data? If any institutions are interested in making those available, I’d love talk to you, give you input, and work with you toward that goal.