for those who would make a difference

Tag: Open format

Open Formats & Open Source for Better Government

The Government of Canada is currently reliant on proprietary file formats and proprietary software applications, which lock it into a licensing bind with a single software manufacturer — Microsoft.  There is not only a question of cost — as we pay a monopoly corporation for per-seat licenses to run software that already dominates the market — but more importantly, there is the question of future access to our own data.  In this post, I’d like to share my thoughts on both issues.

Before you dismiss the idea of a major institution losing access to its stored data as ludicrous, consider this quote from Natalie Ceeney, chief executive of the UK National Archives:

“If you put paper on shelves, it’s pretty certain it is going to be there in a hundred years. If you stored something on a floppy disc just three or four years ago [2003-04], you’d have a hard time finding a modern computer capable of opening it. Digital information is in fact inherently far more ephemeral than paper. The pace of software and hardware developments means we are living in the world of a ticking time bomb when it comes to digital preservation.”

The UK National Archives includes a collection of 900 years of written material. As of 2007 they estimated that 580 terabytes of their data (the equivalent of 580,000 encyclopedias) was stored in file formats which have since become extinct.

Continue Reading

Open data is data that delivers results

Image via Wikipedia

I struck a nerve around open data, as I mentioned in my earlier article, when I stated that “XML is simply a markup language, a container for data. Is it one of the most preferred containers? Absolutely. However, open government data is not synonymous with XML. Open government data is simply government-owned data that can be mined in order to create useful information. It can be in XML, PDF, text files, print outs, etc… The key point is that the data is being released for others to use to create value from it, not the format that it is released in”.

Initial comments on twitter argued that open data had to be XML, then opened up to being any open, non-proprietary format. For developers I would absolutely agree that this makes sense. It’s much easier for developers to work with open formats like CSV and XML vs. proprietary formats like PDF. Developers, however, are not the leaders of open government.

Continue Reading