Written by: Karen Suhaka | May 13, 2013

It’s exciting times right now in the government transparency community.  Alan deLevie posted this very thorough summary of the recently released Open Data Policy, and it’s reception across the community,  in Robert Richard’s wonderful Legal Informatics Research Network google group.  I’ve pasted the whole thing below for your enjoyment.

 

Written by Alan deLevie:

Today the White House released its Open Data Policy. Excerpt:

Pursuant to the Executive Order of May 9, 2013, Making Open and Machine Readable the New Default for Government Information, this Memorandum establishes a framework to help institutionalize the principles of effective information management at each stage of the information’s life cycle to promote interoperability and openness. Whether or not particular information can be made public, agencies can apply this framework to all information resources to promote efficiency and produce value.

John Wonderlich, of the Sunlight Foundation, wrote a blog post, praising the Executive Order:

By requiring agencies to publicly list all their data that could be made public, the President is not just reaffirming that decisions about disclosure should be based on the public interest, he’s also giving the public (and Congress) tools to enforce them. When open data procedures are incorporated into agency processes from the start, we’ll start to see more systems designed for bulk access from the start, and we’ll be better able to recoup all the missed opportunities in legacy datasets that are still closed. We’ll be able to evaluate agencies’ transparency against what they’ve defined as their candidates for release, and clearly identify areas where agencies avoid disclosure altogether.

Of note is the White House’s decision to publish the draft using GitHub Pages. This means that the draft is stored using Git version control, and Pull Requests can be submitted via Github.com. The policy memo is stored here within the Project Open Data repository. Non-technical users can use tools such as Prose.io or GitHub’s built-in text editor to submit Pull Requests.
Open Data hacker Josh Tauberer asked on Twitter: “So who decides what pull requests to merge? @waldojaquith @philipashlock”. Tauberer wrote in greater detail on his personal blog about the memo:

TL:DR: The new E.O. and memorandum are good for transparency and lock in almost all of the generally accepted notions of open government data. But it misses the mark on the requirement of “open licenses.”

Anthea Watson Strong, a Google employee with the Politics and Elections team expressed reservations with the use of Git:
“What do we think of the WH putting their open data memo into Git? Not the most user friendly to non-eng, right?https://github.com/project-open-data/project-open-data.github.io/issues …” (link).
“I can’t imagine middle america is going to fork the policy document to improve the content… right?” (link).
“Heck, I’ve had a GitHub account for a while as an observer, but not being a coder, I’ve never *actually* submitted a pull request.” (link).
Derek Willis, of the New York Times, expressed interest in a section of the memo called the “Mosaic Effect” (link):

Mosaic Effect: The Mosaic Effect occurs when the information in an individual dataset, in isolation, may not pose a risk of identifying an individual (or threatening some other important interest such as security), but when combined with other available information, could pose such risk. Before disclosing potential PII or other potentially sensitive information, agencies must consider other publicly available data – in any medium and from any source – to determine whether some combination of existing data and the data intended to be publicly released could allow for the identification of an individual or pose another security concern. [source]

Willis also wrote: “More than pull requests, I’d like to see full git blame for individual sections of this policy memo…” (link). “git blame” allows people to see who wrote and modified each line of a given document in a git repository.
The memo’s commit history shows a single commit from GitHub’s new government hire, Ben Balter.
On GitHub’s blog, Balter wrote about the memo:

Millions of people around the world use GitHub every day to build software together, but the GitHub Way™ isn’t limited to code.
We are humbled to see that the White House drafted and released the official Open Data Policy of the United States on GitHub! The Presidential Memorandum calls for the creation of the aptly named Project Open Data, with the goal of making government data “available, discoverable, and usable – in a word, open”.

Today’s news marks the first time a government entity has published law as a living, collaborative document. We’re excited to see how the Open Data Policy evolves with the input of the community, and we hope this is just the first of many.
By choosing GitHub Pages, government employees as well as citizens are empowered to continuously propose and discuss potential improvements, creating a living policy document to “unlock the potential of government data”.

Civic hackers and public servants at all levels are using GitHub to collaborate on everything from information about elected officials to the location of the nearest bike rack. While adding a [thumbs up] to a pull request isn’t going to replace ballots at your local polling place any time soon, one thing’s for sure: the revolution is only just beginning.

 

About BillTrack50 – BillTrack50 offers free tools for citizens to easily research legislators and bills across all 50 states and Congress. BillTrack50 also offers professional tools to help organizations with ongoing legislative and regulatory tracking, as well as easy ways to share information both internally and with the public.