I'm going to start with the page that you can see up on the screen now. This is the home page for data.gc.ca. This is our one stop shop for all of the open data that the Government of Canada makes available at any time to citizens, researchers, voluntary organizations, the private sector, the media. It maintains access and discovery of all open data. It also has some other open government activities, but I'm going to focus mostly on the open data.
The page has been designed with large tiles, as you can see, so that you can easily find what all the features on the site are, so that you can quickly jump to the information that you're looking for. Key, I think, for our conversation today is that tile in the top left-hand corner, the search data.
I'm going to proceed as if I was an average open data user. I click on “search data” and I'm going to pretend that I'm somebody who's looking to buy a new house and I'm interested in what the safety is in the neighbourhood that I'm considering buying a house in. I would type in crime, for example, hit submit, and all the data that is currently made available from the Government of Canada related to crime would come up.
Oftentimes there is a lot of data. We need to be able to help the user filter down those results to a smaller amount, so that they can find what they're looking for a little bit faster. I could reorder the data sets alphabetically or by the date that they were actually created or last modified, and their relevance. I'm going to leave it at relevance for now.
The left-hand side provides a whole variety of filters that can be used to narrow down the search results. I'm going to keep proceeding as if I'm looking for neighbourhood safety information, so I'm going down and see that under subject there's law. I click on law. The number of data sets comes down, still a fair amount, but as I move down the data sets looking for the information I'm looking for, I see crime statistics for Canada, the provinces and the territories, and I know that that's the information that I'm looking for.
I click on that data set and this is the metadata. This slide provides all of the information that we have on that data set, who the publisher is—in this case it's Statistics Canada—what subject it falls under, the date it was last published, and a short description and title of the data set. Those for us are the mandatory metadata fields that must be completed by any department or agency that is making data available.
Before I go into the specific information related to this data set, because I want to point out some of the features for each data set, I want to highlight the licence, which is right here. This licence is a significant point of progress for us working within open data.
As Corinne mentioned, it used to be that individual departments would make their information and data available under a variety of licences, most of which were several pages long and written in legalese that was very difficult to understand by the layman and oftentimes there were specific restrictions on the reuse of that data. For example, most often you couldn't reuse that data commercially.
Our new licence is written in plain language. It's extremely simple. It's based on best practices for open licensing internationally. We are sharing this licence with other jurisdictions within Canada, promoting adoption on a pan-Canadian level, so that data users will be able to bring data together from multiple jurisdictions within Canada at both the provincial and the jurisdictional level, and that can be combined and mashed together.
I'll just go back to the data set and point out a couple more features.
Back before we launched the most recent version of data.gc.ca this past summer, we held a series of round tables across the country with the open data community to hear what it was they would most like to see in the revised, revamped open data portal.
They wanted the ability to rate the data sets themselves and tell us what they thought of the data. They wanted to be able to provide individual comments on that data in the hopes that we could perhaps improve that data. They wanted to be able to share the data easily with others.
All of those features have been incorporated into the new data.gc.ca. You'll see up here on the right side that you can rate the data. It's a five maple-leaf scale. You would simply pick the rating that you're giving the data. You can provide individual comments below and then submit, and it becomes part of the ongoing consumer rating of that data. You can share the data via Facebook, Google, or Twitter, and you can provide comment on the data and share those comments with all other users of that data.
If I want to download the data, I simply click on one of these buttons. The data is made available in different file formats to ensure flexibility of use by the individual users. The data sets are made available in French and English, and there is additional supporting documentation to help the users use the data, and all of these are one-button downloads. Press the button and the data downloads—I won't do that right now.
I should just mention before I leave this page that there's an openness rating down at the bottom. We've incorporated the use of an international openness scale that's used by other jurisdictions to indicate the level of openness of the data sets. It's based on the five-star scale. Most of the data that we hold is three stars and above. This speaks to whether or not the data is being made available in a well-structured format, whether or not you require proprietary software in order to be able to open the data instead of an open software program, and we, the U.S., the U.K., and a variety of other jurisdictions, are using this scale.
If I go back to the search page and pretend that I wasn't able to find the data set that I was looking for, there's a button that says “Can't find what you're looking for?” up at the top. We're very keen to get feedback and information from potential open data users on what data they'd like to see that we haven't yet made available. That helps us to prioritize our work, by working with individual departments to have that data made available. If you didn't have the data and you clicked on that, you'd see a variety of data sets that have already been requested. So you'd look at that first to see if the data that you're looking for has already been requested, and if you see it, you could add your voice to those who have already requested that data.
Behind the scenes what we do is we work with this list and individual departments to find that data and to try to make that data available, and then we update here, on this page, when we've been able to make that data available.
So, for example, here, with the national household survey released in May 2013, now when you click on that it would take you to the actual data set. If you couldn't find the data that you're looking for, you could submit a new data set and it would become part of this list, and again, other individuals would be able to come in after you and add their thumbs-up or their support for getting that data set as well.
Now I'll go back to the home page to show you a couple of other features specific to open data. I'll start with the showcase. We use this area of the site to provide examples and illustrate the use and the utility of open data. We keep a whole section called open data in action, which provides information on specific projects within the Government of Canada, most of which are collaborative, working with other jurisdictions, that use open data specifically to inform a particular policy area.
The oil sands monitoring portal is a joint initiative between Environment Canada here within the federal government and the Alberta government. It specifically focuses on open data. Together the two jurisdictions make more open data available out to the academic world to support greater research.
Also available through the showcase is an apps gallery, which provides access to a comprehensive listing of the apps that have been made available and developed by the Government of Canada using open data. These apps are downloadable for mobile devices.
If I click on the left-hand side, for example, to find a specific app for my phone, I can click on “mobile” and see all of the apps that are currently made available by the Government of Canada for download into mobile phones. “Recalls and safety alerts”, for example, is an app using open data that can be downloaded. I can download the app straight from the site.
I'll give you just a couple more features of the site. About data.gc.ca, I want to point out that a variety of the information resources that are put on this site are for the open data layperson, designed to get them interested in open data and explain what can be done.
We talk about the licence, making it clear that the data that is available can be released and reused on an unrestricted basis. There's a section on frequently asked questions. There's also “Open Data 101”, a handbook on open data to get people who have yet to start really using open data off the ground with the basics of what open data is, how it can used, and how to work with open data.
At the other end of the scale, the site has a developers' corner, which is really for the open data user who has some experience already. These are potential developers, for the most part, people who are interested in building applications using federal data or federal data combined with data from either the private sector or other public sector jurisdictions.
Here we have a little bit more sophisticated information around working with data sets; how to use an application programming interface, a software tool that makes access to data that changes frequently within the federal government more easy to use if you're building an app that will want to access that data on an ongoing basis; information about our metadata element set; and then Open Data 101.
That really brings me to the end of the tour.