Government Operations Committee on May 13th, 2014

Evidence of meeting #25 for Government Operations and Estimates in the 41st Parliament, 2nd session. (The original version is on Parliament’s site, as are the minutes.) The winning word was sets.

A video is available from Parliament.

On the agenda

Study on the Government's Open Data Practices

MPs speaking

Also speaking

Mary Francoli Assistant Professor, Communication Studies, Carleton University, As an Individual

Denis Deslauriers Director of the Information Technology and Telecommunications Service, City of Quebec, As an Individual

Alton Hollett Assistant Deputy Minister, Economics and Statistics Branch, Department of Finance, Government of Newfoundland and Labrador

Harout Chitilian Vice-chair of the executive committee responsible for administrative reform, youth, smart city initiatives and information technology, City of Montreal

Jean-Pierre Fortin IT Strategy and Planning Advisor, Information Technology Service, City of Montreal

Gordon O'Connor Carleton—Mississippi Mills, CPC

8:45 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Good morning everyone. This is our 25th meeting. We are continuing our study on the government's open data practices, which is well on its way and indeed, nearly over.

We have several witnesses today, several experts: Ms. Francoli, Assistant Professor, Communication Studies, at Carleton University; Denis Deslauriers, Director of the Information Technology and Telecommunications Service, City of Quebec; as well as Alton?? Hollett, Assistant Deputy Minister, Economics and Statistics Branch, Department of Finance for the Government of Newfoundland and Labrador. We also have two witnesses joining us by videoconference from the City of Montreal a little later on: Mr. Chitillian, Vice-Chair of the executive committee responsible for administrative reform, youth, smart city initiatives and information technology; and Mr. Fortin, IT Strategy and Planning Advisor, Information Technology Service.

We will begin by hearing the witnesses present in the room, beginning with Ms. Francoli, who has 10 minutes for her presentation, just as all the others. Following that, once each witness will have spoken for 10 minutes at most, the committee members will ask the witnesses their questions.

I thank you for being here today.

Ms. Francoli, you have the floor for 10 minutes.

8:45 a.m.

Professor Mary Francoli Assistant Professor, Communication Studies, Carleton University, As an Individual

Thank you very much for inviting me to speak to your committee. I'm going to focus my opening remarks on the Open Government Partnership, or the OGP, in which Canada is a member.

The OGP secures commitments from governments to improve transparency, accountability, and citizen engagement; to fight corruption; and to harness technology to strengthen governance. A requirement of membership in the initiative is that each country agree to have an independent review of its national action plan and of its progress every two years. This is called the independent reporting mechanism, and it's part of a system of checks and balances built into the OGP. I'm the independent researcher for Canada. Our first progress report was published in February of this year. The foundation for that report and for my remarks today is based on stakeholder feedback that made up the bulk of the report.

Canada's national action plan to the Open Government Partnership focuses on more than just open data, but given the parameters of your study, I'm going to confine my comments to open data as much as possible.

There are a lot of different issues that I could talk about in relation to open data, but given the limited time, I thought I would focus this morning on some of the main areas of concern that users raised during my stakeholder interviews and meetings. So it's a bit of a critical analysis of our open data strategy. I'm not speaking as much to some of the positive things, but I'm certainly happy to speak to those later during the questions and answers.

I organized my comments today around seven main concerns or points that the majority of stakeholders that I spoke to during the course of my study raised as issues with regard to the open data strategy.

The first is the diversity of data sets. Currently the data.gc.ca portal is largely dominated by geospatial data. There are few to no data sets in many other areas, including employment insurance, health, and issues related to specific demographics such as seniors or aboriginal persons. A lot of the users that I spoke to during the course of my study found that quite limiting, just the nature of the data sets themselves.

The second point is the quality of data. A couple of the points relate to that. Quality of data was perhaps what the majority of stakeholders were most concerned about. There is a widespread belief that the quality of the data in the data portal will suffer, and will continue to suffer in the long term, as a result of steps that have been taken to cut data collection at its point of origin.

A prime example of this—and I have to say this was the example that was given by almost everybody I spoke to during the course of my study—is the cancellation of the mandatory long-form census. Those sorts of measures around data collection have led to concern about the availability of updated and comparable data sets at smaller units of geography in the future. We’re already starting to see that be the case. Even in the last few days there have been a few news stories and reports about the loss of data from the last census exercise.

Those are the first two points.

The third point is the fragmented nature of the data sets that are found on the data.gc.ca portal. Data users noted that it isn’t uncommon for data sets to be released in what they said were bits and pieces instead of in complete and wider-reaching data sets. Sometimes they said they're also separated from their methodology and their quality description. What data users were finding is that when they were trying to work with the data, they had to spend quite a bit of time and really did need quite a high level of expertise to be able to combine data sets and make them really useful.

I think it came out during the course of the conversations I had that this problem might be a function of a bit of a difference in the definition of “data set” amongst data scientists and data users and government. I think there's a bit more conversation needed around the definition of a “data set”.

Another problem related to quality and the nature of the data is the format of the data sets on the portal.

In the past there have been some inconsistencies in the format of many of the data sets. I know that's an issue that the Treasury Board Secretariat has been working on. In the process of developing standards we really need to make sure that good metadata is included with the data sets. Missing and inconsistent metadata makes analysis really difficult; it makes it difficult for data users. The impression that I had from some of the users was that the standards for formatting are set a bit on the lower side, and that some of the metadata from certain data sets had potentially been removed in the name of standardization and consistency.

That brings me to my fifth point, which is the data portal itself. A lot of the people I spoke with had significant concerns about the data portal. I just came back from Open Government Partnership meetings in Dublin—they were European regional meetings—and heard many of the same concerns coming from civil society actors and assessors of action plans, coming from other countries that either have a data portal or are considering starting up a data portal. Data.gc.ca, as you know, is managed out of TBS, the Treasury Board Secretariat, which has the responsibility for the open government file. That centralization of the portal means that the data on the portal is effectively removed from its creators and its curators. It's removed from those, then, who have the highest degrees of specialization and understanding of the data itself. That puts TBS in the perhaps unenviable position of being a middleman, managing relationships and queries between those who are using the data and those who collected the data.

Some thinking needs to go into that issue, and perhaps the location of the data portal should be thought about. Some people I spoke with indicated that NRCan would perhaps seem a more logical home for the data portal, given that the majority of the data sets do belong to them and they have a high degree of expertise in data collection, presentation, and analysis.

Another issue with the portal is the search function. Users did quite widely indicate that it's not particularly user-friendly or well-designed, and they really thought that, at a minimum, with the portal, the functionality of the search function should be improved.

My second last point is that there is a growing data divide that's being created right now. Releasing data sets alone really doesn't have that much potential. It's not going to lead to any kind of significant change. You need people who can take the data and use the data. That requires expertise; it also requires resources. The raw format that the data sets are released in really does privilege data scientists, people who have high degrees of expertise in the use of raw data. Many others, non-governmental organizations, for example, would benefit greatly from the data sets and the information, but they're not able to use them because they lack the resources and they lack the expertise. If we're, in Canada, widely acknowledging that open data is important, then we need to think about potentially developing a mechanism for addressing that data divide and making sure that the data is accessible to a wider range of people than just people with a high degree of expertise—data scientists.

The final point that I'll make today is that open data is not open government. There has been a lot going on with open data, including the important study that you are undertaking with this particular committee. It's where a lot of other governments as well have placed their energy. We're certainly not alone in Canada in focusing on open data.

While there is certainly room for improvement, we have done some good things when it comes to open data. To be focused and careful of time, I didn’t necessarily go over all of those good things. I'm happy to talk to them during the questions. My worry, after talking to a range of stakeholders, and conducting the Canadian evaluation of our open government progress, is that open data is becoming privileged at the expense of other areas of open government and some of the other commitments that we have made in our OGP action plan to the international community and to Canadians.

I'll close there. As I said, I'm happy to answer questions. I've provided the clerk with the link to a full copy of the report, and I can provide any other research that you might find useful.

Thank you very much.

8:55 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Thank you for that presentation, Ms. Francoli.

Without further ado, I give the floor to Mr. Deslauriers, from the City of Quebec, who has a maximum of 10 minutes for his presentation.

Mr. Deslauriers, thank you for being here. You have the floor.

8:55 a.m.

Denis Deslauriers Director of the Information Technology and Telecommunications Service, City of Quebec, As an Individual

Thank you, Mr. Chairman.

Good morning, everyone.

I am both pleased and honoured to be here today in order to share our experience with open data, in the hope of helping to improve the Canadian government operations with respect to data belonging to its citizens.

Like the Canadian government, we began in 2011 to release data, in response to requests from citizens.

For your information, the City of Quebec has a population of 575,000 people. As compared to Canada's population, which is about 35 million, that represents a ratio of 1 to 60. The City of Quebec currently has 65 data sets, as compared to 11,000 for the Canadian government, which is a ration of 1 to 180. The City of Quebec currently has 5,000 downloads per month on its website, where has the Canadian government has had approximately 15,000 per month over the last few months, including 5,000 during last February's EDOC, which is a ratio of 1 to 3. Since our launch in 2011, citizens have downloaded something 118,000 times on the City of Quebec's website. According to what I read this morning on your site, the Canadian government tallied 100,000. These numbers lead us to the question of whether volume necessarily equals usefulness.

Over the last few years, the City of Quebec has participated in several community events, events organized by the community rather than the city. For example, programming marathons that last between one and three days. We have also asked colleges and universities students to create, using our data, applications that truly benefit citizens, rather than creating bogus projects at the end of the school year that will be seen only by their professors.

We became aware that the one-to-three-day traditional programming marathons were much too short to allow the creation of quality applications, even if we provided funding.

We also realized that final projects submitted by students provided much better quality applications. However, at the end of the school year, students go off to work and forget about their applications. What is more, professors cleared the servers to prepare for the next session, and applications disappear, no matter good they are.

Among the problems we can raise, transparency and usefulness seem the most important to me. Take for example police data collected by each city in Canada, data which I have consulted myself. I cannot help but wonder how useful it is for a citizen to find out how many officers are working or the numbers of officers eligible for retirement. Although it is made accessible for reasons of transparency, does this data truly improve a citizen's daily life? I truly wonder.

As to statistics per province on break-ins, attempted murders, drug trafficking, gambling and so forth, they may well interest statisticians, but do they improve taxpayers' daily lives? I find all this a bit much.

Our mayor recently announced he would be making crime data available and break it down by neighbourhood. This will certainly have an impact on property values in neighbourhoods with higher crime rates. Furthermore, the residents' reaction will be to ask the city to double, triple or even quadruple patrols. In one hour, rather than having one pair of eyes watching, we will have four. Is that sufficient? Forgive me for my skepticism.

I would prefer the residents become more aware and stop asking themselves what government can do for them, thinking that government is responsible for surveillance and security. No, that should not be the case. As a citizen, one should be responsible for reporting events that seem suspicious. Thus, instead of having four pairs of eyes per hour, there would be 100, 1,000 or 10,000. If we raise citizens' awareness about the quality of life in their own neighbourhoods, we will improve safety in those neighbourhoods and bring property values back up. Generally speaking, citizens are unconcerned by their provincial or municipal crime rates. They are mainly interested by what is going on around them, in their own backyard.

I believe there are changes to be made, and I have other suggestions to that end.

What would you say if we provided real data on waiting times at border crossings, so that people could head to the right stations? What if we provided information about wait times in hospitals, medical clinics, and all the places where permits are issued? So that people can choose the least busy ones for themselves? That would certainly help mitigate problems.

What would you say if we provided information as simple as georeferenced data on our parks, fountains and public washrooms? These are things people regularly complain about, as they do not know where fountains and public washrooms are located.

Imagine the consequences if we provided the rate of success for kidney surgeries by hospital. Of the four hospitals in my own city, one has a success rate of 62%, and the others, 80%. What would people do? They would choose one of the hospitals with an 80% score. This data would allow us to improve as communities. We could try to find a solution to any given hospital problems, by checking whether the hospital received proper funding over the last few years, whether its staff is competent or has received the necessary training. It might be appropriate to reduce investments in hospitals with a success rate of 80%, to spend two years attempting to correct the situation in the one with the 60% score, and thus, rebalance supply. In my opinion citizens would be better off.

As for developers, I think we could make their lives simpler. We have already begun to do so this year by adopting the Creative Commons International licence for the cities of Gatineau, Montreal, Quebec and Sherbrooke, as well as for the Quebec provincial government. Developers no longer have to worry about which licence must be taken into account when using this data.

We have also begun standardizing data. For the first time, datasets for events and new ideas have been standardized among cities. We now host this dataset on our website and are uniting portals into a single stop for these four cities and the Quebec provincial government, in order to facilitate the use of the website for citizens.

We must not shy away from encouraging their use. We have kept citizens in the dark for a very long time. We felt we were in a better position to know what was in their interest. It is time to let them in. To that end, we have to show them how to go about it, as they have lost the habit. We must encourage use.

We recently held a municipal programming marathon. These four cities each contributed a $5,000 prize. Then, during one week, we asked the winning teams to polish their products in preparation for a provincial competition. We provided four organizations with $12,500 in funding for that purpose. It cost my city $8,000 for an application that is compatible with iPhones, Windows Phone and Android, as well as online in an adapted format. That application will be maintained for one year. That was one of the requirements to be able to win the contest.

We can also try to create wealth. Universities have begun to design products with us, for example, an application that helps you find a parking spot downtown during events. We encouraged them to partner with a not-for-profit organization, or NFPO. They continued to improve the application to make it possible, for example, to prolong the parking permit through a smartphone, without having to return to the parking meter, or, getting a discount at a nearby restaurant before attending a show. We are creating wealth at the same time.

We are supporting two CEGEP initiatives that will create a not-for-profit so that applications developed by students at the end of the year can be used permanently by future student cohorts. If these applications become too cumbersome to maintain, CEGEPs may ask the students to take them over and set up a business. That would create jobs.

We also need to think about facilitating data consultation by citizens. We have a great deal of data, but could we provide citizens with the required tools to view them easily and automatically generate graphs in columns and pie charts, rather than providing them with raw data? Ultimately, these measures are not focused on the developer, but rather the ordinary citizen. We could provide citizens with mapping that would allow them to view data on their area and benefit from it. Imagine the possibilities.

We could also invest in playing a role as liaison and catalyst for the community. Some software engineering students are quite good at developing applications, but not great at designing. In an another faculty, students who specialize in multimedia design are very good at designing, but have never developed applications. All these students are asking to work with us. We play a middleman role with citizens and try to maximize their contributions. According to the testimony we have heard, this is greatly appreciated. These students offer real benefits to citizens. They are happy to return the favour, because they know that their tuition fees are largely subsidized by citizens.

I will be pleased to answer your questions.

Thank you.

9:05 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Thank you for your presentation.

I will now give the floor to Mr. Hollett, who is the Assistant Deputy Minister for the Department of Finance for the Government of Newfoundland and Labrador.

Thank you for being here this morning. You have at most 10 minutes.

9:05 a.m.

Alton Hollett Assistant Deputy Minister, Economics and Statistics Branch, Department of Finance, Government of Newfoundland and Labrador

Thank you very much.

I'm very happy to be here this morning. I'm always interested and happy to talk to people about things that we're interested in as well.

I have brought along some slides, but unfortunately there wasn't time to have them translated into French, so you're not going to get the benefit of pictures that would probably make some of the things I'm going to say a little clearer. I'm going to have to take a slightly different approach to what I say because of that, but that's fine.

I looked at the questions of interest to the committee, and I thought the best way I could respond is to talk about where our government is going. We've recently announced an open government initiative, of which the open data is one component. I'll talk about that and ultimately tell you how we're dealing with that and where we're going. I will also go through some work that we've done in the past with data sharing, which will tell you why we're taking the approach we're taking. With all the things that I'm hearing, which Mary and others are saying, we've had the same experiences.

I will do that. I'll talk about our experience. I'm going to talk about the Newfoundland and Labrador community accounts data sharing initiative, from which we've learned an awful lot. It is the foundation for the way that we think about open data. I'm going to talk a bit about meeting user needs, and of course I'm going to end up with a little about where we're actually going.

We've had a long tradition of data sharing, and also supporting the users of our data. The Newfoundland and Labrador Statistics Agency has always been very interested in providing data to people in our 400 communities, scattered around 6,000 miles of coastline. We've given open access to a very wide range of information. We bought the data from Statistics Canada. We've developed it from internal sources, and we just put it out there. Statistics Canada has said that it's a peculiar thing to do; we're buying data and giving it away. However, we've always thought it was very important to do that.

To us, the open data initiative that we see across North America is essentially a focus on things we have always done. We're happy to see that focus. We're very engaged in the idea of open data, and very committed to it. However, it's not something that's absolutely new to us, by any stretch of the imagination.

Regarding the system of community accounts, on my slide I call it the “flagship” of Newfoundland data sharing, and it certainly is. We released it to the public in 2000. It has data for 400 communities, 200 neighbourhoods in our larger communities. It's actually a fully developed data set, in the sense that everything is documented; you can get back to the source. We have applications there, mapping, and so on.

The other thing we've done.... Dr. Doug May of Memorial University and I have partnered in this, and we've been at it for many, many years. The way we've packaged our data in the system of community accounts is that we use a well-being framework. My slides will be available, I think, and you'll be able to see it. I have a schematic there that shows an overview of that well-being framework. The reason we did that is that we wanted to make the data meaningful to people. The idea of the well-being framework is that we present data that gives statistics and measurements of factors that contribute to well-being in people's lives.

When you look at this at a community level, it's very powerful. People very quickly become experts because they know their communities. If you give them a number, all of a sudden it starts putting a quantitative dimension to basically knowing themselves. We have found that to be very effective.

In working with people at the OECD and the Australian Bureau of Statistics, and places like that, we found that we're probably 10 years ahead of things because that's where things seem to be going right now. That was very gratifying. In the beginning I was afraid we were going up the wrong alley, but we weren't.

We found it very useful. When you use that framework, you look at income, employment, unemployment, demographics, and so on. It helps to give you a sense of what data you should put in your system. It also helps with your prioritizing. You get people coming and they're asking, when are we going to have this, and when are we going to have that? We found it to be very effective, and we found that our communities and our neighbourhood people really liked that approach. A lot of people just don't know what the possibilities are.

In terms of lessons learned, which is a driving force in terms of what we're doing with open data, we found that people came to us and said, “You're a statistics agency and we'd like some data”. We asked what they wanted and they asked what we had. That's a hard question for a statistics agency to answer. It's very hard, as you can imagine. You can think of it as a warehouse that's full of all kinds of wonderful data. Most people don't know what they want, and a lot of people don't know the possibilities. This is the power of the conceptual framework that we've put in there with well-defined objectives, and so on and so forth.

Our experience with open data versus more developed data sets is that the majority of users really are not coming to us looking for the open data type of data. We find that most people, as Mary said, who use these data, who are looking for these data, are academics or seasoned data users, and quite often it requires a lot of work to actually use them. Of course, we've always provided those kinds of data when people ask for them, and I wouldn't want to give the impression that we don't think that because people are not asking for the data right now that good open data initiatives that are well delivered and well structured can't develop an appetite and develop a lot of interest and a lot of usage of those types of data. But I think we have to be realistic about where we are today. The market for raw open data, if you want to think of it that way, at this point in time, is certainly not very well developed, and if it is, it's clustered in specific places.

We view the open data approach as really most simplistic—and I don't mean that in a negative way. It's pretty elemental, how the concept defines open data, and then, of course, there's the value-added data, which is the community accounts type of data, in terms of my example. We look at data in terms of a sort of spectrum. There's data, information, and knowledge. From our perspective, the open data, the raw data, would be sort of just the data end of it, but when you do things with that data to make it more useful, you turn data into information, and when more fully developed, you begin to turn it into knowledge.

We've always put a lot of emphasis on trying to provide information and knowledge data. I do believe that in the future, when I get into a visioning mood, I really think we'll put a lot of effort into open data. We'll learn a lot about it, and eventually end up coming back to data that are better supported, better defined, and not simply dumped out of administrative data sets because they were never designed for those kinds of reasons. It will go full cycle. The market for open raw data will be there, and probably bigger than it is today, but I think most of the demand will not be there over the medium to long term.

In my slides, which you won't see.... I had a couple of slides there that I refer to as repairing data usage. The behind-the-scenes challenge is a messy business, and it really is. I encourage you to take a look at the slides when Marc puts them up, because I'm not going to get into it now. It is by no means simple or straightforward to take a set of data that people would consider raw data, and even to do marginal work to turn it into something that's going to be useful to pretty much any user. It doesn't matter how technically strong and numerically literate an academic person or any seasoned data user is. Administrative data files are nightmares to deal with, and that's where a lot of the open data interest actually lies.

These data sets, to be useful, require a lot of support. This is one of the main reasons why our government has had us into the data side of this, as a professional and well-developed statistics agency, to make sure. We want to be a leader in our province in providing a good data product. We don't want to get out there and just churn it out and have all our staff on the phone all the time trying to answer questions as to what this is.

We want to make sure that.... The value-added will vary across the spectrum, but the value needs to be there if this is going to be successful. I would argue, based on experience, that if we don't put effort into making the data sets clean, even the rawest form, if we don't make them clean and well-defined so they can be used properly and efficiently, we are creating a resource nightmare for our organizations in trying to deal with people who are going to be coming looking for help, looking for how to interpret, how to use—where do they come from, what do they mean, what can you do with them? Ultimately, I think this could be the foundation for the failure of open data initiatives, which I think are a very good way for governments to go.

In terms of what data can be shared, what we find at this particular point in time is that it's really a challenge to know which way people are going and which way people actually want to go. For many of the sites we look at, there is no obvious organizational framework. You see that the offerings are all over the place when you look across the different sites that are out there. The word I have on my slide is “spurious”, and in many cases the quality is questionable.

But as for the way we think of it, we think of data as answers to questions, so where we start.... We've been going with the open data, and we've been encouraging our stakeholders across government to do so, the people who are into open information but don't really understand the open data as well as we do because we spend our lives at it.

First, we have to decide what questions we actually want to answer. Once we know what kinds of questions we want to answer, that begins to give us some idea of what the objectives of the initiatives are going to be. Who is the target audience? Are they highly skilled? Are they less skilled? Do we know what they want? Then, based on all of that, what's the best way to provide it across the spectrum? That's from raw data to knowledge, if you want to think of it that way.

As for what we've done in the approach we've taken, our government is fully committed to open government, to open data. There's absolutely no question about that. What we've done is establish a preliminary website. It's almost a demonstration website, but it's not something that will be withdrawn. It's something that will be made bigger. There, our Office of Public Engagement is beginning to consult.

I'm finishing now, Mr. Chair, because I'm sure I must be close to 10 minutes.

They're doing a consultation to see if we can engage with people and see where their interests might lie. A big thing we're doing that's going to be very useful for a variety of reasons is that we're actually building an inventory of all data sets across government. That is not simple. It's a big job, but we do have it under way. Of course, we're making sure as we go that privacy, confidentiality, and all that sort of thing is appropriately dealt with.

Based on our consultations, and also on our judgment, because I sort of feel that we're not going to get an awful lot of feedback from our consultations based on experience.... As I said earlier, you ask them what they want, and they don't really know for sure—

9:15 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

I will have to ask you to please conclude.

9:15 a.m.

Assistant Deputy Minister, Economics and Statistics Branch, Department of Finance, Government of Newfoundland and Labrador

Alton Hollett

Yes, absolutely.

That's the essence of what I have to say or what I can say. I'm happy to answer questions.

9:15 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Thank you very much for your presentation.

It's now the turn of the City of Montreal officials, who should be in a great mood this morning. We will hear from Mr. Chitilian, the Vice-Chair of the Executive Committee, and also from Mr. Fortin.

On behalf of the 10 members of the Standing Committee on Government Operations and Estimates, I would like to thank you for being at our meeting this morning.

You have the floor. You have 10 minutes.

9:20 a.m.

Harout Chitilian Vice-chair of the executive committee responsible for administrative reform, youth, smart city initiatives and information technology, City of Montreal

Thank you very much.

Committee members, Mr. Chairman, it is a great honour for us to present the point of view of the City of Montreal on open data.

With me is a senior official from our public service, who will answer questions later on.

Let me start by giving a quick history of the open data policy and open data initiatives at the City of Montreal. We started in 2011 and put in place, first and foremost, an open data policy, and then we went ahead to develop a website to free up the data to the public.

We started gauging the interest of the public in the data we started liberating and freeing up. We noticed that in the early goings, the initiative of the City of Montreal was perceived as a compliancy issue. Everybody was doing it, so we also had to do it, especially because there was a lot of pressure from the bottom up. Now, almost three years after the development of this policy, we recognize that it's not a compliancy issue anymore; it’s a management-transparency issue. In other words, the city has to be transparent towards the citizens; therefore, we will and we do free up data that demonstrates how we use public funds.

Also, we will use the open data policy in order to develop solutions around three key issues we face on a daily basis in the urban environment. The first and foremost issue is transport. All of the future transport systems we are developing have built-in open data mechanisms and they will also have mechanisms for crowdsourced data, equally. It's one thing to have the public administration data, but where you get the synergy is if you join this data with that which comes from the citizens themselves.

The second issue we will concentrate on is sustainable development. There again, we have a lot of data, but we need to share that data—to help university students and help companies add value to that data; to develop policies and solutions to tackle most of the issues we will face in the future.

Last but not least is emergency services. Again, we have some data, but we need data provided to us by the citizens in order to build lasting solutions for this issue. How do we achieve that? On our end, we will tackle the three Ps that I always repeat to our people, to our citizens. First, we are always adapting our policy. Therefore, just like the City of Quebec and the Government of Quebec, we went with the Creative Commons 4.0 licensing. Second, we are reviewing all the processes of the city; therefore, existing systems and new systems have to have built-in open data mechanisms. They have to produce open data.

Finally, the greatest challenge of any public administration is to change the culture of its people. The data we have in our organization belongs to the people who are the different public servants in different services. Therefore, the greatest challenge we have is bringing all these people up to par with our policy decisions because a lot of the services are still very hesitant to free up their data.

On this, I will let Mr. Fortin, follow up with four specific areas where he's going to develop this idea.

Thank you.

9:20 a.m.

Jean-Pierre Fortin IT Strategy and Planning Advisor, Information Technology Service, City of Montreal

Thank you, Mr. Chitilian.

I will continue by speaking to the points of interest which were submitted to us when we were invited to appear. In the invitation, it was suggested that we talk about the needs of users and about socio-economic benefits, if there were any, as well as the ways different governments could collaborate. It was also suggested that we talk about best practices, and of the ones we felt were better than others.

But before doing so, I think it is important to specify certain statistics.

Up until now, the City of Montreal has released 107 datasets. They deal with subjects of interest to citizens, which Mr. Chitilian mentioned, including transportation, administrative data and services close to the people, such as sports, recreation, culture and so on.

As far as the needs of users and citizens are concerned, I believe that others before us have already said this, but we really have to insist on the fact that beyond the accessibility of data, citizens are asking for information and structured data so they can improve the way they use the services the city has to offer and the way they can access these services. Structuring the data of course depends on the availability of platforms. In this situation, we are not talking about platforms which would only receive data, but really about systems which make the way these data can be used understandable. Taken to its logical conclusion, we could even draw a parallel with environments involving business intelligence. Otherwise, these data would really be of interest to no one.

I also think that citizens need to feel that their city is transparent, and therefore their government as well. In return, we can hope that the public's cynicism towards its institutions would go down. So what we are talking about is creating and maintaining a relationship of trust.

As for the socio-economic benefits, I have just mentioned the first one, namely the feeling of belonging and of pride people have when they contribute to a more open society, one which is more dynamic and which makes sense. Another effect is that this creates bottom up work and initiatives, that is, initiatives which create value based on this data. For example, citizens could take the initiative and create applications for their fellow citizens.

If you do your job well and if you like your community, you can expect that a virtuous circle will develop. It would be a kind of ecosystem which includes a city that has data and makes it available, that includes supporters or creators of solutions who use these data, as well as informed and engaged citizens. So you would find yourself in what could be called a virtuous circle.

Regarding best practices, at the City of Montreal, we have always found inspiration in Europe's best practices. This includes both top down initiatives, where governments strongly participate, and, to the contrary, initiatives which strongly call on community involvement.

We also are clear on the fact that the British government contributes not only because it publishes wide ranges of datasets, but also because of the open quality of this data. We were recently consulting an index on open data of various governments, and Great Britain ranked first precisely because of the fact that its data is so open.

Which leads me to talk about the choice of licence.

As with our friends from Quebec City, whom I would like to recognize, and officials from the Government of Quebec and those of the other Quebec towns, we believe that this is an extremely open licence which has very few restrictions. It's the Creative Commons 4.0 licence, whose only requirement has to do with attribution.

In our opinion, it is essential that governments which want to work together agree on a licence which is as open as possible. Otherwise, even though there might be common standards, if the licence does not allow for combining data in a very general way, the work will be in vain. This is why we are working very hard to get all of Canada's public organizations to adopt a licence which is as open as possible and which, of course, comes with the fewest restrictions.

In addition, as far as collaboration is concerned, there are licences and standards, but in this case, as in other countries, the process is moving forward by trial and error. Everybody wants to do their own thing. On the other hand, we are witnessing a form of industrialization of all our processes, and because there are so many platforms, it will probably not be necessary for everyone to develop and maintain their own. Perhaps we can think about sharing these platforms, which would be defined based on common criteria and interests. At the end of the day, we might have super platforms, within which all public organizations could deposit their data. The level of interpretation of these data would largely exceed the level of interpretation of each order of government.

For example, it might be interesting for a Canadian citizen to not only know the extent to which people engage in recreational activities or use public transit, but also, generally speaking, to have an idea about the way in which Canadians engage in recreational activities in their hometowns. For that type of information to be available, the data would obviously have to be combined and integrated into common platforms.

I will stop here. I am ready to take your questions.

9:30 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Thank you for sharing your expertise in this field with us.

We will now move on to questions from committee members. Since we are also talking with witnesses via videoconference, I would ask you to please indicate whom you are putting your questions to.

Mr. Ravignat, you have the floor for five minutes.

9:30 a.m.

NDP

Mathieu Ravignat NDP Pontiac, QC

Thank you, Mr. Chairman.

My first question would be for Madam Francoli.

You began your presentation by saying that open data is only one part of open government. This is a rather narrowcasted study but I would still like to ask you about it. Regarding the relationship with other segments of open government and open data, it's hard to view open data and only open data without talking more broadly about open government.

Do you have any thoughts with regard to other principles of open government and the relationship to open data?

9:30 a.m.

Prof. Mary Francoli

That's a good question.

We've actually defined or structured our action plan around different aspects of open government in Canada in a really rational way. In Canada we talk about open government in terms of open information, open data, and open dialogue. The open information bit of things relates more to access to information and it relates more to what we often hear referred to as unstructured data. That is, files that public servants might have on their computers and information that's generated more in a documentary form and not just in a raw data set. The open information bit is around that. The open data, obviously, we know; we've talked about that. The open dialogue bit is around engaging citizens in an ongoing and meaningful way. That's really necessary to a good open data strategy as well. All of those things really do work hand in hand.

9:30 a.m.

NDP

Mathieu Ravignat NDP Pontiac, QC

Is there an open dialogue piece going on right now with regards to data in the government's initiative, or have they abandoned that dialogue piece?

9:30 a.m.

Prof. Mary Francoli

No. I mean engagement in dialogue around the open data bit has actually been not too bad compared to some of the other areas of open government. There have been various consultations that have been held since we joined the Open Government Partnership. As part of our membership in that body, we had to have a consultation around the development of our first action plan. We had to have the consultation around the post-year-one self-assessment. We didn't do a good job at those consultations, and the government has acknowledged in its own self-assessment report that we didn't do a good job in citizen engagement around those.

On the open data side of things, it was actually a little bit better. The consultation process involved an online consultation, but it also involved a series of round table discussions where people were engaged on a face-to-face basis. So really, in terms of the various consultations we've done around different aspects of open government, open data is actually probably the strongest.

9:35 a.m.

NDP

Mathieu Ravignat NDP Pontiac, QC

Now at the same time you kind of have to find a way to build in continuous improvement with regard to the data sets and with regard to the what's available and the format they're in. At TBS I guess there are practices in place or structures in place that would allow that feedback mechanism to ensure that. Because the portal is rather young, and we'll give some benefit of the doubt to the government, which I'm not one to do often, but it's a rather young thing and it needs to improve, obviously, given your presentation and other witnesses we've heard. But in order to do that, you have to build in a kind of continuous improvement capacity. Is that currently going on?

9:35 a.m.

Prof. Mary Francoli

I think that's something that could definitely be strengthened. I think you raise a really good point to say that the data portal is really young, and our commitments in general to various aspects of open government are pretty young. So really we're in the second year of our commitments under our national action plan. We're trying to develop a second one. It's been a learning curve, and there's work to do for sure.

I think part of that work is developing a more ongoing dialogue between a range of different users. I heard academics mentioned a few times this morning. I'm really happy to be part of that discussion. It's always nice to get out of your office, certainly, to make your work meaningful, and not just say it's on the syllabus 100 times a day. But it's not just academics; it's people maybe you wouldn't think of at first blush. A lot of non-governmental organizations really want to be engaged in this dialogue around open data and tell the government how they think the data portal can be improved. Data scientists want to be involved in the conversation on a more ongoing way to say, “These are the problems we're seeing realistically with being able to use the data sets”, so some of the problems that I spoke to directly during my opening remarks.

Even though we've done what I think is a fairly good job with citizen consultation and engagement in relation to our particular open data commitments, that kind of mechanism for an ongoing and sustained dialogue amongst different actors is something that certainly needs to be developed in a much better way, and I think the government has acknowledged that. So right now, if you go to the data portal, you'll see that they're running a consultation on how to do consultations. So they're kind of asking people, “How do you think we can do this? How can we develop a mechanism for engaging people, and who needs to be part of that discussion?”

We're not the only government struggling with that. I mean I certainly heard that from other national governments at the OGP meetings in Dublin last week.

9:35 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Thank you for your answers.

Mr. Trottier, you also have five minutes.

May 13th, 2014 / 9:35 a.m.

Conservative

Bernard Trottier Conservative Etobicoke—Lakeshore, ON

Thank you, Mr. Chairman.

I would like to thank the witnesses for taking part in this morning's meeting.

Mr. Deslauriers

and also Mr. Hollett,

You spoke about the importance of data in specific communities. People want to have data and information about what is happening in their neighbourhood. That's very important. I know that it was said that a great deal of data in the Government of Canada's portal were geospatial data. I think that's very important.

As you mentioned, Mr. Deslauriers, people want to have information on the crime rate in their area; they are not interested in macrodata.

Are there any problems with the way geospatial data are provided in the Government of Canada's portal and in all of the other portals? Is there anything we can do to help people get information on what is happening in their own neighbourhood, be it about the crime rate, the environment, transportation or the challenges of daily life?

I would like Mr. Deslauriers to answer first and then Mr. Hollett.

9:40 a.m.

Director of the Information Technology and Telecommunications Service, City of Quebec, As an Individual

Denis Deslauriers

I believe so. The closer you get to people's daily lives, the more they are interested. For example, during the month of August you could ask people who live in heat islands whether they would be interested in knowing how the heat spreads throughout their neighbourhood and what they could do to play down its effects. If you want to be an open government, it means that you have to be open to suggestions from citizens and to their involvement in the development and maintenance of any solutions.

So yes, I think that there is a way to do that. We already provide some information, but it's not complete. For instance, there is also data on heat which comes from the provincial or federal governments. It would be interesting to get this information at the same time in a simple manner.

When I looked at the Canadian site, I saw a TIFF map, but it was completely useless for me, since I am an ordinary citizen. Unless I am an expert, that is useless to me, because I cannot use the data. I don't have the ability to do that, even though I work in IT. You have to be able to easily use this data, to draw conclusions and to take measures at the community level with your neighbours and the people in your neighbourhood.

We need to find ways to help citizens easily consult this data and then act accordingly. The point is that people should be able to do something without asking government to fix the problem. Ideally, people should be able to do things on their own initiative which we would simply support. This would be much less onerous and much more sustainable in the long term.

9:40 a.m.

Conservative

Bernard Trottier Conservative Etobicoke—Lakeshore, ON

Mr. Hollett, maybe you could comment on that. You mentioned some of the things you're doing by community. I'm not sure how Newfoundland and Labrador defines a community. Is it latitude and longitude? There are certain challenges at the very micro-level to define what is the relative data for a specific location. I know all governments around the world must deal with this challenge. What's the approach that Newfoundland and Labrador has taken?

9:40 a.m.

Assistant Deputy Minister, Economics and Statistics Branch, Department of Finance, Government of Newfoundland and Labrador

Alton Hollett

It's not so hard for us, because we have 600 communities spread around 6,000 miles of coastline, and almost every one of them is separated from every other community. So basically our approach to defining a community is: it's Fogo, it's Harbour Grace, it's whatever. That hasn't been a problem for us at all.

That's one of the reasons we did the neighbourhoods. It's easy to do the communities themselves, but then people are interested in the larger areas, such as, if you know Newfoundland—some people here certainly do—the larger communities of Corner Brook, St. John's, and Clarenville. So we broke those into neighbourhoods of 1,000 population, and we did that with residents of those neighbourhoods as well, by the way. We had consultations with them and discussions. They essentially designed the neighbourhoods, but then we superimposed Statistics Canada geography over those so we could standardize the approaches, and so on, that we made.

I would like to just respond a little bit to some of the points that you made.

The geospatial aspect of it is critical and at our statistics agency we have very strong geospatial capacity there. One of the reasons it's so important is that I find in Newfoundland and Labrador—and I would predict that there's not much difference elsewhere—that what you said is absolutely right. Most people don't care about the macrodata. Obviously governments care, because we have to do our best to manage our economies and we have to have those indicators. But what really means a lot to people in communities and neighbourhoods is to take a look at data about themselves. We found that to be very....

The way we've always looked at it is that when you think about economic or social development, putting data into that equation is a very powerful new thing to add there. But if you deliver the data at the right level and in the right forums—we call it accessible, which means you can understand it and it's easy to get, easy to manipulate, that sort of thing—you actually turn people, who may not have much in the way of quantitative background at all, into experts. If I tell somebody in Arnold's Cove what the unemployment rate is in Arnold's Cove, they have a number that begins to put a dimension around their community. But they're experts on that, because they know what's happening to the guy next door. They know who's going to Alberta, they know who's working in the fish plant, and so on and so forth.

We've taken that kind of an approach for getting data into people's hands at the micro-level. I really believe that the buzzwords that we hear about evidence-based decision-making, and all that sort of thing. I think that the secret is moving in there at the micro-level and making data available to people in a way that it means something to them, so that they can actually start thinking that way and understand an awful lot more, and for that matter understand what governments are dealing with, because we have to deal with the realities. Lots of times people don't have enough information to know what the realities are and the dimensions of that in the same way that we do.

9:45 a.m.

NDP

The Chair NDP Pierre-Luc Dusseault

Thank you, Mr. Hollett. I'll stop you.

Mr. Trottier, your time is up.

Ms. Day, you have five minutes.

9:45 a.m.

NDP

Anne-Marie Day NDP Charlesbourg—Haute-Saint-Charles, QC

Thank you, Mr. Chairman.

I would like to thank all of the witnesses for their participation. We always appreciate their testimony.

My first questions are for Ms. Francoli.

On May 10th last, I was reading an article in La Presse. Amongst other things, the piece was about the poor quality of the search engine on the donnees.gc.ca website, and about the lack of communication following feedback from users. The piece also criticized the lack of transparency which came as a result of the elimination of the long-form census, in addition to the fact that the short-form census had cost taxpayers another $22 million. All of this showed that there were serious problems.

You remember that the Government of Canada participated in an international initiative to get governments to give more power to their citizens, to promote this fact, to improve transparency, to fight against corruption and to take advantage of new technologies in order to strengthen governance. This is the government's responsibility, and not that of citizens.

In your view, how does the Government of Canada fare compared to other G8 members in the race towards transparent data and as far as its commitment is concerned?