The Public Data Layer

I have been thinking a lot lately about the increasing importance of the "public data layer" -- meaning, data that we will need ("we" applied broadly, meaning the general public, NGOs, government, scientists, journalists) to make sense of what's going on in and increasingly busy, but increasingly quantifiable world. First, some of the drivers here. In general, there is more data being generated than ever before, so much of which has a bearing on "public" issues. A few of the specific drivers include:

Increasing role of "platforms" in regulated spaces (transportation, health, finance, education, etc) -- these are enormous generators of data with direct and indirect bearing on public issues.
Sensors & IoT (publicly and privately owned) -- same as above.
Abundance of media -- as we have seen with the recent US election, the rise of social & independent media is democratizing but also problematic.
Personal health data -- the cost of gene sequencing is dropping like a rock, which will lead to an explosion of health data. This data will provide personal value but can also provide enormous societal value.

Why this will be important? Because all of these data have the potential to increase collective intelligence and societal knowledge. And more specifically, we have the potential to redesign the way we make policy and handle regulation given these inputs. If we do this right, we can get smarter at policymaking, and design regulatory systems that have both greater effectiveness and lower costs of implementation and compliance. So, what infrastructure will we need to handle and process all of this public data? This seems to be forming into a few broad categories:

Data pooling & analysis platforms -- tools and APIs that make sense of these data -- generic/foundational tools like Composable Analytics and Stae, and more specific, vertically-oriented projects & tools, like OpenTraffic and Aerostate.
"Regulation 2.0" platforms -- specifically designed to facilitate a data-driven policymaking and regulatory process -- for example, MeWe, Airmap, SeamlessGov.
Foundational and application-layer blockchains -- on the pure tech side, this is the most interesting area of development. Blockchains give us both public data access and data integrity in a way that's not been possible before. Much of the focus is still on "foundational" blockchains like Bitcoin, Ethereum, Tezos and Zcash, but eventually this technology will reach the application layer and we'll have more explicitly "public" applications. I also expect that Blockchains and Regulation 2.0 platforms will get ever closer and ultimately merge.

That's the vision -- where it seems clear that we are heading, and where we need to head. So, the more important question is, how will we actually get there? A bunch of questions/thoughts on my mind are:

Broad vs narrow? Strikes me that we will see the most traction in narrow applications first -- the thin edge of the wedge, that solves a concrete problem. Also, the "personal data layer" hasn't arrived in one broad platform either.
Open standards + distribution magnets: dating back to my work around open transit data, a key learning was that open standards need distribution magnets. The thing that got transit agencies to publish data in the open GTFS format was Google Maps.
Portal access vs. real access -- the natural tendency of data owners is to offer access via siloes and portals (e.g., Uber Movement). This is something, but's not the real thing -- the more important question is how to get actual data moving.
Government isn't the only audience: public data is of course useful for policymaking and regulation, but it's equally important for scientific research and journalism. These areas could end up being the initial leaders.

That's it for now. More to come. For some more context on my thinking here, see Regulating with Data and Alternative Compliance.

January 11, 2017

Experience ↔ Design ↔ Policy

People often ask me how I ended up working in venture capital, and more specifically in a role that deals with policy issues ("policy" broadly speaking, including public policy, legal, "trust & safety", content & community policy, etc.). Coming from a background as a hacker / entrepreneur with an urban planning degree, how I ended up here can be a little bit puzzling. The way I like to describe it is this: From the beginning, I've been fascinated with the "experience" of things -- the way things feel. Things meaning products, places, experiences etc. I've always been super attuned to the details that make something "feel great", and I'd say the overriding theme through everything I've done is the pursuit of the root cause of "great experiences". From there, I naturally have been drawn to design: the physical construction of things. I love to make and hack, and I geek out over the minor design details of lots of things, whether that's the seam placement on a car's body panels, or the design of a crosswalk, or the entrance to a building, or the buttery UI of an app. Design is the place where people meet experience. But over time, I came to realize something else: what we design and how we design it is not an island unto itself. It's shaped -- and enabled, and often constrained -- by the rules and policies that underly the design fabric. That's true for cars, parks, buildings, cities, websites, apps, social networks, and the internet. The underlying policy is the infrastructure upon which everything is built. This first really hit me, right after college (16 years ago now), when I was reading Cities Back from the Edge: New Life for Downtown

January 5, 2017

Unintended Consequences

I've been struck recently by the power and surprise of unintended consequences. For example, a recent Slate article digs into flip side of the life-saving potential of automated vehicles: our reliance on car crash deaths for organ donors:

"An estimated 94 percent of motor-vehicle accidents involve some kind of a driver error. As the number of vehicles with human operators falls, so too will the preventable fatalities. In June, Christopher A. Hart, the chairman of the National Transportation Safety Board, said, “Driverless cars could save many if not most of the 32,000 lives that are lost every year on our streets and highways.” Even if self-driving cars only realize a fraction of their projected safety benefits, a decline in the number of available organs could begin as soon as the first wave of autonomous and semiautonomous vehicles hits the road—threatening to compound our nation’s already serious shortages." [#]

Or, with gene editing, what if we are successful at eradicating illness and preserving life forever? What new challenges will that present? How will we eat? How will we not consume all of earth's natural resources? Or perhaps the life-saving potential will ultimately be canceled out by the life-harming potential -- it's clearly just as possible to use gene editing to weaponize mosquitos as it is to sterilize them. Or, with the democratization of media -- on the one hand radically increasing freedom of expression, but also laying the foundation for the "fake news" problem. I don't think anyone who believed in the power of social networks to enable free speech and political organizing online really saw that coming, and it's a real, hard problem. Or, with artificial intelligence -- how do we avoid being blinded by the shiny newness of helpful automation while ignoring potential existential threats? Bill Gates

January 26, 2017