Loading
Settings
1
1

A set-theoretic view of Ethereum coteries

5 USDC if you know who ^ is without googling it

by mike
friday – october 20, 2023

Acknowledgements
Special thanks to Data Always, Tim, Justin, Barnabé, Thomas, stokes, Vitalik, Danny, Izzy, Davide, Toni, & Dankrad for discussions and comments! :-)

tl;dr; Many different entities compose Ethereum’s consensus layer. In a recent Bankless episode, Danny presented a framework of four groups: app-layer users, holders, stakers, and node operators. He claimed that a healthy ecosystem has a clear distinction between each of these groups and that none of them are controlled by a single actor.

We explore this mental model further by presenting each group as a set and considering different relations between the sets. We begin by establishing the preliminary character list and notation, which we use to present four observations (that follow directly from definitions) and four desiderata (that are not guaranteed but are nice-to-haves). With this groundwork, we describe four “simplified” cases (labeled Cases 1-4) and four “extended” cases (labeled Cases 5-8) to unpack the structure of the protocol.

The simplified cases deal with the relative sizes of the base groups, whereas the extended cases focus on more targeted situations: (a) the difference between decentralized staking pools and centralized staking providers, (b) the implications of minimum viable issuance, (c) the concerns around an application that grows “too big to fail”, and (d) how restaking introduces another layer into the protocol. This article raises more questions than it answers, but the mental model seems useful as we consider the design space of the consensus layer.

Article Description
Minimum viable issuance Anders’ mega-thread on MVI
How Lido Threatens Ethereum Danny’s Bankless episode
Concerns around centralization of stake Izzy’s analysis of hyptothetical distributions


Acronyms

source expansion
NOs node operator(s)
UASF user-activated soft fork
CEX centralized exchange

Preliminaries

Before diving in, let’s lay a bit of the groundwork. In this article, we describe each group of network participants as belonging to a set. Danny’s recent Bankless episode inspired this framing and it seems worth expanding. To begin, we consider the following groups.

Participants

Note: We don’t include the set of people who run nodes but don’t stake.

Notation

It would be more accurate to talk about the cardinalities of these sets, but since this is not a very formal piece, I am just going to abuse this notation for readability’s sake and to avoid writing |A| everywhere. Throughout the article, we mainly define the cardinality of each set as the “number of unique individuals” who constitute each group. When comparing node operators and solo stakers, we use the “consensus layer size” of each, implying that a single node operator could appear larger than the entire set of solo stakers because they control more validators than all the solo stakers combined. Lastly, we consider node operators as holders in that they represent ETH delegated through them, but we acknowledge that the node operators do not own all the ETH that they stake.

Grounding observations

Given these definitions, it’s useful to explicitly note a few relationships that directly follow.

Observation 1; UsersHolders
Since each holder is a user (even if their only use is to hold ETH), this relation is always true. We could also write this as UsersHolders (i.e., users are a superset of holders).

Observation 2; HoldersStakers
Since each staker is a holder (they either stake the ETH themselves or receive ETH to stake on behalf of the true owner), this relation is always true. We could also write this as HoldersStakers (i.e., holders are a superset of stakers).

Observation 3; StakersNO,NONOs
Since the total set of stakers is composed of pools (which delegate to node operators) and solo stakers, we can say that the stakers are larger than any single node operator. We will explore the situation where a single node operator comes to represent a large proportion of the total stakers. We could also write this as StakersNO,NONOs (i.e., stakers are a superset of each node operator).

Observation 4; StakersSolo
Since the total set of stakers is composed of pools (which delegate to node operators) and solo stakers, we can say that the stakers are larger than the solo stakers. This statement says nothing about the relative size of solo stakers versus other node operators. We could also write this as StakersSolo (i.e., stakers are a superset of solo stakers).

Desiderata

Beyond the four observations above, we can also identify four corresponding outcomes that are “desirable” from the protocol perspective. These are ~by no means~ guaranteed, but rather what we intuitively design for in a healthy ecosystem.

Desiderata 1; UsersHolders
The simplest goal is that the set of people interacting with dApps, NFTs, stablecoins, DeFi, etc., is a much larger set than the collection of participants holding significant amounts of ETH (as adoption increases, it seems reasonable to assume that Users would grow faster than Holders). This may become increasingly true as we move towards a fee-abstracted world where a user doesn’t need to hold ETH to pay for gas. Note that we don’t specifically focus on “read-only” users who consume blockchain data, but rather are more concerned with those who transact in the Ethereum ecosystem in some way.

Desiderata 2; HoldersStakers
The ratio between holders and stakers corresponds to the “proportion of ETH supply staked” – 22% and counting as of October 2023. This value plays an important role in the Ethereum protocol. Too low of a value (e.g., <1%) presents the clear issue of insufficient economic security (it is too cheap, in ETH terms, to attack the network). Conversely, too high of a value (e.g., >99%) may have second-order effects that are hard to predict. Staking limits (e.g., through an issuance curve that approaches negative infinity as the staked supply increases) and MEV burn (which also reduces the net issuance by diminishing the MEV rewards) are the main “arrows in the quiver” to achieve this outcome – see Vitalik’s Paths towards single-slot finality for additional ideas.

The exact impact of having nearly the entire supply staked (HoldersStakers) is uncertain. One issue it presents is on the social governance layer. With most of the supply locked in the consensus layer and a majority of users interacting only with derivate versions of ETH, the staking pool DAOs or DeFi protocols that issue these derivatives have immense power in the protocol. Another negative aspect is the elimination of the “medium of exchange” property of ETH the asset. While ETH still behaves like “collateral money” and LSTs denominated in ETH preserve the asset’s “unit of account” nature, the lack of circulation could pose real threats. Additionally, with a large majority of the ETH supply staked, there is no ETH that could be deployed to counteract a malicious consensus-layer actor that controls a majority of the stake.

Desiderata 3; StakersNO,NONOs
From a consensus perspective, the protocol security is improved if the set of Stakers is significantly larger than any individual NO (to prevent finality delays, reorg attacks, strong censorship, etc.). It’s important to note that some node operators have a coordination layer between them, while others may be completely independent. One of the main points of disagreement with regards to staking pools, using Lido for example, is whether to treat them as a single node operator with 32% of the stake or 31 distinct operators with around 1% stake each. There are reasonable arguments on both sides and this article isn’t aimed at addressing that discussion. On the other hand, centralized staking providers, using Coinbase for example, are best understood as single node operators with 1019% of the total stake.

Desiderata 4; Solo
This simply states that we want the set of solo stakers to be far from non-empty. It seems likely that solo stakers will only ever constitute a relatively small portion of the total stakers (current estimates are around 5% of the total stake), but solo stakers do represent a much larger portion of the total nodes in the system.

With this framework, let’s examine a few hypothetical distributions. We start with four “simplified” cases (labeled Cases 1-4). We call them simplified because they only focus on the sets we have defined so far and follow from the observations above. We then analyze four “extended” cases (labeled Cases 5-8). Each of the extended cases explores a more realistic aspect of the staking ecosystem; the goal of these thought experiments is to tease out how the simplified model can be made more realistic. We conclude with a set of open questions, each associated with one of the cases.

Four “simplified” cases

Case 1
UsersHoldersStakers;StakersNOs;Solo

“Balanced” (best outcome)


Case 2
UsersHoldersStakers>NO1;NO1NO2,NO3,Solo

“Winner-take-most” (medium outcome)


Case 3
UsersHoldersStakers;StakersNOs;Solo

“Full supply for staking pool distribution” (medium outcome)


Case 4
UsersHoldersStakers>NO1;NO1NO2,NO3,Solo

“Too big” (bad outcome)

Four “extended” cases

While the above cases are (hopefully) easy to follow and intuitive, they lack a bit of grounding in reality. Each of the next four cases extends this model to better reflect what we are seeing today and what we might expect in the coming years. None of these examples aim to be comprehensive either, they just add some nuance (again… hopefully lol).


Case 5 – Staking protocols vs centralized staking providers

“Decentralized staking protocol centralized staking provider”


Case 6 – Minimum-viable issuance and solo-stakers

“Solo stakers are priced out by falling rewards resulting from minimum-viable issuance”


Case 7AppX getting too big to fail

AppX is too big to fail, making them the ultimate arbiter of consensus-layer truth.”


Case 8 – Restaking finding mass adoption

“All ETH is restaked, adding another layer of delegation and incentive (mis)alignment.”

Open Questions