# 4844 Data Challenge Wishlist ## P2P and node analysis * Pre vs post 4844 launch p2p analysis in relation to: * Mesh propagation * Gossipsub IWANT/IHAVE * Sync * Peering * Request/response * Anything that changes that you would expect to remain stable or changes in unexpected ways * How do blob sidecars propagate to the network? * Timing analysis * Regional analysis (e.g. Sydney vs Berlin) * Mesh propagation vs IWANT/IHAVE vs request/response * What does blob TX gossip look like? * Peering -- how does 4844 affect peering, discovery, scoring, clustering, etc? Assymetries depending on node resources? How about client type? * DHT analysis -- does the upgrade impact the DHT health, distribution, node count? * Impact on minimum requirements to run a node? To run a validator? To sync a node? What types of low-resource machines and configurations can handle the additional load, which ones cannot? ## Chain and market analysis * Which protocols are using blobs? Frequency, distribution, and other patterns * Analysis of do blob-tx (type 3) usage -- placement in block, gas usage in addition to data, operation complexity, etc * Blob data usage * Wasted blob space tracker -- how much of the 128kb is 'zero' * Blob-sharing, are there emergent markets for sharing blobs for efficiency? [(context)](https://twitter.com/dapplion/status/1727728292747256204?s=61&t=0DRnRlakpNdFGGFYSqq29Q) * Blob custering -- single consumer spanning across multiple blobs * Blob data decomposition -- diffs, similarity tests, spectral decomps, any structured way to understand and compare disparate blobs * Mempool * Transaction timing collusion and cooperation. How do large consumers spread their requests/txs in relation and in reaction to eachother? * Do blob-txs hit the public mempool or get inserted in other places in the MEV pipeline (e.g. builders) * Market * Is there a relationship between data-gas and normal gas -- data-gas vs normal gas price, count, complexity, etc * Roll-up market analysis -- costs of posting to L1, blob counts, mainnet blob-spacing, L2 block batching, etc * General analysis of Ethereum tx market before and after 4844 launch * Consensus-layer * How does the introduction of blobs affect attestation performance, missed blocks, sync committees, etc * What does 4844 do to validator incentives? * Is there interesting insight to gain about chain/validator performance in relation to hobbiests, pools, regions, etc