The observed 3.8 bps HL discount is not a mispricing. It is the
perp-spot basis —
the natural wedge between a perpetual futures contract and the spot market. Our initial comparison
conflated HL perpetual prices with Binance spot prices, producing a measurement that combines
structural premium components with any genuine cross-exchange divergence.
E[funding] — Expected
funding rate
payments. When longs dominate, they pay shorts to keep the perp tethered to spot.
Positive funding ⇒ perp trades at a premium; negative funding ⇒ discount.
The 8-hour funding cycle on both HL and Binance perps creates a sawtooth pattern in the basis.
inventory_premium — The compensation demanded by
market makers
for holding directional risk. When aggregate MM inventory is skewed long, ask prices widen;
when skewed short, bid prices widen. This component reflects the cost of warehousing risk
in a leveraged perpetual contract.
liquidation_pressure —
Liquidation
cascades create asymmetric selling pressure in perp markets. Forced closes are market orders
that consume liquidity at the worst possible time. The existence of this tail risk compresses
perp prices relative to spot, especially during high
open interest regimes.
ε — Transient microstructure noise: stale quotes,
clock drift between venues, bar alignment artifacts, and latency-induced phantom spreads.
This term should be mean-zero over sufficiently long windows.
Measurement confound: Our comparison was HL ETH-PERP vs Binance ETHUSDT spot.
To isolate genuine cross-exchange effects, we need four price series:
Differences ⇒ deposit/withdrawal friction, oracle lag
Key question: Is the basis stationary? Does it
mean-revert,
exhibit regime-switching (trending vs mean-reverting states), or correlate with
the funding rate cycle? If the basis is cointegrated with funding, it is not exploitable.
If it exhibits independent mean-reverting episodes, basis trades become viable.
An ADF test
on the basis time series, conditioned on funding state, is the first diagnostic.
The maker/taker asymmetry observed in HL trade data is textbook
adverse selection.
Every resting limit order is a free option granted to the market: an informed taker can pick off stale
quotes when they know the true price has moved. The maker's PnL is the residual after accounting for
this option cost.
Πmaker≈spread_capture+rebates−adverse_selection−inventory_costProfitable MM requires spread_capture + rebates > adverse_selection + inventory_cost
spread_capture — The half-spread earned on each fill. If the
bid-ask spread
is 2 bps and the maker is filled on one side, they capture ~1 bp per unit.
This is the gross revenue of market making.
rebates — Exchange-paid maker rebates. HL's fee schedule provides
rebates to liquidity providers, directly subsidizing the quoting function.
These rebates can be the difference between profitable and unprofitable MM.
adverse_selection — The expected loss from being picked off by
informed flow. Measured by
markouts:
the change in mid-price after a maker fill, measured at fixed time horizons.
This is the single most important metric for evaluating maker strategy quality.
inventory_cost — The cost of holding directional exposure.
Includes funding payments, margin opportunity cost, and the risk of adverse price moves
while the position is open. In volatile markets, this term dominates.
Markout Analysis
The critical metric is markouts. For each maker fill, compute the
mid-price
change at fixed horizons after the fill:
Short-term information. Most HFT adverse selection materializes here.
10s
mid(t+10s) − fill_price
Medium-term. Cross-exchange information propagation window.
30s
mid(t+30s) − fill_price
Informed flow signal. If markout is still negative here, the fill was truly toxic.
60s
mid(t+60s) − fill_price
Asymptotic. Should converge to the "true" adverse selection component.
If a maker buys at price p and the mid-price falls
at subsequent horizons ⇒ the maker was adversely selected.
The markout curve (plotting expected mid-change vs horizon) is the fingerprint of flow toxicity.
A healthy maker fill has positive markouts (mean-reversion); a toxic fill has persistently
negative markouts (trend continuation).
HL consistently shows deeper near-touch liquidity than venues with comparable volume.
This is not a coincidence — it follows directly from the
market maker's
loss function.
E[L]≈λ·Δt·σ·qExpected MM loss per quote cycle
λ — Toxic flow intensity.
The arrival rate of informed orders that trade against stale quotes.
Higher λ ⇒ more frequent adverse selection events.
Δt — Quote exposure latency.
The time between a maker observing a price signal and successfully cancelling or updating
their resting orders. This is the variable HL's architecture directly reduces.
σ —
Volatility.
The magnitude of price moves during the exposure window.
Higher vol ⇒ stale quotes are further from true price ⇒ larger losses per pick-off.
q — Quoted size.
The volume resting at each price level. Larger size ⇒ larger loss when picked off.
Makers balance quoted depth against expected adverse selection cost.
The Δt Mechanism
HL's
HyperBFT consensus
provides sub-second block finality with deterministic ordering. This directly reduces Δt —
the quote exposure window — compared to slower settlement systems.
Slow Cancel System (high Δt)
Makers quote wide spreads to compensate for stale quote risk
Quoted size is small (limit loss per pick-off)
Book is shallow near the touch
Depth exists far from mid (less likely to be picked off)
Effective spread for large orders is wide
Fast Cancel System (low Δt) — HL
Makers quote tight spreads (low pick-off risk per unit time)
Quoted size is large (can cancel before adversely selected)
Book is deep near the touch
Liquidity concentrates at best bid/ask
Effective spread for large orders is narrow
The relationship is multiplicative: halving Δt roughly halves the expected loss per quote cycle,
allowing makers to either halve their spread or double their quoted size at the same risk budget.
This is why execution-layer performance translates directly to observable book quality.
Cross-correlation analysis at 10-second resolution shows that Binance leads HL
(correlation 0.28) but HL also leads Binance (correlation 0.22). The gap is modest.
This is not a simple leader-follower relationship — HL contains
independent price information, not just lagged copying of the dominant venue.
0.28
Binance → HL (10s lag corr)
0.22
HL → Binance (10s lag corr)
0.06
Information Share Gap
Sources of HL's Independent Signal
Leveraged positioning flow — HL perps allow up to 50x leverage.
Traders expressing high-conviction directional views choose HL for capital efficiency.
These leveraged entries and exits reveal information about sentiment that does not
first appear on spot venues.
Liquidation information — The HL liquidation engine generates
forced market orders that are visible in trade flow before their price impact
propagates to other venues. Observing liquidation cascades on HL provides a leading
signal of impending volatility.
Crypto-native sentiment — HL attracts a different participant
base than Binance. DeFi-native traders, on-chain analysts, and protocol insiders
may preferentially trade on HL, embedding information that Binance's participant
base does not yet possess.
Inventory stress from HL makers — When HL market makers accumulate
large directional positions, their hedging activity on other venues transmits HL-originated
information to Binance and other CEXs. The inventory signal travels through the
inter-venue hedging channel.
Permanent/transitory decomposition; which venue adjusts to deviations from equilibrium
Same cointegration requirement; identifies the common efficient price
Hypothesis: HL's information share is between 0.15 and 0.35
(Binance holds the majority but HL is not negligible). If HL's share exceeds 0.25,
the venue generates enough independent signal to support cross-exchange strategies
that use HL flow as a leading indicator.
§5 — The Microstructure Smell: 2–4x Trade Count
HL records 2–4x the trade count per hour of Binance despite being a smaller venue by
notional volume. This is a strong microstructure anomaly that demands explanation.
Higher trade count at lower volume implies smaller average trade size — a fingerprint
of specific market-making behaviors.
Candidate Explanations
Quote-churn-heavy MM activity — Market makers on HL may be
using aggressive quote update strategies: cancel-replace cycles that generate many
small fills as the book reshuffles. Each quote update that partially fills before
cancellation registers as a trade.
Smaller fill sizes — If HL's matching engine splits large
incoming orders across many resting levels (or if resting order sizes are small),
a single aggressive order generates multiple trade prints. This would inflate
trade count without inflating volume.
Inventory recycling — MMs flattening inventory through
rapid round-trips. Buy 1 ETH, sell 1 ETH, repeat — each cycle generates
2 trades but zero net volume in terms of position change. High-frequency inventory
recycling is a sign of tight risk limits and fast hedging.
Rebate farming — If HL's fee structure rewards maker volume
with tiered rebates, participants may generate wash-like volume to reach higher
rebate tiers. This is not necessarily wash trading (which requires self-matching)
but can involve coordinated flow that inflates trade counts.
Diagnostic Metrics Needed
Metric
HL
Binance
What It Reveals
Median trade size (ETH)
TBD
TBD
If HL median is much smaller, confirms fragmentation
Trade size distribution (p10/p50/p90)
TBD
TBD
Shape reveals whether size is bimodal (retail + MM) or unimodal
Intertrade duration (median, ms)
TBD
TBD
Very short durations (<100ms) indicate algorithmic activity
Cancel-to-fill ratio
TBD
TBD
High ratio confirms quote-churn MM strategy
Queue lifetime (median, ms)
TBD
TBD
Short lifetimes confirm aggressive cancel-replace cycling
§6 — Timestamp Caveat
10 seconds of alpha in crypto is enormous.
Before believing any cross-exchange lead-lag signal at this timescale, every timestamp
artifact must be eliminated. A spurious 10-second lead is trivially generated by
clock drift, stale data, or bar alignment errors.
Required sanity checks before trusting any cross-exchange timing signal:
#
Check
Failure Mode
Mitigation
1
Synchronize clocks perfectly
Local machine clock drift vs exchange server clock
Use NTP-synced collection servers; verify drift < 1ms
2
Use exchange timestamps, not local receipt
Network latency varies by venue (HL websocket vs Binance REST)
Parse T field from exchange response; never use time.time()
3
Correct for bar alignment
10s bars starting at :00 vs :05 create phantom 5s leads
Align all bars to common epoch; use tick-level data where possible
4
Remove stale prints
Old trades reported late inflate lagged correlation
Filter trades where |server_time - trade_time| > threshold
5
Compare mids, not trades
Last-trade price lags mid-price; different venues have different last-trade staleness
Using bar close as a predictor of same-bar close on another venue
Strictly use bar(t) to predict bar(t+1); never same-bar comparisons
Only after all six checks pass should cross-correlation results be interpreted as genuine
information flow. The correct test is: does venue A's bar(t) predict venue B's
bar(t+1) out-of-sample, after accounting for venue B's own autoregressive structure?
§7 — Research Agenda
Ordered by expected value of information, highest first:
Estimate the fraction of efficient price innovation attributable to HL vs Binance.
Requires synchronized mid-price series at 1s frequency, VECM estimation, and Cholesky
decomposition of the innovation covariance matrix. Upper and lower bounds from permuting
the Cholesky ordering give the information share range.
2
Markout Curves (1s through 60s)
For every HL maker fill, track mid-price evolution at 1s, 5s, 10s, 30s, 60s.
Segment by: time of day, volatility regime, trade size, aggressor direction.
The markout curve shape reveals whether HL maker flow is predominantly informed or uninformed,
and at what horizon information is incorporated.
Volume-synchronized probability of informed trading adapted for HL's perpetual market.
Compute flow toxicity in volume-time (not clock-time) buckets. Rising VPIN preceding
liquidation cascades would confirm that toxic flow is detectable before cascades begin.
4
Basis-State Modeling
Model the perp-spot basis as a function of observable state variables.
Use the basis state vector (below) as features in a regime-switching model to identify
exploitable mean-reverting episodes vs trending basis regimes.
5
Event Studies (CPI, FOMC, ETF Headlines)
Measure information share dynamics around macro events. Hypothesis: HL's information
share spikes during crypto-native events (liquidation cascades, protocol announcements)
but drops during macro events (FOMC, CPI) where Binance's larger participant base
processes information faster.
6
Impulse Response / Overshoot / Reversion
Estimate the impulse response function from a structural VAR on HL and Binance returns.
Measure: overshoot magnitude, reversion half-life, and whether the permanent price
impact originates from HL or Binance. This reveals the causal structure of cross-exchange
price formation.
Basis State Vector
The following features define the microstructure state at any point in time. A model predicting
basis dynamics should condition on this vector:
Feature
Source
Frequency
Hypothesis
perp-spot basis
HL perp mid − Binance spot mid
1s
The target variable; should be mean-reverting conditioned on other features
funding rate
HL funding rate (current + predicted)
8h (interpolated)
Positive funding ⇒ basis should be positive; deviation ⇒ opportunity
order imbalance
HL (bid_volume − ask_volume) / total_volume
1s
Persistent imbalance predicts basis direction in the short term
liquidation intensity
HL liquidation volume / total volume (rolling 5min)
5min
High liquidation intensity ⇒ basis compression (forced selling)
Asymmetric depth predicts short-term price pressure direction
open interest change
ΔOI over rolling 1h window
1min
Rising OI + rising price ⇒ new longs; rising OI + falling price ⇒ new shorts
cross-venue spread
|HL best ask − Binance best bid| (and inverse)
1s
When cross-venue spread inverts, arbitrage flow normalizes the basis
Uncertainty: Whether the basis is sufficiently
stationary to model parametrically, or whether regime-switching dynamics dominate. Assumption: HL and Binance prices are cointegrated
at the instrument-matched level (perp-perp, spot-spot). Implication: If cointegration breaks during stress
events, all basis-trading strategies fail simultaneously — the tail risk is correlated
with the events where you most need the hedge to work.