Data Inventory

Complete catalog of trade data stored in Backblaze B2. Nine datasets spanning Hyperliquid L1 on-chain data, Binance spot, Bybit perps, and derived market data.
Backblaze B2 b2:jthor-trade-data/ 9 Datasets Apr 2020 – May 2026
1.59 TB
Total archive size
263,322
Objects in bucket
$9.54/mo
Storage cost
$0
Egress (3x/day free)

Coverage Timeline

Horizontal coverage per dataset, April 2020 through May 2026 (38 months total span).
Binance Spot Trades
Jan 2020 – May 2025 (5.4 yr)
Bybit Perp Trades
Mar 2020 – May 2025 (5 yr)
HL L2 Book
Apr 2023 – May 2026 (3.1 yr)
HL Asset Contexts
May 2023 – May 2026 (3 yr)
HL Trades (node_trades)
Mar – Jun 2025 (92 d)
HL Fills (node_fills)
May – Jul 2025 (64 d)
HL Fills by Block
Jul 2025 – May 2026 (293 d)
2020 2021 2022 2023 2024 2025 2026

Dataset Summary

All datasets in b2:jthor-trade-data/ ordered by size.
Dataset Files Size Coverage Format Granularity
Binance Spot Trades 828 888.1 GB Jan 2020 – May 2025 CSV Monthly + daily, 18 symbols
Bybit Perp Trades 13,477 209.8 GB Mar 2020 – May 2025 GZ CSV Daily, 11 symbols
HL L2 Book 237,656 165.7 GB Apr 2023 – May 2026 LZ4 NDJSON Hourly per coin, 10 coins
HL Fills by Block 7,022 164.4 GB Jul 2025 – May 2026 LZ4 NDJSON Hourly, batched by L1 block
HL Fills (node_fills) 1,507 29.6 GB May – Jul 2025 LZ4 NDJSON Hourly
HL Trades (node_trades) 1,542 14.1 GB Mar – Jun 2025 LZ4 NDJSON Hourly
HL Asset Contexts 1,077 8.1 GB May 2023 – May 2026 LZ4 CSV Daily
Coherence Research 213 794 MB Various Anomaly detection datasets

Dataset Details

Schema, unique fields, and notes for each dataset.

HL Fills (node_fills)

29.6 GB
1,507 files May 25 – Jul 27, 2025 (64 days) LZ4 NDJSON, hourly

Per-wallet fill data from HL L1. Each record is one fill event attributed to a specific wallet address, with realized PnL, position tracking, and maker/taker classification.

Unique value: wallet addresses, per-fill PnL, maker/taker flag
[wallet, {coin, px, sz, side, time, closedPnl, fee, dir, crossed, startPosition, hash, tid, oid, ...}]

HL Trades (node_trades)

14.1 GB
1,542 files Mar 22 – Jun 21, 2025 (92 days) LZ4 NDJSON, hourly

Matched trades from HL L1. Unlike fills, each record shows both sides of the trade — the maker and taker are both visible in the side_info array.

Unique value: both parties visible in side_info array
{coin, side, time, px, sz, hash, side_info: [{user, startPosition, dir, ...}, ...]}

HL Fills by Block

164.4 GB
7,022 files Jul 27, 2025 – May 15, 2026 (293 days) LZ4 NDJSON, hourly

Successor to node_fills. Fills grouped by L1 block number with block timestamps. Continuous live stream that is still actively recording.

Unique value: fills grouped by block, continuous live stream
{local_time, block_time, block_number, events: [...fills...]}

HL Asset Contexts

8.1 GB
1,077 files May 20, 2023 – May 15, 2026 (1,092 days / 3 years) LZ4 CSV, daily

Daily snapshots of all HL perp market context: funding rates, open interest, oracle prices, mark prices, and volume. Covers entire HL history.

Unique value: funding rates, open interest, oracle prices
time, coin, funding, open_interest, prev_day_px, day_ntl_vlm, premium, oracle_px, mark_px, mid_px

HL L2 Book

165.7 GB
237,656 files Apr 15, 2023 – May 15, 2026 (1,127 days / 3.1 years) LZ4 NDJSON, hourly per coin

Full order book snapshots (bid/ask depth) for 10 coins. Hourly files per coin, enabling order flow analysis, liquidity measurement, and spread decomposition.

Coins: BTC, ETH, SOL, HYPE, XRP, DOGE, AVAX, SUI, AAVE, WIF
{time, levels: [[{px, sz, n}, ...], [{px, sz, n}, ...]]} // [bids, asks] full depth

Binance Spot Trades

888.1 GB
828 files Jan 2020 – May 2025 (5.4 years) CSV (monthly + daily), 18 symbols

Every spot trade on Binance for 18 USDT pairs. Monthly bulk archives plus daily files for recent data. The largest dataset by far.

Symbols: BTCUSDT, ETHUSDT, SOLUSDT, XRPUSDT, DOGEUSDT, BNBUSDT, AVAXUSDT, SUIUSDT, AAVEUSDT, WLDUSDT, WIFUSDT, ENAUSDT, TRUMPUSDT, VIRTUALUSDT, POPCATUSDT, MOODENGUSDT, BERAUSDT, PENGUUSDT
trade_id, price, qty, quote_qty, time, is_buyer_maker, is_best_match

Bybit Perp Trades

209.8 GB
13,477 files Mar 2020 – May 2025 (5 years) GZ CSV, daily, 11 symbols

Daily perpetual futures trade data from Bybit. Compressed CSV files per symbol per day. Provides a third venue for cross-exchange comparison alongside Binance and HL.

Symbols: BTCUSDT, ETHUSDT, SOLUSDT, XRPUSDT, DOGEUSDT, BNBUSDT, AVAXUSDT, SUIUSDT, AAVEUSDT, WIFUSDT, HYPEUSDT
timestamp, symbol, side, size, price, tickDirection, trdMatchID, grossValue, homeNotional, foreignNotional

Coherence Research

794 MB
213 files Open source anomaly detection datasets Various formats

Benchmark datasets for anomaly detection model development and validation.

Contains: credit_card, NAB, NASA turbofan, PhysioNet ECG, SKAB, UCI power

Binance Derived DOWNLOADING

TBD
In progress on Maryland instance BTC + ETH primary Multiple formats

Supplementary market data for basis/funding analysis. Currently being downloaded and will be uploaded to B2 on completion.

1m klines (spot + UM perps): BTC, ETH, ETHBTC
Mark price klines (UM): BTC, ETH
Premium index klines (UM): BTC, ETH
Funding rates (monthly): BTC, ETH
L2 bookDepth (UM): BTC, ETH (from 2023)

Cross-Exchange Field Mapping

How fields align across the three exchanges for unified analysis.
Concept HL Fills HL Trades Binance Spot Bybit Perps
Price px px price price
Size sz sz qty size
Timestamp time (ms) time (ms) time (us) timestamp (ms)
Side / Aggressor side (B/A) side is_buyer_maker side (Buy/Sell)
Symbol coin coin Symbol (e.g. BTCUSDT) symbol
Notional px * sz px * sz quote_qty foreignNotional
Trade ID tid hash trade_id trdMatchID
Wallet / User wallet side_info[].user
PnL closedPnl
Maker/Taker crossed is_buyer_maker tickDirection

Storage Cost

Backblaze B2 pricing for the archive.

Monthly Cost Breakdown

Storage (1.59 TB @ $6/TB/mo)$9.54
Class B transactions$0.00
Class C transactions$0.00
Egress (3x storage/day free)$0.00
Total$9.54/month

Free Egress Budget

Daily free download3x stored = 4.77 TB/day
Full archive download~8 hours (at free tier)
B2 S3-compatible APIYes
RegionUS West
Redundancy11 nines durability
← Back to project overview