SF persists most of its compute output to long-term storage so research and audit are reproducible.Documentation Index
Fetch the complete documentation index at: https://docs.simplefunctions.dev/llms.txt
Use this file to discover all available pages before exploring further.
R2 archives
Daily themed dumps to Cloudflare R2:data-dump-markets— market_price_snapshots, market_indicators, market_regimesdata-dump-compute— every compute table (33 of 37 tables)data-dump-thesis— thesis_videos, cross_venue_pairs, legislation_market_pairsdata-dump-obs— observability tables (cron_run_log, health_alerts, cost_events)
src/app/api/cron/data-dump-*. Bucket: sf-data-dump.
HuggingFace datasets
Public datasets athuggingface.co/datasets/SimpleFunctions/*:
sf-index-history— full SF Index history.sf-settled-markets— resolved markets (volume > $10K filter).sf-calibration-scorecards— model calibration over time.sf-world-state-history— daily world snapshots.
hf-export, hf-settled-markets, hf-sf-index-history, hf-calibration, hf-backfill-world-state. They write via lib/hf-client.ts which supports LFS for files >10 MiB.
Wayback / Common Crawl
wayback-submit-core and wayback-submit-answer push canonical SF pages to Wayback Machine for citation tracking. citation-monitor polls back to see who’s citing us.
Provenance
Every dump row carries thetrace_id from the producing cron. The dump_at timestamp is embedded in the file path.
Access
sf data-dump grant flow (TBD).
Next steps
Provenance
trace_id in every dump row.Surface map
Full table list and surface inventory.