Historical Options-Chain Data Vendors — Raxx Use-Case Comparison
Status: research-only. This document does NOT constitute legal or tax advice. Before executing any data licensing agreement, consult a licensing attorney and a CPA licensed in Pennsylvania on COGS treatment. Last updated: 2026-04-23. Vendor prices and terms sourced as of that date — SaaS pricing and redistribution terms change frequently; verify freshness before any financial commitment.
TL;DR
Alpaca Algo Trader Plus (your current $90.75/mo subscription) does NOT solve the blocker. Historical options data on Alpaca goes back only to February 2024 — roughly 26 months of history as of today. The markov-fit backtest requires Jan 2007 – Dec 2025 (~18 years, covering 2008–09 stress, 2020 COVID, 2022 bear). The blocker is real.
ORATS at the $199/mo tier (Data API + historical) is the best-fit option for Raxx's
strategy scope, with genuine 2007-to-present coverage, purpose-built iron-condor
backtest tooling, and a confirmed use by spintwig (the practitioner ORATS cites as
the closest analog to Application A in markov-fit-analysis.md). However, ORATS's
standard terms prohibit redistribution of raw data to third parties under a
single-user license. A commercial/SaaS arrangement requires a separate enterprise
agreement — that conversation must happen with ORATS's sales team before any
licensing commitment. Do not assume the $199/mo retail API covers SaaS use.
The second-best path, if ORATS's enterprise terms are prohibitive, is Databento OPRA for raw chain history (historical pay-as-you-go) combined with ORATS for the backtester-as-a-service model (ORATS backtester runs on their servers; Raxx presents results — different license surface than redistributing raw data). A licensing attorney should review both paths before contracting.
1. Executive Summary
Top recommendation: ORATS Data API (historical tier)
- Price to investigate: $199/mo (historical) + potentially a one-time $2,000 data export for the full 2007-to-present archive — amounts are from third-party sources and must be verified directly with ORATS; see §2 for sourcing.
- Why it fits: 2007-to-present options chain history; purpose-built iron condor
backtest engine; same vendor used by spintwig (the exact practitioner source in
markov-fit-analysis.md§6.3). Coverage verified for SPX/SPY/mega-cap universe. - Hard gate: Standard retail API terms appear to restrict use to a single user for internal business purposes only, prohibiting redistribution to third parties (including Raxx's end users). Enterprise/commercial licensing terms are not publicly disclosed. Must be negotiated before adoption.
Runner-up: Databento OPRA historical (pay-as-you-go)
- Price to investigate: Historical data pay-as-you-go (pricing per dataset/query); live data (not needed for backtest-only use case) starts at $199/mo Standard plan. Historical archive starts March 28, 2023 — insufficient on its own for the 2007-to-present backtest requirement.
- Why it's a runner-up: Raw tick-level OPRA data, clean API, transparent pricing. History only goes to March 2023, meaning a multi-source strategy (Databento for recent + historicaloptiondata.com or CBOE DataShop for 2007–2022) would be needed. More assembly work but potentially cleaner commercial terms for internal backtesting use vs. redistribution.
- Hard gate: Databento distinguishes "internal use" from "redistribution." Using their data to run Raxx's MBT backtester internally (not displaying raw chain data to users) may qualify as internal use. A licensing attorney should assess.
Alpaca re-investigation verdict
Your current $90.75/mo Algo Trader Plus subscription does not solve the blocker.
Historical options data on Alpaca covers only from February 2024 forward. The
2007–2019 training window specified in markov-fit-analysis.md §3, Application A,
requires 13 years of data unavailable through Alpaca. Additionally, Alpaca's support
documentation explicitly states that API data cannot be redistributed for business
purposes — so even if history were adequate, Raxx's SaaS redistribution use case
would require a separate commercial arrangement.
Total cost of ownership — worked estimate
The table below uses ORATS at the $199/mo historical API tier as the scenario. These are estimates pending enterprise pricing confirmation.
| Users | Current infra cost/mo | Add ORATS ~$199/mo | New break-even (approx) | Delta from 14–22 baseline |
|---|---|---|---|---|
| At break-even (current) | $242–$392/mo fixed | +$199/mo | ~26–34 users | +12 users |
| 10 users | loss | +$19.90/user share | deeper loss | not material at this scale |
| 100 users | $13.53/user margin | −$1.99/user share | ~$11.54/user margin | −$2/user |
| 250 users (cap) | ~$16.30/user margin | −$0.80/user share | ~$15.50/user margin | −$0.80/user |
Key takeaway: At scale (100+ users), a $199/mo data license costs under $2/user and is easily absorbed by the existing margin. It shifts break-even from ~14–22 users to ~26–34 users — a meaningful but not fatal change. The real constraint is the license negotiation, not the dollar amount.
These figures use the cost structure from
docs/business/founders-pro-pricing-rationale.md§2. Verify current ORATS enterprise pricing before incorporating into the model. Bring to the CPA for COGS classification.
2. Vendor Matrix
Prices marked with † were sourced from third-party review articles, community sources, or indirect search-result summaries as of 2026-04-23. Verify each directly with the vendor before any commitment. Prices marked with ‡ were sourced from the vendor's own documentation pages as indexed in search results as of this date.
| Vendor | History depth | Coverage for Raxx use case | Price (retail/individual) | Price (enterprise/SaaS) | API shape | Redistribution posture | Primary source |
|---|---|---|---|---|---|---|---|
| ORATS | 2007 – present (EOD)†; intraday from Aug 2020† | SPX, SPY, 5,000+ equity options; IVR, IV surface, greeks†; iron condor backtest engine built-in† | $99/mo (entry)† / $199/mo (historical)† / $299/mo (pro)† | Not publicly listed — must contact sales | REST API; flat file export option†; hosted backtester† | Standard terms: internal use, single user, no redistribution‡; enterprise terms unknown — must inquire | https://orats.com/data-api; https://orats.com/terms-conditions |
| Alpaca Algo Trader Plus | Feb 2024 – present ONLY‡ | Real-time SIP, options chain, greeks; insufficient for 2007 backtest‡ | $90.75/mo (current subscription)† | Not applicable — same terms | REST + WebSocket‡ | Explicitly prohibits redistribution for business purposes‡ | https://docs.alpaca.markets/docs/historical-option-data; https://alpaca.markets/support/redistribute-alpaca-api |
| Databento OPRA | Mar 28, 2023 – present‡ (10+ yrs of additional history announced May 2025 — verify) | Full OPRA consolidated feed, all 17 US options exchanges, tick-level NBBO‡ | Historical: pay-as-you-go; live $199/mo Standard‡ (as of Jun 3, 2025) | Plus $1,399/mo; Unlimited $3,500/mo†; annual contracts | REST API; flat file (DaaS)‡ | Distinguishes internal use vs. redistribution; SaaS backend (internal analytics) may qualify as internal use†; confirm with attorney | https://databento.com/blog/introducing-new-opra-pricing-plans; https://databento.com/blog/subscriber-status |
| CBOE DataShop | Not publicly disclosed; official CBOE source | Options EOD summary, intraday intervals, Cboe-exchange data†; full market coverage unclear | EOD Summary: pricing not publicly listed; enterprise/subscription required; Intraday Open-Close data cited at $2,000/mo† | Not applicable — all enterprise-grade | Flat file download / subscription†; no retail REST API | CBOE's own data; typical terms for institutional use; commercial use requires direct agreement | https://datashop.cboe.com/option-eod-summary; https://datashop.cboe.com/ |
| OptionMetrics IvyDB US | 1996 – present (academic†) | Daily options prices, IV, greeks for US equity + index options†; widely cited in academic literature† | Academic only — priced per institutional access†; commercial pricing: bespoke, contact sales† | Enterprise / commercial: negotiated, no public price† | WRDS platform or direct delivery†; no self-serve REST API† | Academic license ≠ commercial license†; commercial deployment requires dedicated agreement — treat as enterprise-tier | https://optionmetrics.com/data-products/; https://wrds-www.wharton.upenn.edu/pages/about/data-vendors/optionmetrics/ |
| OptionsDX | SPX: free for older years, $0 – $20 per dataset†; depth varies by symbol† | SPX, VIX, SPY, BTC; EOD and intraday to minute-level†; greeks, IV†; small universe relative to Raxx's equity-universe scope | Per dataset: $0 (older, free)† – $20 (recent)†; no recurring API subscription† | Not available — dataset purchase only, no API† | Flat file download (CSV)†; no API† | Licensing terms not publicly disclosed in search results — confirm commercial use before relying on this for production backtest | https://www.optionsdx.com/; https://www.optionsdx.com/product/spx-option-chain/ |
| Polygon.io | Full OPRA tick history (depth unclear for pre-2018 period)†; generally strong for recent data | All US equity options, greeks, IV†; broad universe | Stocks + Options + FX bundle: $449/mo†; options-only tiers unclear | Enterprise with redistribution license: must contact sales; retail API explicitly prohibits commercial redistribution‡ | REST API + WebSocket‡; flat file option† | Standard terms explicitly prohibit commercial or business use and redistribution‡ — enterprise redistribution license required; treat as a red flag until enterprise terms confirmed | https://polygon.io/legal/market-data-terms-of-service; https://polygon.io/business-options |
| Tradier | Current chain data; options history: available via OCC option symbol lookup†; depth and history length not documented† | Real-time chain + greeks via ORATS integration†; limited or no deep historical chain archive | Free sandbox; subscription for brokerage API†; market data pricing not published† | Not positioned for historical backtest use† | REST API†; brokerage-API-oriented | Not positioned as a data redistribution vendor; greeks/IV powered by ORATS (underlying vendor is ORATS)† | https://documentation.tradier.com/brokerage-api/markets/get-options-chains; https://docs.tradier.com/docs/historical-data |
| IEX Cloud | Discontinued — service shut down 2024‡ | N/A — service no longer operational | N/A | N/A | N/A | N/A | https://www.saasworthy.com/product/iex-cloud — service discontinued |
| Bloomberg B-PIPE | Comprehensive; decades of history† | Complete — but irrelevant at Raxx scale | $31,980/yr per terminal†; B-PIPE $50,000–$200,000+/yr† | Enterprise only; no retail path | Proprietary terminal / API† | Enterprise agreements — far beyond Raxx scale | https://costbench.com/software/financial-data-terminals/bloomberg-terminal/ |
| Refinitiv (LSEG) Eikon | Comprehensive† | SPX iron condor example published by LSEG†; enterprise | $3,600–$24,000/yr†; commercial API separately priced | Enterprise only† | LSEG Developer API†; REST | Enterprise licensing required — far beyond Raxx scale | https://developers.lseg.com/en/article-catalog/article/finding-expired-options-and-backtesting-a-short-iron-condor-stra |
| FactSet | Comprehensive† | Enterprise financial data | $12,000/yr†; commercial API separately priced | Enterprise only† | REST / proprietary | Enterprise licensing — far beyond Raxx scale | https://costbench.com/best/best-financial-data-terminals/ |
| historicaloptiondata.com (DeltaNeutral) | 2002 – present (EOD)†; SPX back to 1990† | SPX, SPY, individual equities; EOD; greeks levels 1–3†; legitimate backtest source† | Per dataset: SPX $3.85/mo single symbol†; annual subscription available†; price for full equity universe unknown | Not applicable — dataset purchase only; no API† | Flat file download (CSV)† | Commercial use terms not disclosed in search results; confirm before production use | https://historicaloptiondata.com/product/allspx-sp-500/ |
3. Use-Case Fit Analysis
Raxx's specific requirements
From agent_prompt.txt and markov-fit-analysis.md:
- Strategy: Iron condor, credit spread, cash-secured put
- Underlyings: SPX/SPY (index), QQQ, IWM, mega-cap equities (AAPL, MSFT, etc.)
- Parameters: IVR > 50; 30–45 DTE; 15–20 delta; credit ≥ 30% width; 50% profit take; 2× stop; 21 DTE exit
- Backtest window: Jan 2007 – Dec 2025 (training: 2007–2019; test: 2020–2025)
- Data needed per trade: strike, expiry, bid/ask, greeks (delta, IV), underlying
price — consistent with NBBO mid fill modeling per
mbt-paper-trading-engine.md§4 - Frequency: EOD snapshots at a minimum; intraday beneficial for fill simulation
Vendor fit by requirement
| Requirement | ORATS | Databento OPRA | DeltaNeutral/historicaloptiondata | OptionsDX |
|---|---|---|---|---|
| History to 2007 | YES† | NO (starts Mar 2023‡) | YES (SPX to 1990†; equities to 2002†) | Partial — older years free; not full universe† |
| SPX/SPY coverage | YES† | YES‡ | YES† | YES (SPX, SPY explicitly†) |
| Broad equity universe (AAPL, MSFT, etc.) | YES (5,000+ symbols†) | YES (full OPRA†) | Partial — individual equity datasets vary† | NO — small curated list† |
| EOD chain with greeks and IV | YES (proprietary indicators†) | YES (NBBO, greeks via OPRA†) | YES (L2/L3 options†) | YES† |
| Intraday data | YES (Aug 2020+†) | YES (tick-level†) | NO — EOD only† | YES (minute-level†) |
| Iron condor backtester | YES (built-in, 300M+ pre-computed†) | NO — raw data only | NO — raw data only | NO — raw data only |
| API access (not flat file only) | YES† | YES‡ | NO — flat file† | NO — flat file† |
| Retail price under $500/mo | YES ($199/mo retail†) | YES (historical PAYG†) | YES (per dataset, under $50/mo for SPX†) | YES† |
| Commercial SaaS redistribution | UNKNOWN — must negotiate† | UNCLEAR — internal-use distinction†; needs attorney review | UNKNOWN — terms not disclosed† | UNKNOWN — terms not disclosed† |
Conclusion: For the markov-fit backtest (Application A, markov-fit-analysis.md),
ORATS is the highest-fit single vendor. It has the history depth, the iron condor
tooling, and the exact track record (spintwig backtests). The gap is exclusively on
the commercial SaaS redistribution license. DeltaNeutral/historicaloptiondata is a
viable raw-data supplement for pre-2023 history if Databento is the primary API layer.
OptionsDX is useful only for SPX/SPY-specific backtests; it does not cover the broader
equity universe needed for a production MQ-A signal.
4. License-Risk Summary
Understanding OPRA redistribution fees (applies to Databento, Polygon.io, and any OPRA-sourced vendor)
The Options Price Reporting Authority (OPRA) charges fees for redistribution of their consolidated options data. Key thresholds as of 2026-04-23:
- Redistributor flat fee: $1,500/mo to anyone who displays OPRA data externally to customers, in an app, tool, or website. Applies regardless of user count. Source: https://www.marketdata.app/education/options/opra-fees/
- Per-user display fee (real-time, non-professional): $1.25/user/mo. Source: https://www.marketdata.app/education/options/opra-fees/
- Per-user display fee (professional): $31.50/user/mo. Source: https://www.marketdata.app/education/options/opra-fees/
- Historical / delayed data (>15 min): No per-user display fee from OPRA. Source: https://www.marketdata.app/education/options/opra-fees/
- Non-display (backend) use: Still requires licensing even if data is not shown to users. Source: https://www.marketdata.app/education/options/opra-fees/
Implication for Raxx's MBT backtester: If Raxx uses historical options chain data exclusively for internal backtesting — running simulations on the server, never displaying raw chain data to users — this may qualify as "non-display" use rather than "redistribution." Non-display use still requires licensing from OPRA (or the OPRA- accredited vendor), but avoids the $1,500/mo redistributor fee and per-user display fees. A licensing attorney must confirm this interpretation before any contract.
Vendor-specific license flags
| Vendor | License risk level | Specific flag |
|---|---|---|
| ORATS (retail tier) | HIGH — standard terms prohibit SaaS redistribution | Terms explicitly state "internal business purposes only" and "not to distribute/redistribute to third parties"‡; single-user license per term‡; enterprise terms unknown. Do not adopt without enterprise agreement. |
| Polygon.io (standard plans) | DISQUALIFIED for commercial use under standard terms | Terms explicitly prohibit "business or commercial purposes" and redistribution under standard plans‡; enterprise plan may be available but requires contact with sales; cost unknown; startup discount offered† |
| Alpaca Market Data | HIGH — redistribution prohibited | Support doc states "cannot redistribute Alpaca API data for business purpose"‡; applies to options chain data |
| Databento OPRA | MEDIUM — internal use vs. redistribution distinction may help | Distinguishes internal use from redistribution†; historical data PAYG for backtesting is a cleaner use case†; SaaS backend analytics may qualify as internal; attorney review required before contract |
| CBOE DataShop | MEDIUM — enterprise posture; terms negotiated | No retail self-serve path; institutional licensing required; likely cleaner IP posture since CBOE owns the data source; but higher cost floor |
| OptionMetrics IvyDB | MEDIUM — commercial license must be separate from academic | Academic license explicitly ≠ commercial license; commercial pricing bespoke; contact required |
| OptionsDX | UNKNOWN | No public terms found in research; confirm before use |
| historicaloptiondata.com | UNKNOWN | No public commercial terms found; confirm before use |
| Bloomberg / Refinitiv / FactSet | LOW risk but HIGH cost — enterprise agreements standard | Enterprise licensing is the default; terms clear; simply out of reach at Raxx's current scale |
Red flags to raise with a licensing attorney before any contract
- Does Raxx's use of historical options chain data to run MBT's backtest engine qualify as "internal non-display use" under OPRA licensing rules, avoiding the $1,500/mo redistributor fee?
- Do ORATS's enterprise/commercial terms permit Raxx to run backtests on behalf
of paying users (i.e., user triggers
mbt.run_backteston Raxx's server, ORATS data is consumed server-side, and the backtest result — not raw data — is returned to the user)? - Does Polygon.io's business/enterprise tier permit the same server-side-only use case, and what is the enterprise price?
5. Alpaca Re-Investigation
What we currently pay for
- Subscription: Algo Trader Plus at $90.75/mo (confirmed from dashboard 2026-04-23;
see
docs/business/founders-pro-pricing-rationale.md§9.1) - Included per documentation:
- Real-time SIP data (stocks + options)
- Real-time options chains with greeks
- Historical bars and trades (stocks: multi-year; options: Feb 2024 – present only)
- 10,000 API calls/min
- Unlimited WebSocket symbols
The hard gap
Alpaca's historical options data begins February 2024 only. Source: https://docs.alpaca.markets/docs/historical-option-data
The markov-fit backtest requires:
- Training window: Jan 2007 – Dec 2019 (from markov-fit-analysis.md §3, Application A)
- Test window: Jan 2020 – Dec 2025
- Minimum: ~18 years of EOD options chain snapshots for SPX/SPY and liquid equities
Alpaca provides ~26 months of historical options data (Feb 2024 – Apr 2026) against a requirement of ~216 months. This is an 88% data gap by time period.
The redistribution constraint
Alpaca's support documentation explicitly states: "You cannot redistribute Alpaca API data for business purpose." Source: https://alpaca.markets/support/redistribute-alpaca-api
Even setting aside the history gap, the standard Alpaca terms prohibit the commercial redistribution scenario implied by Raxx's SaaS model. Note: "redistribution" in the context of MBT's internal backtester (where options data is consumed server-side and a backtest result is returned to the user) may or may not constitute "redistribution" under Alpaca's specific definition — a question for a licensing attorney. However, given the history gap is fatal on its own, this is a secondary concern.
Verdict
The blocker is real. Alpaca Algo Trader Plus does not solve it. A supplemental historical options data source is required for any backtest window predating February 2024. The existing $90.75/mo subscription remains valuable for:
- Real-time options quotes and chains (for MBT's live paper-trading fill simulation)
- Real-time market data for MQ-A regime signal inputs (VIX, SPY)
- Post-Feb-2024 historical options data as supplemental confirmation data
But it cannot replace a dedicated historical options data vendor for the markov-fit backtest or any backtesting covering market stress periods prior to 2024.
6. Cost Delta to Raxx Pricing
Base case from docs/business/founders-pro-pricing-rationale.md §2:
- Break-even (lean fixed costs): ~14 users
- Break-even (with attorneys/CPA active): ~22 users
- Margin at 100 users: ~$13.53/user/mo
- Margin at 250 users (Founders cap): ~$16.30/user/mo
Scenario: Add ORATS at $199/mo (retail historical tier, pending enterprise confirmation)
This is an illustrative estimate only. Enterprise pricing may differ materially.
| Metric | Before ORATS | After ORATS +$199/mo | Delta |
|---|---|---|---|
| Fixed costs (lean) | $242/mo | $441/mo | +$199 |
| Fixed costs (attorneys active) | $392/mo | $591/mo | +$199 |
| Break-even (lean) | ~14 users | ~$441 / $17.95 = ~25 users | +11 |
| Break-even (attorneys) | ~22 users | ~$591 / $17.95 = ~33 users | +11 |
| Margin at 100 users | $13.53/user | $13.53 − $1.99 = $11.54/user | −$1.99 |
| Margin at 250 users | $16.30/user | $16.30 − $0.80 = $15.50/user | −$0.80 |
At 250 users (the Founders cap), a $199/mo data license costs $0.80/user/mo — well within the existing margin. It shifts Founders break-even from ~14–22 users to ~25–33 users. That is material at the pre-launch stage but manageable once the cohort reaches 30+ paying users.
This analysis is sensitive to enterprise pricing. If ORATS's enterprise/commercial tier (required for SaaS use) costs $500/mo or more, the numbers change:
| Enterprise price | Break-even (lean) | Margin at 100 users |
|---|---|---|
| $199/mo | ~25 users | $11.54/user |
| $299/mo | ~30 users | $10.54/user |
| $500/mo | ~41 users | $8.54/user |
| $1,000/mo | ~69 users | $3.54/user |
At $1,000/mo+ (Databento Plus tier), the economics become strained at the Founders $19/mo price point. At $500/mo or below, the model remains viable.
Flag for the CPA: A data licensing fee for a SaaS product is typically classified as COGS (cost of goods sold) on a gross-margin view of the business, similar to Heroku and Alpaca. Confirm classification and treatment for the PA LLC before committing.
7. Questions for Attorney and CPA
Questions for a licensing attorney (IP / contract focus)
-
OPRA non-display vs. redistribution classification. If Raxx's MBT engine consumes historical options chain data server-side — no raw data is displayed to users; only computed backtest results (P&L, win rate, drawdown) are returned — does this qualify as "non-display use" under OPRA's fee schedule, avoiding the $1,500/mo redistributor fee? Cite: https://www.marketdata.app/education/options/opra-fees/
-
ORATS enterprise terms review. If Raxx executes ORATS's enterprise/commercial agreement, what specific provisions would govern: (a) running backtests on behalf of paying users (server-side computation, results returned); (b) caching ORATS data in Raxx's own database for repeated backtest access; (c) displaying derived metrics (e.g., "regime state" probabilities derived from ORATS IV data) in Antlers? Which of these uses requires a redistribution license vs. an internal-use license? Source: https://orats.com/terms-conditions
-
Polygon.io business/enterprise tier review. Same questions as #2 for Polygon.io's enterprise tier, if ORATS terms are not acceptable. Polygon.io's standard terms explicitly prohibit commercial use and redistribution; is the enterprise tier's scope sufficient for Raxx's use case, and at what price? Source: https://polygon.io/legal/market-data-terms-of-service
-
Alpaca redistribution clarification. Does Alpaca's prohibition on "redistributing for business purposes" apply to: (a) raw options chain data passed through to a user's browser, or (b) server-side use of data to simulate fills in MBT? The distinction matters for the MBT real-time fill simulation path (not the backtest path). Source: https://alpaca.markets/support/redistribute-alpaca-api
Questions for a CPA (PA-licensed, SaaS/digital-goods focus)
-
COGS classification for data licensing fees. Is a monthly options data licensing fee (e.g., ORATS at $199–$500/mo) appropriately classified as COGS or as an operating expense for a Pennsylvania LLC treated as a disregarded entity or S-Corp? How does this classification affect the gross margin view in
founders-pro-pricing-rationale.md? -
Variable vs. fixed cost treatment. A data licensing fee that does not scale with user count (fixed monthly fee) is a semi-fixed cost, similar to Alpaca. Confirm whether the IRS and PA FTB treat a flat-rate data subscription differently from a per-unit COGS for purposes of Schedule C or Form 1120-S.
-
Prepaid data archive classification. If Raxx purchases a one-time historical data archive (e.g., ORATS one-time export cited at ~$2,000† for full 2007-to-present history) in addition to a monthly subscription, is that a capital expenditure to be amortized or an expensed operating cost? What amortization schedule applies?
8. Recommendation
This section presents research findings only. It does NOT constitute legal or business advice. Take this packet to a licensing attorney and CPA before any contract commitment.
Research finding: A two-step process is required
Step 1 — Commercial license confirmation (licensing attorney, 1–2 weeks). Before any vendor selection can be finalized, a licensing attorney must confirm whether Raxx's specific use case (server-side backtest computation, results returned to users, no raw data displayed) qualifies as "internal use" or triggers "redistribution" fees under each candidate vendor's terms. This is the true blocker — not the dollar amount.
Step 2 — Vendor selection (after license path is confirmed).
If ORATS enterprise terms permit server-side SaaS use at under ~$500/mo, ORATS is
the recommendation:
- Coverage: 2007 – present, matching the backtest requirement exactly
- Quality: same data source used by spintwig (the practitioner reference in
markov-fit-analysis.md §6.3)
- Built-in tooling: iron condor backtester with 300M+ pre-computed runs available
- Integration path: REST API aligns with Raxx's Flask/Celery stack
If ORATS enterprise terms are not viable (prohibitive cost or licensing posture), the fallback path is: 1. Databento OPRA for historical data Mar 2023 – present (PAYG, pending attorney confirmation of "internal use" classification) 2. DeltaNeutral / historicaloptiondata.com for 2002–2022 archive (flat file, per-dataset purchase, pending confirmation of commercial use terms) 3. These two sources would provide coverage from 2002 to present at potentially lower total cost, at the expense of more data assembly work.
IEX Cloud: Disqualified — service has shut down. Polygon.io (standard): Disqualified under standard terms for commercial use. Tradier: Not a historical data provider; powered by ORATS underneath; not a path. Bloomberg / Refinitiv / FactSet: Out of scope at current scale. OptionMetrics IvyDB: Academic posture; commercial license bespoke; investigate only if ORATS and Databento both fail. OptionsDX: SPX/SPY only; no API; flat-file only; acceptable for narrow SPX-only backtest validation but not for the full Raxx equity universe.
Summary of required actions
- Engage Matthew Crosby (IP, already engaged — per
reference_attorneys.md) or a contract/licensing attorney to review ORATS's enterprise terms and confirm the OPRA non-display use classification. Timeline: before any data vendor contract. - Request ORATS enterprise pricing. Contact: https://orats.com/faq (contact form); mention SaaS backtesting use case, ~250 end users, server-side only computation. Do not sign the standard retail API agreement for SaaS use.
- Bring the cost delta analysis (§6 above) to the CPA for COGS classification before incorporating the data line item into Raxx's pricing model or Founders Pro economics.
- Once license path is confirmed, file an implementation card with
product-managerto obtain ORATS test credentials, validate data coverage against the 2007–2025 backtest spec, and integrate the historical chain feed intombt.run_backtest.
9. Jurisdiction Flags
- Federal (OPRA): OPRA fee schedule is set by the Options Price Reporting Authority (a national market system plan under SEC oversight). The $1,500/mo redistributor fee and per-user display fees are federal in nature — they apply regardless of Raxx's state of incorporation. Source: https://www.opraplan.com/
- Delaware / Pennsylvania: Entity formation state does not change OPRA fee applicability. It is a federal market data licensing regime.
- EU / International: Most US options data vendors (ORATS, Databento, CBOE DataShop) do not explicitly flag EU data-transfer restrictions in publicly available terms. However, if Raxx accepts EU users, GDPR Article 46 (data transfers to third countries) may interact with vendor data processing agreements. This is a separate flag from OPRA licensing. Confirm with the licensing attorney before onboarding EU users.
10. Timing / Deadlines
No hard regulatory deadlines apply directly to vendor selection. However:
- Dependency on markov-fit backtest (from
markov-fit-analysis.md): Application A (Regime-Aware Entry Gate) is Priority 1. The data licensing review is the stated hard blocker before any backtest work begins. Resolving this within 2–4 weeks unblocks ~3–4 weeks of data-scientist backtest work, then 3–4 weeks of feature development. Total estimated delay to regime-gate feature: ~2–3 months from today if licensing confirmation is prompt. - ORATS pricing freshness: Vendor SaaS pricing changes without notice. The $199/mo figure for the historical API tier comes from third-party review sources (FlashAlpha, Bullish Bears) and must be verified directly with ORATS before any contract.
- Databento pricing freshness: The June 3, 2025 pricing restructure (Standard at $199/mo, live data only; historical PAYG preserved) is sourced from Databento's own blog but should be re-verified since it was published before today's date.
Sources
| Source | URL | Used for |
|---|---|---|
| ORATS Data API page | https://orats.com/data-api | Pricing tiers, historical depth, API description |
| ORATS Terms & Conditions | https://orats.com/terms-conditions | Redistribution and single-user license terms |
| ORATS University — Historical Data | https://orats.com/university/historical-data | Historical data description |
| ORATS Backtest API guide | https://docs.orats.io/backtest-api-guide/strategies.html | Iron condor backtest methodology |
| Alpaca Historical Option Data docs | https://docs.alpaca.markets/docs/historical-option-data | Feb 2024 history start date |
| Alpaca support — redistribution policy | https://alpaca.markets/support/redistribute-alpaca-api | Business redistribution prohibition |
| Alpaca About Market Data API | https://docs.alpaca.markets/docs/about-market-data-api | Algo Trader Plus tier description |
| Databento OPRA pricing changes | https://databento.com/blog/introducing-new-opra-pricing-plans | Jun 2025 Standard plan ($199/mo), Plus/Unlimited tiers |
| Databento subscriber status | https://databento.com/blog/subscriber-status | Internal use vs. redistribution distinction |
| Databento OPRA dataset page | https://databento.com/datasets/OPRA.PILLAR | OPRA historical start date (Mar 28, 2023) |
| Databento options overview | https://databento.com/options | Options data capabilities |
| CBOE DataShop | https://datashop.cboe.com/ | Official CBOE data source; product list |
| CBOE DataShop — Option EOD Summary | https://datashop.cboe.com/option-eod-summary | EOD product description |
| CBOE DataShop — Options Summary Monthly | https://datashop.cboe.com/options-summary-monthly-subscription | Monthly subscription product |
| marketdata.app — OPRA fees explainer | https://www.marketdata.app/education/options/opra-fees/ | $1,500/mo redistributor fee; $1.25/user NP display fee; historical data exemption |
| OPRA home | https://www.opraplan.com/ | Regulatory authority for options consolidated data |
| SEC OPRA fee schedule filing | https://www.sec.gov/files/rules/sro/nms/2025/34-104267.pdf | 2025 fee amendment |
| OptionMetrics data products | https://optionmetrics.com/data-products/ | Product description; commercial licensing note |
| OptionMetrics via WRDS | https://wrds-www.wharton.upenn.edu/pages/about/data-vendors/optionmetrics/ | Academic vs. commercial licensing distinction |
| OptionsDX home | https://www.optionsdx.com/ | Free / low-cost SPX historical data |
| OptionsDX SPX product | https://www.optionsdx.com/product/spx-option-chain/ | $0–$20 per dataset pricing |
| Polygon.io Market Data Terms of Service | https://polygon.io/legal/market-data-terms-of-service | Explicit prohibition on commercial use and redistribution |
| Polygon.io Business Options | https://polygon.io/business-options | Enterprise redistribution path |
| Polygon.io pricing | https://polygon.io/pricing | Pricing tier structure |
| Tradier Get Options Chains docs | https://documentation.tradier.com/brokerage-api/markets/get-options-chains | ORATS-powered greeks; chain data |
| IEX Cloud status (discontinued) | https://www.saasworthy.com/product/iex-cloud | IEX Cloud service shutdown |
| Bloomberg Terminal pricing | https://costbench.com/software/financial-data-terminals/bloomberg-terminal/ | $31,980/yr terminal; B-PIPE enterprise floor |
| LSEG iron condor backtest example | https://developers.lseg.com/en/article-catalog/article/finding-expired-options-and-backtesting-a-short-iron-condor-stra | Enterprise Refinitiv path for iron condor backtesting |
| spintwig SPX 45-DTE iron condor backtest | https://spintwig.com/short-spx-iron-condor-45-dte-s1-signal-options-backtest/ | Practitioner reference using ORATS data |
| FlashAlpha vs. ORATS comparison 2026 | https://flashalpha.com/articles/flashalpha-vs-orats-options-data-api-backtesting | ORATS pricing context ($199–$299/mo); history depth |
| historicaloptiondata.com (DeltaNeutral) | https://historicaloptiondata.com/ | 2002+ history; SPX back to 1990; per-dataset pricing |
| DeltaNeutral SPX product | https://historicaloptiondata.com/product/allspx-sp-500/ | $3.85/mo single symbol SPX reference |
| Bullish Bears ORATS review | https://bullishbears.com/orats-option-scanner-review/ | $99/$299/mo individual/pro pricing reference |
| Databento — market data licensing intro | https://databento.com/blog/introduction-market-data-licensing | Licensing framework overview |
| Internal: markov-fit-analysis.md | docs/data-science/markov-fit-analysis.md | Backtest spec; data requirements; ORATS call-out |
| Internal: mbt-paper-trading-engine.md | docs/architecture/mbt-paper-trading-engine.md | Fill model; data needs; MQ-A integration points |
| Internal: multi-tenant-alpaca.md | docs/architecture/multi-tenant-alpaca.md | Alpaca scope; one shared server-side key |
| Internal: founders-pro-pricing-rationale.md | (agent-a7301c16de2386345 worktree) | Cost structure; break-even analysis; $90.75/mo Alpaca confirmed |
Before acting on any vendor recommendation in this document, consult: - A licensing attorney (IP / contract focus) for OPRA classification and vendor terms review (Matthew Crosby is currently engaged for IP — confirm whether data licensing is in scope, or identify a contract specialist). - A CPA licensed in Pennsylvania for COGS classification of a recurring data licensing fee and one-time archive purchase treatment. Jurisdiction: federal (OPRA), Pennsylvania (entity), potential EU flag if international users onboarded.