Investor brief

We trade the gap between weather science and market prices.

polyAether is an automated system that trades daily city-temperature prediction markets. Its edge is disciplined probability calibration — a persistent, measurable mispricing of uncertainty — not speed, forecasting genius, or hype. This page explains exactly how that works, what it can and cannot do, and where we honestly stand today.

Live paper validation Weather markets Edge: calibration, not speed ~80 curated stations

01 — The opportunity

Prediction markets misprice weather uncertainty.

On venues like Polymarket, anyone can trade on tomorrow's high temperature in a city. Each degree band — say 89–90 °F — is its own contract that pays $1 if that band contains the day's official high and $0 if it does not. A full market is a row of these contracts covering every plausible outcome, and their prices, read together, are the crowd's probability distribution over tomorrow's weather.

Here is the quiet flaw in that distribution. The people trading these markets are, for the most part, casually uncertain. They know weather is hard to predict, so they hedge their bets — and they hedge too much. They spread their money across too many outcomes, buying the unlikely tails "just in case." The result is a market that systematically overprices unlikely outcomes and underprices the likely middle. It charges too much for the surprises and too little for the boring, probable answer.

The core mispricing

Independent research on comparable temperature markets finds the market's implied uncertainty runs roughly 1.3× the true forecast error. The crowd behaves as if tomorrow is about a third more uncertain than the science actually says it is.

That 1.3× number is the whole thesis in a single figure. It means a genuinely well-calibrated forecaster — one whose stated probabilities match reality over the long run — can price each outcome more accurately than the crowd, and quietly trade the difference. We are not betting that we know tomorrow's weather better than the national weather service. We are betting that we can turn the same public science into sharper probabilities than a market of over-cautious humans does.

An intuitive example

Imagine the forecast for a city tomorrow points clearly at a high of about 91 °F, with the normal day-ahead wobble of a degree or two. A calibrated model might say the 91–92 band is worth about 38 cents on the dollar — a 38% chance. But the market, nervous about being wrong, has spread its money out: it prices that same band at only 30 cents, and pads the far-off ≤86 and ≥97 tails with money they don't deserve.

Buying the 91–92 band at 30 cents when it is genuinely worth 38 is not a prediction that we will win this particular day. We might not — weather is weather. It is a purchase of a favorable price. Do it once and it's a coin flip with a good edge. Do it across hundreds of independent city-days, and the 8-cent gap between price and value is what shows up in the results.

02 — How the edge works

Turn the best public forecasts into a calibrated price.

The pipeline is four honest steps: forecast, calibrate, compare, trade. None of them is magic. The discipline is in doing each one correctly, every day, for every market.

1 — Forecast

We start where every serious forecaster starts: the public numerical weather models. We pool a ~122-member super-ensemble drawn from the three leading systems — the American GFS, the German ICON, and the European ECMWF. Each member is one plausible run of the atmosphere. Together they don't give us a single guess; they give us a distribution — a spread of possible highs for tomorrow, with the shape of that spread telling us how confident the physics actually is. A tight cluster means a near-certain day; a wide fan means genuine uncertainty. This is the raw material we price against.

2 — Calibrate

Raw ensembles are good but not honest enough to trade. They have two well-known flaws, and we correct both. First, every station has a persistent local bias — a given model may run consistently warm or cool at one specific airport because of terrain, coastline, or urban heat the model can't resolve. We bias-correct to the exact settlement station, not the city in general. Second, ensembles are famously under-dispersed: they act more confident than they should, packing their members too tightly. We widen the spread to match how far reality has actually landed from the ensemble mean, historically. The output is a probability for every temperature band that we would be willing to bet is true, not just plausible.

3 — Compare

Now we lay our calibrated probability next to the market's price for the same band. The difference — our probability minus the market's price, net of the spread and expected slippage — is the edge on that contract. If we think a band is worth 38 cents and it trades at 30, that's an eight-cent gross edge; subtract realistic execution costs, and what's left is the number we actually act on. Most bands will show no edge, or a negative one, and we simply pass on them.

4 — Trade

Where the edge clears our threshold after costs, we buy — and only there. Size is set by conviction and edge, never all-in, and every position is capped and diversified across cities so no single day, city, or weather system can dominate the book. The system is happiest making many small, favorable purchases and letting the arithmetic of a real edge do the compounding.

Forecast

Pool a ~122-member super-ensemble (GFS + ICON + ECMWF) into a probability distribution over the day's high.

Calibrate

Bias-correct to the exact settlement station and fix the ensemble's known under-dispersion.

Compare

Our probability for each band minus the market's price is the edge — net of spread and slippage.

Trade

Buy the underpriced bands; size by conviction, capped and diversified across cities.

Our probability — buy zone Our probability — other bands Market-implied price

Where our probability exceeds the market's price (green, above the line), the outcome is underpriced — we buy. The overpriced tails, where the line sits above our bars, we let go. This is the 1.3× mispricing made visible: the crowd's line is flatter and fatter in the tails than the physics warrants.

Golden hour

Late in the local day, once the afternoon peak has largely occurred, the day's high is nearly determined — yet the market is often still priced as if it were uncertain. These late, near-certain, fast-converging trades are our highest-conviction opportunities: the physics has already resolved, and only the price hasn't caught up.

03 — A worked example

One market, end to end, with numbers.

The pipeline is easiest to trust when you watch it run once. Here is a single, illustrative city-day from the first calibrated probability to the settled result.

The setup. It's the afternoon before settlement for a city market. Our calibrated super-ensemble, bias-corrected to the exact airport this contract settles on, produces a distribution centered near 91 °F. Converted to bands, our probabilities read: 89–90 at 34%, 91–92 at 38%, 93–94 at 16%, and the rest scattered across the tails.

The comparison. We read the order book. The market is pricing 91–92 at 30 cents, 89–90 at 31 cents, and — tellingly — it's paying up for the unlikely ≥97 tail at 6 cents, well above our 2% read. On 91–92, our 38% against a 30-cent price is an eight-cent gross edge. After the spread and expected slippage, call it roughly five cents of net edge per contract. That clears our threshold.

38%

our probability, 91–92 band

30¢

market price for the same band

~5¢

net edge per contract, after costs

$1.00

payout if the band is correct

The trade. Sizing runs through fractional Kelly against that five-cent edge, then gets clipped by the per-market cap and checked against how much same-region exposure the book already holds. Suppose that math yields a modest position of, say, 120 contracts at 30 cents — about $36 at risk on this single band. We buy, and only this band; we ignore the padded ≥97 tail rather than short it.

The settlement. The next day, the official high at the exact settlement station rounds to 92 °F. The 91–92 contracts pay $1 each: 120 contracts return $120 against the $36 staked. On this particular day, the trade won. The point of the example, though, is not the win — it's that we bought a 38-cent value for 30 cents. Some days that same disciplined purchase settles at zero. The edge is the price gap, repeated; the individual outcome is noise.

Why this is illustrative, not a promise

The numbers above are a clean, representative walk-through, not a recorded trade. Real markets are noisier: edges are often smaller, some clear days show none at all, and execution eats into the gap. We show the mechanism so you can judge it — not to imply this is a typical result.

04 — Why it's defensible

The moat is precision, not latency.

The hard part of this business isn't the forecast — good public forecasts are available to everyone. The hard part is resolution-correctness: knowing, to the letter, how a given market decides who won. Each contract settles on one specific airport weather station, reads temperature rounded a specific way, and draws the boundary of "the day" on a specific local clock. Get any one of those details wrong and every trade you place is subtly, permanently biased — you'll think you have an edge while quietly bleeding it away. Most naive bots make exactly this mistake, and it's invisible until the losses add up.

Our durable advantage is that we did this unglamorous work and verified it against reality. We reverse-engineered the exact settlement rule and checked it against 220 real, resolved market-days across New York, Los Angeles, and London — confirming the precise station each market reads, the round-half-up convention, the use of hourly-only observations, and the correct local timezone for the day boundary. When our reconstructed settlement matched the venue's actual settlement across all 220 days, we knew our forecasts were aimed at the right target.

Why this is durable when speed is not

There was once a latency game in these markets — a race to react to new observations a few seconds faster than the next bot. That window has largely decayed; it's not where the money is anymore, and chasing it is a treadmill. Calibration accuracy is different. It doesn't get arbitraged away by someone with a faster server, because it isn't about speed — it's about being right about probabilities in a market structurally inclined to be wrong about them. That correctness compounds quietly, and it doesn't evaporate the moment a competitor upgrades their hardware.

Right station, right rule

Forecasts target the exact airport the market settles on — not a city-center approximation that can drift several degrees and quietly reverse the edge.

Verified against 220 days

Our reconstructed settlement matched real resolutions across 220 market-days in three cities, confirming station, rounding, and timezone.

Calibrated, not just accurate

Every forecast is scored against reality (Brier, PIT). We trade only when the model provably beats the market's implied probabilities.

Not a latency bet

The old speed edge has decayed. Ours is precision, which a faster competitor cannot simply out-run.

05 — Technology and speed

An always-on loop, built for correctness first.

We don't compete on raw speed, but the system is still fast, disciplined, and always awake. Speed here buys us cleaner execution and fresher reads — not the edge itself.

The engine runs a continuous loop that never sleeps. Every five minutes it scans across ~80 curated stations, refreshing forecasts and re-reading each open market for changes. When it evaluates a market, it reads the live order book in about 23 milliseconds and reaches a trading decision in roughly 0.2 microseconds — the decision logic itself is trivially cheap once the calibrated probabilities are in hand, because the hard thinking happened upstream in the forecast and calibration. The loop's job is to notice, quickly and reliably, when a market's price has drifted away from our probability, and to act before that gap closes on its own.

The station list is curated, not exhaustive. We only cover cities where we've verified the settlement rule and where the market has enough liquidity to be worth trading. Breadth matters — it's what turns a small per-trade edge into a diversified book — but breadth without verified settlement is just noise, so we grow the coverage deliberately.

~23 ms

order-book read

~0.2 µs

trading decision

5 min

full scan interval

~80

curated stations

Being always-on matters for a strategy whose best opportunities cluster at specific, unpredictable moments — the "golden hour" late in each city's local day, when the high has essentially resolved but the price hasn't caught up. A system that only wakes up on a schedule would miss those. Ours is watching every market, all day, everywhere it trades, so that when a clean edge appears it's already looking.

06 — Risk management

Discipline is the product.

Any single forecast can be wrong, and some will be. The entire system is built so that no single trade, city, or day can meaningfully hurt the book. Four controls do most of that work, and each exists for a specific failure it prevents.

Fractional Kelly sizing. The Kelly criterion tells you the mathematically growth-optimal bet size for a given edge — but full Kelly is famously volatile and unforgiving of a mis-estimated edge. So we bet a fixed fraction of it. Positions scale up with conviction and edge and shrink when either is thin, but they are never all-in. This keeps every bet small and repeatable, which is exactly what a small-edge, high-repetition strategy needs.

Hard caps. On top of sizing sit blunt, non-negotiable limits: a per-market cap so no single contract can balloon, a total-exposure cap on how much of the book is at risk at once, and a daily-loss limit that auto-halts trading if a day goes badly. These aren't clever — they're guardrails that don't depend on any model being right.

Correlation cap. The subtle danger in weather is that trades which look independent aren't. A heat wave that pushes ten cities above their bands at once is one bet, not ten — and a naive book would happily over-concentrate into it. We budget same-region and weather-system exposure as a single position, so a correlated event can't quietly become the whole portfolio.

Kill switch. Finally, a single control stops everything, instantly. If a data feed goes stale, a forecast source misbehaves, or the system's own behavior drifts outside expected bounds, the kill switch halts all trading rather than letting a malfunction compound. When something is wrong, doing nothing is the correct move — and the system is built to reach for that.

Fractional Kelly sizing

Positions scale with conviction and edge, never all-in — small, repeatable bets sized below the growth-optimal point for safety.

Hard caps

Per-market, total-exposure, and daily-loss limits that auto-halt the book regardless of what any model believes.

Correlation cap

A heat wave across many cities is one bet, not many — same-region exposure is budgeted as a single position.

Kill switch

Stale data, a misbehaving source, or drift outside bounds halts all trading instantly. Doing nothing is a valid state.

07 — The economics, honestly

Base hits, not a moonshot.

This is a small-edge, high-repetition business: modest expected value per trade, compounded across many diversified, largely-uncorrelated city-days. It is not a get-rich-quick machine, and we won't present it as one.

The arithmetic is deliberately unglamorous. A few cents of net edge on a contract is not much on its own — but a real edge, applied across a broad, diversified book of independent city-days, is how the number grows. Because the bets are largely uncorrelated (New York's weather has little to do with Los Angeles's on the same day), the ups and downs of individual trades tend to wash out, leaving the small persistent edge visible over time. Diversification isn't a nicety here; it is the mechanism.

A realistic illustrative sketch

To make the shape concrete — and to be explicit that this is a sketch, not a forecast or a promise — imagine a book placing on the order of a few dozen small, edged bets a day, each with a few cents of net edge after costs, most of them independent. Even a genuine edge like that produces losing days and losing weeks; the signal only separates from the noise over hundreds of trades. The realistic outcome of such a strategy, if the edge is real, is a modest, grinding, positive drift — not a chart that goes up and to the right in a straight line. Any month can be red.

Please read this as a sketch

The description above illustrates the structure of the return — small edges, many bets, diversification smoothing the path. It is not a projection, a target, or a track record. We have no proven returns yet, and nothing here should be read as one.

Capacity is the honest ceiling

The binding constraint on this strategy is liquidity, not ideas. Weather prediction markets are thin. Past a certain size, our own orders start moving the price against us, eating the very edge we're trying to capture — so there is a real ceiling on how much capital this can deploy before returns degrade. More cities and more venues raise that ceiling over time, but it is finite and honest to state plainly: this is a strategy that works at a disciplined size, not one that scales indefinitely. Returns come from discipline and coverage, never from leverage.

08 — Where we are today

Current status — read this first

Live paper validation — no capital at risk, no track record yet.

The system runs 24/7, making the exact decisions it would with real money and scoring them against real settled temperatures — but placing no real orders. We are strictly in paper mode. We will commit capital only once the paper record shows a genuine, statistically-validated calibration edge — our forecasts demonstrably beating the market's implied probabilities over a meaningful sample. Until that bar is cleared, there is no proven return and no track record, and we say so plainly on every page.

09 — Investor FAQ

The sharp questions, answered straight.

Do you have a track record?

No. This is the most important thing to be clear about. The system is in live paper validation — it makes real decisions against real settled outcomes but risks no capital, so there is no realized return to point to. We are deliberately not raising against a track record we don't have. What we can show is the mechanism, the 220-day settlement verification, and — as it accumulates — the paper calibration record.

What's the capacity? How much can this actually take?

Limited, and we'd rather say so up front. Weather prediction markets are thin, so beyond a certain size our own orders move prices against us and erode the edge. Capacity grows as we add verified cities and venues, but it is finite. This is a disciplined-size strategy, not one that absorbs unlimited capital — anyone sizing an allocation should treat the liquidity ceiling as the binding constraint.

What's the edge's half-life? Won't it get competed away?

The old latency edge in these markets has already largely decayed — we don't rely on it. Our edge is calibration correctness against a structural crowd bias (the ~1.3× over-pricing of uncertainty), which is harder to arbitrage away because it isn't about speed. That said, no edge is permanent: as more sophisticated participants enter, the mispricing can compress. Our answer is to keep improving the model and expanding coverage, and to be honest that the edge must be re-earned, not assumed.

What's the regulatory picture?

The strategy depends on prediction-market venues that restrict access in some jurisdictions, and the regulatory treatment of these markets is still evolving. Rules can change in ways that affect where and how we can trade. We treat single-venue and regulatory exposure as first-class risks, not footnotes — see the risk factors below.

What does a bad month look like?

Red, and that's expected. Because individual bets are near coin-flips with a small edge, even a genuine edge produces losing days and losing weeks — the signal only separates from noise over hundreds of trades. A bad month is a run of unfavorable settlements within normal variance; the controls (caps, correlation limits, kill switch) exist to make sure a bad month stays a bad month and never becomes a blow-up. If a bad stretch instead reflects a broken edge, the paper-first discipline and monitoring are designed to catch that before real capital is committed.

What are you actually asking for at this stage?

Understanding, not a wire transfer. This page exists to explain the thesis and the mechanism honestly while we build the paper record. The right time to discuss capital is after that record demonstrates the calibration edge — not before.

10 — Risk factors

What could go wrong.

Unproven edge. The strategy is in validation; a real, persistent edge has not yet been demonstrated with live results, and it may turn out to be smaller than hoped — or absent.
Liquidity & capacity. Thin markets cap deployable capital and can widen execution costs; our own size can move prices against us.
Competition & decay. Edges attract other traders and erode over time; the model must keep improving to stay ahead of the crowd.
Model & execution risk. Forecast error, data-source failure, or software bugs can cause losses despite the controls.
Venue & regulatory. Reliance on prediction-market venues that restrict some jurisdictions; rules and access can change.
Weather is uncertain. Even a perfectly calibrated model loses individual bets; the edge is only visible over many trades, never a single day.
Correlated events. A regional heat wave can turn many seemingly-independent bets into one; correlation controls mitigate but cannot eliminate this.