Every model update, methodology change, data source addition, and platform release, documented publicly in chronological order. CERES does not make undisclosed changes to its prediction system.
Following live deployment, the initial IPC Phase 3+ coefficients (arXiv v1) were found to produce logit saturation: all 43 monitored countries returned P(IPC 3+) > 0.99, eliminating discriminative utility. Root cause: the additive stack of composite_stress (5.80), convergence_score (2.20), and n_independent (0.40 × 4 signals) dominated the intercept for any country with three or more elevated signals. Coefficients adjusted: intercept −2.10 to −4.50, composite_stress β 5.80 to 3.20, convergence_score β 2.20 to 1.40, n_independent β 0.40 to 0.20. Recalibrated model produces P = 0.036–0.994 across the monitored-country CSS range. Same fix applied to admin1.py, admin2.py, and scenario.py. Updated coefficients documented in Appendix C of the preprint and on the Methodology page. arXiv v2 planned Q3 2026.
Added donor-facing /impact page with live activity statistics and downloadable datasets. Embeddable risk widget at /widget for partner org integration. Three CSV export endpoints: current predictions, full archive, and grading ledger. Named-contact alert routing for Institutional subscribers, setting specific contacts per region for Tier I alerts. Public changelog added.
Comprehensive mobile CSS overrides across every page. Navigation, dashboard, account, subnational, validation, API access, methodology, tracker, about, data, and login pages all optimised for 320–768px viewports. Mobile bottom sheet for dashboard detail view. Hamburger menu confirmed functional.
Subscribers can now set a watchlist of specific regions and a probability threshold. Alerts and digests will be filtered to watchlist regions only if configured. Available to all paid subscribers via the account page. GET and PUT /v1/auth/watchlist endpoints added.
Initial public release of CERES. Predictions issued at national level across 43 high-risk countries, with Admin1 disaggregation where data permits, including Ethiopia, Somalia, Sudan, South Sudan, Kenya, Yemen, Niger, Mali, Burkina Faso, Haiti, Afghanistan, Syria, DRC, Zimbabwe, and Mozambique. All predictions timestamped and graded at T+90 days.
Logistic regression model initialised with author-specified coefficients informed by 87 IPC transition records across 31 countries (2011–2023). Four back-validation cases (Somalia 2011, South Sudan 2017, Ethiopia 2022, Yemen 2021) used to verify directional plausibility. Input-perturbation resampling (n=2,000 replications) for 90% sensitivity intervals. All performance metrics (Brier score, SI coverage, Tier I precision/recall) are prospective targets. T+90 grading windows open June 2026 as the first predictions reach their horizon; because observed IPC outcomes publish on a 2–4 month lag, the first graded results are expected Aug–Oct 2026. See Validation page for live progress.
Pipeline ingests six core model inputs. Climate: CHIRPS dekadal precipitation (SPI-3, SPI-6, rainfall deficit) and MODIS NDVI 16-day composite (vegetation stress). Conflict: UCDP GED georeferenced events (4-week rolling, fatality count, actor type). Food: IPC acute food insecurity phase, WFP VAM food consumption score and rCSI, and FAO GIEWS market and cereal prices. Two supplementary sources support hypothesis corroboration without entering the model: FEWS NET outlooks as a cross-check and UNHCR Admin1 displacement data. Grading: IPC cadre outcomes at T+90.
CERES is built on the Hypothesis Generation Engine (HGE), Northflow Technologies' institutional-grade hypothesis synthesis infrastructure. HGE synthesises multi-source signals into ranked, evidenced, falsifiable hypotheses. CERES is Adapter #5. The same calibration standards, uncertainty quantification, and auditability requirements apply as to all HGE adapters.
Public API launched at ceres-core-production.up.railway.app. Endpoints: /v1/predictions (all regions + per-region), /v1/hypotheses (ranked driver hypotheses), /v1/admin1 (Admin1 stress signals), /v1/grades (grading ledger + aggregate metrics), /v1/archive (weekly snapshot history), /v1/export/hdx (HXL-tagged CSV for HDX). Auth: Bearer API key header. Rate limits: 500/month (Free), 10,000/month (Professional), unlimited (Institutional).
Every weekly pipeline run archives a snapshot of all region predictions. The /tracker page displays the full time series per region with sparklines, trend indicators, and verification outcomes. Archive statistics: total runs, regions tracked, snapshot count, earliest run date.
CERES disaggregates crisis signals to Admin1 (province/state/region) level across the monitored Admin1 regions. The /subnational page displays composite stress scores and sub-scores (drought, conflict, food access, IPC stress, price stress) for all monitored administrative units. Sortable, filterable, downloadable.
TIER-1: P(IPC 3+) ≥ 0.70 or P(IPC 4+) ≥ 0.50, high-probability escalation within 90 days, immediate alert. TIER-2: P(IPC 3+) 0.45–0.69 or CRITICAL convergence, elevated risk, enhanced monitoring. TIER-3: P(IPC 3+) < 0.45, watch status, included in weekly digest. Thresholds author-specified at initialisation; prospective calibration ongoing as graded outcomes accumulate from Aug–Oct 2026.
CERES documents every change to the prediction system, model coefficients, data sources, and methodology in this public log before or at the time of deployment. Changes are never made retroactively to improve the appearance of historical accuracy. If a model change would affect the interpretation of past predictions, this is explicitly noted. This log is the authoritative record of CERES system provenance.