Source: smktrans package
(technical report).
File: quit_probability_calibration_targets_20260216_v2.csv.
The STPM estimates annual transition probabilities from current smoker to former smoker using a closed-form demographic accounting formula. The formula adjusts for differential mortality by smoking status, immigration/emigration survivorship, initiation, and relapse:
where a = age, y = year,
s = sex, d = IMD quintile,
π = smoothed HSE proportions (nnet::multinom),
l(x) = birth cohort survivorship (HMD + ONS),
μs = smoking-status-specific mortality (52 disease RRs from tobalcepi),
prelapse = Hawkins 2010 + Jackson 2019 estimates by years since quit.
Calibration targets average these single-year-of-age probabilities into strata:
| Table | Stratification | Years | Purpose |
|---|---|---|---|
| 3 | Sex × Age (25–44, 45–64, 65–74) | 2011–2016 | Calibration |
| 4 | IMD quintile × 3-year period | 2011–2016 | Calibration |
| 5 | Sex × Age | 2017–2019 | Validation |
| 6 | IMD quintile | 2017–2019 | Validation |
Typical range: 6–12% annual quit probability. Uncertainty: Beta(n=20) assumption with 0.9 correlation between samples (100 draws).
Source: Smoking Toolkit Study (STS), waves 2007–2019. Pipeline: analysis/sts-quit-targets/.
Outcome variable: trylyc — whether a past-year smoker made ≥1 quit attempt in the past 12 months.
where bState 2 = stopped smoking in past year, bState 3 = current smoker. Denominator: past-year smokers only.
Stratified by: Age × Sex × 3-year period, and IMD × 3-year period. Ages 25–74.
Typical range: 20–45%. Multi-stage MICE imputation (SynthSmoke-compliant). IMD imputed via multinomial logit from HSE 2011. Weights post-stratified to ONS 2011 population.
Synthetic population: 9,842 agents. 2,000 smokers (20.3%), 2,133 ex-smokers, 258 in quit tunnel, 5,451 never-smokers.
The ABM has a 12-month quit tunnel (NEWQUITTER → ONGOINGQUITTER1–11) before reaching EXSMOKER. STPM has no tunnel: anyone who quits is immediately a former smoker. This means STPM's "former smoker" maps to ABM's tunnel occupants + EXSMOKER.
These plots show how the COM-B input variables correlate with IMD quintile at baseline. Q1 = least deprived, Q5 = most deprived. Click to open full-size lightbox.
cCigConsumptionPrequit frozen at quit start.
mNonSmokerSelfIdentity fake beta (0.001) for maintenance.
Exemplar agents stratified by IMD × sex (10 agents across 5 IMD quintiles × 2 sexes).Track all agents who were SMOKER in the 2011 December snapshot (n=2,119) across 29 years.
Each panel tracks one COM-B variable across all maintenance ticks for each exemplar agent. Only variables that change over time are shown — this reveals whether endogenous variables (addiction decay, cessation aids, non-smoker identity) and exogenous variables (regional prevalence, age transitions) are working as expected. The first panel shows the resulting P(maintenance).
The 10 exemplar agents were selected to be stratified by IMD × sex (one per IMD quintile per sex). Their month-0 P(maintenance) values are compared to the full population below.
| Population (n=2000) | Exemplar (n=6) | |
|---|---|---|
| Median monthly P(maint) | 0.213 | 0.234 |
| Mean monthly P(maint) | 0.224 | 0.218 |
| Logged mean (simulation) | — | 0.254 |
Implemented in comb_theory.py, class QuitMaintenanceTheory.
where:
Current intercept: b₀ = -0.7672137.
Source: data/intermediate_data/com_b/maintenance_SEM_coefficients_202509019_v2.csv.
Per Harry's email, this intercept represents the log-odds of still being abstinent roughly one month after a quit attempt (reference category: "more than a week and up to a month" since quit). It is already approximately a monthly probability — no 6-month conversion needed.
Monthly decay factor: e−0.0368×4.33 = 0.8526. After 12 months: a₁₂ = a₀ · 0.852612 = a₀ · 0.1475.
On relapse, cCigAddictStrength resets to its pre-quit value.
Each agent's maintenance probability is computed from their syn pop attributes. Two scenarios:
| Baseline scenario | Best-case scenario | |
|---|---|---|
| Cessation aids | None (all OFF) | All ON (e-cig, NRT, varenicline, behavioural support, cytisine) |
| Regional prevalence | 0.20 (static) | 0.20 (static) |
| Smoking neighbours | 1 (static) | 1 (static) |
| Combined logit boost from aids | 0 | +2.43 |
An agent must survive 12 consecutive monthly maintenance checks to reach EXSMOKER. The 12-month survival probability is the product of 12 monthly probabilities (with addiction decay applied each month):
Quick conversion guide — if monthly P(maint) were constant:
| Monthly P(maint) | Annual survival P12 | Interpretation |
|---|---|---|
| 0.30 | 0.000005 (0.0005%) | Virtually impossible |
| 0.50 | 0.000244 (0.02%) | ~1 in 4,000 |
| 0.70 | 0.0138 (1.4%) | ~1 in 72 |
| 0.80 | 0.0687 (6.9%) | ~1 in 15 |
| 0.90 | 0.282 (28%) | ~1 in 4 |
| 0.95 | 0.540 (54%) | Majority survive |
| Baseline (no aids) | Best case (all aids ON) | |||
|---|---|---|---|---|
| Per month | Over 12 months | Per month | Over 12 months | |
| Median | 0.213 | 5.98e-08 | 0.739 | 4.73e-02 |
| Mean | 0.224 | 8.66e-06 | 0.729 | 6.46e-02 |
| Max | 0.615 | 0.004545 | 0.914 | 0.341137 |
The predictor sum (C + O + M, excluding intercept) averages -0.537 across all baseline smokers. Even with b₀ = 0, the mean logit would be -0.537, giving P(maint) ≈ 0.369 and P(12mo) ≈ 6.34e-06.
Largest negative contributors to the logit (mean β·x across baseline smokers):
| Variable | β | SE | mean(x) | mean(β·x) | Prevalence |
|---|---|---|---|---|---|
cCigAddictStrength | −0.195 | 0.051 | 2.03 | −0.396 | 91% have urge > 0 |
cCigConsumptionPrequit | −0.029 | 0.008 | 12.3 | −0.357 | All smokers |
oSocialHousing | −0.391 | 0.169 | 0.48 | −0.188 | 48% in social housing |
oEducationalLevelBelowDegree | −0.175 | 0.160 | 0.58 | −0.102 | 58% below degree |
Meanwhile, the largest positive contributors (cessation aids: e-cigarette β=+0.45, varenicline β=+0.72, cytisine β=+0.79) contribute almost nothing at baseline because <10% of smokers use them. They only activate when an agent enters the quit tunnel and is stochastically allocated aids.
First, the conversion from monthly P(maintenance) to 12-month tunnel survival. The curve is P12 — it drops off steeply below ~0.85/month.
Combined variable range tornado: each variable is varied from its minimum to maximum plausible runtime value (e.g. cessation aids 0→1, addiction 0→5, regional prevalence 0.10→0.30). The intercept is varied by ±1 SE (0.700). All other terms held at population means.
cCigAddictStrength = STS sturge variable: self-reported strength of urge
to smoke, integer 0–5 (0 = none, 5 = extreme). β = −0.195.
Three bias scenarios: current (−0.767), moderate uplift (0.0), and strong uplift (+1.0). At the current intercept, addiction strength barely differentiates agents — all curves collapse near zero. The intercept dominates. At b₀ = +1.0, the curves separate and addiction strength starts to matter as intended.
The attempt SEM intercept (−2.169) was estimated on a 6-month window (per Harry). Converting to monthly:
This uses the complementary formula: P(≥1 event in 6 months) is a union of monthly events, so P(no event in 1 month) = (1 − P6mo)1/6.
Per Harry's email: the maintenance SEM intercept represents the log-odds of still being abstinent roughly one month after a quit attempt (reference category: "more than a week and up to a month" since quit). This is already approximately a monthly probability.
Harry's caveats:
cCigAddictStrength coefficient may partially compensate for the latter).Harry's suggestion: treat the intercept as-is for monthly maintenance, or slightly downscale it and add uncertainty.
Sweep from b₀ = −1.0 to +1.0. For each bias value, 10 replicate runs of 120 ticks (2011–2020). The plot shows: (1) number of baseline smokers who reached EXSMOKER by 2020, (2) number still in the tunnel at the 2020 snapshot, and (3) the STPM verification quit rate (right axis, 0–1 scale). The dashed blue line is the mean STPM target.
| Scenario | b₀ | Rationale | Expected effect |
|---|---|---|---|
| Current (SEM as-is) | −0.767 | Harry says intercept is already monthly | ~0 agents reach EXSMOKER. Tunnel is a revolving door. |
| Moderate uplift | 0.0 | Remove negative intercept; let predictors drive variation | ~8 EXSMOKER per 10 years (sweep data). Still very low. |
| Strong uplift | +0.5 | Compensate for SEM underestimation of transition probability | ~40 EXSMOKER per 10 years. Some agents survive. |
| Calibrate to real-world | TBD | Target ~3–5% of attempters surviving 12 months | Requires b₀ ≈ +0.8 to +1.2 (rough estimate from sweep). |
cCigAddictStrength
(−0.40) and cCigConsumptionPrequit (−0.36).