`op`

Back to L3 | Browse all axes | Browse all options

Axis op on sub-layer L3_A_step_op (layer l3).

Sub-layer

L3_A_step_op

Axis metadata

Default: 'lag'
Sweepable: False
Status: operational

Operational status summary

Operational: 46 option(s)
Future: 5 option(s)

Options

`adaptive_ma_rf` – operational

AlbaMA – RF-driven adaptive moving average smoother for a single time series.

Goulet Coulombe & Klieber (2025) ‘Adaptive Moving Average for Macroeconomic Monitoring’ (arXiv:2501.13222 §2). A random forest fit with a single regressor – the time index – on the target series y (i.e. RF(y_t ~ t)). Per-observation leaf membership induces a weight matrix w_τt whose row sums to 1, so the smoother is a learned-bandwidth moving average of y; the realised window adapts to local volatility / regime. Paper p.8 defaults: n_estimators = B = 500, min_samples_leaf = 40, max_features = 1. sided = 'two' (default) fits one forest on the full sample (retrospective smoother); sided = 'one' fits an expanding-window forest per t (real-time nowcasting variant, paper §3.3 / p.10).

Atomic primitive: existing ma_window uses a fixed length; hamilton_filter is a regression on lags rather than a moving average; neither composes into AlbaMA without a learned window selector.

When to use

Replicating AlbaMA recipes; macro indicator monitoring under regime shifts.

When NOT to use

Multivariate denoising of a predictor panel (AlbaMA smooths a single target series).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Goulet Coulombe & Klieber (2025) ‘An Adaptive Moving Average for Macroeconomic Monitoring’, arXiv:2501.13222.

Related options: savitzky_golay_filter, hamilton_filter, hp_filter, ma_window

Last reviewed 2026-05-05 by macroforecast author.

`asymmetric_trim` – operational

Albacore-family rank-space transformation (Goulet Coulombe et al. 2024).

Per-period sort: panel Π of shape (T, K) is mapped to O where O[t, r] = sort(Π[t, :])[r] (ascending). Asymmetric trimming emerges in the downstream nonneg ridge (ridge(coefficient_constraint=nonneg)) that learns rank-position weights – this op does the rank-space transformation only.

Optional smooth_window > 0 applies a centred moving average to each rank-position time series (paper §3 mentions 3-month MA for noisy components; users can chain ma_window explicitly when they want a different window).

Operational from v0.8.9 (B-6). Layer scope (l2, l3) so the L3 DAG can dispatch it at recipe time. Algorithm spec: docs/replications/maximally_forward_looking_algorithm_notes.md.

When to use

Building Albacore_ranks-style core inflation indicators; supervised asymmetric trimming where the band is learned from data.

When NOT to use

Symmetric trimmed-mean targets (use a fixed-window ma_window instead).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Goulet Coulombe, Klieber, Barrette & Goebel (2024) ‘Maximally Forward-Looking Core Inflation’, technical report (R package: assemblage).

Related options: ma_window, ma_increasing_order, scaled_pca

Last reviewed 2026-05-05 by macroforecast author.

`boruta_selection` – future

(no schema description for boruta_selection)

TBD: option doc not yet authored for this value. The encyclopedia falls back to the bare schema description above. PRs adding a full OptionDoc entry under macroforecast/scaffold/option_docs/l3.py are welcome.

`cumsum` – operational

Cumulative sum of a series.

Running total Σ_{s ≤ t} y_s. Inverts diff (modulo an initial constant). Used to recover level forecasts from differenced predictions or to build cumulative-shock features.

When to use

Building cumulative-impact features; recovering levels from differenced forecasts.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: diff, level

Last reviewed 2026-05-05 by macroforecast author.

`dfm` – operational

Dynamic factor model – Kalman state-space factor extraction.

statsmodels DynamicFactor MLE estimate of latent factors with idiosyncratic AR(1) errors. Differs from pca in that factors are smoothed via the Kalman filter and respect a factor-VAR transition.

When the panel is mixed-frequency (FRED-SD), the runtime auto-routes to DynamicFactorMQ (Mariano-Murasawa 2003).

When to use

Smoothed factors with an explicit dynamic; mixed-frequency panels (FRED-SD).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Mariano & Murasawa (2003) ‘A new coincident index of business cycles based on monthly and quarterly series’, JAE 18(4): 427-443.

Related options: pca, scaled_pca

Last reviewed 2026-05-05 by macroforecast author.

`diff` – operational

First difference: y_t - y_{t-1}.

Computes the simple first difference on the input column. The first observation becomes NaN. Combine with lag to recover level features when the L2 layer already differenced the panel.

When to use

I(1) variables that need a stationary representation in addition to the L2-applied tcode.

When NOT to use

When the panel is already differenced by L2.B (avoids double-differencing).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: level, log_diff, pct_change

Last reviewed 2026-05-05 by macroforecast author.

`feature_selection` – operational

Filter columns by variance / correlation / lasso pre-screen.

Drops columns failing one of three criteria configured via params.method:

variance – drop columns with variance below params.threshold.
correlation – drop columns with pairwise correlation above params.threshold (keeps the first).
lasso – fit a Lasso pre-screen and keep columns with non-zero coefficients.

When to use

Trimming the panel before expensive downstream estimators (NN, SVM, kernel) when high-dim noise dominates.

When NOT to use

Tree models – they handle irrelevant features natively.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: scale, pca

Last reviewed 2026-05-05 by macroforecast author.

`fourier` – operational

Fourier basis features – sin/cos at fixed harmonics.

Generates sin/cos pairs at harmonic frequencies of the calendar period (params.period, params.n_harmonics). Captures smooth periodic patterns without the indicator-explosion of season_dummy.

When to use

Smooth seasonality (annual / weekly cycles) where dummies would over-fit.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: season_dummy, wavelet

Last reviewed 2026-05-05 by macroforecast author.

`genetic_algorithm_selection` – future

(no schema description for genetic_algorithm_selection)

TBD: option doc not yet authored for this value. The encyclopedia falls back to the bare schema description above. PRs adding a full OptionDoc entry under macroforecast/scaffold/option_docs/l3.py are welcome.

`hamilton_filter` – operational

Hamilton (2018) regression-based detrend (HP-filter alternative).

Regression-based two-sided alternative to the HP filter advocated by Hamilton (2018) for its better real-time properties. Default lookback h = 8 (quarterly) / 24 (monthly). Uses statsmodels hamilton_filter.

When to use

Real-time / one-sided detrending where HP’s two-sided smoothing is inappropriate.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Hamilton (2018) ‘Why You Should Never Use the Hodrick-Prescott Filter’, RES 100(5): 831-843.

Related options: hp_filter

Last reviewed 2026-05-05 by macroforecast author.

`holiday` – operational

Holiday / event dummy variables.

0/1 indicators for calendar holidays (US federal by default; params.country selects locale via the holidays package). For business / financial macro series.

When to use

Daily / weekly business-cycle series where holidays create discrete level shifts.

When NOT to use

Pure macro series at monthly+ frequency where holidays are absorbed by season_dummy.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: season_dummy

Last reviewed 2026-05-05 by macroforecast author.

`hp_filter` – operational

Hodrick-Prescott filter – trend / cycle decomposition.

statsmodels hpfilter with smoothing parameter params.lamb (1600 for quarterly, 129600 for monthly per Ravn-Uhlig 2002). Returns the cyclical component by default; the trend can also be returned via params.return = 'trend'.

When to use

Extracting business-cycle gaps from trending series.

When NOT to use

Real-time / one-sided forecasting – HP introduces look-ahead bias unless restricted to expanding_window_per_origin.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Hodrick & Prescott (1997) ‘Postwar U.S. Business Cycles: An Empirical Investigation’, JMCB 29(1): 1-16.

Related options: hamilton_filter, diff

Last reviewed 2026-05-05 by macroforecast author.

`interaction` – operational

Pairwise interaction terms only (no pure powers).

Subset of polynomial degree-2 features that contains only pairwise products x_i · x_j for i ≠ j. Cheaper than full polynomial expansion when interaction structure (not non-linearity in single inputs) is the target.

When to use

Capturing predictor-pair complementarities in linear models.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: polynomial

Last reviewed 2026-05-05 by macroforecast author.

`kernel` – operational

Kernel-feature pre-step (Random Fourier / Nyström handle).

Generic handle for an explicit kernel-feature embedding; concrete dispatch is determined by params.kernel (rbf / poly / laplacian). For named variants use kernel_features (RBF Random Fourier) or nystroem.

When to use

Kernel-augmented linear / SVM pipelines.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: kernel_features, nystroem

Last reviewed 2026-05-05 by macroforecast author.

`kernel_features` – operational

Random Fourier features – approximate RBF kernel via random projection.

sklearn RBFSampler: maps inputs to params.n_components random Fourier features whose dot product approximates the RBF kernel. Enables linear models to fit RBF-kernelised responses at training-set-size linear cost.

When to use

Kernel-augmented ridge / SVM at scale (n > 10k).

When NOT to use

Small-sample problems where exact kernel SVM is feasible.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Rahimi & Recht (2007) ‘Random Features for Large-Scale Kernel Machines’, NeurIPS.

Related options: kernel, nystroem

Last reviewed 2026-05-05 by macroforecast author.

`l3_feature_bundle` – operational

Internal sentinel: bundle column-producing children into the X matrix.

Synthetic node emitted by the runtime to collect the outputs of all leaf transforms into a single X artifact consumed by L4. Authors do not write this node by hand; the cascade builder inserts it.

Surfaced in operational_options so the schema completeness test stays consistent.

When to use

Never authored directly – inserted by the cascade builder.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: l3_metadata_build

Last reviewed 2026-05-05 by macroforecast author.

`l3_metadata_build` – operational

Internal sentinel: assemble L3 metadata sink (lineage / pipeline definitions).

Synthetic node that produces the column_lineage and pipeline_definitions sinks from the cascade graph. Powers L7 lineage_attribution and transformation_attribution downstream.

When to use

Never authored directly – inserted by the cascade builder when L7 lineage hooks are active.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: l3_feature_bundle

Last reviewed 2026-05-05 by macroforecast author.

`lasso_path_selection` – future

(no schema description for lasso_path_selection)

TBD: option doc not yet authored for this value. The encyclopedia falls back to the bare schema description above. PRs adding a full OptionDoc entry under macroforecast/scaffold/option_docs/l3.py are welcome.

`level` – operational

Pass-through: keep the series at its original level.

No-op transform; the column flows through unchanged. Used as an explicit anchor in the DAG so downstream ops can reference the level form even when the L2 transform_policy converted to log-differences. Useful when you want both level and diff branches in the same recipe.

When to use

Authoring branches that need the level form alongside a transformed branch.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: diff, log, log_diff, pct_change

Last reviewed 2026-05-05 by macroforecast author.

`log` – operational

Natural logarithm: ln(y).

Element-wise natural log. Strictly positive series only; raises if any input is non-positive. Often paired with diff to produce log-changes (which are approximately equal to percentage changes for small movements).

When to use

Strictly-positive macro series (price levels, employment counts, GDP) before differencing.

When NOT to use

Series that can be negative or zero (rates, growth rates, balances).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: log_diff, level, pct_change

Last reviewed 2026-05-05 by macroforecast author.

`log_diff` – operational

Log first difference: ln(y_t) - ln(y_{t-1}).

Composite of log then diff. The standard FRED-MD transformation code 5/6 representation; produces a stationary approximation of the percentage change and is symmetric in expansions vs contractions.

When to use

Strictly-positive trending series (real GDP, employment, prices); FRED-MD tcode 5/6 default.

When NOT to use

Series that take non-positive values.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
McCracken & Ng (2016) ‘FRED-MD: A Monthly Database for Macroeconomic Research’, JBES 34(4): 574-589. https://doi.org/10.1080/07350015.2015.1086655

Related options: log, diff, pct_change

Last reviewed 2026-05-05 by macroforecast author.

`ma_increasing_order` – operational

MARX – moving averages of increasing order (Coulombe 2024).

Stacks moving averages with windows [1, 2, 4, 8, ..., w_max] into a multi-column block. Captures multi-scale persistence in a single op; popular feature in macroeconomic random forest pipelines.

Implements the MARX (Moving-Average-of-Random-eXogeneous) trick from Coulombe (2024).

When to use

Tree / RF models that benefit from multi-scale temporal features without manual lag selection.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Coulombe (2024) ‘The Macroeconomic Random Forest’, Journal of Applied Econometrics 39(7): 1190-1209.

Related options: ma_window, lag

Last reviewed 2026-05-05 by macroforecast author.

`ma_window` – operational

Trailing moving average over a fixed window.

Computes mean(y_{t-w+1..t}) for a user-specified window params.window. temporal_rule controls expanding vs rolling vs block-wise refit semantics. The first w-1 rows are NaN.

When to use

Smoothing noisy series; building short / medium / long-term momentum features.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: ma_increasing_order, diff, scale

Last reviewed 2026-05-05 by macroforecast author.

`maf_per_variable_pca` – operational

Per-variable MAF via PCA on lag-panels – Coulombe et al. (2021 IJF) Eq. (7).

Implements the paper-exact Moving Average Factor (MAF) construction from Coulombe, Leroux, Stevanovic & Surprenant (2021 IJF) §2.2 Eq. (7). For each variable k = 1..K in the input panel:

Build the T × (n_lags + 1) lag-panel [X_{t,k}, L X_{t,k}, ..., L^{n_lags} X_{t,k}].
Run PCA retaining n_components_per_var components (paper default: 2).
Append the resulting factor columns to the output.

Output shape: (T, K · n_components_per_var). With defaults n_lags=12, n_components_per_var=2 the output is (T, 2K) – paper footnote 11: ‘We keep two MAFs for each series and they are obtained by PCA.’

Distinction from existing ma_increasing_order → pca(4) path: the existing stacked-PCA MAF cell runs a single PCA over all MA columns at once (stacked, 4 global components). This op runs separate PCA per variable, yielding 2K locally-structured factors rather than 4 global ones. Use this op when paper-Eq.7-exact replication is required.

First n_lags rows per variable are NaN (lag-panel boundary). temporal_rule is required; full_sample_once is rejected to enforce walk-forward boundaries.

Operational from v0.9.0 (phase-f16).

When to use

Paper-exact replication of Coulombe et al. (2021 IJF) MAF construction; when per-variable PCA factors are preferred over global stacked-PCA (4-component) path.

When NOT to use

When the 16-cell horse-race stacked-PCA MAF cell is sufficient (use ma_increasing_order → pca instead); or when K is large and 2K output columns would exceed T.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Coulombe, Leroux, Stevanovic & Surprenant (2021) ‘Macroeconomic Data Transformations Matter’, International Journal of Forecasting 37(4): 1338-1354. https://doi.org/10.1016/j.ijforecast.2021.05.005

Related options: maf, ma_increasing_order, ma_window

Last reviewed 2026-05-05 by macroforecast author.

`midas` – operational

MIDAS Almon / Exp-Almon / Beta weighted lag polynomial (Ghysels-Sinko-Valkanov 2007).

Parametric mixed-frequency aggregation: emits one column per HF predictor whose value is the weighted sum Σ_k ω_k(θ̂) · x_{t·m − k}. The weight kernel ω_k(θ) is fitted by NLS against the LF target_signal input.

Three weighting families:

almon – polynomial basis ω_k = Σ_q θ_q · k^q of order polynomial_order (default 2). Optional sum-to-one normalisation.
exp_almon (default) – ω_k ∝ exp(θ_1 k + θ_2 k²); numerically stable and the GSV 2007 §3 default.
beta – ω_k ∝ k_norm^{θ_1−1} (1 − k_norm)^{θ_2−1} for k_norm = (k+1)/(K+1); flexible monotone / hump kernel.

Defaults: freq_ratio = 3, n_lags_high = 12, sum_to_one = True, max_iter = 200. Optimiser: scipy.optimize.minimize (Nelder-Mead). Per-predictor theta_hat / weights / converged stashed in result.attrs['midas_fit'] for L7 inspection.

Requires a target_signal input port – shares routing with scaled_pca.

When to use

Mixed-frequency nowcasting where parametric lag weights are desired; reproducing GSV 2007 macro / asset-pricing applications.

When NOT to use

High-noise predictors where NLS optimiser diverges (use u_midas + downstream regularised regression instead).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Ghysels, Sinko & Valkanov (2007) ‘MIDAS Regressions: Further Results and New Directions’, Econometric Reviews 26(1): 53-90.
Ghysels, Santa-Clara & Valkanov (2004) ‘The MIDAS Touch: Mixed Data Sampling Regression Models’, UCLA / UNC working paper.

Related options: u_midas, scaled_pca, lag

Last reviewed 2026-05-05 by macroforecast author.

`nystroem` – operational

Nyström kernel approximation – subset-based feature map.

sklearn Nystroem constructs a low-rank approximation of an arbitrary kernel matrix using a random subsample of training points. More accurate than Random Fourier features for non-RBF kernels but with a larger memory footprint.

When to use

Non-RBF kernel-augmented linear models (poly / sigmoid).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: kernel_features, kernel

Last reviewed 2026-05-05 by macroforecast author.

`nystroem_features` – operational

Alias for nystroem – explicit feature-stage name.

Identical to nystroem; preferred when a multi-stage pipeline names its kernel approximation explicitly in the lineage graph.

When to use

Multi-stage pipelines that separate kernel approximation from downstream linear fits.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: nystroem

Last reviewed 2026-05-05 by macroforecast author.

`partial_least_squares` – operational

Partial least squares regression – supervised factor extraction.

Computes orthogonal latent components that maximise the covariance with the target (not just predictor variance, as in PCA). sklearn’s PLSRegression; params.n_components.

When to use

When a target-supervised reduction is preferable to PCA’s unsupervised projection.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Wold, Sjöström & Eriksson (2001) ‘PLS-regression: a basic tool of chemometrics’, Chemometrics and Intelligent Laboratory Systems 58(2): 109-130.

Related options: pca, scaled_pca

Last reviewed 2026-05-05 by macroforecast author.

`pca` – operational

Principal component analysis – linear factor extraction.

Eigendecomposition of the column covariance; returns the top params.n_components principal components. Implements the Stock-Watson (2002) diffusion-index workflow used throughout FRED-MD applications.

Combine with factor_augmented_ar or factor_augmented_var at L4 to build the diffusion-index forecaster. temporal_rule controls whether components are re-fit per origin (default: expanding_window_per_origin).

When to use

Reducing FRED-MD’s 100+ predictors to a handful of latent factors; factor-augmented forecasts.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Stock & Watson (2002) ‘Forecasting Using Principal Components from a Large Number of Predictors’, JASA 97(460): 1167-1179.

Related options: sparse_pca, scaled_pca, varimax, dfm, partial_least_squares

Last reviewed 2026-05-05 by macroforecast author.

`pct_change` – operational

Period-over-period percentage change: (y_t / y_{t-1}) - 1.

Strict simple growth rate; not equivalent to log_diff for large movements. Returns NaN where the previous observation is zero or NaN.

When to use

When a literal percentage interpretation is required (returns, inflation rates).

When NOT to use

Trend-following analysis where log_diff’s symmetry is preferable.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: log_diff, diff

Last reviewed 2026-05-05 by macroforecast author.

`polynomial` – operational

Polynomial basis expansion – degree-d powers of input.

sklearn PolynomialFeatures of degree params.degree. Includes interaction terms by default; set params.interaction_only=True for products without pure powers.

When to use

Capturing low-order non-linearity for linear / kernel models.

When NOT to use

High dimension (degree > 3 with many predictors) – explodes the design matrix; use kernel methods instead.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: interaction, kernel_features, polynomial_expansion

Last reviewed 2026-05-05 by macroforecast author.

`polynomial_expansion` – operational

Alias for polynomial – explicit expansion node in cascade pipelines.

Identical to polynomial but with a name that reads more clearly as a stage in a multi-step expansion pipeline.

When to use

Pipelines that explicitly stage expand → reduce sequences.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: polynomial

Last reviewed 2026-05-05 by macroforecast author.

`random_projection` – operational

Johnson-Lindenstrauss random Gaussian projection.

Reduces dimensionality by multiplying with a random Gaussian matrix scaled to (approximately) preserve pairwise distances. Cheap baseline for dimensionality reduction; sklearn’s GaussianRandomProjection.

When to use

Sweep baselines / sanity checks against PCA’s structured reduction.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: pca, kernel_features

Last reviewed 2026-05-05 by macroforecast author.

`recursive_feature_elimination` – future

(no schema description for recursive_feature_elimination)

TBD: option doc not yet authored for this value. The encyclopedia falls back to the bare schema description above. PRs adding a full OptionDoc entry under macroforecast/scaffold/option_docs/l3.py are welcome.

`regime_indicator` – operational

Discrete regime / state indicator from L1.G.

Lifts the L1.G regime sink (regime_indicator) into the feature panel as a categorical column. Required when L4 is configured for regime_use = predictor or conditional_intercept.

When to use

Regime-conditional forecasts where the model needs explicit access to the state.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: season_dummy, time_trend

Last reviewed 2026-05-05 by macroforecast author.

`savitzky_golay_filter` – operational

Polynomial-fit smoothing filter (Savitzky & Golay 1964).

Local polynomial regression smoothing: each output value is the polynomial-fit centre value of a moving window. window_length (default 5) and polyorder (default 2) parameterise the kernel. Operational: runtime delegates to scipy.signal.savgol_filter (scipy is a hard dependency).

Used as the fixed-window baseline against which Goulet Coulombe & Klieber (2025) AlbaMA’s adaptive-window estimator is compared in the v0.9.x replication recipe.

When to use

Smoothing macro indicator series for monitoring; AlbaMA replication baseline.

When NOT to use

Series with strong non-linear trends – the polynomial fit smooths them out.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Savitzky & Golay (1964) ‘Smoothing and Differentiation of Data by Simplified Least Squares Procedures’, Analytical Chemistry 36(8).

Related options: hp_filter, hamilton_filter, ma_window, adaptive_ma_rf

Last reviewed 2026-05-05 by macroforecast author.

`scale` – operational

Standardise to zero mean and unit variance.

Computes (y - μ) / σ over the temporal_rule window (expanding_window_per_origin by default to avoid look-ahead). Required pre-step for distance-based learners (kNN, SVM, NN); ridge/lasso also benefit when columns are on different scales.

When to use

Pre-conditioning for distance-based or regularised learners; mandatory for SVM/NN.

When NOT to use

Tree-based models (RF/XGBoost/LightGBM) – scale-invariant by construction.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: pca, kernel_features

Last reviewed 2026-05-05 by macroforecast author.

`scaled_pca` – operational

Scaled / weighted PCA (target-aware factor extraction).

Weights each column by its predictive correlation with the target before performing PCA. Implements the Huang-Jiang-Tu-Zhou (2022) scaled PCA for forecasting macro variables.

Reduces to plain PCA when all weights are equal.

When to use

When standard PCA’s leading factor is dominated by predictively-irrelevant variance.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Huang, Jiang, Tu & Zhou (2022) ‘Scaled PCA: A New Approach to Dimension Reduction’, Management Science 68(3): 1678-1695.

Related options: pca, partial_least_squares

Last reviewed 2026-05-05 by macroforecast author.

`season_dummy` – operational

Calendar dummy variables (month-of-year, quarter-of-year).

Generates params.n - 1 0/1 indicators for the calendar period (drops one to avoid multicollinearity with intercept). Standard frequentist seasonality control.

When to use

Capturing calendar seasonality in linear models when a smooth Fourier basis would over-shrink discrete jumps.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: fourier, seasonal_lag

Last reviewed 2026-05-05 by macroforecast author.

`seasonal_lag` – operational

Lag at a seasonal period (e.g. y_{t-12} for monthly data).

Standard lag op restricted to the seasonal index (params.lag = 12 for monthly, 4 for quarterly). Useful for year-over-year features and seasonal AR terms.

When to use

Capturing year-over-year persistence; seasonal AR baselines.

When NOT to use

When season_dummy or X-13 deseasonalisation is preferred over lag-based seasonality.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: season_dummy, ma_window

Last reviewed 2026-05-05 by macroforecast author.

`sliced_inverse_regression` – operational

sSUFF / Sliced inverse regression (scaled) – supervised dimension reduction (Huang-Jiang-Li-Tong-Zhou 2022).

Supervised dimension reduction extending scaled_pca to non-linear y → X dependence. Pipeline: (1) standardise X; (2) optional column-wise predictive scaling (scaling_method = scaled_pca reuses the Huang-Zhou OLS-slope; marginal_R2 uses sign(β_j)·√R²_j; none skips); (3) sort rows by y and partition into n_slices H contiguous slices; (4) compute weighted between-slice covariance Σ_S = Σ_h (n_h/n) · m̄_h · m̄_h^⊤; (5) take the top-n_components eigenvectors as factor loadings; (6) project the full panel onto these directions. The sSUFF augmentation (Huang-Zhou-Tong 2022) recovers latent factors with higher correlation than plain SIR in the macro-panel regime where signals are sparse over predictors.

Defaults: n_components = 2, n_slices = 10, scaling_method = 'scaled_pca'. Requires a target_signal input port; temporal_rule is required and rejects full_sample_once.

When to use

Supervised factor extraction from macro panels with non-linear y → X structure; alternative to scaled_pca when the predictive direction is non-monotone.

When NOT to use

Very small T (need ≥ 5·n_slices observations after dropping NaN); strictly linear y → X relationship (scaled_pca is sufficient).

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Huang, Jiang, Li, Tong & Zhou (2022) ‘Scaled PCA: A New Approach to Dimension Reduction’, Management Science 68(3): 1678-1695.
Fan, Xue & Yao (2017) ‘Sufficient forecasting using factor models’, Journal of Econometrics 201(2): 292-306.
Li (1991) ‘Sliced Inverse Regression for Dimension Reduction’, JASA 86(414): 316-327.

Related options: scaled_pca, supervised_pca, partial_least_squares

Last reviewed 2026-05-05 by macroforecast author.

`sparse_pca` – operational

Sparse PCA – L1-penalised factor loadings (sklearn / Zou-Hastie-Tibshirani 2006).

Variant of PCA where loadings are pushed toward zero by an L1 penalty (params.alpha). Yields more interpretable factors at the cost of a small reconstruction loss; uses sklearn’s SparsePCA.

When to use

When you want factor loadings to map cleanly onto a small subset of original predictors (interpretability).

When NOT to use

When pure variance maximisation is more important than interpretability – use plain pca. For the Chen-Rohe (2023) SCA variant used in Zhou-Rapach (2025) Sparse Macro-Finance Factors, use sparse_pca_chen_rohe instead.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: pca, scaled_pca, sparse_pca_chen_rohe, supervised_pca

Last reviewed 2026-05-05 by macroforecast author.

`sparse_pca_chen_rohe` – operational

Chen-Rohe (2023) Sparse Component Analysis – non-diagonal D variant.

Sparse component analysis solving min_{Z,D,Θ} ‖X − Z D Θ'‖_F s.t. Z ∈ S(T,J), Θ ∈ S(M,J), ‖Θ‖_1 ≤ ζ (Chen-Rohe 2023; Rapach & Zhou 2025 eq. 3). Differs from sparse_pca (sklearn / Zou-Hastie-Tibshirani 2006) in two ways: (1) the central matrix D is not restricted to be diagonal, which lets SCA explain more total variation for a given sparsity budget; (2) the single hyperparameter ζ ∈ [J, J√M] enters as an ℓ_1 budget constraint rather than a Lagrangian penalty.

Implementation: alternating maximisation of the equivalent bilinear convex-hull form max_{Z,Θ} ‖Z' X Θ‖_F over H(T,J) × H(M,J) (Zhou-Rapach 2025 eq. 4), iterating SVD-projection of Z and L1-budget projection of Θ. Used as the macro-side stage in Rapach & Zhou (2025) Sparse Macro-Finance Factors. Operational v0.9.1 dev-stage v0.9.0C-3.

Hyperparams: n_components (= J; default 4), zeta (= L1 budget; 0.0 defaults to J = most-binding boundary the paper finds optimal in CV), max_iter (default 200), random_state.

When to use

Sparse macro-finance factor extraction with non-diagonal D; the Rapach-Zhou (2025) macro-side procedure.

When NOT to use

When sklearn-style L1-penalised loadings are sufficient – prefer the cheaper sparse_pca.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Chen & Rohe (2023) ‘A New Basis for Sparse Principal Component Analysis’, Journal of Computational and Graphical Statistics. arXiv:2007.00596.
Rapach & Zhou (2025) ‘Sparse Macro-Finance Factors’ working paper – §2.1 eqs. (3)-(4).

Related options: sparse_pca, supervised_pca, scaled_pca, pca

Last reviewed 2026-05-05 by macroforecast author.

`stability_selection` – future

(no schema description for stability_selection)

TBD: option doc not yet authored for this value. The encyclopedia falls back to the bare schema description above. PRs adding a full OptionDoc entry under macroforecast/scaffold/option_docs/l3.py are welcome.

`supervised_pca` – operational

Supervised PCA (Giglio-Xiu-Zhang 2025) – screen-then-PCA on a target panel.

Two-stage supervised reduction:

For each target column g, rank panel columns by univariate correlation with g and keep the top ⌊q · M⌋ (q ∈ (0, 1] hyperparameter; default 0.5);
Run PCA on the screened sub-panel, returning P supervised components.

Refinement of Giglio-Xiu (2021) three-pass: screening makes the construction robust to weak factors and omitted-variable bias. Used as the asset-side stage of Rapach & Zhou (2025) Sparse Macro-Finance Factors for risk-premium estimation. Distinct from partial_least_squares (PLS uses covariance-maximising NIPALS over all columns; SPCA uses correlation-screened PCA on a sub-panel) and from scaled_pca (Huang-Jiang-Tu-Zhou 2022 weights every column; SPCA hard-screens).

Operational v0.9.1 dev-stage v0.9.0C-4. Hyperparams: n_components (= P; default 4), q (screening rate; default 0.5).

When to use

Cross-sectional asset-pricing factor extraction; weak-factor-robust supervised reduction; Rapach-Zhou (2025) replication.

When NOT to use

When the supervisory signal is dense (every panel column matters) – prefer scaled_pca or partial_least_squares.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Giglio, Xiu & Zhang (2025) ‘Test Assets and Weak Factors’, Journal of Finance, forthcoming.
Giglio & Xiu (2021) ‘Asset Pricing with Omitted Factors’, Journal of Political Economy 129(7): 1947-1990.
Rapach & Zhou (2025) ‘Sparse Macro-Finance Factors’ working paper – §2.2 eqs. (5)-(8).

Related options: partial_least_squares, scaled_pca, sparse_pca_chen_rohe, pca

Last reviewed 2026-05-05 by macroforecast author.

`target_construction` – operational

Build the supervised target (y) from the panel.

Constructs the regression target according to forecast_strategy (direct / iterated / cumulative_average) and the L1.F horizon set. Outputs the y artifact that L4 fit_model nodes consume.

Required as the leaf of every L3 DAG; the runtime auto-injects it when the user does not.

When to use

Always required – runtime auto-inserts when missing.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: level, diff, log_diff

Last reviewed 2026-05-05 by macroforecast author.

`time_trend` – operational

Deterministic linear time trend (t = 1, 2, ...).

Adds a column t to the panel; with params.degree > 1 appends polynomial trends. Deterministic complement to stochastic detrending (HP / Hamilton).

When to use

Trend-stationary linear models where a deterministic trend is part of the DGP.

When NOT to use

Series with structural breaks – use regime_indicator or stochastic detrending instead.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: hp_filter, hamilton_filter

Last reviewed 2026-05-05 by macroforecast author.

`u_midas` – operational

Unrestricted MIDAS lag stack – mixed-frequency aggregation primitive (Foroni-Marcellino-Schumacher 2015).

Atomic mixed-frequency primitive: stacks K = n_lags_high high-frequency lags of each predictor at low-frequency dates without imposing any weighting structure. For HF column col and LF index position t·m, emits col_lag0, col_lag1, …, col_lag{K-1} where col_lagk[t] = frame[col].iloc[t·m − k].

The downstream L4 OLS estimator recovers data-driven lag coefficients – the unrestricted MIDAS regression (paper §3.2 eq.(20)):

y_{t×k} = μ₀ + μ₁ y_{t×k−k} + ψ₀ x_{t×k−1} + ψ₁ x_{t×k−2} + … + ψ_K x_{t×k−K} + ε_{t×k}

where μ₀, μ₁, and ψ(L) are estimated by OLS (paper §3.2 p.11). Ridge regularisation is available as an explicit opt-in via regularization='ridge' in the paper_methods.u_midas(...) recipe; it deviates from the paper’s estimator choice and is not the default.

Lag-order selection: n_lags_high='bic' (default) runs BIC over K ∈ {1, …, ceil(1.5 × freq_ratio)}, fitting OLS at each candidate and selecting K* = argmin BIC (paper §3.2 p.11 + §3.5). Pass an integer to fix K. 'aic' is also accepted.

AR(1) y-lag: the paper_methods.u_midas(...) helper sets include_y_lag=True by default, prepending the lagged target y_lag1 as the leftmost design-matrix column (μ₁ term of eq.(20)). Set include_y_lag=False to match the simplified §2.3 eq.(14) form with no AR component.

Defaults: freq_ratio = 3 (quarterly target / monthly HF), n_lags_high = 'bic'; target_freq = 'low' subsamples the LF anchor dates. temporal_rule is required and rejects full_sample_once so the aggregation respects walk-forward boundaries.

Surfaces the Borup-Rapach-Schütte (2023) mixed-frequency ML-nowcasting workflow as a 1-line recipe via paper_methods.u_midas(...).

When to use

Macro nowcasting with monthly predictors and quarterly targets; mixed-frequency feature engineering when no parametric weight kernel is desired.

When NOT to use

When n_lags_high · n_HF_columns is large relative to T even after BIC selects a small K – BIC penalises over-parameterisation but cannot reduce the number of predictor columns. Use midas parametric weighting instead, or pair with downstream lasso / ridge (set regularization='ridge') to handle wide design matrices.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’
Foroni, Marcellino & Schumacher (2011/2015) ‘Unrestricted Mixed Data Sampling (MIDAS): MIDAS Regressions With Unrestricted Lag Polynomials’. Bundesbank Discussion Paper Series 1, No. 35/2011; published as JRSS-A 178(1): 57-82. DOI 10.1111/rssa.12043.
Borup, Rapach & Schütte (2023) ‘Mixed-frequency machine learning: Nowcasting and backcasting weekly initial claims with daily internet search-volume data’, International Journal of Forecasting 39(3): 1122-1144.

Related options: midas, lag, ma_window

Last reviewed 2026-05-05 by macroforecast author.

`varimax` – operational

Varimax-rotated factors (orthogonal rotation for interpretability).

Applies a varimax rotation to PCA loadings, maximising the variance of squared loadings within each factor. Produces factors that load heavily on a small subset of original predictors – useful for naming / labelling factors.

When to use

Factor analysis where downstream interpretation requires distinct, well-named factors.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: pca, sparse_pca

Last reviewed 2026-05-05 by macroforecast author.

`varimax_rotation` – operational

Alias for varimax – rotation step in a multi-stage factor pipeline.

Identical operation to varimax but registered separately so a cascading L3 pipeline can declare pca → varimax_rotation as two visible nodes in its lineage.

When to use

Multi-stage pipelines that explicitly separate factor extraction from rotation.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: varimax, pca

Last reviewed 2026-05-05 by macroforecast author.

`wavelet` – operational

Discrete wavelet transform – multi-scale time-frequency features.

Decomposes the series into wavelet detail and approximation coefficients at several scales (params.wavelet, params.level). Captures localised time-frequency patterns that Fourier basis cannot.

When to use

Series with localised oscillations or non-stationary cycles (financial / climate macro).

When NOT to use

Smooth seasonal patterns – use fourier instead.

References

macroforecast design Part 2, L3: ‘feature engineering is a DAG of typed transforms; cascade-depth bounds the longest chain at cascade_max_depth.’

Related options: fourier

Last reviewed 2026-05-05 by macroforecast author.

op

Sub-layer

Axis metadata

Operational status summary

Options

adaptive_ma_rf – operational

asymmetric_trim – operational

boruta_selection – future

cumsum – operational

dfm – operational

diff – operational

feature_selection – operational

fourier – operational

genetic_algorithm_selection – future

hamilton_filter – operational

holiday – operational

hp_filter – operational

interaction – operational

kernel – operational

kernel_features – operational

l3_feature_bundle – operational

l3_metadata_build – operational

lasso_path_selection – future

level – operational

log – operational

log_diff – operational

ma_increasing_order – operational

ma_window – operational

maf_per_variable_pca – operational

midas – operational

nystroem – operational

nystroem_features – operational

partial_least_squares – operational

pca – operational

pct_change – operational

polynomial – operational

polynomial_expansion – operational

random_projection – operational

recursive_feature_elimination – future

regime_indicator – operational

savitzky_golay_filter – operational

scale – operational

scaled_pca – operational

season_dummy – operational

seasonal_lag – operational

sliced_inverse_regression – operational

sparse_pca – operational

sparse_pca_chen_rohe – operational

stability_selection – future

supervised_pca – operational

target_construction – operational

time_trend – operational

u_midas – operational

varimax – operational

varimax_rotation – operational

wavelet – operational

`op`

`adaptive_ma_rf` – operational

`asymmetric_trim` – operational

`boruta_selection` – future

`cumsum` – operational

`dfm` – operational

`diff` – operational

`feature_selection` – operational

`fourier` – operational

`genetic_algorithm_selection` – future

`hamilton_filter` – operational

`holiday` – operational

`hp_filter` – operational

`interaction` – operational

`kernel` – operational

`kernel_features` – operational

`l3_feature_bundle` – operational

`l3_metadata_build` – operational

`lasso_path_selection` – future

`level` – operational

`log` – operational

`log_diff` – operational

`ma_increasing_order` – operational

`ma_window` – operational

`maf_per_variable_pca` – operational

`midas` – operational

`nystroem` – operational

`nystroem_features` – operational

`partial_least_squares` – operational

`pca` – operational

`pct_change` – operational

`polynomial` – operational

`polynomial_expansion` – operational

`random_projection` – operational

`recursive_feature_elimination` – future

`regime_indicator` – operational

`savitzky_golay_filter` – operational

`scale` – operational

`scaled_pca` – operational

`season_dummy` – operational

`seasonal_lag` – operational

`sliced_inverse_regression` – operational

`sparse_pca` – operational

`sparse_pca_chen_rohe` – operational

`stability_selection` – future

`supervised_pca` – operational

`target_construction` – operational

`time_trend` – operational

`u_midas` – operational

`varimax` – operational

`varimax_rotation` – operational

`wavelet` – operational