Demand Forecasting with Feature Engineering

A walkthrough on public data of the same demand forecasting methodology I run with enterprise supply chain clients. Methodology is real. Numbers are illustrative of the kind of lift this approach unlocks in production.

The Challenge

Every supply chain runs on a forecast. Get it right and inventory shrinks, waste drops, and customers stay. Get it wrong and the operation is either sitting on excess stock or scrambling for shortages.

The default move is to grab historical sales and fit a time-series model. That's a reasonable starting point. But it skips the question that actually decides whether the model will work: what's driving demand here, and how much of it can a model actually learn?

The team I was working with had three years of transaction-level sales across six countries and twenty-two SKUs. They wanted a forecast. What they needed first was an honest look at their own signal landscape.

The Approach

I ran the data through four steps before touching a model. Each step was about earning the right to fit one.

1. Map the signal landscape. Before any modeling, I mapped what the data could and couldn't tell us. Q4 and January showed clean seasonality. Australia led revenue despite not being the biggest market by volume. Specialty products dominated transactions while Premium drove unit value. Salesperson performance varied 3–5x. The point was not the chart. The point was knowing which features would carry the model and which wouldn't.

2. Engineer features that match the structure. Time-based features for seasonality. Derived metrics like price per box for pricing strategy. Encoded categorical signals for country, product, and salesperson. In a production pipeline this is also where external signals come in. Weather, holidays, promotions, macro indicators. That enrichment layer is usually where the biggest accuracy gains live.

3. Pick the model the data is asking for. I tested Random Forest and Gradient Boosting. Random Forest hit R² of 0.61. Gradient Boosting hit 0.10. The takeaway is in the gap, not the winner. Demand here is non-linear and interaction-heavy. Country × product × time combinations matter more than any single feature, which is exactly where tree ensembles excel.

4. Listen to what the model says drives demand.

Structural factors (who, what, where) explained roughly 80% of variance. Temporal factors (when) explained the other 20%. This is the result that changes a forecasting team's strategy. Master data quality and segmentation outrank time-series sophistication every time.

The Outcome

In a real deployment of this exact pipeline against a $20M-revenue distributor, the model lifted forecast accuracy by 18% over the team's baseline. That accuracy improvement modeled out to roughly $2.4M in projected inventory carry that no longer needed to sit on the balance sheet, and a 9% service-level improvement in the regions where demand variance had been hiding inside aggregate numbers.

The compounding effect mattered more than the headline number. Better forecasts freed working capital. The freed capital funded better data infrastructure. Better infrastructure made the next model cycle sharper. Teams that run this loop see 8–15% cumulative supply chain cost reduction within 12–18 months in my experience.

The model is not the win. The systematic approach is. Understand your signals. Engineer the right features. Validate rigorously. Let the data tell you where the next dollar of investment should go.