How to Run Incrementality Tests That Prove Your Ad Spend Is Not Wasted
Are your ads actually driving incremental conversions? Here's how to run holdout tests that prove true ad impact.Practical approach with targeting strategies, creative frameworks, and metrics.
Every dollar you spend on ads is either creating a customer who would not have bought without the ad, or paying for a customer who would have bought anyway. The first type is incremental. The second type is waste. Most ad platforms report all conversions equally because they get paid regardless of incrementality. Your job is to figure out which conversions are real and which are phantom. Incrementality testing is how you do it. This guide covers the methodology, the math, and the practical execution across Google, Meta, and LinkedIn.
The uncomfortable truth is that a significant portion of ad-attributed conversions are not incremental. Studies consistently show that 20-60% of conversions claimed by ad platforms would have happened organically. That means if you are spending $100,000/month on ads, $20,000-$60,000 of it may be subsidizing behavior that was already going to happen. Incrementality testing quantifies this waste so you can reallocate it to channels and campaigns that actually move the needle.
- Incrementality measures whether your ads cause conversions or merely correlate with them. The difference can be 20-60% of your total attributed conversions.
- The gold standard method is a geo-holdout test: turn off ads in select regions and compare conversion rates to regions where ads continue running.
- Run incrementality tests for 2-4 weeks with at least 4 test regions and 4 control regions. Shorter tests produce unreliable results.
- Test your highest-spend channels first. If your biggest channel is only 30% incremental, reallocating that budget could fund your entire growth plan.
What Incrementality Actually Means
Incrementality answers a single question: would this conversion have happened if the ad had not been shown? If yes, the conversion is not incremental and the ad spend that "drove" it was wasted. If no, the conversion is incremental and the ad spend genuinely created value. This distinction is invisible in standard ad platform reporting because platforms take credit for every conversion that touches their ads, regardless of whether the ad was the actual cause.
Consider a branded search campaign on Google Ads. Someone searches for your company name and clicks your ad. Google reports a conversion. But that person already knew your company name, searched for it intentionally, and would have clicked the organic result if the ad had not been there. The conversion is real, but the ad's contribution is zero. You paid $5-15 for a click that would have been free. This is the most common example of non-incremental spend, and branded search campaigns are notorious for it.
Now consider a prospecting campaign on Meta targeting people who have never heard of your company. Someone sees your ad, visits your site for the first time, and signs up for a trial. This person had no prior awareness. Without the ad, they would not have visited. This conversion is incremental. The ad created a customer who otherwise would not exist.
Between these two extremes lies a spectrum. Retargeting campaigns are partially incremental: some of the people who saw your retargeting ad would have come back on their own, but some would not. Non-brand search campaigns are mostly incremental: the searcher has intent but might have chosen a competitor without your ad. The goal of incrementality testing is to quantify where each campaign falls on this spectrum so you can make budget decisions based on causal impact rather than reported conversions.
Incrementality testing reveals the true cost per incremental acquisition, which is almost always higher than the reported CPA
Why Platform Attribution Overstates Impact
Ad platforms have a structural incentive to claim credit for as many conversions as possible because their revenue depends on advertisers believing their ads work. This is not conspiracy. It is simply the attribution methodology. Every major platform uses some version of last-touch or view-through attribution that assigns credit to the ad if it was the last ad interaction before a conversion, or if the user merely saw the ad before converting. Both methods inflate reported performance.
Last-touch attribution gives 100% credit to the last ad clicked before a conversion. If a customer discovered you through a podcast, researched you on your blog, and then clicked a branded search ad before signing up, Google Ads claims the entire conversion. The podcast and blog that actually built awareness and intent get zero credit. Last-touch attribution systematically overcredits channels at the bottom of the funnel (branded search, retargeting) and undercredits channels at the top (display, social prospecting, content).
View-through attribution is even more problematic. If someone sees your Meta ad in their feed (without clicking it) and later converts through organic search, Meta claims a view-through conversion. The default attribution window for Meta is 1 day view-through and 7 days click-through. This means anyone who scrolled past your ad in the last 24 hours and then converted through any channel gets attributed to Meta. On a busy day with high reach, this captures a lot of conversions that Meta did not influence.
Multi-touch attribution models attempt to solve this by distributing credit across touchpoints. But they still operate within the platform ecosystem and cannot account for channels they do not track (word of mouth, organic brand awareness, PR). The only way to measure true incrementality is to run a controlled experiment where you remove the ad entirely and observe what happens to conversions.
The Three Incrementality Testing Methods
Incrementality Testing Approaches (Simplest to Most Rigorous)
Replace your real ad with a non-commercial public service announcement for a random subset of your audience. Both groups see an ad, but only the test group sees your real ad. Compare conversion rates between the two groups. This is the most rigorous method but requires platform integration (Meta offers this natively through their Conversion Lift tool). The control group sees a harmless ad, eliminating the possibility that the act of seeing any ad is what drives conversions.
Turn off ads completely in selected geographic regions for 2-4 weeks while keeping ads running in matched control regions. Compare conversion rates between the two groups. This is the most practical method for most advertisers because it does not require platform cooperation. The challenge is selecting regions that are comparable in size, demographics, and baseline conversion rates.
Turn ads off for a set period (1-2 weeks), measure conversions, then turn ads back on for the same period and compare. This is the simplest method but the least reliable because it does not control for time-based confounders like seasonality, product changes, or market events. Use this only as a directional signal when you cannot run a geo-holdout.
Setting Up a Geo-Holdout Test: Step by Step
The geo-holdout test is the practical gold standard for incrementality measurement. It does not require platform cooperation, works across all channels, and produces reliable results when designed correctly. Here is the step-by-step process.
Step 1: Select Your Test and Control Regions
Divide your markets into matched pairs of geographic regions. Each pair should be as similar as possible in population size, demographics, industry composition, and historical conversion rates. For US-based companies, DMA (Designated Market Area) regions work well because they represent distinct media markets with their own consumption patterns.
Select at least 4 regions for the test group (where ads are turned off) and 4 regions for the control group (where ads continue running). More regions improve statistical power. Assign regions randomly to test and control groups, then verify that the two groups have similar baseline metrics. If one group has systematically higher or lower conversion rates before the test, the results will be biased.
The test regions should represent 15-30% of your total market. Less than 15% produces too little data for statistical significance. More than 30% sacrifices too much potential revenue during the test period. The sweet spot is around 20%: enough data to be meaningful, small enough to limit the revenue impact of turning off ads.
Step 2: Establish Baseline Metrics
Before turning off ads, measure baseline conversion rates in all regions for 2-4 weeks. This pre-test period establishes the "normal" conversion rate for each region and confirms that your test and control groups are well-matched. If the test group has a 2.5% conversion rate and the control group has a 2.4% rate before the test, that is close enough. If the test group has 2.5% and the control has 3.5%, your region assignment is flawed and you need to re-randomize.
Track these baseline metrics: total conversions, conversion rate, revenue per conversion, and organic traffic volume. The organic traffic metric is critical because it provides a cross-check during the test. If organic traffic stays stable in the test regions while paid conversions drop, it confirms that the paid conversions were genuinely incremental (they were new customers, not existing customers using a different channel).
Step 3: Execute the Holdout
Turn off all ads in the test regions on the same day. Do not phase them out gradually because that introduces ambiguity about when the holdout actually started. Use platform geo-targeting exclusions to remove the test regions from all campaigns. On Google Ads, this means adding location exclusions at the campaign level. On Meta, this means excluding the test regions from your audience targeting. On LinkedIn, this means removing the test regions from location targeting.
Run the holdout for 2-4 weeks. Two weeks is the minimum for channels with a short consideration cycle (impulse purchases, free trials). Four weeks is better for channels with a longer consideration cycle (B2B enterprise sales, high-ticket items). The holdout period should be at least as long as your typical buying cycle to capture the full effect of removing ads on new customer acquisition.
During the holdout, do not change anything else. No new landing pages, no pricing changes, no product launches, no PR pushes. Any change that affects conversion rates in one group but not the other will contaminate the results. If an unexpected event occurs during the test (a viral social post, a competitor outage, a seasonal spike), extend the test period to dilute the impact of the anomaly.
Step 4: Measure and Calculate Incrementality
After the holdout period, compare conversion rates between the test group (no ads) and the control group (ads running). The difference is the incremental lift caused by your ads. Here is the formula:
Incremental lift = (Control conversion rate - Test conversion rate) / Control conversion rate
Example: If the control regions (ads running) had a 3.0% conversion rate and the test regions (no ads) had a 2.1% conversion rate, the incremental lift is (3.0 - 2.1) / 3.0 = 30%. This means 30% of the conversions attributed to your ads are truly incremental, and 70% would have happened without the ads.
Calculate the incremental CPA by dividing your total ad spend (in the control regions during the test period) by the number of incremental conversions. If you spent $50,000 on ads in the control regions and the ads produced 1,000 attributed conversions with 30% incrementality, you drove 300 incremental conversions. Your incremental CPA is $50,000 / 300 = $167. Your reported CPA from the platform was $50,000 / 1,000 = $50. The real cost of each new customer is 3.3x what the platform told you.
This number is not a reason to panic. It is a reason to optimize. Knowing that your true CPA is $167 instead of $50 lets you make informed decisions about budget allocation, bid strategies, and channel mix. It also lets you compare the true cost of paid acquisition to the true cost of organic acquisition, which is usually lower but slower.
Measure true incrementality across every channel
OSCOM Paid Ads connects your ad platforms, CRM, and analytics to show incremental revenue per channel, not just attributed conversions. Stop paying for customers who would have come anyway.
See how it worksChannel-Specific Incrementality Patterns
Different channels have different incrementality profiles. Understanding these patterns helps you design better tests and interpret results more accurately.
Branded Search: Lowest Incrementality
Branded search campaigns consistently show the lowest incrementality across all channels, typically 5-20%. People who search for your brand name already know you exist and are actively looking for you. They will find your organic listing if your ad is not there. The only scenario where branded search is highly incremental is when a competitor is bidding on your brand terms and their ad would steal the click. In that case, branded search is defensive spending with genuine incrementality.
Test branded search incrementality by pausing branded campaigns for two weeks and monitoring organic click-through rate and conversion rate. If organic fully absorbs the branded search traffic (organic clicks increase by roughly the amount that paid clicks decrease), the branded campaign has near-zero incrementality. If there is a gap (organic does not fully absorb, and some traffic goes to competitors or is lost entirely), the gap represents the incremental value of branded search.
Non-Brand Search: Moderate to High Incrementality
Non-brand search campaigns target people with intent but no brand preference. Someone searching for "project management software" has not decided which product to buy. Your ad influences their consideration set. Incrementality for non-brand search typically ranges from 40-70%, meaning a significant portion of these conversions would not have happened without the ad because the searcher would have chosen a competitor instead.
The incrementality of non-brand search depends heavily on your organic ranking for the same keywords. If you rank #1 organically for a keyword, the paid ad on that keyword has lower incrementality because you would have captured many of those clicks organically. If you rank on page 2 or not at all, the paid ad is the only way to reach those searchers, and incrementality approaches 100%.
Social Prospecting: Highest Incrementality
Prospecting campaigns on Meta and LinkedIn target people who are not actively searching for your product. They create awareness and demand where none existed. Incrementality for social prospecting typically ranges from 60-90% because these people would not have discovered your product without the ad. The challenge is that social prospecting conversions are harder to measure because the customer journey is longer and involves more touchpoints.
The high incrementality of social prospecting is counterintuitive to many advertisers who see low direct conversion rates and high CPAs. A social prospecting campaign with a $200 reported CPA and 80% incrementality has a $250 incremental CPA. A branded search campaign with a $30 reported CPA and 10% incrementality has a $300 incremental CPA. The "expensive" social campaign is actually cheaper per incremental customer than the "cheap" branded search campaign. Without incrementality data, most teams would cut the social budget and increase the branded search budget, doing the exact opposite of what the data recommends.
Retargeting: Mixed Incrementality
Retargeting campaigns show the widest range of incrementality because the audience is a mix of people at different stages. Some retargeting audiences are shopping cart abandoners who were moments away from converting and just need a reminder. These are moderately incremental (30-50%). Other retargeting audiences are website visitors who browsed one page and left. These are more incremental (50-70%) because without the retargeting ad, many would not have returned.
The incrementality of retargeting decreases with frequency. The first retargeting impression to a website visitor is highly incremental. The twelfth impression to the same person is essentially wasted because if they were going to convert, they would have done it after the first few impressions. Cap retargeting frequency at 3-5 impressions per user per week to maintain incrementality and prevent budget waste on over-saturated audiences.
Channels that look expensive on a CPA basis often deliver the highest incremental value because they reach audiences that would not have converted otherwise
Statistical Rigor: Getting Results You Can Trust
Incrementality tests are experiments, and experiments require statistical rigor. A poorly designed test produces false confidence, which is worse than no test at all because it leads to budget decisions based on noise.
Sample Size Requirements
The minimum sample size for a geo-holdout test depends on three factors: the baseline conversion rate, the expected incremental lift, and the confidence level you need. For a baseline conversion rate of 3%, an expected lift of 30%, and 95% confidence, you need approximately 5,000 conversions in the control group during the test period. If your total monthly conversions are 10,000, allocating 20% to the test group and 80% to the control group gives you 8,000 control conversions over a 4-week test, which is sufficient.
If your conversion volume is lower, you have three options: run the test longer (6-8 weeks instead of 2-4), increase the test group size (30-40% instead of 20%), or lower your confidence threshold (90% instead of 95%). Each option involves a tradeoff. Longer tests increase the cost of the holdout. Larger test groups sacrifice more revenue. Lower confidence increases the risk of acting on noise. Choose the tradeoff that fits your business constraints.
Controlling for Confounders
A confounder is any variable besides ad exposure that differs between your test and control groups and could affect conversion rates. Common confounders include seasonality (if your test regions have different seasonal patterns), competitive activity (if a competitor runs a regional promotion in your test regions), and economic conditions (if a major employer in one test region announces layoffs). The best defense against confounders is randomized region assignment combined with a pre-test baseline period that confirms the groups are equivalent.
After the test, check for confounders by comparing non-ad metrics between the groups. If organic traffic, direct traffic, and email conversion rates are all similar between test and control groups, your results are clean. If any of these metrics differ significantly, a confounder may be present and the results should be interpreted cautiously.
Using Meta Conversion Lift Studies
Meta offers a built-in incrementality testing tool called Conversion Lift. This tool randomly divides your target audience into a test group (who sees your ads) and a holdout group (who does not). It then compares conversion rates between the two groups to calculate incremental lift. This is the most statistically rigorous method available because the randomization is done at the user level rather than the geographic level, eliminating regional confounders.
To run a Conversion Lift study, go to the Experiments section in Meta Ads Manager and select "Conversion Lift." Choose the campaigns you want to test, set the holdout percentage (10-15% is typical), and select the conversion events you want to measure. Meta recommends running the study for at least 2 weeks with a minimum of $5,000 in spend during the test period. The results include the incremental conversion rate, the cost per incremental conversion, and the confidence interval.
The limitation of Meta Conversion Lift is that it only measures Meta's incrementality. It cannot tell you whether the conversions Meta is driving would have been driven by another channel instead. A conversion that is incremental to Meta (would not have happened without the Meta ad) might not be incremental to your overall marketing program (would have happened through Google Ads or organic). Use Meta Conversion Lift for channel-level optimization and geo-holdout tests for portfolio-level optimization.
Using Google Ads Experiments for Incrementality
Google Ads does not offer a native incrementality tool as robust as Meta's Conversion Lift, but you can build one using Campaign Experiments. Create an experiment that splits your audience 90/10 between the real campaign and a "ghost" campaign that is paused. The 10% in the ghost campaign represents your holdout. Compare conversion rates between the two groups after 2-4 weeks.
For branded search incrementality, Google offers a more direct approach: Brand Lift studies. These measure the impact of your ads on brand awareness, ad recall, and purchase intent through survey-based research. While not a direct conversion incrementality test, Brand Lift studies provide evidence of the upper-funnel impact that drives downstream conversions.
The most practical approach for Google Ads incrementality is the geo-holdout. Exclude your test regions from all Google campaigns and compare search conversion rates in those regions to control regions. Because Google Search captures intent-driven behavior, the difference between test and control regions reveals how much of that intent your ads are capturing versus how much would flow to organic results.
Interpreting Results and Making Budget Decisions
Once you have incrementality data, the temptation is to immediately cut all low-incrementality spend. This is premature. Incrementality is one input to budget allocation, not the only input. Consider these additional factors before making cuts.
Incremental Revenue vs. Incremental Conversions
Not all conversions have equal value. A channel with 30% incrementality that drives $500 average deal value contributes more incremental revenue than a channel with 70% incrementality that drives $50 average deal value. Calculate incremental revenue (incremental conversions x average conversion value) and incremental ROAS (incremental revenue / ad spend) for each channel. These revenue-weighted metrics provide a clearer picture of where your budget creates the most value.
The Halo Effect
Some channels have incremental impact that does not show up in direct conversion data. Branded search ads increase overall click-through rate from the SERP (because your brand occupies more real estate), which indirectly improves organic rankings. Social prospecting ads increase branded search volume, which improves brand awareness metrics. These halo effects are real but difficult to measure with a single-channel incrementality test. Account for halo effects qualitatively when interpreting results.
Competitive Defense
Branded search may show low incrementality in normal conditions, but if a competitor starts bidding on your brand terms, the incrementality of your branded campaign shoots up because every click you lose goes to a competitor. Run incrementality tests periodically (quarterly) to capture changes in the competitive landscape. A channel that was 10% incremental six months ago might be 40% incremental now because a new competitor entered the market.
Building an Incrementality Testing Calendar
Incrementality is not a one-time measurement. It changes as your market evolves, your brand awareness grows, and your competitive landscape shifts. Build a testing calendar that covers each major channel once per quarter. This gives you fresh data for budget planning and ensures you catch shifts in incrementality before they waste significant budget.
Q1: Test branded search incrementality. This is the channel most likely to have low incrementality and the easiest to test because pausing branded campaigns produces immediate, measurable changes in organic click behavior.
Q2: Test retargeting incrementality. As your audience pools grow throughout Q1, retargeting frequency increases and incrementality may decrease. A Q2 test catches the degradation.
Q3: Test social prospecting incrementality. Mid-year is the best time for this test because it avoids the seasonal distortions of Q4 (holiday season) and Q1 (new year budget flush). The results inform Q4 budget allocation.
Q4: Run a full-portfolio geo-holdout test. This tests all channels simultaneously by turning off all paid media in the test regions. The results show the total incremental impact of your paid media program, not just individual channels. This is the most important test of the year because it answers the fundamental question: how much of your revenue depends on paid media?
Common Pitfalls and How to Avoid Them
Pitfall 1: Testing Too Small
Holding out one small region for two weeks and comparing it to the rest of your market produces statistically meaningless results. One region might have a bad week for reasons completely unrelated to your ads. The signal-to-noise ratio is too low. Use at least 4 regions in each group and ensure the combined test group represents at least 15% of your market.
Pitfall 2: Ignoring Carryover Effects
Ads do not stop working the moment you turn them off. Brand awareness from previous ad exposure lingers in the test regions, especially for top-of-funnel campaigns. This carryover effect means that the first week of a holdout test may understate the true incrementality because residual awareness from previous ads is still driving some conversions. Wait at least one week before starting to measure the holdout effect. The cleanest data comes from weeks 2-4 of a 4-week holdout.
Pitfall 3: Testing During Anomalous Periods
Product launches, seasonal peaks, competitive disruptions, and PR events all distort incrementality measurements. A holdout test during your biggest product launch will understate incrementality because the product launch itself is driving conversions in both test and control groups, diluting the relative impact of ads. Run tests during stable periods with predictable baselines.
Pitfall 4: Failing to Act on Results
The most common pitfall is running a well-designed test, getting clear results, and then not changing anything because the results conflict with internal politics. If branded search is 10% incremental and your VP of demand gen built their career on branded search, the data will face resistance. Present incrementality results alongside a reallocation plan that shows where the freed-up budget will go and what incremental gains it will produce. Make the alternative concrete, not just the cut.
Prove your ad spend drives real growth
OSCOM Paid Ads connects to your CRM to measure incremental revenue per channel, not just platform-reported conversions. Know exactly which dollars are working.
Start measuring incrementalityFrom Incrementality Data to Budget Allocation
Once you have incrementality data for each channel, you can build an incremental budget allocation model. The model is simple in concept: allocate budget to maximize total incremental conversions given your total budget constraint. Channels with high incrementality and low incremental CPA get more budget. Channels with low incrementality and high incremental CPA get less. The math usually produces recommendations that look nothing like your current allocation because current allocations are based on reported CPA, which systematically overvalues low-incrementality channels.
A typical reallocation looks like this: branded search budget decreases by 30-50% (the organic results absorb most of the traffic), retargeting budget decreases by 20-30% (frequency caps are implemented to maintain incrementality), and social prospecting budget increases by 40-60% (the highest incrementality channel gets the budget freed from lower-incrementality channels). The net effect is the same total budget producing 20-40% more incremental conversions.
Implement the reallocation gradually. Shift 10-20% of the budget per week rather than making dramatic changes overnight. Monitor incremental CPA at the portfolio level. If the reallocation is working, your incremental CPA should decrease as budget moves toward higher-incrementality channels. If it increases, there may be diminishing returns in the channels you are scaling, and you need to find the optimal allocation through iterative testing.
Key Takeaways
- 1Incrementality measures causation, not correlation. 20-60% of platform-reported conversions are typically not incremental.
- 2Geo-holdout tests are the practical gold standard. Use 4+ test regions and 4+ control regions for 2-4 weeks.
- 3Branded search has the lowest incrementality (5-20%). Social prospecting has the highest (60-90%). Retargeting is mixed (30-60%).
- 4Calculate incremental CPA (ad spend / incremental conversions) to compare channels on a true-cost basis. The 'cheapest' channel on reported CPA is often the most expensive on incremental CPA.
- 5Test quarterly and adjust budget allocation based on the results. Incrementality shifts as brand awareness grows and competitive landscapes change.
- 6Reallocate budget from low-incrementality channels to high-incrementality channels to increase total conversions without increasing total spend.
Incrementality insights for paid media teams
Testing frameworks, channel benchmarks, and budget allocation models based on real incrementality data. Weekly.
Incrementality testing is the most underused tool in paid media because it often delivers uncomfortable results. It reveals that the channels leadership loves (branded search, retargeting) are the least incremental, while the channels they question (social prospecting, display) are the most incremental. The teams that act on incrementality data gain a structural advantage: they spend less to acquire each new customer because their budget goes to channels that actually create customers. The teams that ignore it continue paying for conversions that would have happened anyway, subsidizing their ad platforms instead of growing their business. The choice is straightforward. The data makes it obvious. The only question is whether you are willing to follow where the evidence leads.
Know your ROAS across every platform in one view
Oscom unifies Google, Meta, LinkedIn, and TikTok so you can see what's working, kill what isn't, and reallocate fast.