This repository contains partial replication code and data for "Search Engine Revenue from Navigational and Brand Advertising", appearing at ICWSM 2024.
- Curlie:
data/curlie.zip
has 1.3 million entity:URL pairs collected between Feb-Apr 2023 from Curlie:
|
-
Env:
pip3 install -r src/requirements.txt
-
Ad API Data Collection:
src/ad_apis
has scripts that collect data from the Google and Bing Ad APIs. These scripts require developer credentials to function, see https://developers.google.com/google-ads/api/docs/start and https://learn.microsoft.com/en-us/advertising/guides.src/ad_apis/google/get_historical_metrics.py
: gets Google historical metrics for a set of keywordssrc/ad_apis/google/get_forecast_metrics.py
: gets Google forecast metrics for a set of keywordssrc/ad_apis/bing/get_bid_prices.py
: gets Bing historical metrics for a set of keywords
-
Results Replication:
data/intermediate.zip
has intermediate measurements corresponding to each SERP. Per Northeastern IRB #20-03-04, we cannot share individual-level data from earlier steps in the analysis pipeline.src/results
has scripts to reproduce the Section 4 analysis using this intermediate data.python3 src/results/google_revenue.py run
python3 src/results/google_revenue.py make_plots
python3 src/results/microsoft_revenue.py run
python3 src/results/microsoft_revenue.py make_plots
python3 src/results/effects.py