-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
change tracking update from nightly job to weekly #4345
Comments
Stats were collected 3 days in April (10, 11, 12) and June (05, 06, 07).
|
catalog cron job change: GSA/catalog.data.gov#962 |
Random follow-on notes: We suspect this issue is caused by bot tracking inflating the |
Not to limit bot traffic, but to exclude bot traffic from tracking count. We need to use above three days data, April (10, 11, 12) and June (05, 06, 07), combined with CloudWatch log data (containing user-agent info), to see the effect of excluding bot traffic to the tracking count. |
Stats shows catalog visits to
/_tracking
in June doubles in number compared to April stats. The count of unique dataset page visits is four times more. Together with recent questionable Solr performance, they makes the nightly tracking update job takes too long to finish. Give the fact the page visit stats is not critical to be processes nightly, we can change to a weekly job.Sketch
tracking-update is supposed to run nightly according to CKAN core. We need to change to default behavior in ckanext-geodatagov so it handles weekly data.The text was updated successfully, but these errors were encountered: