Send better user-agent values (and got config changes) #7309

chris48s · 2021-11-22T19:44:25Z

There is not loads of changed code in this PR, but there's a couple of things going on in here which are kind of linked and kind of not.

One is a bit of cleanup. The request --> got work left us with this slightly clunky thing where got.js exports a fetchFactory which takes fetchLimitBytes and returns a function. That allows us to inject fetchLimitBytes from config in the server but then sometimes it is useful to construct it outside the context of the server object.
We've also had this long-standing issue Send better user-agent values #6268 which I had a look at before and was also a bit clunky/fiddly for the same reason.

I did have a look at completely decoupling config parsing from the server, but what I realised was that because we can set some of the args at runtime (see

shields/server.js

Lines 28 to 33 in 95a439a

    
           if (+process.argv[2]) { 
        
             config.public.bind.port = +process.argv[2] 
        
           } 
        
           if (process.argv[3]) { 
        
             config.public.bind.address = process.argv[3] 
        
           }

) this didn't seem like the right approach as the whole config object doesn't depend exclusively on the yml/env vars and we'd have to double validate the config at server start time.
The approach I settled on was making a subset of config available outside the context of the server object, which seemed like the right tradeoff.

The other bit of this is adding a feature. Basically before this PR, every instance of shields (our own, self-hosted, dev copies) sends the exact same user agent value (Shields.io/2003a, for..reasons).
As of this PR:

The default user agent is shields (self-hosted)/dev
We (or users who self-host their own instance) can override the base value (the part before the slash)
If either HEROKU_SLUG_COMMIT (we have this set in production) or DOCKER_SHIELDS_VERSION (this PR adds it to the image) are set, that will be used for the second part of the UA string (after the slash)
If neither of those vars are set it will be /dev

This should mean in most cases the userAgent string will be reasonably meaningful.

- add userAgentBase setting - send short SHA in user agent on heroku - set a version (tag or short SHA) in Dockefile and use it to report server version in UA for docker users

shields-ci · 2021-11-22T19:46:13Z

	Warnings
⚠️	📚 Remember to ensure any changes to `config.private` in `services/github/auth/acceptor.js` are reflected in the server secrets documentation
⚠️	This PR modified the server but none of its tests. That's okay so long as it's refactoring existing code.

	Messages
📖	✨ Thanks for your contribution to Shields, @chris48s!

Generated by 🚫 dangerJS against d41fc55

core/base-service/got.js

chris48s · 2021-11-22T19:46:53Z

core/base-service/got-config.js

+
+const fetchLimitBytes = bytes(publicConfig.fetchLimit)
+
+function getUserAgent(userAgentBase = publicConfig.userAgentBase) {


We will need to set USER_AGENT_BASE=Shields.io in prod before we deploy

services/validators.js

calebcartwright

I'm good with the changes, both code and strategic, and approving accordingly so you can move forward if you'd like. One minor inline item that could be ignored or handled in a follow up, but happy to re-:+1: if you decide to make any other changes

chris48s · 2021-11-25T17:30:49Z

I've set the env var, deployed and confirmed it is working in production by using the endpoint badge to make a request to an endpoint where I can see the header.
Hopefully this is all fine and I think it is a good change, but there is this slight worry at the back of my mind that somewhere out there in the world is a rate limit or deny list with a specific exception for Shields.io/2003a hard-coded which is going to break something now that we're sending a different suffix each time. We will have to see..

calebcartwright · 2021-11-25T19:50:22Z

but there is this slight worry at the back of my mind that somewhere out there in the world is a rate limit or deny list with a specific exception for Shields.io/2003a hard-coded which is going to break something now that we're sending a different suffix each time

This was in the back of my mind as well, though I actually think this change is perhaps the best way forward to surface any potential issues. I don't think we can (or should) perpetually pin a user agent, and it feels like changing it as we've done here may be the only way for us to truly discover any such cases, which will then allow us to figure out next steps

chris48s added 3 commits November 21, 2021 18:14

expose fetchLimitBytes/userAgent in got-config module

d7d09f6

export a function not a factory

de598bd

send better user-agent values

d2bde17

- add userAgentBase setting - send short SHA in user agent on heroku - set a version (tag or short SHA) in Dockefile and use it to report server version in UA for docker users

chris48s added the core Server, BaseService, GitHub auth label Nov 22, 2021

shields-cd temporarily deployed to shields-staging-pr-7309 November 22, 2021 19:44 Inactive

chris48s commented Nov 22, 2021

View reviewed changes

calebcartwright reviewed Nov 23, 2021

View reviewed changes

services/validators.js Show resolved Hide resolved

calebcartwright previously approved these changes Nov 23, 2021

View reviewed changes

add a comment explaining fileSize

9144720

chris48s dismissed calebcartwright’s stale review via 9144720 November 24, 2021 20:20

chris48s temporarily deployed to shields-staging-pr-7309 November 24, 2021 20:20 Inactive

calebcartwright approved these changes Nov 24, 2021

View reviewed changes

chris48s added the squash when passing label Nov 25, 2021

Merge branch 'master' into got-config

d41fc55

shields-cd deployed to shields-staging-pr-7309 November 25, 2021 17:06 View deployment

repo-ranger bot merged commit 99bffd3 into master Nov 25, 2021

repo-ranger bot deleted the got-config branch November 25, 2021 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Send better user-agent values (and got config changes) #7309

Send better user-agent values (and got config changes) #7309

chris48s commented Nov 22, 2021

shields-ci commented Nov 22, 2021 •

edited

Loading

chris48s Nov 22, 2021

calebcartwright left a comment

chris48s commented Nov 25, 2021

calebcartwright commented Nov 25, 2021

	if (+process.argv[2]) {
	config.public.bind.port = +process.argv[2]
	}
	if (process.argv[3]) {
	config.public.bind.address = process.argv[3]
	}


		const fetchLimitBytes = bytes(publicConfig.fetchLimit)

		function getUserAgent(userAgentBase = publicConfig.userAgentBase) {

Send better user-agent values (and got config changes) #7309

Send better user-agent values (and got config changes) #7309

Conversation

chris48s commented Nov 22, 2021

shields-ci commented Nov 22, 2021 • edited Loading

chris48s Nov 22, 2021

Choose a reason for hiding this comment

calebcartwright left a comment

Choose a reason for hiding this comment

chris48s commented Nov 25, 2021

calebcartwright commented Nov 25, 2021

shields-ci commented Nov 22, 2021 •

edited

Loading