Agent actions token support #23452

aleksmaus · 2021-01-12T16:53:16Z

What does this PR do?

Implements the actions token exchange.
This includes the stub for the agent actions that just logs the action received (DEBUG log level).
The actions will need need to be wired in further to properly pass them to the apps/beats and receive the result back. @blakerouse let me know if you are going to handle that or if I should dig further.

The flow of the action token exchange is the following.

Fleet Server sends the ack_token string with that actions to the agent that serves as a "mark" of the latest action received by the agent.
Screenshot of the app/beat action logged from the stub:

Agent persists the ack_token value (currently in a separate file action_ack_token.yml

@blakerouse let me know if there is a better place for that, since I'm just getting familiar with the agent code.

Agent sends the ack_token string with the next check-in request to Fleet Server.
Fleet Server decodes/translates the ack_token string into the action doc sequence number and updates the agent record action_seq_no

This way the fleet server tracks the latest action received by the agent.

If the ack_token is not present in the check in payload the value that is stored with the agent record is used.

Why is it important?

This is needed to support the new Fleet Server agent actions handling on the agent side. Without this change the new action document in the .fleet-actions will cause the agent to go into a loop of check-ins and receiving the same action over and over, since there will be no indication that the agent action was received.

@blakerouse One question about the persisted ack_token on the agent side. We probably should remove the file every time the agent enrolls, since it creates the new agent record for the fleet. Thoughts?

elasticmachine · 2021-01-12T16:53:21Z

Pinging @elastic/ingest-management (Team:Ingest Management)

elasticmachine · 2021-01-12T17:18:49Z

💚 Build Succeeded

the below badges are clickable and redirect to their specific view in the CI or DOCS

Expand to view the summary

Build stats

Build Cause: Pull request #23452 updated
- Start Time: 2021-01-14T22:44:25.101+0000
Duration: 22 min 29 sec
Commit: b857a49

Test stats 🧪

Test	Results
Failed	0
Passed	1450
Skipped	4
Total	1454

💚 Flaky test report

Tests succeeded.

Expand to view the summary

Test stats 🧪

Test	Results
Failed	0
Passed	1450
Skipped	4
Total	1454

elasticmachine · 2021-01-12T20:56:02Z

Pinging @elastic/agent (Team:Agent)

blakerouse · 2021-01-12T21:49:57Z

x-pack/elastic-agent/pkg/agent/application/info/agent_id.go

@@ -43,11 +44,16 @@ func AgentConfigFile() string {
 	return filepath.Join(paths.Config(), defaultAgentConfigFile)
 }

-// AgentActionStoreFile is the file that will contains the action that can be replayed after restart.
+// AgentActionStoreFile is the file that contains the action that can be replayed after restart.


Being that we already have an action_store.yml why not place the action token inside of this file? Why the need to seperate it into its own file?

can change. was not sure what's the right/established pattern.

is there any case where the action_store.yml could get completely overwritten, thus loosing the marker?

i think that would be the correct place

It should only be overwritten on a re-enroll, which is what should happen in the token case. The code paths already handle that, so its the best place for it.

ruflin · 2021-01-13T10:10:19Z

x-pack/elastic-agent/pkg/agent/application/acktoken_store.go

+}
+
+type ackTokenSerializer struct {
+	AckToken string `yaml:"ack_token"`


Maybe worth to make the mutex directly part of this struct? Then you could use atsCached.Lock() which would make the code likely more readable.

nchaulet · 2021-01-13T11:55:54Z

x-pack/elastic-agent/pkg/fleetapi/checkin_cmd.go

@@ -22,6 +22,7 @@ const checkingPath = "/api/fleet/agents/%s/checkin"
 // CheckinRequest consists of multiple events reported to fleet ui.
 type CheckinRequest struct {
 	Status   string              `json:"status"`
+	AckToken string              `json:"ack_token"`


Should we omit if empty? otherwise we probably need to add the property to Kibana

Sure. Will update.

aleksmaus · 2021-01-13T14:07:22Z

Discussed with @blakerouse, going to consolidate the action_store.yml and action_ack_token.yml files into more generic state.yml storage format that can be used for both and extended further.
The agent will handle the migration on first start.

blakerouse

Looks really good, well tested! Please backport this to 7.x.

* Agent actions token support * Make check happy * Consolidate action store and the ack token store into state.yml store * Make state storage thread safe

* Agent actions token support * Consolidate action store and the ack token store into state.yml store * Make state storage thread safe

Agent actions token support

d7dfd91

aleksmaus requested review from ph, ruflin, scunningham and blakerouse January 12, 2021 16:53

botelastic bot added needs_team Indicates that the issue/PR needs a Team:* label Team:Ingest Management labels Jan 12, 2021

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Jan 12, 2021

aleksmaus requested a review from james-elastic January 12, 2021 16:59

Make check happy

7400c05

james-elastic approved these changes Jan 12, 2021

View reviewed changes

aleksmaus mentioned this pull request Jan 12, 2021

OSQuerybeat support + new agent actions support #22782

Closed

andresrc added the Team:Elastic-Agent Label for the Agent team label Jan 12, 2021

andresrc assigned aleksmaus Jan 12, 2021

blakerouse reviewed Jan 12, 2021

View reviewed changes

ruflin reviewed Jan 13, 2021

View reviewed changes

nchaulet reviewed Jan 13, 2021

View reviewed changes

ph removed their request for review January 13, 2021 13:33

aleksmaus added 2 commits January 14, 2021 17:26

Consolidate action store and the ack token store into state.yml store

3cd42af

Make state storage thread safe

b857a49

aleksmaus force-pushed the feature/agent_actions branch from 4321259 to b857a49 Compare January 14, 2021 22:43

aleksmaus requested review from blakerouse, ruflin and nchaulet January 15, 2021 00:38

blakerouse approved these changes Jan 19, 2021

View reviewed changes

aleksmaus merged commit a233f03 into elastic:master Jan 19, 2021

aleksmaus mentioned this pull request Jan 19, 2021

Agent actions token support (#23452) #23569

Merged

aleksmaus added a commit that referenced this pull request Jan 19, 2021

Agent actions token support (#23452) (#23569)

699321e

* Agent actions token support * Consolidate action store and the ack token store into state.yml store * Make state storage thread safe

mdelapenya mentioned this pull request Jan 22, 2021

Root cause analysis for the recent failures in the Fleet agent elastic/e2e-testing#640

Closed

cconboy mentioned this pull request May 11, 2021

Update FAQ from action_store.yml to state.yml elastic/observability-docs#648

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent actions token support #23452

Agent actions token support #23452

aleksmaus commented Jan 12, 2021

elasticmachine commented Jan 12, 2021

elasticmachine commented Jan 12, 2021 •

edited by jenkins-beats-ci bot

Loading

Build stats

Test stats 🧪

Test stats 🧪

elasticmachine commented Jan 12, 2021

blakerouse Jan 12, 2021

aleksmaus Jan 12, 2021

aleksmaus Jan 12, 2021 •

edited

Loading

blakerouse Jan 12, 2021

ruflin Jan 13, 2021

nchaulet Jan 13, 2021

aleksmaus Jan 13, 2021

aleksmaus commented Jan 13, 2021

blakerouse left a comment

Agent actions token support #23452

Agent actions token support #23452

Conversation

aleksmaus commented Jan 12, 2021

What does this PR do?

Why is it important?

elasticmachine commented Jan 12, 2021

elasticmachine commented Jan 12, 2021 • edited by jenkins-beats-ci bot Loading

💚 Build Succeeded

Build stats

Test stats 🧪

💚 Flaky test report

Test stats 🧪

elasticmachine commented Jan 12, 2021

blakerouse Jan 12, 2021

Choose a reason for hiding this comment

aleksmaus Jan 12, 2021

Choose a reason for hiding this comment

aleksmaus Jan 12, 2021 • edited Loading

Choose a reason for hiding this comment

blakerouse Jan 12, 2021

Choose a reason for hiding this comment

ruflin Jan 13, 2021

Choose a reason for hiding this comment

nchaulet Jan 13, 2021

Choose a reason for hiding this comment

aleksmaus Jan 13, 2021

Choose a reason for hiding this comment

aleksmaus commented Jan 13, 2021

blakerouse left a comment

Choose a reason for hiding this comment

elasticmachine commented Jan 12, 2021 •

edited by jenkins-beats-ci bot

Loading

aleksmaus Jan 12, 2021 •

edited

Loading