Fixes the scrape timeout validation. #465

cyriltovena · 2023-01-02T13:58:46Z

Fixes #463

simonswine

Good catch on the bug fix, not too sure if I agree on the second change.

pkg/agent/config.go

simonswine · 2023-01-11T10:51:42Z

pkg/agent/config.go

-	if c.ScrapeTimeout > c.ScrapeInterval {
-		return fmt.Errorf("scrape timeout must be larger or equal to inverval for: %v", c.JobName)
-	}
 	if c.ScrapeTimeout == 0 {
-		c.ScrapeTimeout = c.ScrapeInterval
+		c.ScrapeTimeout = c.ScrapeInterval + model.Duration(3*time.Second)
+	}
+	if c.ScrapeTimeout <= c.ScrapeInterval {
+		return fmt.Errorf("scrape timeout must be larger or equal to inverval for: %v", c.JobName)
 	}


I think we should only do those defaults and checks if delta=true. Because otherwise computing the profile will be instantaneous and we should have shorter timeouts.

pkg/agent/config.go

simonswine · 2023-01-11T11:12:40Z

pkg/agent/profiles.go

@@ -228,7 +228,7 @@ func (tg *TargetGroup) targetsFromGroup(group *targetgroup.Group) ([]*Target, []
 				}

 				if pcfg, found := tg.config.ProfilingConfig.PprofConfig[profType]; found && pcfg.Delta {
-					params.Add("seconds", strconv.Itoa(int(time.Duration(tg.config.ScrapeTimeout)/time.Second)-1))
+					params.Add("seconds", strconv.Itoa(int(time.Duration(tg.config.ScrapeInterval)/time.Second)))


I am not too sure why this had been removed. I always thought that this will make sure we have a consistent scrape time, even when network latency is added (up to 1 second).

Let's take this example:

scrape_interval: 15s
scrape_timeout: 18s
profiling_time: 15s (was 14s before)

After this change, when we have network latency of 500ms, we would miss out on scrapes, because the real scrape interval would be 15s+network latency and this will create problems when we look at things over time e.g. this historgram screenshot (similar to the prometheus rate([twice_scrape_interval]) problem.

Here you can see a point missing at 11:12:47 because of that:

Co-authored-by: Christian Simon <simon@swine.de>

Fixes the scrape timeout validation.

Fixes the scrape timeout validation.

2fd3dea

cyriltovena requested a review from simonswine January 9, 2023 15:25

simonswine approved these changes Jan 11, 2023

View reviewed changes

cyriltovena and others added 3 commits January 11, 2023 12:17

Update pkg/agent/config.go

7be9994

Co-authored-by: Christian Simon <simon@swine.de>

Update pkg/agent/config.go

1a39aa1

Co-authored-by: Christian Simon <simon@swine.de>

review feedback

30de572

cyriltovena enabled auto-merge January 11, 2023 13:07

cyriltovena merged commit c5e6772 into main Jan 11, 2023

cyriltovena deleted the fixes/463 branch January 11, 2023 13:16

simonswine pushed a commit to simonswine/pyroscope that referenced this pull request Jun 30, 2023

Merge pull request grafana/phlare#465 from grafana/fixes/463

9958fe8

Fixes the scrape timeout validation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes the scrape timeout validation. #465

Fixes the scrape timeout validation. #465

cyriltovena commented Jan 2, 2023

simonswine left a comment

simonswine Jan 11, 2023

simonswine Jan 11, 2023

Fixes the scrape timeout validation. #465

Fixes the scrape timeout validation. #465

Conversation

cyriltovena commented Jan 2, 2023

simonswine left a comment

Choose a reason for hiding this comment

simonswine Jan 11, 2023

Choose a reason for hiding this comment

simonswine Jan 11, 2023

Choose a reason for hiding this comment