Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: track content length histogram #1206

Merged
merged 2 commits into from
Feb 3, 2022
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
151 changes: 128 additions & 23 deletions packages/gateway/src/durable-objects/summary-metrics.js
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,16 @@
* @property {number} totalWinnerResponseTime total response time of the requests
* @property {number} totalWinnerSuccessfulRequests total number of successful requests
* @property {number} totalCachedResponses total number of cached responses
* @property {number} totalContentLengthBytes total content length of responses
* @property {number} totalCachedContentLengthBytes total content length of cached responses
* @property {Record<string, number>} contentLengthHistogram
*
* @typedef {Object} ResponseWinnerStats
* @property {number} responseTime number of milliseconds to get response
* @property {number} contentLength content length header content
*
* @typedef {Object} ContentLengthStats
* @property {number} contentLength content length header content
*/

// Key to track total time for winner gateway to respond
Expand All @@ -11,11 +21,17 @@ const TOTAL_WINNER_RESPONSE_TIME_ID = 'totalWinnerResponseTime'
const TOTAL_WINNER_SUCCESSFUL_REQUESTS_ID = 'totalWinnerSuccessfulRequests'
// Key to track total cached requests
const TOTAL_CACHED_RESPONSES_ID = 'totalCachedResponses'
// Key to track total content length of responses
const TOTAL_CONTENT_LENGTH_BYTES_ID = 'totalContentLengthBytes'
// Key to track total cached content length of responses
const TOTAL_CACHED_CONTENT_LENGTH_BYTES_ID = 'totalCachedContentLengthBytes'
// Key to track content size histogram
const CONTENT_LENGTH_HISTOGRAM_ID = 'contentLengthHistogram'

/**
* Durable Object for keeping generic Metrics of gateway.nft.storage
*/
export class SummaryMetrics0 {
export class SummaryMetrics1 {
constructor(state) {
this.state = state

Expand All @@ -30,6 +46,17 @@ export class SummaryMetrics0 {
// Total cached requests
this.totalCachedResponses =
(await this.state.storage.get(TOTAL_CACHED_RESPONSES_ID)) || 0
// Total content length responses
this.totalContentLengthBytes =
(await this.state.storage.get(TOTAL_CONTENT_LENGTH_BYTES_ID)) || 0
// Total cached content length responses
this.totalCachedContentLengthBytes =
(await this.state.storage.get(TOTAL_CACHED_CONTENT_LENGTH_BYTES_ID)) ||
0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These need to be BigInt. Apparently DO storage supports every type supported by the Structured Clone algorithm which includes BigInt thankfully.

https://developers.cloudflare.com/workers/runtime-apis/durable-objects#transactional-storage-api

// Content length histogram
this.contentLengthHistogram =
(await this.state.storage.get(CONTENT_LENGTH_HISTOGRAM_ID)) ||
createHistogramObject()
})
}

Expand All @@ -47,6 +74,9 @@ export class SummaryMetrics0 {
totalWinnerResponseTime: this.totalWinnerResponseTime,
totalWinnerSuccessfulRequests: this.totalWinnerSuccessfulRequests,
totalCachedResponses: this.totalCachedResponses,
totalContentLengthBytes: this.totalContentLengthBytes,
totalCachedContentLengthBytes: this.totalCachedContentLengthBytes,
contentLengthHistogram: this.contentLengthHistogram,
})
)
default:
Expand All @@ -55,35 +85,110 @@ export class SummaryMetrics0 {
}

// POST
let data
switch (url.pathname) {
case '/metrics/winner':
const data = await request.json()
// Updated Metrics
this.totalWinnerResponseTime += data.responseTime
this.totalWinnerSuccessfulRequests += 1
// Save updated Metrics
await Promise.all([
this.state.storage.put(
TOTAL_WINNER_RESPONSE_TIME_ID,
this.totalWinnerResponseTime
),
this.state.storage.put(
TOTAL_WINNER_SUCCESSFUL_REQUESTS_ID,
this.totalWinnerSuccessfulRequests
),
])
/** @type {ResponseWinnerStats} */
data = await request.json()
await this._updateWinnerMetrics(data)
return new Response()
case '/metrics/cache':
// Update metrics
this.totalCachedResponses += 1
// Sabe updated metrics
await this.state.storage.put(
TOTAL_CACHED_RESPONSES_ID,
this.totalCachedResponses
)
/** @type {ContentLengthStats} */
data = await request.json()
await this._updateWinnerMetrics(data)
return new Response()
default:
return new Response('Not found', { status: 404 })
}
}

/**
* @param {ContentLengthStats} stats
*/
async _updatedCacheMetrics(stats) {
// Update metrics
this.totalCachedResponses += 1
this.totalCachedContentLengthBytes += stats.contentLength
this._updateContentLengthMetrics(stats)
// Sabe updated metrics
await Promise.all([
this.state.storage.put(
TOTAL_CACHED_RESPONSES_ID,
this.totalCachedResponses
),
this.state.storage.put(
TOTAL_CACHED_CONTENT_LENGTH_BYTES_ID,
this.totalCachedContentLengthBytes
),
this.state.storage.put(
TOTAL_CONTENT_LENGTH_BYTES_ID,
this.totalContentLengthBytes
),
this.state.storage.put(
CONTENT_LENGTH_HISTOGRAM_ID,
this.contentLengthHistogram
),
])
}

/**
* @param {ResponseWinnerStats} stats
*/
async _updateWinnerMetrics(stats) {
// Updated Metrics
this.totalWinnerResponseTime += stats.responseTime
this.totalWinnerSuccessfulRequests += 1
this._updateContentLengthMetrics(stats)
// Save updated Metrics
await Promise.all([
this.state.storage.put(
TOTAL_WINNER_RESPONSE_TIME_ID,
this.totalWinnerResponseTime
),
this.state.storage.put(
TOTAL_WINNER_SUCCESSFUL_REQUESTS_ID,
this.totalWinnerSuccessfulRequests
),
this.state.storage.put(
TOTAL_CONTENT_LENGTH_BYTES_ID,
this.totalContentLengthBytes
),
this.state.storage.put(
CONTENT_LENGTH_HISTOGRAM_ID,
this.contentLengthHistogram
),
])
}

/**
* @param {ContentLengthStats} stats
*/
_updateContentLengthMetrics(stats) {
this.totalContentLengthBytes += stats.contentLength

// Update histogram
const tmpHistogram = {
...this.contentLengthHistogram,
}

// Get all the histogram buckets where the content size is smaller
const histogramCandidates = contentLengthHistogram.filter(
(h) => stats.contentLength < h
)
histogramCandidates.forEach((candidate) => {
tmpHistogram[candidate] += 1
})

this.contentLengthHistogram = tmpHistogram
}
}

function createHistogramObject() {
const h = contentLengthHistogram.map((h) => [h, 0])
return Object.fromEntries(h)
}

// We will count occurences per bucket where content size is less or equal than bucket value
export const contentLengthHistogram = [
0.5, 1, 2, 5, 25, 50, 100, 500, 1000, 5000, 10000, 15000, 20000, 30000, 32000,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think these need to be converted to bytes...?

]
18 changes: 13 additions & 5 deletions packages/gateway/src/gateway.js
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ export async function gatewayGet(request, env, ctx) {

if (res) {
// Update cache metrics in background
ctx.waitUntil(updateSummaryCacheMetrics(request, env))
ctx.waitUntil(updateSummaryCacheMetrics(request, env, res))
return res
}

Expand Down Expand Up @@ -150,13 +150,21 @@ async function _gatewayFetch(
/**
* @param {Request} request
* @param {import('./env').Env} env
* @param {Response} response
*/
async function updateSummaryCacheMetrics(request, env) {
async function updateSummaryCacheMetrics(request, env, response) {
// Get durable object for gateway
const id = env.summaryMetricsDurable.idFromName(SUMMARY_METRICS_ID)
const stub = env.summaryMetricsDurable.get(id)

await stub.fetch(_getDurableRequestUrl(request, 'metrics/cache'))
/** @type {import('./durable-objects/summary-metrics').ContentLengthStats} */
const contentLengthStats = {
contentLength: Number(response.headers.get('content-length')),
}

await stub.fetch(
_getDurableRequestUrl(request, 'metrics/cache', contentLengthStats)
)
}

/**
Expand All @@ -169,10 +177,10 @@ async function updateSummaryWinnerMetrics(request, env, gwResponse) {
const id = env.summaryMetricsDurable.idFromName(SUMMARY_METRICS_ID)
const stub = env.summaryMetricsDurable.get(id)

/** @type {import('./durable-objects/gateway-metrics').ResponseStats} */
/** @type {import('./durable-objects/summary-metrics').ResponseWinnerStats} */
const responseStats = {
ok: gwResponse.response.ok,
responseTime: gwResponse.responseTime,
contentLength: Number(gwResponse.response.headers.get('content-length')),
}

await stub.fetch(
Expand Down
2 changes: 1 addition & 1 deletion packages/gateway/src/index.js
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ import { metricsGet } from './metrics.js'

// Export Durable Object namespace from the root module.
export { GatewayMetrics1 } from './durable-objects/gateway-metrics.js'
export { SummaryMetrics0 } from './durable-objects/summary-metrics.js'
export { SummaryMetrics1 } from './durable-objects/summary-metrics.js'
export { CidsTracker0 } from './durable-objects/cids.js'

import { addCorsHeaders, withCorsHeaders } from './cors.js'
Expand Down
14 changes: 14 additions & 0 deletions packages/gateway/src/metrics.js
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@ import pMap from 'p-map'

import { METRICS_CACHE_MAX_AGE, SUMMARY_METRICS_ID } from './constants.js'
import { histogram } from './durable-objects/gateway-metrics.js'
import { contentLengthHistogram } from './durable-objects/summary-metrics.js'

/**
* @typedef {import('./durable-objects/gateway-metrics').GatewayMetrics} GatewayMetrics
Expand Down Expand Up @@ -128,6 +129,19 @@ export async function metricsGet(request, env, ctx) {
(gw) =>
`nftgateway_requests_per_time_total{gateway="${gw}",le="+Inf",env="${env.ENV}"} ${metricsCollected.ipfsGateways[gw].totalSuccessfulRequests}`
),
`# HELP nftgateway_responses_content_length_total`,
`# TYPE nftgateway_responses_content_length_total content length delivered histogram`,
...contentLengthHistogram.map(
(t) =>
`nftgateway_responses_content_length_total{le="${t}",env="${env.ENV}"} ${metricsCollected.summaryMetrics.contentLengthHistogram[t]}`
),
`nftgateway_responses_content_length_total{le="+Inf",env="${env.ENV}"} ${metricsCollected.summaryMetrics.totalWinnerSuccessfulRequests}`,
`# HELP nftgateway_responses_content_length_bytes_total`,
`# TYPE nftgateway_responses_content_length_bytes_total content length of delivered cached responses`,
`nftgateway_responses_content_length_bytes_total{env="${env.ENV}"} ${metricsCollected.summaryMetrics.totalContentLengthBytes}`,
`# HELP nftgateway_cached_responses_content_length_bytes_total`,
`# TYPE nftgateway_cached_responses_content_length_bytes_total content length of delivered cached responses`,
`nftgateway_cached_responses_content_length_bytes_total{env="${env.ENV}"} ${metricsCollected.summaryMetrics.totalCachedContentLengthBytes}`,
].join('\n')

res = new Response(metrics, {
Expand Down
27 changes: 27 additions & 0 deletions packages/gateway/test/metrics.spec.js
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,13 @@ test('Gets Metrics content when empty state', async (t) => {
true
)
t.is(metricsResponse.includes('nftgateway_winner_requests_total'), true)
t.is(metricsResponse.includes(`_responses_content_length_total{le=`), true)
t.is(
metricsResponse.includes(
`_responses_content_length_bytes_total{env="test"} 0`
),
true
)
gateways.forEach((gw) => {
t.is(
metricsResponse.includes(`_requests_total{gateway="${gw}",env="test"} 0`),
Expand Down Expand Up @@ -70,6 +77,26 @@ test('Gets Metrics content', async (t) => {
const response = await mf.dispatchFetch('http://localhost:8787/metrics')
const metricsResponse = await response.text()

// content length of responses is 23
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

? is zero below

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do you mean below? Want to get the metrics before the requests and see if content length is zero?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added that case

t.is(
metricsResponse.includes(
`_responses_content_length_total{le="5",env="test"} 0`
),
true
)
t.is(
metricsResponse.includes(
`_responses_content_length_total{le="25",env="test"} 2`
),
true
)
t.is(
metricsResponse.includes(
`_responses_content_length_bytes_total{env="test"} 46`
),
true
)

gateways.forEach((gw) => {
t.is(
metricsResponse.includes(`_requests_total{gateway="${gw}",env="test"} 2`),
Expand Down
6 changes: 5 additions & 1 deletion packages/gateway/wrangler.toml
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ main = "index.mjs"
[durable_objects]
bindings = [
{name = "GATEWAYMETRICS", class_name = "GatewayMetrics1"},
{name = "SUMMARYMETRICS", class_name = "SummaryMetrics0"},
{name = "SUMMARYMETRICS", class_name = "SummaryMetrics1"},
{name = "CIDSTRACKER", class_name = "CidsTracker0"}
]

Expand Down Expand Up @@ -129,3 +129,7 @@ deleted_classes = ["GatewayMetrics0"]
tag = "v17" # Should be unique for each entry
new_classes = ["SummaryMetrics0"]
deleted_classes = ["GenericMetrics1"]
[[migrations]]
tag = "v18" # Should be unique for each entry
new_classes = ["SummaryMetrics1"]
deleted_classes = ["SummaryMetrics0"]