feat!: per-miner & per-client daily deal stats #336

bajtos · 2024-08-30T08:22:42Z

Rework the daily_deals table into the following schema:

For each (day, miner_id, client_id), we want to know the following numbers (counts):

tested: (NEW) total deals tested
index_majority_found: (NEW) deals where we found a majority agreeing on the same result of the IPNI query
indexed: deals announcing retrievals to IPNI (HTTP or Graphsync retrievals)
indexed_http: (NEW) deals announcing HTTP retrievals to IPNI
retrieval_majority_found: (NEW) deals where we found a majority agreeing on the same result of the retrieval request
retrievable: deals where the majority agrees the content can be retrieved

BREAKING CHANGE: spark-stats endpoints consuming this table need to change queries to use tested instead of total.

Links:

Remaining work:

add tests for the new columns
add index to speed up date-based lookups (INDEX ON daily_deals (day))

Rework the daily_deals table into the following schema: For each (day, miner_id, client_id), we want to know the following numbers (counts): - `tested`: (NEW) total deals tested - `indexed`: deals announcing retrievals to IPNI (HTTP or Graphsync retrievals) - `indexed_http`: (NEW) deals announcing HTTP retrievals to IPNI - `majority_found`: (NEW) deals where we found a majority agreeing on the same result - `retrievable`: deals where the majority agrees the content can be retrieved BREAKING CHANGE: spark-stats endpoints consuming this table need to change queries to use `tested` instead of `total`. Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

lib/typings.d.ts

juliangruber · 2024-08-31T16:38:43Z

BREAKING CHANGE: spark-stats endpoints consuming this table need to change queries to use tested instead of total.

Can we use this as an opportunity to fix this coupling, by adding an HTTP endpoint to spark-evaluate and letting spark-stats consume that, instead of querying its db?

juliangruber

Please see #336 (comment)

lib/public-stats.js

bajtos · 2024-09-02T09:19:56Z

BREAKING CHANGE: spark-stats endpoints consuming this table need to change queries to use tested instead of total.

Can we use this as an opportunity to fix this coupling, by adding an HTTP endpoint to spark-evaluate and letting spark-stats consume that, instead of querying its db?

spark-evaluate does not expose any HTTP interface right now. Changing that is way beyond the scope of this work, IMO.

IIRC, the direction we wanted to take is to improve spark-evaluate to publish the evaluation results to IPFS & commit the CID to the smart contract and then move all code building publicly-retrievable stats from spark-evaluate to spark-stats repository.

bajtos · 2024-09-03T12:39:24Z

@juliangruber I realised I need two values for majority_found - one regarding the indexer result and another regarding the retrieval result. It's possible for a deal to be correctly indexed but to not have majority agreement on whether it can be retrieved.

What would be good column names?

I feel that retrievable_majority_found and indexed_majority_found could be confusing, I read it as "somebody found an indexed majority" or "a majority that's retrievable".

How about has_result_indexed and has_result_retrievable?

The scores will be then calculated as follows:

deals indexed: indexed / has_result_indexed
deals offering HTTP retrievals: indexed_http / has_result_indexed
retrievable deals: retrievable / has_result_retrievable

juliangruber · 2024-09-04T13:07:23Z

What about these:

index_majority_found
retrieval_majority_found

I think these make sense because they stand for "majority found in the index process" and "majority found in the retrieval process". Strictly speaking it's "index resolution", so should be index_resolution_majority_found, but I don't think this level of precision is necessary

Co-authored-by: Julian Gruber <julian@juliangruber.com>

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos · 2024-09-06T08:24:06Z

The PR is ready for final review & landing. @juliangruber PTAL 🙏🏻

lib/public-stats.js

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

lib/public-stats.js

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

test/public-stats.test.js

bajtos · 2024-09-09T09:32:59Z

@juliangruber LGTY now?

juliangruber

Great work, Miro!!

sentry-io · 2024-09-11T20:45:13Z

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

‼️ Error: contract runner does not support sending transactions (operation="sendTransaction", code=UNSUPPOR... ?(platform-stats) View Issue

_{Did you find this useful? React with a 👍 or 👎}

bajtos requested a review from juliangruber August 30, 2024 08:22

bajtos commented Aug 30, 2024

View reviewed changes

lib/typings.d.ts Show resolved Hide resolved

juliangruber requested changes Aug 31, 2024

View reviewed changes

lib/public-stats.js Outdated Show resolved Hide resolved

lib/public-stats.js Outdated Show resolved Hide resolved

bajtos mentioned this pull request Sep 3, 2024

Retrieval status breakdown in public dashboard space-meridian/roadmap#123

Open

2 tasks

bajtos and others added 11 commits September 5, 2024 14:35

Update lib/public-stats.js

3cf30eb

Co-authored-by: Julian Gruber <julian@juliangruber.com>

Merge branch 'main' into deal-retrievability-score

d4c3ddf

fixup! INDEX ON daily_deals (day)

1943e84

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

fixup! failing test after merge from main

75e2d45

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

fixup! index_majority_found & retrieval_majority_found

1dc7fb2

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

report task-with-no-clients to Sentry

6424760

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

test: deal_id, client_id

e0c6d4c

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

test index_majority_found, indexed, indexed_http

d76bcf7

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

test: retrieval_majority_found, retrievable

8f9c930

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

fixup! formatting

ef9af97

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

Merge branch 'main' into deal-retrievability-score

bc40229

bajtos marked this pull request as ready for review September 6, 2024 08:23

bajtos requested a review from juliangruber September 6, 2024 08:23

juliangruber requested changes Sep 6, 2024

View reviewed changes

lib/public-stats.js Outdated Show resolved Hide resolved

lib/public-stats.js Outdated Show resolved Hide resolved

bajtos added 3 commits September 9, 2024 10:58

Merge branch 'main' into deal-retrievability-score

514480a

fixup! skip tasks/committees with no clients

7343a20

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bump DB migration script index after merge from main

4cb498f

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos commented Sep 9, 2024

View reviewed changes

lib/public-stats.js Outdated Show resolved Hide resolved

refactor: add hasMajority props to CommitteeEvaluation

d675263

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos commented Sep 9, 2024

View reviewed changes

test/public-stats.test.js Show resolved Hide resolved

bajtos commented Sep 9, 2024

View reviewed changes

test/public-stats.test.js Show resolved Hide resolved

bajtos requested a review from juliangruber September 9, 2024 09:33

juliangruber approved these changes Sep 9, 2024

View reviewed changes

bajtos merged commit d2fe8ca into main Sep 9, 2024
6 checks passed

bajtos deleted the deal-retrievability-score branch September 9, 2024 09:57

This was referenced Sep 9, 2024

deps: upgrade spark-evaluate to d2fe8ca7 filecoin-station/spark-stats#215

Merged

fix: sum deal stats over miners and clients filecoin-station/spark-stats#216

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: per-miner & per-client daily deal stats #336

feat!: per-miner & per-client daily deal stats #336

bajtos commented Aug 30, 2024 •

edited

Loading

juliangruber commented Aug 31, 2024

juliangruber left a comment

bajtos commented Sep 2, 2024

bajtos commented Sep 3, 2024

juliangruber commented Sep 4, 2024

bajtos commented Sep 6, 2024

bajtos commented Sep 9, 2024

juliangruber left a comment

sentry-io bot commented Sep 11, 2024

feat!: per-miner & per-client daily deal stats #336

feat!: per-miner & per-client daily deal stats #336

Conversation

bajtos commented Aug 30, 2024 • edited Loading

juliangruber commented Aug 31, 2024

juliangruber left a comment

Choose a reason for hiding this comment

bajtos commented Sep 2, 2024

bajtos commented Sep 3, 2024

juliangruber commented Sep 4, 2024

bajtos commented Sep 6, 2024

bajtos commented Sep 9, 2024

juliangruber left a comment

Choose a reason for hiding this comment

sentry-io bot commented Sep 11, 2024

Suspect Issues

bajtos commented Aug 30, 2024 •

edited

Loading