-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transition to eval runner #61
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Looks good to me! Reviewed everything up to e399a06 in 1 minute and 38 seconds
More details
- Looked at
1495
lines of code in23
files - Skipped
1
files when reviewing. - Skipped posting
1
drafted comments based on config settings.
1. continuous_eval/eval/__init__.py:9
- Draft comment:
The import ofEvaluationRunner
is correctly added to reflect the new changes in the system architecture. - Reason this comment was not posted:
Confidence changes required:0%
The PR description mentions the removal of the Evaluation Manager and the creation of a new Pipeline Logger class. The code changes reflect these updates, including the addition of the Evaluation Runner and updates to the documentation. The PR also includes other bug fixes and updates to the documentation to reflect the new changes in the system architecture.
Workflow ID: wflow_BKTc0VGtqrOzCZod
You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet
mode, and more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Looks good to me! Incremental review on 068de1f in 1 minute and 50 seconds
More details
- Looked at
231
lines of code in7
files - Skipped
0
files when reviewing. - Skipped posting
1
drafted comments based on config settings.
1. docs/src/content/docs/pipeline/pipeline_logger.mdx:25
- Draft comment:
The usage of thevalue
parameter here is inconsistent with its definition earlier in the document. Thevalue
parameter is meant to store the output of a module, not the name of a tool. Iftool_name
is intended to be logged, it should be part of a different parameter or method designed for logging tool usage.
pipelog.log(uid=sample_uid, module="module_name", output="output_of_module", tool_name="tool_name", tool_args={"a": a, "b": b})
- Reason this comment was not posted:
Confidence of 0% on close inspection, compared to threshold of 50%.
Workflow ID: wflow_VLyCLCKwI51TlYET
You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet
mode, and more.
@pantonante I updated examples in the docs & readme in couple more places to reflect the latest eval runner & logger. The examples repo and full examples in the docs still uses eval_manager; we can update those once we merge these changes in. Please check if my doc changes are good before merging into main! Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
❌ Changes requested. Incremental review on c90711a in 1 minute and 16 seconds
More details
- Looked at
77
lines of code in1
files - Skipped
0
files when reviewing. - Skipped posting
0
drafted comments based on config settings.
Workflow ID: wflow_BQMFBWtw3BCY7Njj
Want Ellipsis to fix these issues? Tag @ellipsis-dev
in a comment. You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet
mode, and more.
@@ -199,25 +200,24 @@ print(pipeline.graph_repr()) # optional: visualize the pipeline | |||
Now you can run the evaluation on your pipeline |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The example code in the README still references the old EvaluationManager
. Since the PR description mentions its removal, the documentation should be updated to reflect the new EvaluationRunner
usage consistently throughout.
Now you can run the evaluation on your pipeline | |
# Update the example code to use `EvaluationRunner` instead of the old `EvaluationManager`. |
Changelog:
Summary:
Transitioned from
EvaluationManager
toEvaluationRunner
, introducedPipelineLogger
, updated documentation includingREADME.md
, and refactored the codebase.Key points:
EvaluationManager
EvaluationRunner
andPipelineLogger
README.md
Generated with ❤️ by ellipsis.dev