Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transition to eval runner #61

Merged
merged 3 commits into from
May 19, 2024
Merged

Transition to eval runner #61

merged 3 commits into from
May 19, 2024

Conversation

pantonante
Copy link
Contributor

@pantonante pantonante commented May 18, 2024

Changelog:

  • Remover Evaluation Manager
  • Create a new Pipeline Logger class
  • Update docs
  • Other bug fixes

🚀 This description was created by Ellipsis for commit c90711a

Summary:

Transitioned from EvaluationManager to EvaluationRunner, introduced PipelineLogger, updated documentation including README.md, and refactored the codebase.

Key points:

  • Removed EvaluationManager
  • Introduced EvaluationRunner and PipelineLogger
  • Updated documentation, including changes in README.md
  • Refactored codebase for new class integration
  • Fixed various bugs

Generated with ❤️ by ellipsis.dev

@pantonante pantonante requested a review from yisz May 18, 2024 03:40
Copy link
Contributor

@ellipsis-dev ellipsis-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 Looks good to me! Reviewed everything up to e399a06 in 1 minute and 38 seconds

More details
  • Looked at 1495 lines of code in 23 files
  • Skipped 1 files when reviewing.
  • Skipped posting 1 drafted comments based on config settings.
1. continuous_eval/eval/__init__.py:9
  • Draft comment:
    The import of EvaluationRunner is correctly added to reflect the new changes in the system architecture.
  • Reason this comment was not posted:
    Confidence changes required: 0%
    The PR description mentions the removal of the Evaluation Manager and the creation of a new Pipeline Logger class. The code changes reflect these updates, including the addition of the Evaluation Runner and updates to the documentation. The PR also includes other bug fixes and updates to the documentation to reflect the new changes in the system architecture.

Workflow ID: wflow_BKTc0VGtqrOzCZod


You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

Copy link
Contributor

@ellipsis-dev ellipsis-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 Looks good to me! Incremental review on 068de1f in 1 minute and 50 seconds

More details
  • Looked at 231 lines of code in 7 files
  • Skipped 0 files when reviewing.
  • Skipped posting 1 drafted comments based on config settings.
1. docs/src/content/docs/pipeline/pipeline_logger.mdx:25
  • Draft comment:
    The usage of the value parameter here is inconsistent with its definition earlier in the document. The value parameter is meant to store the output of a module, not the name of a tool. If tool_name is intended to be logged, it should be part of a different parameter or method designed for logging tool usage.
pipelog.log(uid=sample_uid, module="module_name", output="output_of_module", tool_name="tool_name", tool_args={"a": a, "b": b})
  • Reason this comment was not posted:
    Confidence of 0% on close inspection, compared to threshold of 50%.

Workflow ID: wflow_VLyCLCKwI51TlYET


You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

@yisz
Copy link
Contributor

yisz commented May 18, 2024

@pantonante I updated examples in the docs & readme in couple more places to reflect the latest eval runner & logger.

The examples repo and full examples in the docs still uses eval_manager; we can update those once we merge these changes in.

Please check if my doc changes are good before merging into main! Thanks!

Copy link
Contributor

@ellipsis-dev ellipsis-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

❌ Changes requested. Incremental review on c90711a in 1 minute and 16 seconds

More details
  • Looked at 77 lines of code in 1 files
  • Skipped 0 files when reviewing.
  • Skipped posting 0 drafted comments based on config settings.

Workflow ID: wflow_BQMFBWtw3BCY7Njj


Want Ellipsis to fix these issues? Tag @ellipsis-dev in a comment. You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

@@ -199,25 +200,24 @@ print(pipeline.graph_repr()) # optional: visualize the pipeline
Now you can run the evaluation on your pipeline
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The example code in the README still references the old EvaluationManager. Since the PR description mentions its removal, the documentation should be updated to reflect the new EvaluationRunner usage consistently throughout.

Suggested change
Now you can run the evaluation on your pipeline
# Update the example code to use `EvaluationRunner` instead of the old `EvaluationManager`.

@pantonante pantonante merged commit 344f7e9 into main May 19, 2024
@pantonante pantonante deleted the enhancement/eval_runner branch May 19, 2024 17:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants