Transition to eval runner #61

pantonante · 2024-05-18T03:40:56Z

Changelog:

Remover Evaluation Manager
Create a new Pipeline Logger class
Update docs
Other bug fixes

🚀 This description was created by Ellipsis for commit `c90711a`

Summary:

Transitioned from EvaluationManager to EvaluationRunner, introduced PipelineLogger, updated documentation including README.md, and refactored the codebase.

Key points:

Removed EvaluationManager
Introduced EvaluationRunner and PipelineLogger
Updated documentation, including changes in README.md
Refactored codebase for new class integration
Fixed various bugs

Generated with ❤️ by ellipsis.dev

ellipsis-dev

👍 Looks good to me! Reviewed everything up to e399a06 in 1 minute and 38 seconds

More details

Looked at 1495 lines of code in 23 files
Skipped 1 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. continuous_eval/eval/__init__.py:9

Draft comment:
The import of EvaluationRunner is correctly added to reflect the new changes in the system architecture.
Reason this comment was not posted:
Confidence changes required: 0%
The PR description mentions the removal of the Evaluation Manager and the creation of a new Pipeline Logger class. The code changes reflect these updates, including the addition of the Evaluation Runner and updates to the documentation. The PR also includes other bug fixes and updates to the documentation to reflect the new changes in the system architecture.

Workflow ID: wflow_BKTc0VGtqrOzCZod

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 068de1f in 1 minute and 50 seconds

More details

Looked at 231 lines of code in 7 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. docs/src/content/docs/pipeline/pipeline_logger.mdx:25

Draft comment:
The usage of the value parameter here is inconsistent with its definition earlier in the document. The value parameter is meant to store the output of a module, not the name of a tool. If tool_name is intended to be logged, it should be part of a different parameter or method designed for logging tool usage.

pipelog.log(uid=sample_uid, module="module_name", output="output_of_module", tool_name="tool_name", tool_args={"a": a, "b": b})

Reason this comment was not posted:
Confidence of 0% on close inspection, compared to threshold of 50%.

Workflow ID: wflow_VLyCLCKwI51TlYET

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

yisz · 2024-05-18T16:04:53Z

@pantonante I updated examples in the docs & readme in couple more places to reflect the latest eval runner & logger.

The examples repo and full examples in the docs still uses eval_manager; we can update those once we merge these changes in.

Please check if my doc changes are good before merging into main! Thanks!

ellipsis-dev

❌ Changes requested. Incremental review on c90711a in 1 minute and 16 seconds

More details

Looked at 77 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 0 drafted comments based on config settings.

Workflow ID: wflow_BQMFBWtw3BCY7Njj

Want Ellipsis to fix these issues? Tag @ellipsis-dev in a comment. You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev · 2024-05-18T16:14:17Z

README.md

@@ -199,25 +200,24 @@ print(pipeline.graph_repr()) # optional: visualize the pipeline
 Now you can run the evaluation on your pipeline


The example code in the README still references the old EvaluationManager. Since the PR description mentions its removal, the documentation should be updated to reflect the new EvaluationRunner usage consistently throughout.

Suggested change

Now you can run the evaluation on your pipeline

# Update the example code to use `EvaluationRunner` instead of the old `EvaluationManager`.

Transition to eval runner

e399a06

pantonante requested a review from yisz May 18, 2024 03:40

ellipsis-dev bot reviewed May 18, 2024

View reviewed changes

updated docs & readme for evalrunner | logger

068de1f

ellipsis-dev bot reviewed May 18, 2024

View reviewed changes

repo readme updates

c90711a

ellipsis-dev bot reviewed May 18, 2024

View reviewed changes

pantonante merged commit 344f7e9 into main May 19, 2024

pantonante deleted the enhancement/eval_runner branch May 19, 2024 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transition to eval runner #61

Transition to eval runner #61

pantonante commented May 18, 2024 •

edited by ellipsis-dev bot

Loading

ellipsis-dev bot left a comment

ellipsis-dev bot left a comment

yisz commented May 18, 2024

ellipsis-dev bot left a comment

ellipsis-dev bot May 18, 2024

		@@ -199,25 +200,24 @@ print(pipeline.graph_repr()) # optional: visualize the pipeline
		Now you can run the evaluation on your pipeline

	Now you can run the evaluation on your pipeline
	# Update the example code to use `EvaluationRunner` instead of the old `EvaluationManager`.

Transition to eval runner #61

Transition to eval runner #61

Conversation

pantonante commented May 18, 2024 • edited by ellipsis-dev bot Loading

Summary:

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

yisz commented May 18, 2024

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot May 18, 2024

Choose a reason for hiding this comment

pantonante commented May 18, 2024 •

edited by ellipsis-dev bot

Loading