-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: Misc basic evals improvements (eg: consistent activation heuristic, cli args) #324
Conversation
@jbloomAus does this require us to rerun any evals? also, has the schema changed? (if so we need to update on np's side) |
e2978ea
to
81a7805
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #324 +/- ##
==========================================
+ Coverage 63.97% 64.13% +0.15%
==========================================
Files 25 25
Lines 3223 3318 +95
Branches 408 421 +13
==========================================
+ Hits 2062 2128 +66
- Misses 1052 1074 +22
- Partials 109 116 +7 ☔ View full report in Codecov by Sentry. |
6bdaf7b
to
d1b4f5d
Compare
Description
Updates to the evals code to compute feature-wise metrics, adds tests and improves CLI interface.
Type of change
Please delete options that are not relevant.
Checklist: