Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

relari-ai / continuous-eval Public

Notifications You must be signed in to change notification settings
Fork 29
Star 446

Code
Issues 9
Pull requests 4
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Releases: relari-ai/continuous-eval

Releases · relari-ai/continuous-eval

v0.3.13

04 Aug 23:02

yisz

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.3.13 Latest

Latest

What's Changed

Add TokenCount metric by @pantonante in #74

Full Changelog: v0.3.11...v0.3.13

Contributors

pantonante

Assets 4

Loading

All reactions

v0.3.11

18 Jun 18:44

yisz

Compare

Choose a tag to compare

Loading

v0.3.11

What's Changed

Fix issue #69 by @kelvinchanwh in #73
Fix issue #71 by @kelvinchanwh in #72
Fix key name of retrieved_contexts in the dataset evaluation sample by @jmartisk in #68

New Contributors

@kelvinchanwh made their first contribution in #73
@jmartisk made their first contribution in #68

Full Changelog: v0.3.10...v0.3.11

Contributors

jmartisk and kelvinchanwh

Assets 4

Loading

All reactions

v0.3.10

02 Jun 16:12

yisz

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.3.10

What's Changed

add max_tokens field for llm interface in #66
fix progress bar always 1 short | updated generator in examples to 4o in #67

Full Changelog: v0.3.9...v0.3.10

Assets 4

Loading

All reactions

v0.3.9

23 May 19:53

yisz

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.3.9

What's Changed

Fix contexts attribute name in examples documentation in #58
Transition to eval runner in #61
Add sql metrics in #63

New Contributors

@LucasLeRay made their first contribution in #58

Full Changelog: v0.3.7...v0.3.9

Contributors

LucasLeRay

Assets 4

Loading

All reactions

v0.3.7

25 Apr 01:42

yisz

Compare

Choose a tag to compare

Loading

v0.3.7

What's Changed

fixed double counting corner case for precision / average precision in #55
Fix required keyword for code string to ground_truth_answers in #56

New Contributors

@stantonius made their first contribution in #56

Full Changelog: v0.3.5...v0.3.7

Contributors

stantonius

Assets 4

Loading

All reactions

v0.3.5

27 Mar 06:06

pantonante

Compare

Choose a tag to compare

Loading

v0.3.5

Add bedrock LLM provider
Bug fixing

Assets 2

Loading

All reactions

v0.3.4

20 Mar 05:26

yisz

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.3.4

What's Changed

Add cohere LLM in #46
Feature/llm custom metric in #48
Feature: add vllm openai api endpoint in #49

New Contributors

@joennlae made their first contribution in #49

Full Changelog: v0.3.2...v0.3.4

Contributors

joennlae

Assets 4

Loading

All reactions

v0.3.2

08 Mar 09:00

pantonante

Compare

Choose a tag to compare

Loading

v0.3.2

Metrics batch execution now use threads by default
Bug fixing

Assets 4

Loading

All reactions

v0.3.1

29 Feb 19:38

pantonante

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.3.1

Key points:

Added from_data class method to Dataset class
Fixed is_empty method in EvaluationResults, MetricsResults, and TestResults
Added error handling in LLM-based metrics

Assets 4

Loading

All reactions

v0.2.7

16 Feb 22:19

yisz

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.2.7

What's Changed

Added Code Evaluation Metrics in #29

Full Changelog: v0.2.6...v0.2.7

Assets 4

Loading

All reactions

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.