PromptFlow Experimentation Framework

This experiment framework is a utility library that uses the prompt flow SDK to conduct, track and analyze experiments.

Key Features of Experimentation Framework

Developer-Friendly Experiment Execution: Simplified APIs streamline the process of running experiments.
Flexible Execution Environment: Experimentation can be conducted on both local machines and on Azure Machine Learning (AML) compute, facilitating seamless switching between environments based on dataset sizes.
Versatile Experiment Flows: Enable the chaining of experiments, allowing easy passing of outputs from one experiment to another.
Efficient Experiment Tracking: Unique identifiers and tags help monitor and differentiate experiments, aiding in efficient tracking and management.
Variants and Connected Runs: Simplified APIs enable the creation of experiment runs with multiple variants and connected runs in a single step, this in turn will create multiple runs using PromptFlow automatically.
Output Management: Provides utility functions to retrieve experiment outputs in various formats (CSV or JSONL) and merge outputs from multiple runs for streamlined analysis.
Custom Python Tool for GPT4 with Vision or GPT4o model: Offers a custom Python tool that allows to make GPT4 vision/GPT4o with detail parameter to control the resolution. Read more on the implementation here.

Dev setup

Please follow this link for the dev setup details.

Directory Structure

docs : Contains the documentation.
common: Contains common python files required for experiment execution
keyword_correctness: Contains experiment and evaluation flow for keyword correctness use case.

Sample Experiment and Evaluation flows

Along with the framework, there is a sample use case for keyword correctness is implemented here with a two step experiment flows and also has an evaluation flow.

Experiment architecture - Read more on the details of the experiment architecture

Guidelines to execute the experiments - Follow this link to understand the experiment execution

Understand the metrics - The sample evaluation flow evaluates the results, here is the breakdown of the metrics evaluated in the evaluation flow

Walkthrough of Experimentation Framework

ExperimentationFramework.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
common		common
docs		docs
keyword_correctness		keyword_correctness
.env_template		.env_template
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PromptFlow Experimentation Framework

Key Features of Experimentation Framework

Dev setup

Directory Structure

Sample Experiment and Evaluation flows

Walkthrough of Experimentation Framework

About

Releases

Packages

Contributors 3

Languages

License

microsoft/dstoolkit-pf-exp-framework

Folders and files

Latest commit

History

Repository files navigation

PromptFlow Experimentation Framework

Key Features of Experimentation Framework

Dev setup

Directory Structure

Sample Experiment and Evaluation flows

Walkthrough of Experimentation Framework

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages