Feature/enhance safety tests with promptguard #1119

chakravarthik27 · 2024-09-18T17:07:13Z

No description provided.

This commit refactors the PromptGuard class in the modelhandler/promptguard.py module. The changes include: - Simplifying the initialization process by using a singleton pattern - Loading the model and tokenizer from Hugging Face - Preprocessing the input text to remove spaces and mitigate prompt injection tactics - Calculating class probabilities for a single or batch of texts - Adding methods to get jailbreak scores and indirect injection scores for a single input text or a batch of texts - Processing texts in batches to improve efficiency The commit also includes changes in the safety.py module: - Importing the PromptGuard class from the modelhandler/promptguard.py module - Replacing the pipeline usage with the PromptGuard class to get indirect injection scores Lastly, the commit includes changes in the output.py and sample.py modules: - Adding a greater than or equal to comparison method in the MaxScoreOutput class - Updating the comparison method in the QASample class to use the new comparison method in MaxScoreOutput

chakravarthik27 added 2 commits September 17, 2024 17:13

Refactor security.py to add new security checks

10aa4b3

Refactor typing imports in accuracy.py and safety.py

62b77b1

chakravarthik27 self-assigned this Sep 18, 2024

chakravarthik27 changed the title ~~Feature/enhance security tests with promptguard~~ Feature/enhance safety tests with promptguard Sep 18, 2024

chakravarthik27 added 3 commits September 18, 2024 23:29

Refactor test type in safety.py and add decimal formatting in output.py

7a58067

fixed: formatted issue

e9c54e9

chakravarthik27 merged commit d89477a into release/2.4.0 Sep 19, 2024
3 checks passed

chakravarthik27 linked an issue Sep 19, 2024 that may be closed by this pull request

Enhance Security Tests with PromptGuard #1117

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/enhance safety tests with promptguard #1119

Feature/enhance safety tests with promptguard #1119

chakravarthik27 commented Sep 18, 2024

Feature/enhance safety tests with promptguard #1119

Feature/enhance safety tests with promptguard #1119

Conversation

chakravarthik27 commented Sep 18, 2024