Harrison/guarded output parser #1804

hwchase17 · 2023-03-20T01:10:49Z

No description provided.

langchain/guardrails/format_instructions.py

langchain/chains/llm.py

langchain/schema.py

langchain/guardrails/parsing.py

jerwelborn · 2023-03-20T23:29:38Z

langchain/output_parsers/pydantic.py

@@ -30,10 +30,11 @@ def get_format_instructions(self) -> str:
        schema = self.pydantic_object.schema()

        # Remove extraneous fields.


noting @hwchase17 : added this back to support more complex types, like an enum.. I tweaked the in-context example as a result, including a positive + negative case. Seems to be working reasonably well. I'd like to add some tests, but doesn't have to be in this diff

eyurtsev · 2023-03-21T05:57:05Z

langchain/chains/llm.py

+    def _get_final_output(
+        self, generations: List[Generation], prompt_value: PromptValue
+    ) -> Any:
+        """Get the final output from a list of generations for a prompt."""


Should we drop an assertion about shape of generations. It's locally (i.e., from the perspective of a person living inside the function) not obvious why it's okay to index into the list (and ignore any other generations)

eyurtsev · 2023-03-21T06:02:02Z

langchain/output_parsers/fix.py

+
+
+class FixOutputParser(BaseOutputParser):
+    """Wraps a parser and tries to fix parsing errors."""


Are the any benefits to trying multiple times?

eyurtsev · 2023-03-21T13:52:33Z

langchain/chains/llm.py

+        response, prompts = await self.agenerate(input_list)
+        return self.create_outputs(response, prompts)
+
+    def _get_final_output(


What do you think about renaming this method and update the doc-string to clarify what "final" means.

The doc-string can be a bit more verbose and indicate that the output will be parsed if a parser has been specified.

At this level of the code, it's surprising to a parser attached on a prompt_value object. (Developers will probably not know what PromptValue is either)

eyurtsev · 2023-03-21T13:53:39Z

langchain/chains/qa_generation/base.py

@@ -47,7 +47,7 @@ def output_keys(self) -> List[str]:

    def _call(self, inputs: Dict[str, str]) -> Dict[str, Any]:
        docs = self.text_splitter.create_documents([inputs[self.input_key]])


This is stylistic -- I like a convention of favoring non abbreviated names (e.g., docs -> documents) -- goal is to reduce number of names that appear in the code base since it removes one degree of freedom from naming.

eyurtsev · 2023-03-21T14:10:04Z

langchain/output_parsers/retry.py

+Details: {error}
+Please try again:"""
+
+NAIVE_RETRY_PROMPT = PromptTemplate.from_template(NAIVE_COMPLETION_RETRY)


How about inlining the template instead of using a proxy variable? We don't re-use the proxy variable anywhere right?

nope, but imo it makes it more readable to have it separate

eyurtsev · 2023-03-21T14:11:00Z

langchain/output_parsers/retry.py

+)
+
+
+class RetryOutputParser(BaseOutputParser):


What's the difference between this parser and the fix parser above?

What do you think about expanding the doc-string significantly to explain use case and how the retry works? e.g., 2-5 lines of documentation.

yup will good, good call

eyurtsev · 2023-03-21T14:15:41Z

langchain/output_parsers/retry.py

+    """Wraps a parser and tries to fix parsing errors."""
+
+    parser: BaseOutputParser
+    retry_chain: LLMChain


The type system doesn't seem to help us much here -- since it looks like the run api of the retry_chain interface is defined by the prompt (i.e., ability to accept prompt and completion)

have you thought of a way to surface that type information? I assume the challenge is maintaining things serializable

not sure what you mean, would love to discuss

eyurtsev · 2023-03-21T14:16:16Z

langchain/output_parsers/retry.py

+        return self.parser.get_format_instructions()
+
+
+class RetryWithErrorOutputParser(BaseOutputParser):


Why not combine this parser with the one above and add a variable to control behavior on error?

prompt inputs differ... its def possible but youd need to make sure the variable for whether to pass error in is aligned with the prompt which just feels like too many levers to make in sync

eyurtsev · 2023-03-21T14:17:55Z

langchain/schema.py

+        return output_parser_dict
+
+
+class OutputParserException(Exception):


1 line doc-string to explain what the exception is to be used for and in what cases it's expected to be raised / excepted?

The exception name looks like it could be raised by either logic issues in the parser or due to malformed data. Is this exception meant to catch only the latter?

hwchase17 added 3 commits March 19, 2023 17:57

guarded output parser

44d2492

cr

1af560c

cr

6898d83

jerwelborn reviewed Mar 20, 2023

View reviewed changes

langchain/guardrails/format_instructions.py Outdated Show resolved Hide resolved

jerwelborn reviewed Mar 20, 2023

View reviewed changes

langchain/guardrails/format_instructions.py Outdated Show resolved Hide resolved

jerwelborn reviewed Mar 20, 2023

View reviewed changes

langchain/chains/llm.py Outdated Show resolved Hide resolved

jerwelborn reviewed Mar 20, 2023

View reviewed changes

langchain/schema.py Outdated Show resolved Hide resolved

jerwelborn added 2 commits March 20, 2023 11:27

factor out 'naive' retry chain

fa2d98c

make parser and guarded parser roughly swappable

bfa858b

jerwelborn force-pushed the harrison/guarded-output-parser branch from c33874e to bfa858b Compare March 20, 2023 19:42

jerwelborn added 2 commits March 20, 2023 12:54

add example nb

325825d

try make guarded/retriable output parser an instance of parser

a0cde05

jerwelborn reviewed Mar 20, 2023

View reviewed changes

langchain/guardrails/parsing.py Outdated Show resolved Hide resolved

jerwelborn added 2 commits March 20, 2023 16:25

tweak pydantic parser

3ee7558

showcase guarded pydantic parsing

32a8507

jerwelborn reviewed Mar 20, 2023

View reviewed changes

hwchase17 added 3 commits March 20, 2023 19:10

cr

ccc1897

Merge branch 'master' into harrison/guarded-output-parser

5f41f07

cr

86085bc

eyurtsev reviewed Mar 21, 2023

View reviewed changes

hwchase17 added 5 commits March 21, 2023 12:48

Merge branch 'master' into harrison/guarded-output-parser

68e9b7f

cr

97b8724

cr

cbab13c

Merge branch 'master' into harrison/guarded-output-parser

1de0790

cr

e8f2ed4

hwchase17 merged commit ce5d97b into master Mar 22, 2023

hwchase17 deleted the harrison/guarded-output-parser branch March 22, 2023 05:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harrison/guarded output parser #1804

Harrison/guarded output parser #1804

hwchase17 commented Mar 20, 2023

jerwelborn Mar 20, 2023 •

edited

Loading

eyurtsev Mar 21, 2023 •

edited

Loading

eyurtsev Mar 21, 2023

eyurtsev Mar 21, 2023

eyurtsev Mar 21, 2023

eyurtsev Mar 21, 2023

hwchase17 Mar 22, 2023

eyurtsev Mar 21, 2023

hwchase17 Mar 22, 2023

eyurtsev Mar 21, 2023

hwchase17 Mar 22, 2023

eyurtsev Mar 21, 2023

hwchase17 Mar 22, 2023

eyurtsev Mar 21, 2023

hwchase17 Mar 22, 2023

		@@ -30,10 +30,11 @@ def get_format_instructions(self) -> str:
		schema = self.pydantic_object.schema()

		# Remove extraneous fields.



		class FixOutputParser(BaseOutputParser):
		"""Wraps a parser and tries to fix parsing errors."""

		@@ -47,7 +47,7 @@ def output_keys(self) -> List[str]:

		def _call(self, inputs: Dict[str, str]) -> Dict[str, Any]:
		docs = self.text_splitter.create_documents([inputs[self.input_key]])

		return self.parser.get_format_instructions()


		class RetryWithErrorOutputParser(BaseOutputParser):

		return output_parser_dict


		class OutputParserException(Exception):

Harrison/guarded output parser #1804

Harrison/guarded output parser #1804

Conversation

hwchase17 commented Mar 20, 2023

jerwelborn Mar 20, 2023 • edited Loading

Choose a reason for hiding this comment

eyurtsev Mar 21, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jerwelborn Mar 20, 2023 •

edited

Loading

eyurtsev Mar 21, 2023 •

edited

Loading