Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Geeartl/5973 title issue #194

Merged
merged 2 commits into from
Sep 6, 2023
Merged

Geeartl/5973 title issue #194

merged 2 commits into from
Sep 6, 2023

Conversation

georearl
Copy link
Contributor

@georearl georearl commented Sep 6, 2023

This PR addresses an issue where FR sometimes returns a piece of text identified as the title. This will then replace the title in subsequent chunks. The problem is these chunks would no longer have the context of the main document title. The PR addresses this by concatenating all items identified as title on the first page of a document, or the first item in the document identified as a title if it is not on page 1. This value is then stored in the title key-value across all subsequent chunks. Sections are still stored in the section key-value. To not lose any value provided by the subsequent pieces of text identified as a title, rightly or wrongly, these are stored in the subtitle key-value. They will be replaced when/if another title is subsequently identified by FR

@georearl georearl changed the base branch from main to vNext-Dev September 6, 2023 23:22
@georearl georearl merged commit c8a0353 into vNext-Dev Sep 6, 2023
2 checks passed
@georearl georearl deleted the geeartl/5973-title-issue branch September 6, 2023 23:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants