Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[formrecognizer] Add migration guide #20960

Merged
merged 17 commits into from
Oct 1, 2021
Merged
244 changes: 244 additions & 0 deletions sdk/formrecognizer/azure-ai-formrecognizer/MIGRATION_GUIDE.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,244 @@
# Guide for migrating azure-ai-formrecognizer to version 3.2.x from versions 3.1.x and below
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

This guide is intended to assist in the migration to `azure-ai-formrecognizer (3.2.x)` from versions `3.1.x` and below. It will focus on side-by-side comparisons for similar operations between versions, please note that version `3.1.2` will be used for comparison with `3.2.0b1`.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

Familiariy with `azure-ai-formrecognizer (3.1.x and below)` package is assumed. For those new to the Azure Form Recognizer SDK please refer to the [README](README) rather than this guide.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

## Table of Contents
- [Migration benefits](#migration-benefits)
- [Important changes](#important-changes)
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
- [Additional samples](#additional-samples)

## Migration benefits

A natural question to ask when considering whether or not to adopt a new version or library is what the benefits of doing so would be. As Azure Form Recognizer has matured and been embraced by a more diverse group of developers, we have been focused on learning the patterns and practices to best support developer productivity and add value to our customers.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

The benefit of the new design of the `azure-ai-formrecognizer (3.2.x)` library is that it introduces two new clients `DocumentAnalysisClient` and the `DocumentModelAdministrationClient` with unified methods for analyzing documents and provides support for the new features added by the service in version `2021-09-30-preview`.
Please refer to the [README](README) for more information on these new clients.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

## Important changes

### Instantiating a client
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

We continue to support connection strings and AAD authentication methods when creating our clients. Below are the differences between the two versions:
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

- In `3.2.x`, we have added `DocumentAnalysisClient` and `DocumentModelAdministrationClient` which support service version `2021-09-30-preview`.
- `FormRecognizerClient` and `FormTrainingClient` will return an error is called with an API version of `2021-09-30-preview` or above.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

Creating a new client in `3.1.x`:
```python
form_recognizer_client = FormRecognizerClient(
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
endpoint=endpoint, credential=AzureKeyCredential(key)
)
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
```

Creating a new client in `3.2.x`:
```python
document_analysis_client = DocumentAnalysisClient(
endpoint=endpoint, credential=AzureKeyCredential(key)
)
```

### Analyzing documents
maririos marked this conversation as resolved.
Show resolved Hide resolved

Differences between the versions:
- In `DocumentAnalysisClient` all prebuilt model methods along with custom model, layout and general prebuilt document analysis are unified into two methods called
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
`begin_analyze_document` and `begin_analyze_document_from_url`.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
- In `FormRecognizerClient` there are two methods (a stream and URL method) for each of the prebuilt models supported by the service, this results in two methods for business card, receipt, identity document, and invoice models, along with a pair of methods for recognizing custom documents and for recognizing content/layout.
- Along with more consolidated analysis methods in the `DocumentAnalysisClient`, the return types have also been improved and remove the hierarchical dependencies between elements. An instance of the `AnalyzeResult` model is now returned which moves important document elements to the top level of the returned model.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
- The `include_field_elements` kwarg does not exist with the `DocumentAnalysisClient`, text details are automatically included with service version `2021-09-30-preview`.
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is repeated down below with custom models, I dont like that it's repetitive but I also felt it was important to let the user know in both places 😆 let me know which one I should keep

catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

Analyzing prebuilt models like business cards, identity documents, invoices and receipts with `3.1.x`:
```python
with open(path_to_sample_forms, "rb") as f:
poller = form_recognizer_client.begin_recognize_receipts(receipt=f, locale="en-US")
receipts = poller.result()

for idx, receipt in enumerate(receipts):
print("--------Recognizing receipt #{}--------".format(idx+1))
# process receipt
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
```

Analyzing prebuilt models like business cards, identity documents, invoices and receipts with `3.2.x`:
```python
with open(path_to_sample_documents, "rb") as f:
poller = document_analysis_client.begin_analyze_document(
"prebuilt-receipt", document=f, locale="en-US"
)
receipts = poller.result()

for idx, receipt in enumerate(receipts.documents):
print("--------Recognizing receipt #{}--------".format(idx + 1))
# process receipt
```

Analyzing document content with `3.1.x`:
```python
with open(path_to_sample_forms, "rb") as f:
poller = form_recognizer_client.begin_recognize_content(form=f)
kristapratico marked this conversation as resolved.
Show resolved Hide resolved
form_pages = poller.result()

for idx, content in enumerate(form_pages):
print("----Recognizing content from page #{}----".format(idx+1))
# process document layout by parsing content in pages
for table_idx, table in enumerate(content.tables):
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
# process document tables

for line_idx, line in enumerate(content.lines):
# process lines
for word in line.words:
# process words in a line

for selection_mark in content.selection_marks:
# process selection marks that were found
```


Analyzing document layout with `3.2.x`:
```python
with open(path_to_sample_documents, "rb") as f:
poller = document_analysis_client.begin_analyze_document(
"prebuilt-layout", document=f
)
result = poller.result()

for idx, style in enumerate(result.styles):
# process document styles

for page in result.pages:
# process pages
for line in page.lines:
# process lines
for word in page.words:
# process words

for table_idx, table in enumerate(result.tables):
# process tables found in document
```

Analyzing general document types with `3.2.x`:
```python
with open(path_to_sample_documents, "rb") as f:
poller = document_analysis_client.begin_analyze_document(
"prebuilt-document", document=f
)
result = poller.result()

for entity in result.entities:
# process entities found in document

for kv_pair in result.key_value_pairs:
# process key-value pairs found in document

for page in result.pages:
# process pages
for line in page.lines:
# process lines
for word in page.words:
# process words

for table_idx, table in enumerate(result.tables):
# process tables found in document
```

NOTE: All of these samples also work with `begin_analyze_document_from_url` when providing a valid URL to the document.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

### Analyzing a custom model

Differences between the versions:
- Analyzing a custom model with `DocumentAnalysisClient` uses the general `begin_analyze_document` and `begin_analyze_document_from_url` methods.
- In order to analyze a custom model with `FormRecognizerClient` the `begin_recognize_custom_models` and its corresponding URL methods are used.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
- The `include_field_elements` kwarg does not exist with the `DocumentAnalysisClient`, text details are automatically included with service version `2021-09-30-preview`.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

Analyze custom document with `3.1.x`:
```python
with open(path_to_sample_forms, "rb") as f:
poller = form_recognizer_client.begin_recognize_custom_forms(
model_id=model_id, form=f, include_field_elements=True
)
forms = poller.result()
```

Analyze custom document with `3.2.x`:
```python
with open(path_to_sample_documents, "rb") as f:
poller = document_analysis_client.begin_analyze_document(
model=model_id, document=f
)
result = poller.result()
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
```

### Training a custom model
Copy link
Member Author

@catalinaperalta catalinaperalta Sep 30, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI that I used training here since this guide is for users who are using the old version and moving to the new one. Within the text I do mention build, but this was on purpose.

Also, I'm aware that I used analyzing above (I felt that one was not too far a leap from recognizing), but let me know if I should unify terms to those used in a certain version


Differences between the versions:
- Files for building a new model for version `3.2.x` can be created using the new labeling tool found [here](TODO).
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
- In version `3.1.x` the `use_training_labels` kwarg was used to indicate whether custom labeling was when creating the new models.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
- In version `3.2.x` the `use_training_labels` kwargs is not supported since training must be carried out with labeled training documents. In order to extract key-value pairs from a document, please refer to the prebuilt model "prebuilt-document" which extracts entities, key-value pairs, and layout from a document.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
Train a custom model with `3.1.x`:
```python
form_training_client = FormTrainingClient(endpoint, AzureKeyCredential(key), api_version="2.1")
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
poller = form_training_client.begin_training(
container_sas_url, use_training_labels=True, model_name="mymodel"
)
model = poller.result()

# Custom model information
print("Model ID: {}".format(model.model_id))
print("Status: {}".format(model.status))
print("Model name: {}".format(model.model_name))
print("Is this a composed model?: {}".format(model.properties.is_composed_model))
print("Training started on: {}".format(model.training_started_on))
print("Training completed on: {}".format(model.training_completed_on))

print("Recognized fields:")
# looping through the submodels, which contains the fields they were trained on
# The labels are based on the ones you gave the training document.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
for submodel in model.submodels:
print("...The submodel has model ID: {}".format(submodel.model_id))
print("...The submodel with form type {} has an average accuracy '{}'".format(
submodel.form_type, submodel.accuracy
))
for name, field in submodel.fields.items():
print("...The model found the field '{}' with an accuracy of {}".format(
name, field.accuracy
))

# Training result information
for doc in model.training_documents:
print("Document name: {}".format(doc.name))
print("Document status: {}".format(doc.status))
print("Document page count: {}".format(doc.page_count))
print("Document errors: {}".format(doc.errors))
```

Train a custom model with `3.2.x`:
```python
document_model_admin_client = DocumentModelAdministrationClient(endpoint, AzureKeyCredential(key))
poller = document_model_admin_client.begin_build_model(
container_sas_url, description="my model description"
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
)
model = poller.result()

print("Model ID: {}".format(model.model_id))
print("Description: {}".format(model.description))
print("Model created on: {}\n".format(model.created_on))
print("Doc types the model can recognize:")
for name, doc_type in model.doc_types.items():
print("\nDoc Type: '{}' which has the following fields:".format(name))
for field_name, field in doc_type.field_schema.items():
print("Field: '{}' has type '{}' and confidence score {}".format(
field_name, field["type"], doc_type.field_confidence[field_name]
))
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
```

catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved
### Other differences
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

- The service has introduced the concept of a `BoundingRegion` which groups elements or documents through the bounding boxes in which they appear per page.
catalinaperalta marked this conversation as resolved.
Show resolved Hide resolved

## Additional samples

For additional samples please take a look at the [Form Recognizer Samples](Samples-README) for more guidance.

[README]: https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/formrecognizer/azure-ai-formrecognizer/README.md
[Samples-README]: https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/formrecognizer/azure-ai-formrecognizer/samples/README.md