Package version:

Class DocumentAnalysisClient

A client for interacting with the Form Recognizer service's analysis features.

Constructors

constructor

Methods

beginAnalyzeDocument beginAnalyzeDocumentFromUrl beginClassifyDocument beginClassifyDocumentFromUrl

Constructors

constructor

new DocumentAnalysisClient(endpoint: string, credential: TokenCredential, options?: DocumentAnalysisClientOptions): DocumentAnalysisClient
Create a DocumentAnalysisClient instance from a resource endpoint and a an Azure Identity TokenCredential.

See the @azure/identity package for more information about authenticating with Azure Active Directory.

Example:
```
import { DocumentAnalysisClient } from "@azure/ai-form-recognizer";
import { DefaultAzureCredential } from "@azure/identity";

const endpoint = "https://<resource name>.cognitiveservices.azure.com";
const credential = new DefaultAzureCredential();

const client = new DocumentAnalysisClient(endpoint, credential);
```
Parameters
- endpoint: string
  
  the endpoint URL of an Azure Cognitive Services instance
- credential: TokenCredential
  
  a TokenCredential instance from the @azure/identity package
- Optional options: DocumentAnalysisClientOptions
  
  optional settings for configuring all methods in the client
Returns DocumentAnalysisClient
- Defined in documentAnalysisClient.ts:88
new DocumentAnalysisClient(endpoint: string, credential: KeyCredential, options?: DocumentAnalysisClientOptions): DocumentAnalysisClient
Create a DocumentAnalysisClient instance from a resource endpoint and a static API key (KeyCredential),

Example:
```
import { DocumentAnalysisClient, AzureKeyCredential } from "@azure/ai-form-recognizer";

const endpoint = "https://<resource name>.cognitiveservices.azure.com";
const credential = new AzureKeyCredential("<api key>");

const client = new DocumentAnalysisClient(endpoint, credential);
```
Parameters
- endpoint: string
  
  the endpoint URL of an Azure Cognitive Services instance
- credential: KeyCredential
  
  a KeyCredential containing the Cognitive Services instance subscription key
- Optional options: DocumentAnalysisClientOptions
  
  optional settings for configuring all methods in the client
Returns DocumentAnalysisClient
- Defined in documentAnalysisClient.ts:111

Methods

beginAnalyzeDocument

beginAnalyzeDocument(modelId: string, document: any, options?: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>): Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Extract data from an input using a model given by its unique ID.

This operation supports custom as well as prebuilt models. For example, to use the prebuilt invoice model, provide the model ID "prebuilt-invoice", or to use the simpler prebuilt layout model, provide the model ID "prebuilt-layout".

The fields produced in the AnalyzeResult depend on the model that is used for analysis, and the values in any extracted documents' fields depend on the document types in the model (if any) and their corresponding field schemas.

Examples

This method supports streamable request bodies (FormRecognizerRequestBody) such as Node.JS ReadableStream objects, browser Blobs, and ArrayBuffers. The contents of the body will be uploaded to the service for analysis.

import * as fs from "fs";

const file = fs.createReadStream("path/to/receipt.pdf");

// The model that is passed to the following function call determines the type of the eventual result. In the
// example, we will use the prebuilt receipt model, but you could use a custom model ID/name instead.
const poller = await client.beginAnalyzeDocument("prebuilt-receipt", file);

// The result is a long-running operation (poller), which must itself be polled until the operation completes
const {
  pages, // pages extracted from the document, which contain lines and words
  tables, // extracted tables, organized into cells that contain their contents
  styles, // text styles (ex. handwriting) that were observed in the document
  keyValuePairs, // extracted pairs of elements  (directed associations from one element in the input to another)
  entities, // extracted entities in the input's content, which are categorized (ex. "Location" or "Organization")
  documents // extracted documents (instances of one of the model's document types and its field schema)
} = await poller.pollUntilDone();

// Extract the fields of the first document. These fields constitute a receipt, because we used the receipt model
const [{ fields: receipt }] = documents;

// The fields correspond to the model's document types and their field schemas. Refer to the Form Recognizer
// documentation for information about the document types and field schemas within a model, or use the `getModel`
// operation to view this information programmatically.
console.log("The type of this receipt is:", receipt?.["ReceiptType"]?.value);

Returns

a long-running operation (poller) that will eventually produce an AnalyzeResult

Parameters

modelId: string

the unique ID (name) of the model within this client's resource
document: any

a FormRecognizerRequestBody that will be uploaded with the request
Optional options: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>

optional settings for the analysis operation and poller

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginAnalyzeDocument<Result>(model: DocumentModel<Result>, document: any, options?: AnalyzeDocumentOptions<Result>): Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Extract data from an input using a model that has a known, strongly-typed document schema (a DocumentModel).

The fields produced in the AnalyzeResult depend on the model that is used for analysis. In TypeScript, the type of the result for this method overload is inferred from the type of the input DocumentModel.

Examples

If the input provided is a string, it will be treated as a URL to the location of a document to be analyzed. See the beginAnalyzeDocumentFromUrl method for more information. Use of that method is preferred when using URLs, and URL support is only provided in this method for backwards compatibility.

import * as fs from "fs";

// See the `prebuilt` folder in the SDK samples (http://aka.ms/azsdk/formrecognizer/js/samples) for examples of
// DocumentModels for known prebuilts.
import { PrebuiltReceiptModel } from "./prebuilt-receipt.ts";

const file = fs.createReadStream("path/to/receipt.pdf");

// The model that is passed to the following function call determines the type of the eventual result. In the
// example, we will use the prebuilt receipt model.
const poller = await client.beginAnalyzeDocument(PrebuiltReceiptModel, file);

// The result is a long-running operation (poller), which must itself be polled until the operation completes
const {
  pages, // pages extracted from the document, which contain lines and words
  tables, // extracted tables, organized into cells that contain their contents
  styles, // text styles (ex. handwriting) that were observed in the document
  keyValuePairs, // extracted pairs of elements  (directed associations from one element in the input to another)

  documents // extracted documents (instances of one of the model's document types and its field schema)
} = await poller.pollUntilDone();

// Extract the fields of the first document. These fields constitute a receipt, because we used the receipt model
const [{ fields: receipt }] = documents;

// Since we used the strongly-typed PrebuiltReceiptModel object instead of the "prebuilt-receipt" model ID
// string, the fields of the receipt are strongly-typed and have camelCase names (as opposed to PascalCase).
console.log("The type of this receipt is:", receipt.receiptType?.value);

Returns

a long-running operation (poller) that will eventually produce an AnalyzeResult with documents that have the result type associated with the input model

Type Parameters

Result

Parameters

model: DocumentModel<Result>

a DocumentModel representing the model to use for analysis and the expected output type
document: any

a FormRecognizerRequestBody that will be uploaded with the request
Optional options: AnalyzeDocumentOptions<Result>

optional settings for the analysis operation and poller

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginAnalyzeDocumentFromUrl

beginAnalyzeDocumentFromUrl(modelId: string, documentUrl: string, options?: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>): Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Extract data from an input using a model given by its unique ID.

Examples

This method supports extracting data from a file at a given URL. The Form Recognizer service will attempt to download a file using the submitted URL, so the URL must be accessible from the public internet. For example, a SAS token can be used to grant read access to a blob in Azure Storage, and the service will use the SAS-encoded URL to request the file.

// the URL must be publicly accessible
const url = "<receipt document url>";

// The model that is passed to the following function call determines the type of the eventual result. In the
// example, we will use the prebuilt receipt model, but you could use a custom model ID/name instead.
const poller = await client.beginAnalyzeDocument("prebuilt-receipt", url);

// The result is a long-running operation (poller), which must itself be polled until the operation completes
const {
  pages, // pages extracted from the document, which contain lines and words
  tables, // extracted tables, organized into cells that contain their contents
  styles, // text styles (ex. handwriting) that were observed in the document
  keyValuePairs, // extracted pairs of elements  (directed associations from one element in the input to another)

  documents // extracted documents (instances of one of the model's document types and its field schema)
} = await poller.pollUntilDone();

// Extract the fields of the first document. These fields constitute a receipt, because we used the receipt model
const [{ fields: receipt }] = documents;

// The fields correspond to the model's document types and their field schemas. Refer to the Form Recognizer
// documentation for information about the document types and field schemas within a model, or use the `getModel`
// operation to view this information programmatically.
console.log("The type of this receipt is:", receipt?.["ReceiptType"]?.value);

Returns

a long-running operation (poller) that will eventually produce an AnalyzeResult

Parameters

modelId: string

the unique ID (name) of the model within this client's resource
documentUrl: string

a URL (string) to an input document accessible from the public internet
Optional options: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>

optional settings for the analysis operation and poller

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginAnalyzeDocumentFromUrl<Result>(model: DocumentModel<Result>, documentUrl: string, options?: AnalyzeDocumentOptions<Result>): Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Extract data from an input using a model that has a known, strongly-typed document schema (a DocumentModel).

Examples

// See the `prebuilt` folder in the SDK samples (http://aka.ms/azsdk/formrecognizer/js/samples) for examples of
// DocumentModels for known prebuilts.
import { PrebuiltReceiptModel } from "./prebuilt-receipt.ts";

// the URL must be publicly accessible
const url = "<receipt document url>";

// The model that is passed to the following function call determines the type of the eventual result. In the
// example, we will use the prebuilt receipt model.
const poller = await client.beginAnalyzeDocument(PrebuiltReceiptModel, url);

// The result is a long-running operation (poller), which must itself be polled until the operation completes
const {
  pages, // pages extracted from the document, which contain lines and words
  tables, // extracted tables, organized into cells that contain their contents
  styles, // text styles (ex. handwriting) that were observed in the document
  keyValuePairs, // extracted pairs of elements  (directed associations from one element in the input to another)

  documents // extracted documents (instances of one of the model's document types and its field schema)
} = await poller.pollUntilDone();

// Extract the fields of the first document. These fields constitute a receipt, because we used the receipt model
const [{ fields: receipt }] = documents;

// Since we used the strongly-typed PrebuiltReceiptModel object instead of the "prebuilt-receipt" model ID
// string, the fields of the receipt are strongly-typed and have camelCase names (as opposed to PascalCase).
console.log("The type of this receipt is:", receipt.receiptType?.value);

Returns

a long-running operation (poller) that will eventually produce an AnalyzeResult

Type Parameters

Result

Parameters

model: DocumentModel<Result>

a DocumentModel representing the model to use for analysis and the expected output type
documentUrl: string

a URL (string) to an input document accessible from the public internet
Optional options: AnalyzeDocumentOptions<Result>

optional settings for the analysis operation and poller

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginClassifyDocument

beginClassifyDocument(classifierId: string, document: any, options?: ClassifyDocumentOptions): Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>
Classify a document using a custom classifier given by its ID.

This method produces a long-running operation (poller) that will eventually produce an AnalyzeResult. This is the same type as beginAnalyzeDocument and beginAnalyzeDocumentFromUrl, but the result will only contain a small subset of its fields. Only the documents field and pages field will be populated, and only minimal page information will be returned. The documents field will contain information about all the identified documents and the docType that they were classified as.

Example

This method supports streamable request bodies (FormRecognizerRequestBody) such as Node.JS ReadableStream objects, browser Blobs, and ArrayBuffers. The contents of the body will be uploaded to the service for analysis.
```
import * as fs from "fs";

const file = fs.createReadStream("path/to/file.pdf");

const poller = await client.beginClassifyDocument("<classifier ID>", file);

// The result is a long-running operation (poller), which must itself be polled until the operation completes
const {
  pages, // pages extracted from the document, which contain only basic information for classifiers
  documents // extracted documents and their types
} = await poller.pollUntilDone();

// We'll print the documents and their types
for (const { docType } of documents) {
  console.log("The type of this document is:", docType);
}
```
Returns
a long-running operation (poller) that will eventually produce an AnalyzeResult
Parameters
- classifierId: string
  
  the ID of the custom classifier to use for analysis
- document: any
  
  the document to classify
- options: ClassifyDocumentOptions = {}
  
  options for the classification operation
Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>
- Defined in documentAnalysisClient.ts:480

beginClassifyDocumentFromUrl

beginClassifyDocumentFromUrl(classifierId: string, documentUrl: string, options?: ClassifyDocumentOptions): Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>
Classify a document from a URL using a custom classifier given by its ID.

This method produces a long-running operation (poller) that will eventually produce an AnalyzeResult. This is the same type as beginAnalyzeDocument and beginAnalyzeDocumentFromUrl, but the result will only contain a small subset of its fields. Only the documents field and pages field will be populated, and only minimal page information will be returned. The documents field will contain information about all the identified documents and the docType that they were classified as.

Example

This method supports extracting data from a file at a given URL. The Form Recognizer service will attempt to download a file using the submitted URL, so the URL must be accessible from the public internet. For example, a SAS token can be used to grant read access to a blob in Azure Storage, and the service will use the SAS-encoded URL to request the file.
```
// the URL must be publicly accessible
const url = "<file url>";

const poller = await client.beginClassifyDocument("<classifier ID>", url);

// The result is a long-running operation (poller), which must itself be polled until the operation completes
const {
  pages, // pages extracted from the document, which contain only basic information for classifiers
  documents // extracted documents and their types
} = await poller.pollUntilDone();

// We'll print the documents and their types
for (const { docType } of documents) {
  console.log("The type of this document is:", docType);
}
```
Returns
Parameters
- classifierId: string
  
  the ID of the custom classifier to use for analysis
- documentUrl: string
  
  the URL of the document to classify
- options: ClassifyDocumentOptions = {}
Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>
- Defined in documentAnalysisClient.ts:531

Class DocumentAnalysisClient

Examples:

Azure Active Directory

API Key (Subscription Key)

Hierarchy

Index

Constructors

Methods

Constructors

constructor

Example:

Parameters

endpoint: string

credential: TokenCredential

Optional options: DocumentAnalysisClientOptions

Returns DocumentAnalysisClient

Example:

Parameters

endpoint: string

credential: KeyCredential

Optional options: DocumentAnalysisClientOptions

Returns DocumentAnalysisClient

Methods

beginAnalyzeDocument

Examples

Returns

Parameters

modelId: string

document: any

Optional options: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Examples

Returns

Type Parameters

Result

Parameters

model: DocumentModel<Result>

document: any

Optional options: AnalyzeDocumentOptions<Result>

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginAnalyzeDocumentFromUrl

Examples

Returns

Parameters

modelId: string

documentUrl: string

Optional options: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Examples

Returns

Type Parameters

Result

Parameters

model: DocumentModel<Result>

documentUrl: string

Optional options: AnalyzeDocumentOptions<Result>

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginClassifyDocument

Example

Returns

Parameters

classifierId: string

document: any

options: ClassifyDocumentOptions = {}

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

beginClassifyDocumentFromUrl

Example

Returns

Parameters

classifierId: string

documentUrl: string

options: ClassifyDocumentOptions = {}

Returns Promise<PollerLike<DocumentAnalysisPollOperationState<Result>, Result>>

Settings

Member Visibility

Theme

`Optional` options: DocumentAnalysisClientOptions

`Optional` options: DocumentAnalysisClientOptions

`Optional` options: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>

`Optional` options: AnalyzeDocumentOptions<Result>

`Optional` options: AnalyzeDocumentOptions<AnalyzeResult<AnalyzedDocument>>

`Optional` options: AnalyzeDocumentOptions<Result>