Azure Cognitive Search client library for Python ================================================ Azure Cognitive Search is a fully managed cloud search service that provides a rich search experience to custom applications. `Source code `_ | `Package (PyPI) `_ | `API reference documentation `_ | `Product documentation `_ | `Samples `_ Getting started --------------- Install the package ^^^^^^^^^^^^^^^^^^^ Install the Azure Cognitive Search client library for Python with `pip `_\ : .. code-block:: bash pip install azure-search-documents --pre Prerequisites ^^^^^^^^^^^^^ * Python 2.7, or 3.5 or later is required to use this package. * You must have an `Azure subscription `_ and an existing `Azure Cognitive Search service `_ to use this package. If you need to create the resource, you can use the `Azure portal `_\ , `Azure PowerShell `_\ , or the `Azure CLI `_. If you use the Azure CLI, replace ```` and ```` with your own unique names: .. code-block:: PowerShell az search service create --resource-group --name --sku Standard The above creates a resource with the "Standard" pricing tier. See `choosing a pricing tier `_ for more information. Authenticate the client ^^^^^^^^^^^^^^^^^^^^^^^ In order to interact with the Cognitive Search service you'll need to create an instance of the Search Client class. To make this possible you will need an `api-key of the Cognitive Search service `_. The SDK provides three clients. #. SearchClient for all document operations. #. SearchIndexClient for all CRUD operations on index resources. #. SearchIndexerClient for all CRUD operations on indexer resources. Create a SearchClient ~~~~~~~~~~~~~~~~~~~~~ To create a SearchClient, you will need an existing index name as well as the values of the Cognitive Search Service `service endpoint `_ and `api key `_. Note that you will need an admin key to index documents (query keys only work for queries). .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents import SearchClient credential = AzureKeyCredential("") client = SearchClient(endpoint="", index_name="", credential=credential) Create a SearchIndexClient ~~~~~~~~~~~~~~~~~~~~~~~~~~ Once you have the values of the Cognitive Search Service `service endpoint `_ and `api key `_ you can create the Search Index client: .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents.indexes import SearchIndexClient credential = AzureKeyCredential("") client = SearchIndexClient(endpoint="", credential=credential) Create a SearchIndexerClient ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Once you have the values of the Cognitive Search Service `service endpoint `_ and `api key `_ you can create the Search Indexer client: .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents.indexes import SearchIndexerClient credential = AzureKeyCredential("") client = SearchIndexerClient(endpoint="", credential=credential) Send your first search request ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ You can use the ``SearchClient`` you created in the first section above to make a basic search request: .. code-block:: python results = client.search(search_text="spa") print("Hotels containing 'spa' in the name (or other fields):") for result in results: print(" Name: {} (rating {})".format(result["HotelName"], result["Rating"])) Key concepts ------------ Azure Cognitive Search has the concepts of search services and indexes and documents, where a search service contains one or more indexes that provides persistent storage of searchable data, and data is loaded in the form of JSON documents. Data can be pushed to an index from an external data source, but if you use an indexer, it's possible to crawl a data source to extract and load data into an index. There are several types of operations that can be executed against the service: * **Index management operations** Create, delete, update, or configure a search index. (\ `API Reference `_\ , `Service Docs `_\ ) * **Document operations** Add, update, or delete documents in the index, query the index, or look up specific documents by ID. (\ `API Reference `_\ , `Service Docs `_\ ) * **Datasource operations** Create, delete, update, or configure data sources for Search Indexers (\ `API Reference `_\ , `Service Docs `_\ ) * **Indexer operations** Automate aspects of an indexing operation by configuring a data source and an indexer that you can schedule or run on demand. This feature is supported for a limited number of data source types. (\ `API Reference `_\ , `Service Docs `_\ ) * **Skillset operations** Part of a cognitive search workload, a skillset defines a series of a series of enrichment processing steps. A skillset is consumed by an indexer. (\ `API Reference `_\ , `Service Docs `_\ ) * **Synonym map operations** A synonym map is a service-level resource that contains user-defined synonyms. This resource is maintained independently from search indexes. Once uploaded, you can point any searchable field to the synonym map (one per field). (\ `API Reference `_\ , `Service Docs `_\ ) Examples -------- The following sections contain snippets for some common operations: * `Perform a simple text search <#perform-a-simple-text-search-on-documents>`_ * `Retrieve a specific document <#retrieve-a-specific-document-from-an-index>`_ * `Get search suggestions <#get-search-suggestions>`_ * `Create an index <#create-an-index>`_ * `Upload documents to an index <#upload-documents-to-an-index>`_ More examples, covering topics such as indexers, skillets, and synonym maps can be found in the `Samples directory `_. Perform a simple text search on documents ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Search the entire index or documents matching a simple search text, e.g. find hotels with the text "spa": .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents import SearchClient client = SearchClient("", "", AzureKeyCredential("")) results = client.search(search_text="spa") print("Hotels containing 'spa' in the name (or other fields):") for result in results: print(" Name: {} (rating {})".format(result["HotelName"], result["Rating"])) Retrieve a specific document from an index ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Get a specific document from the index, e.f. obtain the document for hotel "23": .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents import SearchClient client = SearchClient("", "", AzureKeyCredential("")) result = client.get_document(key="23") print("Details for hotel '23' are:") print(" Name: {}".format(result["HotelName"])) print(" Rating: {}".format(result["Rating"])) print(" Category: {}".format(result["Category"])) Get search suggestions ^^^^^^^^^^^^^^^^^^^^^^ Get search suggestions for related terms, e.g. find search suggestions for the term "coffee": .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents import SearchClient client = SearchClient("", "", AzureKeyCredential("")) results = client.suggest(search_text="coffee", suggester_name="sg") print("Search suggestions for 'coffee'") for result in results: hotel = client.get_document(key=result["HotelId"]) print(" Text: {} for Hotel: {}".format(repr(result["text"]), hotel["HotelName"])) Create an index ^^^^^^^^^^^^^^^ .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents.indexes import SearchIndexClient, CorsOptions, SearchIndex, ScoringProfile client = SearchIndexClient("", AzureKeyCredential("")) name = "hotels" fields = [ SimpleField(name="hotelId", type=SearchFieldDataType.String, key=True), SimpleField(name="baseRate", type=SearchFieldDataType.Double), SearchableField(name="description", type=SearchFieldDataType.String), ComplexField(name="address", fields=[ SimpleField(name="streetAddress", type=SearchFieldDataType.String), SimpleField(name="city", type=SearchFieldDataType.String), ]) ] cors_options = CorsOptions(allowed_origins=["*"], max_age_in_seconds=60) scoring_profiles = [] index = SearchIndex( name=name, fields=fields, scoring_profiles=scoring_profiles, cors_options=cors_options) result = client.create_index(index) Upload documents to an index ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Add documents (or update existing ones), e.g add a new document for a new hotel: .. code-block:: python from azure.core.credentials import AzureKeyCredential from azure.search.documents import SearchClient client = SearchClient("", "", AzureKeyCredential("")) DOCUMENT = { 'Category': 'Hotel', 'HotelId': '1000', 'Rating': 4.0, 'Rooms': [], 'HotelName': 'Azure Inn', } result = client.upload_documents(documents=[DOCUMENT]) print("Upload of new document succeeded: {}".format(result[0].succeeded)) Troubleshooting --------------- General ^^^^^^^ The Azure Cognitive Search client will raise exceptions defined in `Azure Core `_. Logging ^^^^^^^ This library uses the standard `logging `_ library for logging. Basic information about HTTP sessions (URLs, headers, etc.) is logged at INFO level. Detailed DEBUG level logging, including request/response bodies and unredacted headers, can be enabled on a client with the ``logging_enable`` keyword argument: .. code-block:: python import sys import logging from azure.core.credentials import AzureKeyCredential from azure.search.documents import SearchClient # Create a logger for the 'azure' SDK logger = logging.getLogger('azure') logger.setLevel(logging.DEBUG) # Configure a console output handler = logging.StreamHandler(stream=sys.stdout) logger.addHandler(handler) # This client will log detailed information about its HTTP sessions, at DEBUG level client = SearchClient("", "", AzureKeyCredential(""), logging_enable=True) Similarly, ``logging_enable`` can enable detailed logging for a single operation, even when it isn't enabled for the client: .. code-block:: python result = client.search(search_text="spa", logging_enable=True) Next steps ---------- Additional documentation ^^^^^^^^^^^^^^^^^^^^^^^^ For more extensive documentation on Cognitive Search, see the `Azure Cognitive Search documentation `_ on docs.microsoft.com. Contributing ------------ This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit `cla.microsoft.com `_. When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA. This project has adopted the `Microsoft Open Source Code of Conduct `_. For more information see the `Code of Conduct FAQ `_ or contact `opencode@microsoft.com `_ with any additional questions or comments. Related projects ---------------- * `Microsoft Azure SDK for Python `_ .. raw:: html .. image:: https://azure-sdk-impressions.azurewebsites.net/api/impressions/azure-sdk-for-python%2Fsdk%2Fsearch%2Fazure-search-documents%2FREADME.png :target: https://azure-sdk-impressions.azurewebsites.net/api/impressions/azure-sdk-for-python%2Fsdk%2Fsearch%2Fazure-search-documents%2FREADME.png :alt: Impressions Indices and tables ------------------ * :ref:`genindex` * :ref:`modindex` * :ref:`search` .. toctree:: :maxdepth: 5 :glob: :caption: Developer Documentation azure.search.documents.rst