Azure Cognitive Search indexers allow you to ingest data from many new data sources

Published 05-25-2021 10:42 AM 1,997 Views
Microsoft

An indexer in Azure Cognitive Search is a crawler that extracts searchable text and metadata from a data source and populates a search index using field-to-field mappings between source data and your index. This approach is sometimes referred to as a 'pull model' because the service pulls data in without you having to write any code that adds data to an index. Indexers also drive the AI enrichment capabilities of Cognitive Search, integrating external processing of content en route to an index. Previously, indexers mostly just supported Azure data sources like Azure blobs and Azure SQL.

 

Today we’re excited to announce the following updates related to data source support!

 

New preview indexers

  • Amazon Redshift (Powered by Power Query)
  • Cosmos DB Gremlin API
  • Elasticsearch (Powered by Power Query)
  • MySQL
  • PostgreSQL (Powered by Power Query)
  • Salesforce Objects (Powered by Power Query)
  • Salesforce Reports (Powered by Power Query)
  • SharePoint Online
  • Smartsheet (Powered by Power Query)
  • Snowflake (Powered by Power Query)

GA indexers

  • Azure Data Lake Storage Gen2

 

Power Query Connectors

Power Query is a data transformation and data preparation engine with the ability to pull data from many different data sources. Power Query connectors are used in products like Power BI and Excel. Azure Cognitive Search has added support for select Power Query data connectors so that you can pull data from more data sources using the familiar indexer pipeline.

 

You can use the select Power Query connectors just like you would use any other indexer. The Power Query connectors integrated into Azure Cognitive Search support change tracking, skillsets, field mappings, and many of the other features that indexers provide. They also support transformations.

 

These optional transformations can be used to manipulate your data before pulling it into an Azure Cognitive Search index. They can be as simple as removing a column or filtering rows or as advanced as adding your own M script.

 

Mark_Heffernan_0-1621889796443.png

 

To learn more about how to pull data from your data source using one of the new Power Query indexers, view the following tutorial:

 


 

SharePoint Online Indexer

The SharePoint Online indexer allows you to pull content from one or more SharePoint Online document libraries and index that content into an Azure Cognitive Search index. It supports many different file formats including the Office file formats. It also supports change detection that will by default identify which documents in your document library have been updated, added, or deleted. This means that after the initial ingestion of content from your document library, the indexer will only process content that has been updated, added, or deleted from your document library.

 

To learn more about how to pull data from your SharePoint Online document library, view the following tutorial:

 

 

Getting started

To get started with the new preview indexers, sign up using the below form:

https://aka.ms/azure-cognitive-search/indexer-preview

 

For more information, see our documentation at:

 

%3CLINGO-SUB%20id%3D%22lingo-sub-2381988%22%20slang%3D%22en-US%22%3EAzure%20Cognitive%20Search%20indexers%20allow%20you%20to%20ingest%20data%20from%20many%20new%20data%20sources%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2381988%22%20slang%3D%22en-US%22%3E%3CP%20data-unlink%3D%22true%22%3EAn%26nbsp%3B%3CA%20href%3D%22https%3A%2F%2Fdocs.microsoft.com%2Fazure%2Fsearch%2Fsearch-indexer-overview%22%20target%3D%22_self%22%20rel%3D%22noopener%20noreferrer%22%3Eindexer%3C%2FA%3E%26nbsp%3Bin%20%3CA%20href%3D%22https%3A%2F%2Fdocs.microsoft.com%2Fazure%2Fsearch%2Fsearch-what-is-azure-search%22%20target%3D%22_self%22%20rel%3D%22noopener%20noreferrer%22%3EAzure%20Cognitive%20Search%3C%2FA%3E%20is%20a%20crawler%20that%20extracts%20searchable%20text%20and%20metadata%20from%20a%20data%20source%20and%20populates%20a%20search%20index%20using%20field-to-field%20mappings%20between%20source%20data%20and%20your%20index.%20This%20approach%20is%20sometimes%20referred%20to%20as%20a%20'pull%20model'%20because%20the%20service%20pulls%20data%20in%20without%20you%20having%20to%20write%20any%20code%20that%20adds%20data%20to%20an%20index.%20Indexers%20also%20drive%20the%26nbsp%3B%3CA%20href%3D%22https%3A%2F%2Fdocs.microsoft.com%2Fazure%2Fsearch%2Fcognitive-search-concept-intro%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3EAI%20enrichment%26nbsp%3Bcapabilities%3C%2FA%3E%20of%20Cognitive%20Search%2C%20integrating%20external%20processing%20of%20content%20en%20route%20to%20an%20index.%20Previously%2C%20indexers%20mostly%20just%20supported%20Azure%20data%20sources%20like%20Azure%20blobs%20and%20Azure%20SQL.%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3EToday%20we%E2%80%99re%20excited%20to%20announce%20the%20following%20updates%20related%20to%20data%20source%20support!%3C%2FSTRONG%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3ENew%20preview%20indexers%3C%2FP%3E%0A%3CUL%3E%0A%3CLI%3EAmazon%20Redshift%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3CLI%3ECosmos%20DB%20Gremlin%20API%3C%2FLI%3E%0A%3CLI%3EElasticsearch%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3CLI%3EMySQL%3C%2FLI%3E%0A%3CLI%3EPostgreSQL%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3CLI%3ESalesforce%20Objects%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3CLI%3ESalesforce%20Reports%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3CLI%3ESharePoint%20Online%3C%2FLI%3E%0A%3CLI%3ESmartsheet%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3CLI%3ESnowflake%20(Powered%20by%20Power%20Query)%3C%2FLI%3E%0A%3C%2FUL%3E%0A%3CP%3EGA%20indexers%3C%2FP%3E%0A%3CUL%3E%0A%3CLI%3EAzure%20Data%20Lake%20Storage%20Gen2%3C%2FLI%3E%0A%3C%2FUL%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CH1%20id%3D%22toc-hId-1332762972%22%20id%3D%22toc-hId-1332673822%22%3EPower%20Query%20Connectors%3C%2FH1%3E%0A%3CP%20data-unlink%3D%22true%22%3EPower%20Query%26nbsp%3Bis%20a%20data%20transformation%20and%20data%20preparation%20engine%20with%20the%20ability%20to%20pull%20data%20from%20many%20different%20data%20sources.%20Power%20Query%20connectors%20are%20used%20in%20products%20like%20Power%20BI%20and%20Excel.%20Azure%20Cognitive%20Search%20has%20%3CA%20href%3D%22https%3A%2F%2Fdocs.microsoft.com%2Fazure%2Fsearch%2Fsearch-how-to-index-power-query-data-sources%22%20target%3D%22_self%22%20rel%3D%22noopener%20noreferrer%22%3Eadded%20support%20for%20select%20Power%20Query%20data%20connectors%3C%2FA%3E%20so%20that%20you%20can%20pull%20data%20from%20more%20data%20sources%20using%20the%20familiar%20indexer%20pipeline.%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3EYou%20can%20use%20the%20select%20Power%20Query%20connectors%20just%20like%20you%20would%20use%20any%20other%20indexer.%20The%20Power%20Query%20connectors%20integrated%20into%20Azure%20Cognitive%20Search%20support%20change%20tracking%2C%20skillsets%2C%20field%20mappings%2C%20and%20many%20of%20the%20other%20features%20that%20indexers%20provide.%20They%20also%20support%20transformations.%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3EThese%20optional%20transformations%20can%20be%20used%20to%20manipulate%20your%20data%20before%20pulling%20it%20into%20an%20Azure%20Cognitive%20Search%20index.%20They%20can%20be%20as%20simple%20as%20removing%20a%20column%20or%20filtering%20rows%20or%20as%20advanced%20as%20adding%20your%20own%20M%20script.%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22Mark_Heffernan_0-1621889796443.png%22%20style%3D%22width%3A%20932px%3B%22%3E%3CIMG%20src%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F283222i5804E8CCD321A28D%2Fimage-dimensions%2F932x436%3Fv%3Dv2%22%20width%3D%22932%22%20height%3D%22436%22%20role%3D%22button%22%20title%3D%22Mark_Heffernan_0-1621889796443.png%22%20alt%3D%22Mark_Heffernan_0-1621889796443.png%22%20%2F%3E%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3ETo%20learn%20more%20about%20how%20to%20pull%20data%20from%20your%20data%20source%20using%20one%20of%20the%20new%20Power%20Query%20indexers%2C%20view%20the%20following%20tutorial%3A%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3CEM%3E%3C%2FEM%3E%3C%2FP%3E%3CDIV%20class%3D%22video-embed-center%20video-embed%22%3E%3CIFRAME%20class%3D%22embedly-embed%22%20src%3D%22https%3A%2F%2Fcdn.embedly.com%2Fwidgets%2Fmedia.html%3Fsrc%3Dhttps%253A%252F%252Fwww.youtube.com%252Fembed%252Fuy-l4xFX1EE%253Ffeature%253Doembed%26amp%3Bdisplay_name%3DYouTube%26amp%3Burl%3Dhttps%253A%252F%252Fwww.youtube.com%252Fwatch%253Fv%253Duy-l4xFX1EE%26amp%3Bimage%3Dhttps%253A%252F%252Fi.ytimg.com%252Fvi%252Fuy-l4xFX1EE%252Fhqdefault.jpg%26amp%3Bkey%3Db0d40caa4f094c68be7c29880b16f56e%26amp%3Btype%3Dtext%252Fhtml%26amp%3Bschema%3Dyoutube%22%20width%3D%22400%22%20height%3D%22225%22%20scrolling%3D%22no%22%20title%3D%22YouTube%20embed%22%20frameborder%3D%220%22%20allow%3D%22autoplay%3B%20fullscreen%22%20allowfullscreen%3D%22true%22%3E%3C%2FIFRAME%3E%3C%2FDIV%3E%3CBR%20%2F%3E%3CP%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CH1%20id%3D%22toc-hId--474691491%22%20id%3D%22toc-hId--474780641%22%3ESharePoint%20Online%20Indexer%3C%2FH1%3E%0A%3CP%3EThe%20%3CA%20href%3D%22https%3A%2F%2Fdocs.microsoft.com%2Fazure%2Fsearch%2Fsearch-howto-index-sharepoint-online%22%20target%3D%22_self%22%20rel%3D%22noopener%20noreferrer%22%3ESharePoint%20Online%20indexer%3C%2FA%3E%20allows%20you%20to%20pull%20content%20from%20one%20or%20more%20SharePoint%20Online%20document%20libraries%20and%20index%20that%20content%20into%20an%20Azure%20Cognitive%20Search%20index.%20It%20supports%20many%20different%20file%20formats%20including%20the%20Office%20file%20formats.%20It%20also%20supports%20change%20detection%20that%20will%20by%20default%20identify%20which%20documents%20in%20your%20document%20library%20have%20been%20updated%2C%20added%2C%20or%20deleted.%20This%20means%20that%20after%20the%20initial%20ingestion%20of%20content%20from%20your%20document%20library%2C%20the%20indexer%20will%20only%20process%20content%20that%20has%20been%20updated%2C%20added%2C%20or%20deleted%20from%20your%20document%20library.%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3ETo%20learn%20more%20about%20how%20to%20pull%20data%20from%20your%20SharePoint%20Online%20document%20library%2C%20view%20the%20following%20tutorial%3A%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3E%3C%2FP%3E%3CDIV%20class%3D%22video-embed-center%20video-embed%22%3E%3CIFRAME%20class%3D%22embedly-embed%22%20src%3D%22https%3A%2F%2Fcdn.embedly.com%2Fwidgets%2Fmedia.html%3Fsrc%3Dhttps%253A%252F%252Fwww.youtube.com%252Fembed%252FQmG65Vgl0JI%253Ffeature%253Doembed%26amp%3Bdisplay_name%3DYouTube%26amp%3Burl%3Dhttps%253A%252F%252Fwww.youtube.com%252Fwatch%253Fv%253DQmG65Vgl0JI%26amp%3Bimage%3Dhttps%253A%252F%252Fi.ytimg.com%252Fvi%252FQmG65Vgl0JI%252Fhqdefault.jpg%26amp%3Bkey%3Dfad07bfa4bd747d3bdea27e17b533c0e%26amp%3Btype%3Dtext%252Fhtml%26amp%3Bschema%3Dyoutube%22%20width%3D%22400%22%20height%3D%22225%22%20scrolling%3D%22no%22%20title%3D%22YouTube%20embed%22%20frameborder%3D%220%22%20allow%3D%22autoplay%3B%20fullscreen%22%20allowfullscreen%3D%22true%22%3E%3C%2FIFRAME%3E%3C%2FDIV%3E%3CP%3E%3C%2FP%3E%0A%3CP%3E%3CEM%3E%26nbsp%3B%3C%2FEM%3E%3C%2FP%3E%0A%3CH1%20id%3D%22toc-hId-2012821342%22%20id%3D%22toc-hId-2012732192%22%3EGetting%20started%3C%2FH1%3E%0A%3CP%3ETo%20get%20started%20with%20the%20new%20preview%20indexers%2C%20sign%20up%20using%20the%20below%20form%3A%3C%2FP%3E%0A%3CP%3E%3CA%20href%3D%22https%3A%2F%2Faka.ms%2Fazure-cognitive-search%2Findexer-preview%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ehttps%3A%2F%2Faka.ms%2Fazure-cognitive-search%2Findexer-preview%3C%2FA%3E%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CP%3EFor%20more%20information%2C%20see%20our%20documentation%20at%3A%3C%2FP%3E%0A%3CUL%3E%0A%3CLI%3EPower%20Query%20connectors%3A%20%3CA%20href%3D%22https%3A%2F%2Faka.ms%2Fazs%2Fpowerqueryconnectors%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ehttps%3A%2F%2Faka.ms%2Fazs%2Fpowerqueryconnectors%3C%2FA%3E%3C%2FLI%3E%0A%3CLI%3ESharePoint%20Online%20indexer%3A%20%3CA%20href%3D%22https%3A%2F%2Faka.ms%2Fazs%2Fsharepointindexer%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ehttps%3A%2F%2Faka.ms%2Fazs%2Fsharepointindexer%3C%2FA%3E%3C%2FLI%3E%0A%3CLI%3ECosmos%20DB%20Gremlin%20API%3A%20%3CA%20href%3D%22https%3A%2F%2Faka.ms%2Fazs%2Fcosmosdbgremlinindexer%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ehttps%3A%2F%2Faka.ms%2Fazs%2Fcosmosdbgremlinindexer%3C%2FA%3E%3C%2FLI%3E%0A%3CLI%3EMySQL%20indexer%3A%20%3CA%20href%3D%22https%3A%2F%2Faka.ms%2Fazs%2Fmysqlindexer%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ehttps%3A%2F%2Faka.ms%2Fazs%2Fmysqlindexer%3C%2FA%3E%3C%2FLI%3E%0A%3CLI%3EAzure%20Data%20Lake%20Storage%20Gen2%20indexer%3A%20%3CA%20href%3D%22https%3A%2F%2Faka.ms%2Fazs%2Fadlsgen2indexer%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ehttps%3A%2F%2Faka.ms%2Fazs%2Fadlsgen2indexer%3C%2FA%3E%3C%2FLI%3E%0A%3C%2FUL%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-TEASER%20id%3D%22lingo-teaser-2381988%22%20slang%3D%22en-US%22%3E%3CP%3EIndexers%20now%20support%20new%20data%20sources%20including%20SharePoint%20Online%2C%20Salesforce%2C%20Elasticsearch%2C%20and%20many%20more.%3C%2FP%3E%3C%2FLINGO-TEASER%3E%3CLINGO-LABS%20id%3D%22lingo-labs-2381988%22%20slang%3D%22en-US%22%3E%3CLINGO-LABEL%3EAzure%20Cognitive%20Search%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EMicrosoft%20Build%202021%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E
Co-Authors
Version history
Last update:
‎May 26 2021 04:26 PM
Updated by: