Netherlands: Software

Introductie van Micorosoft SQL Server 2016

Issue link: http://hub-nl.insight.com/i/692679

Contents of this Issue

Navigation

Page 193 of 212

181 C H A P T E R 9 | Introducing Azure SQL Data Warehouse Using Azure Data Factory Azure Data Factory (ADF) is a cloud-based data integration service that manages the movement and transformation of data elements between various Azure services on a scheduled basis. ADF pricing is based on the frequency of the activities that you run and whether an activity runs on-premises or in the cloud. In addition, when you use the Copy Activity, the cost depends on the location of the data source and the amount of time required to copy the data. Note You can find current information about ADF pricing at "Data Factory Pricing," https://azure.microsoft.com/en-us/pricing/details/data-factory/. ADF includes the following components, shown in Figure 9-3: Datasets A dataset is any type of data that you reference in an activity, such as a table or a document. Activities An activity is an action that affects data, using zero or more datasets as input and creating one or more datasets as output. We describe the Copy and Stored Procedure activities later in this section, but there are several other activities available. For example, you can use the Hive or Piq activity to execute Hive or Pig queries on an HDInsight cluster or use the Batch Execution activity to invoke the Azure Machine Learning web service to apply a predictive model to a batch of data, to name a few. Pipelines A pipeline is a logical grouping of activities that collectively perform a task. You can manage and schedule this group of activities as a single item rather than work with each activity individually. Linked services A linked service is a connection to an external service such as a data store hosting a dataset or a compute resource required by an activity. Figure 9-3: Components and relationships between components in ADF. Note To use Azure Data Factory with your SQL Data Warehouse, the firewall settings for the SQL Server instance must be configured to allow access to Azure services as described in Chapter 8. You can also use Visual Studio with the Azure Software Development Kit (SDK) to create objects for ADF as described at "Tutorial: Create a pipeline with Copy Activity using Visual Studio," https://azure.microsoft.com/en-us/documentation/articles/data-factory-get-started-using-vs/. Another option is to use PowerShell as described at "Tutorial: Create a pipeline with Copy Activity using Azure PowerShell," https://azure.microsoft.com/en-us/documentation/articles/data-factory- monitor-manage-using-powershell/. As an alternative to writing code, you can use the Copy Wizard to create the necessary linked services, datasets, and pipeline. Learn more at "Tutorial: Create a pipeline with Copy Activity using

Articles in this issue

Links on this page

Archives of this issue

view archives of Netherlands: Software - Introductie van Micorosoft SQL Server 2016