I wrote this in 2010, before Clustered Column Store Indexes, SQL Pools (so rule 3 looks a bit odd), and before date data type was mainstream, but the rest of these still look quite Thou Shalt use something that exists in your query as a partition key Thou shalt not make up a surrogate partition … Continue reading Top 10 SQL Partition Commandments
Blast from the past. I wrote this is 2008 and at one point we had it made into a poster for the office rule whereby on code review we not say a word but just point to the poster. Not sure how well this stood the test of time now it is 2021 so many … Continue reading Top 10 SSIS/DW Commandments
Synapse SQL Dedidcated Pools (aka SQLDW) does not support comments in views or procs in the same was as standalone SQL. This is annoying as comments can be very useful with tracking changes and lineage of objects, especially when used in conjunction with Schema Compare tools in Visual Studio. Everyone has seen that incident where … Continue reading Synpase SQL Tip 2 – Comments in Views/Procs
When loading a Star Schema Datawarehouse it is very common to need to insert rows into the a dimension based on exceptions. For example: Inferred Members (aka early arriving facts) Unknown Members (missing data) A Typical Query may look something like this for inferred members On a SQL Dedicated pool (SQLDW) this can take 2-4 … Continue reading Synapse TSQL Tip 1 – Use Double Defensive Inserts
While a lot of projects in Azure may be using SQLDB or SQLDW (aka Dedicated Pools) we do still have a lot of customers running SQL on a VM (aka IaaS). One common task is setting up data disks. Typically we may need anything from 4-20 data disks and combine them with storage spaces to … Continue reading Adding Disks to Azure VM with Powershell
In CI/CD Azure Synapse Analytics – part 1 we have covered: Setting up git source control in Synapse Studio The difference in main collaboration and workspace publish branch In this blog we are going to cover: How to create build pipelines Deploy to Synapse production workspace using DevOps release pipelines Build Pipeline in DevOps Go … Continue reading CI/CD For Azure Synapse Analytics – Part 2
Do you also wonder how to do continuous integration (CI) and continuous deployment (CD) for Synapse Analytics? But first, lets talk about basic, what is CI/CD is simple terms. CI/CD is one of the best practices of agile methodology and it enables development teams to deploy stable and tested code changes more frequently. This practice … Continue reading CI/CD for Azure Synapse Analytics – Part 1
Now that MSFT have released Synapse as an Integrated framework of tools for Data Engineering we have two different ways to provision our enterprise data warehouse in a dedicated sql pool. Which is the best one to use ? Firstly we can use the new Integrated experience with the dedicated pool owned by Synapse Workspace. … Continue reading Standalone or Integrated flavour of your Dedicated Pool (sqldw)
Data Lake Storage to Synapse Analytics using Managed Service Identity (MSI) – COPY INTO and PolyBase
Why to use MSI? Easy to authenticate any Azure Active Directory supported service Provides limited visibility of the credentials More secured than SQL authentication and less susceptible to hacking Configuration steps Create a storage account and enable Hierarchical namespace for Data Lake Storage Gen2. Within storage account, create a container for file system . Navigate … Continue reading Data Lake Storage to Synapse Analytics using Managed Service Identity (MSI) – COPY INTO and PolyBase
Prerequisites: Synapase Analytics workspace is created along with Data Lake Storage gen2 Step 1: Create new linked Service for a workspace Log in to Synapase analytics workspace from Azure portal and Create new Linked services and select Azure Synapse Analytics (SQL DW) Set up Authentication method for linked service as Managed Identity. Once a linked … Continue reading Configure Managed Identity in Synapse Analytics Workspace using PowerShell
Configuration Steps Create a Azure Data Lake Storage Gen 2 storage account, make sure to enable Hierarchical namespace. Read Microsoft KB here. Create a Synpase workspace and select newly created data lake storage as Account name and blob container as File System name. Assign Storage Blob Data Contributor permissions to Synpase Workspace in the Storage … Continue reading Synapse workspace: Load data using Polybase and Managed Identity
In data engineering a common challenge is to securely establish communication between different services. By providing an Azure resource identity in Azure AD and using it to obtain Azure Active Directory (Azure AD) tokens, Managed identities (formerly known as Managed Service Identity) remove the need for developers to manage credentials. In this blog, we will … Continue reading How to configure Managed Identity in Synapse Analytics Workspace?
Something went wrong. Please refresh the page and/or try again.
always on Azure backups Cloud consolidation Cube DW EDW ERP Events Featured Finance Financial Reporting Irish Economic Crisis Licencing Load Test Maestro Many to Many MasterClass MCM mdx Modelling MOLAP OLAP Partitioning Performance Personal Power BI PowerPivot Power View scom SQL SQLBits sqldw SQL IO SQL Saturday SQL Server SSAS SSIS storage Syanpse Synapse Tabular TSQL Virtualization
Subscribe to Prodata Blog
Get new content delivered directly to your inbox.