Posts

Showing posts from 2019

How to get Azure Storage metrics through REST API?

Image
I have spent sometime on getting Storage Metrics from Azure REST APIs. Though Azure documentation is a bit tricky at first instance. I was able to figure out Storage space utilized under each of the storage account. Note: Below storage metrics are at Storage Account/Blog/Table/Queue level. AVAILABILITY : The following example shows how to read metric data at account level: GET "/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.Storage/storageAccounts/{storageAccountName}/providers/microsoft.insights/metrics?metricnames=Availability&api-version=2018-01-01&aggregation=Average" Response: { "cost": 0, "timespan": "2017-09-07T17:27:41Z/2017-09-07T18:27:41Z", "interval": "PT1H", "value": [ { "id": "/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.Stor

How to remove a String in python list

Below is the snippet to remove a string from python list of strings a = ['fname_encr','lname_encr','address_encr']  # encr = encrypted value column a = list(map(lambda x: x.replace('_encr',''),a) print(a) #python 3 > ['fname','lname','address']

Measuring the Data Quality

Image
In today’s information-driven world, implementing an effective data quality management or DQM strategy cannot be overlooked. DQM refers to a business principle that requires a combination of the right people, processes and technologies all with the common goal of improving the measures of data quality. The subject is the single most important concept in the modern data quality approach. The subject is the entity which will be the target of the data quality investigation at the most granular level. Before we begin any data quality initiative we must discover what the subject of the study is. Like most concepts in our approach, the subject is a concept reflected in the data but not attached to any Technical object. For ex: Employee Status, Hours, Earnings belongs to subject "Employee". If we implement a Telecom Data warehouse, subject areas can be Subscriber, Finance, Marketing.Once identified, the subject becomes more than a concept and will define the

How to Connect to Databricks Delta table using JDBC?

Image
Connect Databricks Delta tables using JDBC  (Microsoft Azure) This post covers Databricks Delta JDBC connection cobnfiguration. Step 1 Download Databricks Spark JDBC driver from below location. This might require to fill in basic information before the download. After you fill in the required field you would receive Driver download links to the email you provided in the form. keep the Jar into C:\Downloads (or any location. this jar need to be added to the classpath) location. Step 2 -> Open Databricks URL. -> Navigate to Cluster tab and click on cluster. -> In the below page copy any one of the JDBC URLs. (you may need to click on advanced settings under configuration tab in cluster config page) Step 3 Navigate to top right Corner to the profile page and click on profile. Generate Token and keep it safely in local machine. Step 4 Below is the scala program to connect to Databricks Delta from Outside Azure/Cloud environment. I have us