SQLServerCentral - www.sqlservercentral.com

A community of more than 1,600,000 database professionals and growing

Featured Contents

The Voice of the DBA

Pseudonymisation

Today we have a guest editorial from Phil Factor as Steve is out of the office.

It is an awful word, but ‘Pseudonymisation’ is the process of partial 'data masking' that transforms personal data in such a way that the resulting data cannot be easily attributed to a specific data subject without the use of additional information.  It is used mostly in medical research but also in reporting, training and testing. Unlike full ‘anonymisation’, a record that has ‘Pseudonymised’ data can still be linked back to the original record via a key. It is one of the ‘appropriate technical and organisational measures’ that are required to comply with the EU general data protection regulation 2016/679 (GDPR) where data needs to be used for research or testing purposes.

Pseudonymisation  isn’t a good solution to the problem of protecting privacy, and preventing unauthorised access. Even encryption is a very poor substitute for effective access-control, because there must be no unauthorised access even to the encrypted data and even encryption is best considered as an extreme delaying tactic.  

Partial data masking or other Pseudonymisation techniques tend to fail where they can be subject to inference attacks. It is often easy to take such data and work out, tho a small but useful extent, who it belongs to: You can soon work out who is using that dating site unless all the data is anonymised. Completely anonymised data is fine, especially for training, but if you are using it for database testing you need to check  that the 'statistics' distributions of the data haven't changed.

Pseudonymisation is a variety of data masking. The task of masking sensitive data within a database is always fraught. RDBMSs are designed to make it pretty easy to work out what the data was before it was obfuscated. Even if you aren’t always sure what goes on in the log, you can be confident that the villains know. In fact, there are plenty of other artefacts around within a SQL Server database to guide the curious. You are likely to need to shut down all devices and traces that track changes in the data before you obfuscate data, or export all the data to a different copy of the database.

I suspect that we want to use real production data where we have a bug that can only be repeated reliably on Production data. This tends to be produced by someone like Mr Null or Mr O’Brien, who between them have caused many a NAD system to fail. (Test data tends not to reflect the full variety that is  met in the real world.) With database systems, we also need to understand why certain queries fail when data conforms to particular distributions, and these distributions of data are difficult to achieve by generating data.

We are getting close to the point where we can no longer use live personal data to maintain and enhance databases. It will require considerable skill to find effective ways of partially or fully anymising the data to ensure that the organisations we work for comply with the law, because it may not be as easy as it first seems.

Phil Factor from SQLServerCentral.com

Join the debate, and respond to today's editorial on the forums

ADVERTISEMENT
Foundry

Could you help with some research?

The Foundry team at Redgate want to hear about your experience with SQL Server asset management. Fill in this short survey to help out, and also to enter their prize draw for a $100 Amazon gift card.  Complete survey.

SQL Clone

NEW SQL Clone - version 1 available now!

Create copies of production databases and SQL backups in seconds and save up to 99% of disk space using SQL Clone. Redgate’s new tool removes much of the time and resource needed to create and manage database copies, allowing teams to work on local environments to develop, test and diagnose issues faster. Try it free.

Featured Contents

 

Forecasting with SQL

Mark Wojciechowicz from SQLServerCentral.com

Can you use SQL Server to extrapolate out trends from your data? Lots of companies would have you buy a BI solution instead, but new author Mark Wojciechowicz brings us a great example of how you could forecast something in T-SQL. More »


 

Registration Now Open for SQL Cruise Alaska in August 2017

Tim Ford

More »


 

Installing SQL Server Agent on Ubuntu Server

Additional Articles from MSSQLTips.com

Manvendra Singh explains how to install SQL Server Agent on an Ubuntu server, so that you can create SQL Server Agent Jobs to schedule repetitive tasks. More »


 

Do you use Visual Studio Code?

Additional Articles from Redgate

Visual Studio Code is rapidly gaining in popularity, but is it all it could be, or is there room for improvement? Redgate is embarking on some research to better understand how you are using this lightweight editor, and where it can be improved. Have your say by filling in this short survey! More »


 

From the SQLServerCentral Blogs - GroupBy conference recording of Introducing the SQL Server 2016 Query Store available now!

Enrico van de Laar from SQLServerCentral Blogs

Good news! The recording of my session “Introducing the SQL Server 2016 Query Store” at the GroupBy conference 2 weeks ago... More »


 

From the SQLServerCentral Blogs - GroupBy–Bringing DevOps to the Database

Steve Jones from SQLServerCentral Blogs

Today was my presentation in the April GroupBy conference lineup. I presented on DevOps and changing your database development to... More »

Question of the Day

Today's Question (by Stanley Kapfunde):

In SQL Server an extent is ____ contiguous pages, using  _____ of disk storage.

Think you know the answer? Click here, and find out if you are right.


We keep track of your score to give you bragging rights against your peers.
This question is worth 1 point in this category: Extent.

We'd love to give you credit for your own question and answer.
To submit a QOTD, simply log in to the Contribution Center.

ADVERTISEMENT

Exam Ref 70-762 Developing SQL Databases

Prepare for Microsoft Exam 70-762, Developing SQL Databases –and help demonstrate your real-world mastery of skills for building and implementing databases across organizations. Designed for database professionals who build and implement databases across organizations and who ensure high levels of data availability, Exam Ref focuses on the critical-thinking and decision-making acumen needed for success at the MCSA level. Get your copy from Amazon today.

Yesterday's Question of the Day

Yesterday's Question (by Steve Jones):

I have a server with 16 CPUs seen by SQL Server 2016. This server has 4 NUMA nodes, each having 4 CPUs inside. I want to set processor affinity to just the CPUs in the first and last NUMA node. What command do I use?

Answer: ALTER SERVER CONFIGURATION SET PROCESS AFFINITY CPU=0 TO 3, 12 TO 15;

Explanation:

You can set affinity to certain nodes or CPUs. The way to do this is to list the groups of CPUs with the TO command and a comma between groups. Since CPUs are 0-based, we would use the command: 

 ALTER SERVER CONFIGURATION SET PROCESS AFFINITY CPU=0 TO 3, 12 TO 15; 

Ref: ALTER SERVER CONFIGURATION - click here


» Discuss this question and answer on the forums

Database Pros Who Need Your Help

Here's a few of the new posts today on the forums. To see more, visit the forums.

SQL Server 2014 : Administration - SQL Server 2014

tempdb: filegroup is full while space is available both in filegroup & transaction log - SqlServer 2014 sp2cu2: having troubles with tempdb (350Gb full size: 8x25Gb primary group + 150 Gb log). Error is Msg 1105, Level...


SQL Server 2014 : Development - SQL Server 2014

Performance issue with EXCEPT - I am rewriting some user-defined functions and I want to test if they give the exact same result as the...

Dynamic Statement with Cursor loop, help - Hi , I am trying to build a dynamic query which will create a select. I am trying to use a...


SQL Server 2012 : SQL 2012 - General

Atomic Transaction.? - Not Quite...... - Hi All, Can I pick your brains please. Have a 3rd party application (Java) calling stored procs to utilise our DB. To say...

Delete nologging - Hi, How to ran a delete statement without entering log into .ldf file. Thank & Regards Krishna.

Best practice for alerting via 3rd party tool - Hi all, New in my environment.  We use Nagios to log in and monitor our DB servers, but from what I've...

To Delete / Truncate / Drop & Create - Good Morning All, Can I Have some opinions please. Have 2 large tables I need to 'clean', 1 with 265263333 rows and one...

SSIS ForEach Loop Container recursion question - I have a weird one today. Because my destination system can only accept files of a certain size, I need...

Oh double hop, you're so fun ... Login failed for user 'NT AUTHORITY\ANONYMOUS LOGON'. - I've fought this issue so many times over the years and perhaps I'm old and rusty, but I'm having a...


SQL Server 2012 : SQL Server 2012 - T-SQL

Understanding why variable is NULL - In the sql code below, I'm trying to understand why @idMin is NULL and not equal to 0.  I'm probably...

Changing if statement to sql code - I wanted to change the code in the below if statement to more like sql code.  Can I simplify it...

Heap vs Clustered Wildcard Search - I am working on converting a heap table to a clustered table by adding a primary key to the identity...

Help On Calculating Aging - create


SQL Server 2008 : SQL Server 2008 - General

SQL Server Inventory - I have been given with a big task - to make an inventory of SQL Server installations across 25 worldwide locations....


SQL Server 2008 : T-SQL (SS2K8)

ServerSizes - My table data. date column values sample : 2017-05-16 03:14:59.647 Server1    100        1/1/2016 Server2    100        1/1/2016 Server3    100        1/1/2016 Server4    100 


Data Warehousing : Integration Services

SSIS inserting data into a destination table using table names in a variable - I have table which will is having the table names in it. I have to now import those table in...

Grief importing date from Excel 2010 to SQL Server 2012, SSIS, Derived column task. - Hello, First of all, I'm an SSIS Noob, and not great with Excel either, someone else created and imported the data...


SQL Server 2005 : SQL Server 2005 Integration Services

upload using flat file source - my file is like this TH*4.2*857463*01**20091015*1045*P**~~IS*7564*ORACLE~ i have two table table : TH ID TH1 TH2 TH3 TH4 TH5 TH6 TH7 TH8 TH9 TH 4.2...


Career : Events

The SQL Saturday Thread - As popular as SQL Saturday is, I'm surprised that nobody created a thread dedicated to SQL Saturday, so I created...


Career : Presentations and Speaking

Potential presentation idea: SQL Server for absolute beginners - So, I took today off from work to prepare and travel to SQL Saturday #517, and something occurred to me...

This email has been sent to [email protected]. To be removed from this list, please click here.
If you have any problems leaving the list, please contact the [email protected].
This newsletter was sent to you because you signed up at SQLServerCentral.com.
Feel free to forward this to any colleagues that you think might be interested.
If you have received this email from a colleague, you can register to receive it here.
This transmission is ©2017 Redgate Software Ltd, Newnham House, Cambridge Business Park, Cambridge, CB4 0WZ, United Kingdom. All rights reserved.
Contact: [email protected]