Introduction to SQL Server Multi-Tenancy (Part 2)

solarwinds_worldwide_llc over 3 years ago 10 minute read time

In Introduction to SQL Server Multi-Tenancy (Part 1), I talked about some of the key considerations around designing a multi-tenant system using SQL Server. There are several ways to implement multi-tenancy, and, as is often the case, there is no single "best" way but rather a range of options that each offer different trade-offs. The approach that is right for you depends on your objectives and needs for your specific environment. It's important to consider which of these approaches best suit your requirements and goals based on the 3 core considerations from Introduction to SQL Server Multi-Tenancy (Part 1): security, maintainability (manageability), and scalability.

The following are the 4 approaches I will cover in this blog post:

Single database, shared schema
Single database, separate schema
Database per tenant
Multiple databases, multiple tenants per database, shared schema

Approach #1: Single Database, Shared Schema

One database to hold the data for all tenants
Every tenant's data is stored in the same set of tables
Tables that contain tenant-specific data include a column to identify which tenant each row belongs to

Security

Risk of exposing one tenant's data to another tenant or updating the wrong tenant's data (e.g., if a developer misses a WHERE clause to filter on the tenant id)

Mitigation: Row-Level Security (RLS) can be used to control access to rows in a table. Create an inline table-valued function to apply a filter on the tenant id and then create a security policy to apply that filter predicate automatically on the target tables. As long as you maintain that security policy with the full set of tables, queries/updates on those tables will then be automatically enforced. Developers don't need to remember to manually add the filter clause to every SQL statement.

No tenant isolation

Maintainability

️ One database schema to maintain and a simple schema update rollout process—it only needs to be applied once

️ Manage the High Availability/Disaster Recovery/maintenance operation/monitoring strategy for just one database

️ Limited development/application code complexity—single schema, single database to connect to

️ Adding new tenants is easy—no processes needed around database/schema provisioning or connection determination

Any query or data modification includes a predicate to restrict the operation to a specific tenant id

Mitigation: Can use RLS policy

Must remember to update the RLS policy as new tables are added over time

Can't easily restore a single tenant's data

Scalability

Limited to scaling-up hardware, rather than scaling out

Risk of "noisy neighbors"—tenants can impact the performance of the system for all others due to a lack of isolation and all competing for the same resources

One-size-fits-all performance tuning and stability—tenants' data volumes and usage can vary dramatically, impacting things such as execution plans making it more difficult to optimize performance across every tenant

As the number of tenants and data per tenant grows, maintenance activities take longer, potentially impacting all tenants