Quick Answer

HADR_BACKUP_BULK_LOCK occurs when an Always On primary replica processes a backup request from a secondary replica and waits for background threads to acquire or release the BulkOp lock. This wait typically appears during log backup operations on secondary replicas and indicates normal Always On synchronization activity, though excessive waits can signal network or storage bottlenecks.

Root Cause Analysis

When a secondary replica initiates a backup operation (typically transaction log backups), it sends a backup request to the primary replica. The primary must coordinate this operation through its background threads that manage the BulkOp lock, which ensures consistency during bulk operations like backups.

The wait manifests in the primary's log writer thread when it needs to coordinate with the backup operation happening on the secondary. SQL Server's Always On architecture requires the primary to maintain awareness of backup operations on secondaries to ensure transaction log consistency and proper LSN sequencing.

In SQL Server 2016 and later, Microsoft improved the efficiency of this coordination mechanism, reducing the frequency and duration of these waits. SQL Server 2019 introduced further optimizations to the log block flushing algorithm that can minimize contention on the BulkOp lock during concurrent backup operations.

The wait specifically occurs in the primary's log manager when it must serialize access to log blocks that are being backed up on a secondary replica. This serialization ensures that log records remain consistent across all replicas during backup operations.

SQL Server 2022 enhanced the parallel redo mechanism which can indirectly reduce these waits by improving overall Always On performance, though the core BulkOp lock mechanism remains unchanged.

Diagnostic Queries

-- Check current HADR_BACKUP_BULK_LOCK waits across all sessions
SELECT 
    session_id,
    wait_type,
    wait_time_ms,
    blocking_session_id,
    resource_description
FROM sys.dm_exec_requests 
WHERE wait_type = 'HADR_BACKUP_BULK_LOCK'
    AND session_id > 50;

-- Analyze historical wait statistics for this wait type
SELECT 
    wait_type,
    waiting_tasks_count,
    wait_time_ms,
    max_wait_time_ms,
    signal_wait_time_ms,
    wait_time_ms / waiting_tasks_count AS avg_wait_time_ms
FROM sys.dm_os_wait_stats 
WHERE wait_type = 'HADR_BACKUP_BULK_LOCK';

-- Examine Always On backup preferences and current backup activity
SELECT 
    ag.name AS availability_group,
    ar.replica_server_name,
    ar.backup_priority,
    ar.secondary_role_allow_connections,
    ars.role_desc,
    ars.synchronization_health_desc
FROM sys.availability_groups ag
JOIN sys.availability_replicas ar ON ag.group_id = ar.group_id
JOIN sys.dm_hadr_availability_replica_states ars ON ar.replica_id = ars.replica_id;

-- Monitor log send queue and backup operations on secondaries
SELECT 
    db.name AS database_name,
    drs.synchronization_state_desc,
    drs.log_send_queue_size,
    drs.log_send_rate,
    drs.redo_queue_size,
    drs.redo_rate,
    drs.last_sent_time,
    drs.last_received_time
FROM sys.dm_hadr_database_replica_states drs
JOIN sys.databases db ON drs.database_id = db.database_id
WHERE drs.is_primary_replica = 1;

-- Check for active backup operations across the availability group
SELECT 
    s.session_id,
    r.command,
    r.percent_complete,
    r.estimated_completion_time,
    s.login_name,
    r.database_id,
    db.name AS database_name
FROM sys.dm_exec_sessions s
JOIN sys.dm_exec_requests r ON s.session_id = r.session_id
JOIN sys.databases db ON r.database_id = db.database_id
WHERE r.command LIKE '%BACKUP%'
    AND db.name IN (SELECT database_name FROM sys.dm_hadr_availability_replica_states);

Fix Scripts

Optimize backup scheduling to reduce contention

-- Create a staggered backup schedule for secondary replicas
-- This reduces simultaneous backup operations that can cause lock contention
EXEC sp_add_job 
    @job_name = 'Staggered_Log_Backup_Secondary',
    @enabled = 1;

EXEC sp_add_jobstep
    @job_name = 'Staggered_Log_Backup_Secondary',
    @step_name = 'Backup_with_delay',
    @command = '
    -- Add random delay between 0-300 seconds to stagger backups
    DECLARE @delay INT = ABS(CHECKSUM(NEWID())) % 300;
    WAITFOR DELAY @delay;
    BACKUP LOG [YourDatabase] TO DISK = ''backup_path'' WITH COMPRESSION;',
    @database_name = 'master';

Test thoroughly in development. Adjust delay ranges based on your backup frequency and RPO requirements.

Configure backup preferences to minimize primary impact

-- Set backup preference to secondary replicas only
-- This reduces backup-related waits on the primary
ALTER AVAILABILITY GROUP [YourAGName]
MODIFY REPLICA ON 'PrimaryServer' 
WITH (BACKUP_PRIORITY = 10);

ALTER AVAILABILITY GROUP [YourAGName]
MODIFY REPLICA ON 'SecondaryServer1' 
WITH (BACKUP_PRIORITY = 90);

ALTER AVAILABILITY GROUP [YourAGName]
MODIFY REPLICA ON 'SecondaryServer2' 
WITH (BACKUP_PRIORITY = 85);

Verify backup jobs respect these preferences using PREFER_SECONDARY or SECONDARY_ONLY options.

Enable backup compression to reduce I/O duration

-- Enable backup compression server-wide to reduce backup duration
-- Shorter backup operations mean shorter lock hold times
EXEC sp_configure 'backup compression default', 1;
RECONFIGURE WITH OVERRIDE;

-- For specific backup operations, always use compression
-- BACKUP LOG [YourDatabase] TO DISK = 'path' WITH COMPRESSION, INIT;

Monitor CPU usage after enabling compression. Some workloads may see CPU pressure increase.

AutoDBA generates fix scripts like these automatically, with impact estimates and rollback SQL included.

HADR_BACKUP_BULK_LOCK Wait Type Explained

Quick Answer

Root Cause Analysis

Diagnostic Queries

Fix Scripts

Prevention

Need hands-on help?

Related Pages

HADR_AG_MUTEX

HADR_AR_CRITICAL_SECTION_ENTRY

HADR_BACKUP_QUEUE

Always On AG Latency

sys.dm_hadr_availability_replica_states