Quick Answer

HADR_PARTNER_SYNC waits occur when SQL Server Always On availability groups are synchronizing metadata changes across replicas, specifically when updating the partner list during configuration changes. This wait typically appears during replica additions, removals, or role changes and is generally not concerning unless it persists for extended periods during normal operations.

Root Cause Analysis

This wait type surfaces within SQL Server's Always On availability group infrastructure when the primary replica needs to synchronize partner metadata with secondary replicas. The wait occurs in the availability group state machine when threads are blocked waiting for exclusive access to update the internal partner list structure.

The partner list is a critical metadata structure that tracks all replicas participating in an availability group, including their endpoints, roles, and synchronization states. When configuration changes occur (adding/removing replicas, changing synchronization modes, or during automatic failover scenarios), SQL Server must acquire exclusive locks on this structure to ensure consistency across all replicas.

The wait manifests in the SQLOS scheduler when worker threads executing availability group management operations are suspended while waiting for the partner list lock. This coordination happens through the availability group's internal synchronization primitives, which use Windows synchronization objects underneath.

In SQL Server 2016 and later versions, Microsoft improved the efficiency of partner list updates by implementing more granular locking mechanisms, reducing the frequency and duration of these waits. SQL Server 2019 introduced additional optimizations for scenarios with multiple availability groups on the same instance, further minimizing contention on shared metadata structures.

The wait becomes problematic when it persists during steady-state operations, indicating potential issues with network connectivity between replicas, slow secondary replicas preventing metadata synchronization, or excessive configuration changes happening simultaneously.

Diagnostic Queries

-- Check current HADR_PARTNER_SYNC wait statistics
SELECT 
    wait_type,
    waiting_tasks_count,
    wait_time_ms,
    max_wait_time_ms,
    signal_wait_time_ms,
    wait_time_ms - signal_wait_time_ms AS resource_wait_time_ms
FROM sys.dm_os_wait_stats 
WHERE wait_type = 'HADR_PARTNER_SYNC'
    AND waiting_tasks_count > 0;

-- Identify active HADR_PARTNER_SYNC waits and associated sessions
SELECT 
    s.session_id,
    r.blocking_session_id,
    r.wait_type,
    r.wait_time,
    r.wait_resource,
    t.text AS current_statement,
    s.program_name,
    s.host_name
FROM sys.dm_exec_requests r
INNER JOIN sys.dm_exec_sessions s ON r.session_id = s.session_id
CROSS APPLY sys.dm_exec_sql_text(r.sql_handle) t
WHERE r.wait_type = 'HADR_PARTNER_SYNC'
ORDER BY r.wait_time DESC;

-- Check availability group replica states and synchronization health
SELECT 
    ag.name AS availability_group_name,
    ar.replica_server_name,
    ars.role_desc,
    ars.synchronization_state_desc,
    ars.synchronization_health_desc,
    ars.connected_state_desc,
    ars.last_connect_error_number,
    ars.last_connect_error_description
FROM sys.availability_groups ag
INNER JOIN sys.availability_replicas ar ON ag.group_id = ar.group_id
INNER JOIN sys.dm_hadr_availability_replica_states ars ON ar.replica_id = ars.replica_id
ORDER BY ag.name, ar.replica_server_name;

-- Monitor partner sync operations in the error log
EXEC xp_readerrorlog 0, 1, N'partner', N'sync', NULL, NULL, N'DESC';

-- Check for recent availability group configuration changes
SELECT 
    ag.name AS availability_group_name,
    ar.replica_server_name,
    ar.availability_mode_desc,
    ar.failover_mode_desc,
    ar.create_date,
    ar.modify_date
FROM sys.availability_groups ag
INNER JOIN sys.availability_replicas ar ON ag.group_id = ar.group_id
WHERE ar.modify_date >= DATEADD(hour, -24, GETDATE())
ORDER BY ar.modify_date DESC;

Fix Scripts

Restart availability group to clear stuck partner synchronization

-- Use with extreme caution - will cause brief unavailability
-- Test in development environment first
USE [master]
GO

DECLARE @AGName NVARCHAR(128) = 'YourAvailabilityGroupName'
DECLARE @SQL NVARCHAR(MAX)

-- Stop the availability group
SET @SQL = 'ALTER AVAILABILITY GROUP [' + @AGName + '] OFFLINE'
EXEC sp_executesql @SQL

WAITFOR DELAY '00:00:05' -- Allow time for cleanup

-- Bring the availability group back online
SET @SQL = 'ALTER AVAILABILITY GROUP [' + @AGName + '] ONLINE'
EXEC sp_executesql @SQL

PRINT 'Availability group restart completed'

This forces a clean restart of the availability group state machine, clearing any stuck partner synchronization operations. Only use during maintenance windows as it causes temporary unavailability.

Force partner list refresh by toggling replica session timeout

-- Run on the primary replica to force a partner metadata refresh
-- Safe to run during production hours
-- Temporarily change session timeout to trigger state refresh
ALTER AVAILABILITY GROUP [YourAvailabilityGroupName]
MODIFY REPLICA ON N'SecondaryServerName'
WITH (SESSION_TIMEOUT = 15);

WAITFOR DELAY '00:00:05';

-- Reset to your standard timeout value
ALTER AVAILABILITY GROUP [YourAvailabilityGroupName]
MODIFY REPLICA ON N'SecondaryServerName'
WITH (SESSION_TIMEOUT = 10);

This forces a partner metadata refresh by toggling the session timeout. Verify replica states afterward using sys.dm_hadr_availability_replica_states.

Clear wait statistics to get fresh baseline

-- Clear historical wait stats to focus on current issues
-- Safe to run in production
DBCC SQLPERF('sys.dm_os_wait_stats', CLEAR);

PRINT 'Wait statistics cleared. Monitor for 15-30 minutes for fresh data.'

Resets wait statistics counters to establish a new baseline for monitoring partner sync waits.

AutoDBA generates fix scripts like these automatically, with impact estimates and rollback SQL included.

HADR_PARTNER_SYNC Wait Type Explained

Quick Answer

Root Cause Analysis

Diagnostic Queries

Fix Scripts

Prevention

Need hands-on help?

Related Pages

HADR_AG_MUTEX

HADR_AR_CRITICAL_SECTION_ENTRY

HADR_BACKUP_BULK_LOCK

Always On AG Latency

sys.dm_hadr_availability_replica_states