Active Directory Migration: Principles, Risks, and Production Checklists - WordPad

Active Directory Migration: Principles, Risks, and Production Checklists

Last reviewed: June 2026

Scope

This article covers Active Directory migration as a production infrastructure project. It applies to domain consolidation, forest migration, AD DS modernization, hybrid identity cleanup, and migration planning where Microsoft Entra ID is part of the target state. It does not assume that every workload can move directly to cloud-only identity — in practice, very few can.

Migration is a continuity project, not a copy job

Active Directory migration is not a copy operation. It is an identity, application, access, ownership, and operational-continuity project, and the technical move is the smallest part of it. The real work is discovering dependencies, proving application compatibility, controlling privileged access, planning rollback, and making sure operations can support the new state on the Monday after cutover.

I learned the shape of this on a large multi-domain consolidation. The Active Directory part was never the hard part. The hard part was everything hanging off it: applications nobody owned, service accounts nobody could explain, delegation that had been copied between domains for a decade, and every site convinced its own setup was "standard." What made that program work wasn't a better migration tool. It was disciplined discovery and a standardized wave model that let us move faster with less hands-on effort per wave — in our case a markedly faster migration path with far less technical-staff involvement than the original per-site estimates assumed. The lesson stuck: the tool moves objects; the process moves the organisation.

So I work to a simple rule: clean what you can, document what you cannot clean, pilot with real users, and never cut over anything that has no owner, no validation method, and no rollback path.

Why AD migrations fail

Almost always for the same reasons, and almost never the ones in the project plan:

  • Unknown service accounts and scheduled tasks.
  • Stale groups that nobody owns.
  • Delegation copied between domains without review.
  • DNS, time sync, or certificate assumptions.
  • Applications with hardcoded LDAP paths to the retiring domain.
  • Legacy Kerberos, NTLM, or LDAP bind dependencies.
  • Hybrid identity assumptions around Microsoft Entra Connect or Cloud Sync.
  • No pilot group that represents real production usage.
  • No rollback plan that has actually been tested.
  • Poor communication to support teams and users.

On that program, the one that nearly bit us was the hardcoded-LDAP-path case: a finance application bound to the old domain through a service account nobody could name. It would have authenticated fine in testing and failed the moment the source domain went read-only. We found it in discovery, not in production — which is the entire argument for discovery.

My AD migration principles

  1. Clean before migration where the risk is low and ownership is clear.
  2. Do not migrate unknown delegation blindly.
  3. Treat DNS, time sync, certificates, and application LDAP dependencies as first-class risks.
  4. Validate with pilot users and real applications, not only test accounts.
  5. Define rollback before production change windows.
  6. Keep communication simple and frequent.
  7. Document ownership after cutover — the migration isn't done when the objects move.

Discovery checklist

Discovery is where the program is won. The inventory I insist on before any wave planning:

  • Domains, forests, trusts, and functional levels.
  • Sites, subnets, replication topology, and domain controller placement.
  • DNS zones, conditional forwarders, application records, and stale records.
  • GPO inventory, WMI filters, logon scripts, drive mappings, and printer dependencies.
  • OU structure, delegation, inherited permissions, and privileged groups.
  • Service accounts, gMSA usage, scheduled tasks, IIS app pools, SQL services, and automation jobs.
  • Application authentication protocols: Kerberos, LDAP, NTLM, SAML, OpenID Connect, OAuth2.
  • Microsoft Entra ID integration, Entra Connect or Cloud Sync, hybrid join, and Conditional Access dependencies.
  • File, print, Citrix, VPN, VDI, certificate, and monitoring dependencies.

Migration strategy matrix

Situation Recommended approach Reason
Clean small domain Direct migration or staged upgrade may be possible Lower dependency risk
Large enterprise Phased migration waves Controls business impact and support load
Unknown dependencies Assessment-first Prevents hidden failures during cutover
Multiple forests Trust, coexistence, and consolidation plan Identity, access, and ownership must be mapped
Hybrid identity Microsoft Entra planning required Sync, UPN, licensing, device state, and Conditional Access matter
Legacy applications Hybrid or exception model first Some workloads need redesign or replacement before cloud identity

Coexistence is where the risk concentrates

Most enterprise migrations spend more time in coexistence than in cutover. For weeks or months, old and new identity states both exist: trusts, sync rules, duplicate group structures, SID history or access translation choices, temporary admin paths, exception applications, and users moving at different speeds. That period is where risk concentrates, because the environment is deliberately more complex than either the source or target end state.

I want a coexistence design that is explicit about:

  • Which domains and forests trust each other, and for how long.
  • Which authentication paths remain valid during migration.
  • Which groups are authoritative in the source and which are authoritative in the target.
  • Whether SID history, ACL translation, or re-permissioning is the chosen access strategy.
  • Which applications are allowed to keep binding to the source domain temporarily.
  • Which admin groups, break-glass accounts, and service accounts exist only for migration.
  • How Microsoft Entra sync behaves while source and target identities coexist.

The danger is not coexistence itself. The danger is accidental permanence. Temporary trusts, exception groups, and duplicated access paths have a habit of becoming the new normal if nobody owns their retirement. Every coexistence mechanism should have an owner and an expiry condition.

Service accounts and delegated access

Service accounts are the practical test of whether discovery was real. User accounts are visible. Service accounts hide inside scheduled tasks, IIS app pools, SQL jobs, integration services, monitoring tools, backup software, scripts, and appliances. They also tend to have weak documentation and excessive rights because they were created to fix a problem quickly.

For each service account I want:

Field Why it matters
Owner Someone can confirm whether the account is still needed.
Purpose The migration team understands what breaks if it fails.
Authentication method LDAP, Kerberos, NTLM, certificate, API token, or hybrid path.
Host or workload Where the credential is configured.
Privileges Whether the account can be reduced before or after migration.
Rotation plan How the credential changes without causing an outage.
Validation method The specific test proving the account works in the target state.

Delegated access needs the same discipline. I do not like copying OU delegation blindly from source to target. Delegation is often the fossil record of old support models, mergers, temporary projects, and one-off fixes. Migration is a rare chance to review it. If the business cannot explain why a helpdesk group needs a right, that right should not move automatically.

Access strategy: translate, re-permission, or redesign

The access model is usually more important than the account move. File shares, applications, SQL permissions, Citrix access, VPN policy, and local admin rights may all depend on source-domain groups. There are three broad strategies:

Strategy When it fits Risk
Translate access Short migration window, many ACLs, known group mapping Preserves old complexity and may carry stale rights forward.
Re-permission Strong ownership and clear target access model Requires more business validation before cutover.
Redesign Legacy access model is not acceptable in target state Slower, but often the only safe path for privileged or regulated access.

The wrong answer is to let the tool decide for you. The strategy should come from business ownership, security posture, and rollback requirements. For low-risk file shares, translation may be practical. For privileged groups, application admin roles, or regulated data, re-permissioning or redesign may be the safer choice even if it takes longer.

RACI example

Migrations fail in the gaps between teams, so ownership has to be explicit:

Workstream Responsible Accountable Consulted Informed
Identity design Identity engineer Infrastructure lead Security, application owners Service desk
Application validation Application owner Business owner Identity, network Project team
DNS and network readiness Network engineer Infrastructure lead Identity, security Application owners
Communication Project lead Service owner Support, business stakeholders Users
Go/no-go Migration lead Change owner All workstream leads Support teams

In a coordinator role on Windows migration programs, this table is the artifact I spend the most time defending, because every unowned row is a future incident with no one assigned to it.

Migration wave example

Wave Scope Success criteria Rollback trigger
Pilot IT users and one low-risk app group Login, GPO, file access, app access, and support path validated Authentication failures or unresolved support blocker
Wave 1 Low-risk departments No critical incidents during hypercare Repeated application failures
Wave 2 Business-critical departments Application-owner sign-off complete Failed business validation
Final Remaining users and cleanup Legacy dependencies documented or retired Unowned dependency discovered

The wave model is also what made that program efficient: each wave taught the next, the runbook got tighter, and by the middle waves the per-site effort had dropped sharply because the surprises had already been found and documented.

Post-migration validation checklist

Validation area Method Owner
User login Test pilot and migrated account login Identity team
GPO application gpresult /h report.html EUC team
DNS resolution nslookup and application testing Network team
Microsoft Entra sync Admin center, sync logs, and sign-in logs Identity team
Application LDAP bind Application-owner validation Application team
File and print access User acceptance and access checks Operations

Rollback is not one action

Rollback in an AD migration is a set of decisions, not a single button. The rollback action depends on what failed:

Failure Possible rollback
User login failure Move user back to source authentication path or restore old profile path.
Application bind failure Repoint application to source LDAP path or restore service-account credential.
Group access failure Restore source group membership, translated ACL, or temporary access group.
Entra sync issue Stop the affected sync change and restore known-good connector configuration.
DNS or name resolution failure Restore previous resolver path or conditional forwarder.
GPO issue Unlink target GPO, restore previous link order, or move user/computer out of wave OU.

That table should exist before the change window. The team does not need to rehearse every rollback in production, but it does need to know which rollback type matches which symptom and who has authority to trigger it. Otherwise the first thirty minutes of an incident are spent debating what "rollback" means.

The controls behind a go/no-go decision

These are the standing controls that make a go/no-go meeting meaningful rather than a rubber stamp:

  • Assign owners for every dependency, not only every technical task.
  • Keep a visible RAID log with risks, assumptions, issues, and decisions.
  • Require application-owner sign-off before each wave.
  • Use CAB approval for production cutover and rollback windows.
  • Brief the service desk with symptoms, known issues, and escalation paths.
  • Run hypercare with daily review until incident volume is stable.

Go/no-go: the binary checklist

And these are the yes/no questions answered in the room, the night before a wave:

  • Pilot users validated.
  • Critical applications tested and signed off.
  • DNS, replication, and time sync validated.
  • Authentication and authorization tested.
  • GPOs and login scripts validated.
  • Rollback prepared and understood.
  • Stakeholders informed.
  • Support model ready.

Final recommendation

Treat AD migration as a business-continuity project. Tools can move objects, but only discovery, ownership, validation, communication, and rollback planning make the migration safe. If the organisation cannot explain who owns an application, how it authenticates, and how it will be validated after cutover, that workload is not ready to migrate — no matter what the project plan says. That consolidation didn't succeed because of any single tool; it succeeded because nothing crossed into a production wave until those four questions had answers.

References

Related infrastructure guides

For Help, press F1 1972 words Ln 1, Col 1