Frequently Asked Questions: Systems Management Server Recovery

Published: June 25, 2001
**
**

On This Page
SMS RecoverySMS Recovery
Recovery ExpertRecovery Expert
Supported ScenariosSupported Scenarios
BackupBackup
Secondary SitesSecondary Sites
OtherOther

SMS Recovery

Q.What is an SMS site recovery?
A.

A Systems Management Server (SMS) site recovery occurs whenever an SMS site is installed with a site code or site server name that was previously used in that SMS hierarchy. The core tasks of a site recovery are to repair and resynchronize data, because these taska are required to prevent the interruption of operations and data corruption. When you re-install a site, the most critical task is to resynchronize serial numbers.

Q.Must I run a recovery if I am just re-installing an SMS site?
A.

Yes. The other SMS sites in the hierarchy require a re-installed site to be in its former state. Therefore, it's vital that you repair and resynchronize the re-installed site with the rest of the hierarchy. You must treat a re-installation as a recovery to prevent operational problems in the hierarchy.

Note: A lack of understanding of these recovery requirements is the most common cause of unsuccessful site recoveries.

Q.Will disconnecting a child site from a failed site protect the child site from damage or corruption?
A.

No, it won’t. In fact:

Disconnecting won’t protect the child site from, or clean up, any damage that might have already occurred.

Disconnecting won’t protect the child site if mistakes are made during recovery and the child site is later reconnected to any site below the failed site in the hierarchy.

To protect the child site, before you connect it to a new parent site, always check for failed site recoveries at the potential parent site and at all sites above it. Performing this check helps avoid corrupting software distribution objects at the connecting site and all of its child sites.

Note: Connecting to sites that weren’t correctly recovered is the most common way for software distribution data to become corrupted for the whole hierarchy.

Q.What levels of skill are needed to recover an SMS site?
A.

To complete SMS maintenance and recovery procedures successfully, an administrator should be comfortable performing detailed manual tasks on local and remote computers. Sometimes these tasks use Resource Kit tools for the operating system, SMS, and Microsoft SQL Server™. In many cases, the administrator must manipulate critical data on live sites, such as when:

Configuring the operating system, working with the file system, and editing the registry.

Configuring operating system security, including accounts, services, shares, trusts, permissions, rights, organizational units, domains, forests, and policy templates.

Configuring SQL Server, restoring databases, and updating SQL Server tables with queries.

note-iconWarning: Failing to complete the procedures successfully can cause site operations to fail, corrupt site data, or even corrupt data at multiple sites. If there is any doubt that the skill level of the SMS administrator is sufficient, a more experienced administrator should supervise the maintenance and recovery procedures. If a skilled administrator is unavailable, the administrator should call Microsoft Product Support Services, who will guide the administrator through the procedures until the administrator is proficient enough to manage the operation alone.

Q.What does a recovery consist of?
A.

The figure below shows each phase of a recovery.

Recovery cycle
Q.
A.

Recovery Expert

Q.What is the difference between the SMS Recovery Expert and the SMS Recovery Wizard?
A.

The SMS Recovery Expert generates a complete list of required tasks to recover a failed SMS site based on your specific scenario and configuration. This list is broken into phases (prepare, rebuild, restore, and repair). Some of the tasks generated by the Recovery Expert might advise you to run the Recovery Wizard as well.

The SMS Recovery Wizard automates some of the repair and resynchronization tasks required to complete a recovery. It also performs some repairs that are impossible to perform manually. Running the Recovery Wizard is optional, but if you do use it, you must run it after you use the Recovery Expert.

Q.Why are recovery tasks only in the SMS Recovery Expert?
A.

The expert format was used, because it is too cumbersome to explain all possible recovery scenarios in a single document. However, if you want to see each possible recovery task, you can: Print the All SMS Recovery Tasks page.

Q.Who is the target audience for the SMS Recovery Expert?
A.

Recovery information is intended for SMS administrators, Microsoft Consulting Services, Solution Providers, Product Support Services, and anyone else with a deep technical knowledge of SMS 2.0 and the environment in which it is being used. The intended audience must be familiar with the backup policy, security policy, and operational requirements of the customer environment.

Q.Why is recovery information available only on the Web?
A.

With the Web, we can update recovery information frequently. Our customers regularly find many new and surprising ways to use SMS 2.0 to solve problems, and so we're always adding to the list of situtations to which failure recovery applies. The Web is the fastest way to deliver the most current recovery information to you.

Q.What can I do if my failed SMS site is on a secure network that doesn't have access to the recovery page on the Microsoft.com Web site?
A.

1.

Download the recovery tools and documents from the Web page to removable media, print out the recovery tasks, and move the media to the secure network.

2.

Call Microsoft Product Support Services for assistance.

Microsoft is considering options for handling this scenario better in the future.

Q.
A.

Supported Scenarios

Q.Is there SMS backup and recovery support for Microsoft Windows 2000?
A.

There is backup and recovery support to the extent that the general SMS product supports Windows® 2000 features. For details about the level of support, refer to the release notes for your version of SMS.

Q.Does SMS support operating system backup tools and third party backup tools?
A.

Yes, this support is available if the third party backup tools perform the same steps in the same order as those listed in the SMS backup control file.

Note: A site backup must be a snapshot of all data, and it is critical that you stop all processes that would be accessing that data. If the processes are not stopped, the backup can include partially completed tasks, which can cause problems after site recovery. For example, the data in the registry, files, and database would not be synchronized with each other. Therefore, the only supported site backup is one that is made as a snapshot of all data, at a time when all processes that might access data are stopped.

Q.Does SMS support the backup and restore operation for an SMS client?
A.

Unfortunately, no. SMS 2.0 does not stop and restart client processes at this time. If client data were backed up while the processes were active, both the data on disk and in the backup could become corrupted. Although the operation is not supported now, Microsoft hopes to provide support in a future release.

Q.
A.

Backup

Q.Can I back up only the SMS site database?
A.

No. SMS data is stored in several places, and SMS data from all the following locations must be restored for an SMS site to function correctly:

SQL Server databases for the site.

Site server files from the site directory (from share SMS_<site code>).

The SMS and NAL registry keys.

Configuration information on computers running Windows NT® and SQL Server.

Q.Can I perform a backup while the SMS site is running?
A.

No. To avoid relational integrity problems in the site backup, you must stop site services before running backup. (The Backup SMS Site Server task does this automatically.) You can then run the backup.

Note: When backing up a site, always stop the SMS site services and back up all four components of a site as a snapshot so that they can be restored as a snapshot. Failing to follow this procedure is the most common cause of corrupted backups, and trying to restore these corrupted backups is the second most common cause of unsuccessful site recoveries.

Q.Must I use the Backup SMS Site Server task?
A.

No. However, to have a valid backup, you must carry out the same tasks in the same order as in the SMS backup control file. The simplest and most reliable plan is to run the Backup SMS Site Server task and use other backup applications to save the backups to tape.

Q.What if I didn't back up the CAPs or logon points, and then they fail?
A.

In such a case, the simplest way to recover is to delete the client access points (CAPs) and logon points, and let the site rebuild them from data stored on the site server. This process ensures that the site is synchronized after the site server is recovered. However, if the CAP contains many files or there are many CAPs and the network is slow, it might take a while to rebuild them all.

Q.Do I have to back up the whole hierarchy at the same time?
A.

There is no advantage to backing up the whole hierarchy at the same time. Each site can be backed up and restored independently of the others.

Q.Can I recover a site without a valid backup?
A.

Yes, but you will experience more data loss, and it will take more time to re-establish normal operations.

Q.
A.

Secondary Sites

Q.Does the Backup SMS Site Server task support secondary sites?
A.

No. However, support for secondary site backup is being considered for a future release.

Q.Do I have to back up secondary sites?
A.

Depends. You need to evaluate the needs of each secondary site based on:

The availability of backup support at the site server location.

How mission-critical the site is.

Cost tradeoffs.

For example, it's probably worth backing up a mission-critical site with 100 local users, including a part-time server administrator, where the site has a tape backup unit on the local network. On the other hand, it's probably not worth backing up a non-critical remote office with only four permanent staff and no local tape backup unit.

Remember that backup is a regular overhead cost, and with reliable hardware and a secure server location, failure happens rarely. The combined goal is to reduce the cost of desktop management and maintain reasonable uptime.

If a site has more than one client access point (CAP), logon point, or distribution point, even if the site server goes down, the client still continues to install software and pass inventory up the hierarchy. Rather than run a regular backup on a site server, you might find it more cost-effective to put a second CAP, logon point, and distribution point on an extra server to provide better support for clients at that site.

You can perform a secondary site recovery either with a backup or without as follows.

To recover a secondary site when you have a backup:

1.

Back up the registry keys and files.

2.

Copy the backup to tape, and store the tape.

3.

After failure, restore the backup and resynchronize the site.

To recover a secondary site when you do not have a backup:

1.

After failure, resynchronize and repair the site.

2.

Redistribute the packages and advertisements to the secondary site.

Q.Does the SMS Recovery Expert support secondary sites?
A.

Yes, it does.

Q.Will the first version of the SMS Recovery Wizard support secondary sites?
A.

No, but this feature is being considered for future releases.

Q.What do I need to know about backup and recovery on secondary sites?
A.

Here are the key points:

Secondary sites can be backed up and restored.

Secondary sites can be recovered without a backup using the same site code as the failed site, but more data loss occurs than is the case when data is recovered from a backup.

There are procedures to recover from the data loss after a site recovery.

The more often you back up data, the less loss a site failure causes. However, you always lose some data. A site's administrators must evaluate whether it's more cost-effective to spend resources on backup or commit resources to dealing with the partial data loss that occurs after a site failure. Actually, this analysis applies to all SMS site systems.

Q.Can I recover a site without a valid backup?
A.

Yes, but there will be more data loss, and a longer time is required to re-establish normal operations.

Q.
A.

Other

Q.What is the status of the Recovery Wizard?
A.

To improve the backup and recovery experience today with SMS 2.0, Microsoft developed the Recovery Expert so that existing SMS customers can experience greater degrees of success backing up and recovering SMS sites in cases of site system migration and upgrades, as well as system failures.

To clearly distinguish between the Recovery Wizard and the Recovery Expert, the Recovery Wizard will be renamed to the Repair Wizard. To provide built-in recovery support and improve the quality of this support, Microsoft will incorporate the Repair Wizard into the Topaz release along with other backup and recovery improvements.

Q.Will any of the recovery tools be localized?
A.

Not at first. Microsoft hopes to localize the second release of the tools.

Q.
A.
Top of pageTop of page