Checklist for node replication
A successful implementation of node replication relies on sufficient, dedicated hardware resources. Increased amounts of memory and processor cores are required. The database and its logs must be appropriately sized to ensure that transactions can complete. A dedicated network, with enough bandwidth to handle the amount of data you intend to replicate, is required.
Use the checklist to verify that hardware and your IBM Spectrum Protect™ configuration have characteristics that are key to good performance.
Question | Tasks, characteristics, options, or settings | More information |
---|---|---|
Are you using high-performance disk for the IBM Spectrum Protect database? | Ensure that the disks that are configured for the IBM
Spectrum Protect database have a minimum capability of 3,000 I/O
operations per second (IOPS). For each TB of data that is backed up daily (before data
deduplication), add 1,000 IOPS to this minimum. For example, an IBM
Spectrum Protect server that is ingesting 3 TB of data per day would
need 6,000 IOPS for the database disks:
|
Checklist for server database disks |
Are you using enough processor cores and memory for node replication and optionally, data deduplication? | If you are using node replication without
deduplication, use a minimum of 4 processor cores and 64 GB of RAM
for both the source and the target servers. For any server that is configured for node replication and data deduplication, use a minimum of 8 processor cores and 128 GB of RAM. |
|
Have you properly sized your disk space for the database, logs, and storage pools? | To determine whether your database can handle
the additional space requirements, you must first estimate how much
more database space node replication uses. For the active log, use a minimum size of 64 GB for node replication. Use the maximum allowed size for the active log, which is 128 GB if you are also using data deduplication. Make the archive log space at least as large as the space defined for the active log. Also, specify a directory for the archive failover log in case it is needed. |
Determining server database requirements for node replication (V7.1.1) |
Is your network capable of handling the additional traffic for the amount of data that you intend to replicate between source and target servers? | For node replication, network bandwidth
must be greater than the planned maximum throughput. You can estimate network bandwidth that is based on the amount of data that you are replicating. |
Estimating network bandwidth for node replication (V7.1.1) |
If your IBM Spectrum Protect server replicates nodes or protects storage pools to a remote server, did you determine whether Aspera® Fast Adaptive Secure Protocol (FASP®) technology can improve data throughput? | Restrictions:
|
See Determining whether Aspera FASP technology can optimize data transfer in your system environment. |
Are you using data deduplication with node replication? | By using data deduplication with node replication, you reduce the bandwidth that is required for replication operations. Data deduplication reduces the amount of data that is sent to the target of the replication operation. | Measuring effects of data deduplication on node replication processing (V7.1.1) |
Have you scheduled node replication in the optimum order for the daily schedule? | Ensure that you are running node replication after client backup. Complete data deduplication processing before replication processing. Schedule compression after replication. |
For more information, see the following topics: |
Have you optimized the number of sessions that are used for sending data to the target replication server? | You can improve replication performance
by using the MAXSESSIONS parameter on the REPLICATE
NODE command to specify data sessions. The number of sessions that are used for replication depends on the amount of data that you are replicating. |
Managing the number of replication sessions (V7.1.1) |
Do you have enough mount points to avoid stalled replication servers and other server processes? | Determine the number of logical and physical drives that can be dedicated to the replication process. For example, if a library has 10 tape drives and four of the drives are used for another task, there are six available drives for node replication. Specify the number of mount points you require and ensure that there are drives available to complete node replication. |
Typically tape is not used for node replication except for the initial replication. |
Do the node replication processes complete replicating all newly ingested data before the beginning of the next backup cycle? | If replication processes cannot finish before
the start of the next backup cycle, consider the following actions:
|
|
If you are using data deduplication with node replication, do the processes for identifying duplicates complete before the start of node replication processing so that data deduplication is used to its full advantage? | If the process completes, or goes into an idle state before node replication begins, then all new data is being processed. |