FINDKB: Scale Two Reflection Datasets
Authors: Keith Watenpaugh, Debra Holland, Stephan Freer & Jim
Stewart
Contact: Keith Watenpaugh, Physical and Analytical Chemistry,
The Upjohn Company, Kalamazoo, MI 49001, USA;
FINDKB provides two algorithms for determining the linear scale factor k and
exponential thermal scale factor ΔB, which will place the observed
structure factor data for two different derivatives on the same scale. The
first derivative dataset is usually that of the 'parent' or 'native' protein
while the second data set is that of a heavy atom isomorphous derivative. The
first algorithm is a least-squares process using all the data in one group. The
second algorithm uses the method of Singh and Ramaseshan (1966) in which a
Wilson-like plot of a function of FP, F of the parent, FD, average FD(+) and
FD(-) of the derivative against is used to obtain the scales k
and ΔB between dataset 1 and dataset 2.
Calculation Algorithms
The First Algorithm
If a sandr line is not entered, the simple least-squares fit
(ignoring the anomalous scattering signal) is used. The linear scale k, and the
exponential scale ΔB are calculated by least-squares. These place the
structure factor data of dataset 2 data on the same relative scale as dataset 1
data using the following expression:
F(1) = k exp(Δ) F(2)
The data may be specified either by dataset ID's or by lrrefl: item
numbers (see the BDF section at the back of the manual). Data for both sets
must be specified in the same way, i.e., ID's only or item numbers only. If
data are specified by dataset IDs, F = 0.5 *[F(+) + F(-)] will be used if both
F(+) and F(-) have been measured for a particular reflection, otherwise F is
set to the member of the (+)/(-) pair that was measured. If data are specified
by item number, this specifies the single quantity used for F (no averaging is
possible). A reflection is skipped if data for either data set were not
measured.
The Second Algorithm
Matthews (1966) and Singh and Ramaseshan (1966) have suggested a more elegant
method for determining k and ΔB for the scaling between the native, or
parent, dataset and the derivative dataset. In the following summary, these
definitions will be used:
average F of the Friedel related pair, FP(+) and FP(-), for an
(h,k,l) reflection.
average F of the Friedel related pair for the derivative
crystal.
k linear F(rel) scale factor to place the derivative F's on the proper scale
of the parent F's.
ΔB exponential scale factor to force the derivative F's to decay as a
function of s in the same way as the parent F's.
In this method, the scale factors and anomalous scattering contribution are
estimated from the equation
where
scale for the parent F
scale for the isomorph
ratio of the real to anomalous scattering signal
2|
parent structure factor amplitude
] / 2
ΔI []
The reflections are divided into small intervals of sinθ and the mean
values used to generate Wilson-like plots. Least-squares fit of the
intercept and its slope, as well as the scale factor k =
/ and ΔB are calculated.
Several passes through the scaling process my be necessary. A first pass using
the simple least-squares method may be used first to get an approximate fit
followed by the sandr method. Sometimes the very low order
reflections should by ignored by setting a minimum sinθ/λ
cutoff.
File Assignments
Reads reflection data from the input archive bdf
Optionally writes corrected data to the output archive bdf
Examples
FINDKB c2 c2pt bdf
In this typical case, the mean of F(+) and F(-) for data set C2PT will be
scaled to the mean of F(+) and F(-) for data set C2. An output bdf will be
produced.
FINDKB 2304 2314
F(-) will be scaled to F(+) for the second data set on the bdf. No output bdf
will be written.
FINDKB 4304 4314 squared
F(-)2 will be scaled to F(+)2 for the fourth data set on
the bdf. No output bdf will be written.
References
Matthews, B.W. 1966. The Determination of the Protein and Anomalously
Scattering Heavy Atom Groups in Protein Crystals. Acta Cryst. 20,
230-239.
Singh, A.K., and Ramaseshan, S. 1966. The Determination of Heavy Atom
Positions in Protein Derivatives. Acta Cryst. 21, 279-280.