PROTIN: Set Constraints for Refinement of Parameters
disulb1 identify disulphide bridges
firstp1 identify atoms with poulation
parameters to be refined
spcpep1 identify the C and N terminus groups
for each chain
altchn1 identify alternate chains
restrm2,3 identify and idealize terminal and
main residues
reslnk3 identify and idealize linkage
residues
residu3 identify and idealize standard chain
residues and other groups
reside3 identify and idealize the last
standard residue at ends of chains
distrm4 characterize terminal chain residue
interatomic distances
dislnk4 characterize linking chain residue
interatomic distances
disres4 characterize normal chain residue
interatomic distances
plares5 identify planar groups in standard
residues
plapro5 identify multi-planar prosthetic
planar groups
platrm5 identify terminal planar groups
plalnk5 set up the four linking planar
groups
plalnl5 specify restraining pairs for linking
planar groups
chiral6 characterize ideal chiral centres
vdwdis7 set van der Waals diameters by atomic
species
vdwtrm7 number of non-bonded contacts for
specified terminal residue
vdwlnk7 number of non-bonded contacts for
specified linking residue
vdwres7 number of non-bonded contacts for
specified residue
vdwcon7 van der Waals contact codes for
vdwtrm, vdwlnk, or vdwres
tortrm8 identify atoms defining ideal torsion
angles in terminal residues
neighb8 identify neighbours in torsion
groups
chidis8 set neighbour distance
identification
torres8 identify atoms defining ideal torsion
angles in chain residues
chiwgt8 set weighting codes for torsion
angles
secstr8 characterize secondary structure
features by ψ and φ values
elltrm9 set ellipsoid specifications for
terminal groups
ellres9 set ellipsoid specifications for main
chain groups
interd10,11 set up blocks of inter chain
special distances
intrad11 set up blocks of intra chain special
distances
spcdis11 set up special inter-group
distances
spcpla12 specify special planar groups
excon13 specify distance identifiers for
contact exclusion
secsel14 specify elements of secondary
structure for backbone
spcsym15 identify chains related by
non-crystallographic symmetry
symop15 rotation and translation of a
non-crystallographic symmetry
symwgt15 set non-crystallographic symmetry
restraint weighting specs
1 Group 1 input lines set up standard groups dictionary.
2 Note that, if at this point all the lines from
restrm through ellres are left out the values
will be taken from the prores file. This is the default file of
idealized groups. A prores file is supplied with the Xtal system which
contains all the usual groups encountered in protein structures.
3 Group 2 input lines set up ideal coordinates of residues.
4 Group 3 input lines set up distance restraints.
5 Group 4 input lines set up plane restraints.
6 Group 5 input lines set up chiral centre restraints.
7 Group 6 input lines set up contact restraints.
8 Group 7 input lines set up torsion angle restraints.
9 Group 8 input lines set up anisotropic thermal restraints.
10 Note that the following lines are optional. All references to
chain identities and user residue numbers are in terms of the names and numbers
supplied through the PROATM program. These names and numbers are translated to
pointers for use in the calculations. Some error messages may display these
pointers. In this case the chain pointers will be 1, 2, ... In order of chain
identification encountered in the atom or proat
lines loaded in PROATM. The residue pointers will be the sequence number for
the residue in order of loading by PROATM.
11 Group 9 input lines set up special distances restraints.
12 Group 10 input lines set up special plane restraints.
13 Group 11 input lines set up contact exclusion restraints.
14 Group 12 input lines set up backbone torsion angle
restraints.
15 Group 13 input lines set up non-crystallographic symmetry
restraints.
PROTIN Option Code
print atom coordinates atl no
neutron data neu no
number of unit cells used in search cel n 3
number of main chains in a plane nap n 5
limit for van der Waals distances vdw d
5
expect hydrogen atoms hat no
expect individual aniso t.f. ani no
atl
dataset number dset n 1
disulb 1 chain identifier of CYS residue on one side
of the S-S bridge
2 residue sequence number on that side
3 chain identifier of CYS residue on other side of the S-S
bridge
4 residue sequence number on the other side
firstp 1 name of the first atom, in the notation of PROATM on
loading, to have the population parameter refined. This atom and all atoms to
the end of the atoms loaded will have their population parameters refined in
the subsequent PROLSQ run
spcpep 1 chain identifier symbol e.g. A2
2 number of cis peptide bonds in chain
3 residue sequence number of cis peptide
altchn 1 primary chain identifier for mothers of alternate
chains
2 secondary chain identifier for mothers of alternate chains
p s
if there is 1 chain and 1 alternate chain: 1 0
if there is 1 chain and 2 alternate chains: 1 1
if there are 2 chains and each has 1 alternate chain: 1 2
if there are 2 chains and only chain 1 has 1 alternate chain: 1 0
if there are 2 chains and only chain 2 has 1 alternate chain: 2 0
Alternate conformations of discretely disordered residues are assigned
during atom loading a different chain number from the mother chain for a given
residue. Only the atoms which are modelled in different positions should be
included among the input atoms.
restrm 1 group name for residue e.g. NAMINO ;
ASP ; HIS
2 name of atom in the group e.g. CA for α carbon
3 x Cartesian coordinate in Ĺ relative to Cα as origin in each
group
4 y Cartesian coordinate
5 z cartesian coordinate
6 order number of atom within the group, e.g. 1 for N, 2 for Cα, etc.
For all standard groups, save glycine, this number starts at 5 because the
coordinates of restrm main are automatically
taken for the first four atoms of every group. This number may also be negative
in reslnk lines to designate atoms belonging to a previous
residue
7 single letter code which is to be used as a synonym for the group name e.g.
G for GLY. Used only for standard groups
reslnk fields as for restrm
residu fields as for restrm
reside fields as for restrm
distrm 1 group name as defined in res*** input
lines e.g. NAMINO
2 order number of origin atom for corresponding distance
3 order number of terminus atom for corresponding distance
4 code to define weight that is to be used to restrain distance
1 bonded pair 2 angle pair
3 planar 1-4 distance 4 XH bond
distance
5 Y-X-H angle distance 6 H-X-H angle
distance
5 code to define weight that is to be used to restrain B values
0 any distance for which B restraint is not required
1 bond distance between two main chain atoms
2 angle distance between two main chain atoms
3 bond distance involving at least one side chain atom
4 angle distance involving at least one side chain atom
5 bond distance involving a hydrogen
6 angle distance involving a hydrogen
Fields 6-9, 10-13, etc. are as for fields 2-5. Note that lines may be
continued by placing a "c" as the last field on a line. The next line
must then have the same ID for both line type and group name. Fields continue
in groups of four to complete the required restraints.
dislnk fields as for distrm
disres fields as for distrm
plares 1 group name
2- the order number of the atoms of the group which are to be restrained to
planarity. If a group has no planes fields 2- are left blank
plapro fields as for plares
platrm fields as for plares
plalnk fields as for plares
plalnl 1- codes specifing all the possible "restraining pairs"
among the atoms specified in the reslnk input lines
Note that only the number of atoms specified to be in planes and given in
the PROTIN line under directive nap will be
considered to restrain the plane.
chiral 1 group name
2 descriptor code for how the chirality of the group is defined
1 for groups intrinsically chiral
0 for chirality related to nomenclature, e.g. LEU
VAL
vdwdis 1 name of atom species e.g. HYDROGEN (limit 12
characters)
2 atomic number of species or atomic scattering factor pointer (carbonyl is
arbitrarily set to 3)
3 van der Waals diameter of the species or distance between centres of two
non-bonded atoms of the species
vdwtrm 1 group name
2 number of non-bonded contacts for group
vdwres fields as for vdwtrm
vdwlnk 1 group name
2 number of non-bonded contacts for group
3 number of non-bonded contacts specified for prolyl link
vdwcon 1 group name
2 order number of origin atom corresponding to a possible non-bonded
contact
3 order number of terminus atom for corresponding possible non-bonded
contact
4 non-bonded distance restraint class:
1 relative position of the given atoms is determined by
only one torsion angle
2 relative position of the given atoms is determined by two
or more torsion angles
These lines follow their corresponding vdwres,
vdwtrm, or vdwlnk lines. Fields 5-7, 8-10, etc.
are as fields 3-4, continuing onto additional lines as required to satisfy the
counts given in the preceding vdwres, vdwtrm,
or vdwlnk lines.
tortrm 1 group
2 atom order number for C(i-1)
3 atom order number for N(i)
4 atom order number for Cα(i)
5 atom order number for C(i)
6 atom order number for N(i+1)
7 atom order number for Cα(i+1)
Fields after 7 are ignored in tortrm lines
8 atom order number for Cβ(i)
9 atom order number for Cγ(i)
10 atom order number for Cδ(i)
11 atom order number for C∊(i)
12 atom order number for Cζ(i)
13 atom order number for N(i)
C(i-1)-N(i)-Cα(i)-C(i) specifies φ
N(i)-Cα(i)-C(i)-N(i+1) specifies ψ
Cα(i)-C(i)-N(i+1)-Cα(i+1) specifies ω
N(i)-Cα(i)-Cβ(i)-Cγ(i) specifies χ(1)
Cα(i)-Cβ(i)-Cγ(i)-Cδ(i) specifies χ(2)
torres fields as for tortrm
neighb 1 group
2- neighbour identification codes
-1 for atoms from residue i-1
0 for atoms from residue i
1 for atoms from residue i+1
5 for atoms from terminal groups e.g. OT of carboxyl
terminus
chidis 1 group
2 distance identification codes (as specified in field 4 of the
distrm line)
chiwgt 1 group
2- weighting code for side chain chi angles
0 for no specifications
2 for planar e.g. χ(5) of ARG
3 for staggered e.g. aliphatics
4 for orthonormal e.g. χ(2) of aromatics
secstr 1 name for element of structure up to 20 characters
maximum
2 code for secondary structure element
1 α helix
2 3(10) helix
3 π helix
4 collagen type helix
5 parallel β sheet
6 anti-parallel β sheet
7 classic β bulge position 1
8 classic β bulge position 2
9 β bend [or turn] type I position 2
10 β bend [or turn] type I position 3
11 β bend [or turn] type II position 2
12 β bend [or turn] type II position 3
13 γ turn position 1
14 γ turn position 2
15 γ turn position 3
3 characteristic φ value
4 characteristic ψ value
5 literature reference of up to 40 characters
elltrm 1 group name
2 order number for atom for which ellipsoid is being specified
3 order number for atom p(0)
4 order number for atom p(1)
5 order number for atom s(0)
6 order number for atom s(1)
Note that negative numbers are used to point to atoms in preceding
residues.
Fields 7-11, 12-16, etc. for additional ellipsoid atoms; new lines to
continue.
ellres fields as for elltrm
intrad 1 origin chain identifier
interd 1 origin chain identifier
2 target chain identifier
Note: There must be spcdis lines following each
intrad or interd line supplied. The number
required will depend upon the number of special inter-group distances for this
chain block to be restrained. e.g. Sγ - Sγ from two different CYS
groups that form a bridge; distance from solvent to protein; etc. One line per
pair of atoms for which distance is to be restrained.
spcdis 1 residue sequence number of group to which origin atom
belongs
2 order number of origin atom within group
3 residue sequence number of group to which target atom belongs
4 order number of target atom
5 restrain atoms to this distance (in Ĺ)
6 code to determine weight that should be used to restrain distances. Same as
those shown for disres lines
7 code to determine weight that should be used to restrain temperature
factors. Same as those shown for disres lines
spcpla 1 chain identifier of first atom in special plane
2 residue sequence number of first atom in special plane
3 order number of first atom in special plane
4-6 the same quantities as in 1-3 for the second,
7-9 third, ... atoms which define the special plane
etc
excon 1 chain identifier of first atom for which contacts are
to be excluded
2 residue sequence number of first atom
3 order number of first atom
4-6 same as in 1-3 for the second atom which defines the contact exclusion
secsel 1 chain identity; all chains of this type will be set
2 residue sequence number of the initial group for this region of structure
3 residue sequence number of the final group for this region of structure
4 code for structure type
-1 restrain φ & ψ values to those of initial
structure
0 no restraints on φ & ψ
n restrain φ & ψ values specified in
secstr line for this n
spcsym 1-n chain identities in this special
non-crystallographic symmetry group
Each spcsym line must be followed by one or more
symop lines.
symop 1-12
symwgt 1 initial residue sequence number to which the
symop applies
2 final residue sequence number to which the symop applies
3 weighting code
implication for
Code main chain side chain
1 1 1
2 1 2
3 1 3
4 2 2
5 2 3
6 3 3
where 1=tight; 2=medium; 3=loose
4-6 as in 1-3 for all group spans specified. etc