http://server.ccl.net/cca/documents/molecular-modeling/node4.shtml |
![]() |
CCL node4 |
![]() ![]() ![]() Next: Molecular surface and volume Up: Molecular Modeling Previous: Computer representation of chemical
Computer representation of geometry
Molecules are not static entities. Even at absolute zero temperature
atoms in a molecule are actively vibrating. The molecular geometry
represented by a static picture on the computer screen or a Dreiding model
is therefore only an approximation. The term atom position is usually
understood as a position of the atom nucleus, or rather as some kind of average
position of the vibrating nucleus.
Luckily, the dimensions of an atom
nucleus are negligible compared to average bond lengths, and since
its mass is thousands of times larger than the mass of surrounding electrons,
the nucleus is the true center of gravity of an atom. The major conceptual
difficulty
is to decide what is an average position of a nucleus. Nuclear vibrations are
anharmonic, and hence,
the time average position of a nucleus is not located half
way between its extreme positions. Moreover, in molecules containing more
than two atoms, nuclei vibrate not only along chemical bonds but also
in directions perpendicular to them. That is why, depending on the method used
to interpret experimental results, slightly different values of bond
lengths and angles may be calculated.
Also, different
experimental methods measure different physical quantities. For example,
X-ray crystallography measures relations between ``electron clouds'' of
atoms,
while electron diffraction or neutron diffraction are based on scattering
from atomic nuclei.
Especially for hydrogen atoms, the nucleus is not located in the center
of the ``atomic cloud'' surrounding the proton. Bonds involving
hydrogen are substantially polarized and X-ray measurements will
underestimate them by as much as few tenths of an Ångström.
In other cases the differences are not as drastic, but one needs
to understand their origin in order to make the best use of experimentally
derived geometries. As you can see, ``molecular geometry'' may mean
different things depending upon the way in which it was derived or
measured.
Interatomic distances
are usually expressed in Ångströms, since distances between chemically
bonded atoms are of the order of 1 Å= The simplest way to specify molecular geometry to the computer is to list cartesian coordinates for each atom. In most cases the right-handed coordinate system is used, whose axes are perpendicular to each other (i.e., orthogonal), as represented in Fig. 6.6.
Cartesian coordinates are usually listed in 3-column format, X, Y, and
Z coordinates for each atom. Sometime the coordinates are listed
in natural crystal axes, called notional axes, which refer to the shape
and dimensions of the unit cell.
The notional axes are not generally perpendicular,
and the coordinates are
scaled by lengths of the unit cell edges. For the general case
of a triclinic system,
represented in Fig. 6.7, the edges of the unit cell
along oblique axes, x, y and z,
are a, b and c, respectively, and the interaxial angles:
NOTE: There was an error in the original text and the formula was given as:
Thanks to Egon Willighagen (egonw@sci.kun.nl) and
Geoff Hutchison (hutchisn@chem.northwestern.edu) it was corrected
on 2002.04.18.
Cartesian coordinates are an efficient representation of molecular geometry for the computer, and have the advantage of including actual spatial orientation of the molecule. However, they lack the chemical contents for chemists. Chemists prefer to specify and analyze molecular geometry in terms of internal coordinates, i.e., bond lengths and bond angles. The most popular internal coordinates are shown in Fig. 6.10, but before explaining how values of internal coordinates are calculated from cartesian coordinates of atoms, it is necessary to explain some of the simplest operations on vectors. The reader is encouraged to refer to the college calculus books for the review of vector analysis.
A scalar quantity is just a number, e.g., molecular weight.
A vector
The length of a vector is the distance between its beginning and its end.
It is always positive (or zero, if the beginning and the end of a vector
are in the same place). Formally, the vector
length v (frequently written also as
As with scalars (i.e., ordinary numbers), certain operations are defined for vectors. Adding two vectors means forming a new vector whose components are the sums of the respective components of the vectors being added:
Subtracting two vectors is analogous, only here the components are subtracted. You can multiply vector by a scalar by multiplying each of its components by the scalar:
Similarly, dividing a vector by a scalar results in a vector whose components are divided by this scalar, however, you obviously cannot divide by zero. Multiplying/dividing a vector by a scalar results in multiplying/dividing its length by this scalar, while preserving its direction. The unit vector is a vector whose length is equal to 1. You may obtain the unit vector from any vector by dividing it by its own length. Such an operation is called normalization of the vector and is usually denoted as:
Note that adding a scalar to a vector does not make any sense and is not among the defined operations. There are two different modes for multiplying a vector by another vector. The scalar product of two vectors, also called the dot product, results in a scalar. It is the product of the vector lengths multiplied by the value of the cosine of the angle between them. It can also be calculated as the sum of the products of corresponding components:
The dot product of two unit vectors is equal to the cosine of the angle
between them. Hence, the cosine of an angle formed by two vectors is
usually found by first normalizing the vectors and then calculating
their dot product(by summing up the products of their components). Note also
that the dot product does not depend upon the order in which the vectors are
multiplied (i.e.
The vector product of two vectors
Alternatively, the components of the resulting vector
where the
You may frequently see the term base vectors, written as:
and
Internal coordinates are efficiently calculated by
the computer
from cartesian coordinates using the vector operations described above.
The bond length ,
The valence angle called also bond
angle,
The valence angle is always positive and not larger than 180
The torsional angle (Fig. 6.10c), is a dihedral angle,
where
It is important to realize that the torsional angle is undefined if either atoms i, j and k, or j, k and l are collinear (i.e., a straight line is passing through three consecutive atoms) because an infinite number of planes can pass through three collinear points. For this reason, it does not make sense to talk about a torsional angle in acetylene.
An important property of torsional angles is that they do not depend
upon the end from which they are measured, i.e.,
To fully specify molecular geometry to the computer in cartesian coordinates for a molecule containing N atoms, 3N values must be entered (i.e., X, Y and Z for each atom). The 3N coordinates specify not only intramolecular distances and angles but also the orientation of the molecule in space. Internal coordinates, on the other hand, specify only intramolecular distances and angles and the spatial orientation of the molecule is usually assumed. The popular way of specyfying molecular geometry using internal coordinates is a Z-matrix convention (see Fig. 6.12). Each line of the Z-matrix, with the exception of the first 3 lines, has the following format:
(i),
![]() ![]() ![]() ![]()
where i is the number of the atom whose position is being defined.
Since atoms are numbered consecutively, this number is equal to the current
Z-matrix row number and this entry is
often used for some mnemonic symbol for an atom or simply omitted as
being redundant. The next entry, A Z-matrix requires only 3N-6 parameters (internal coordinates) for a full specification of molecular geometry (0 in first line, 1 in the second line, 2 in the third line, and 3(N-3) in the next N-3 lines). This is because the orientation of the molecule described by the Z-matrix is predefined, otherwise 6 additional parameters are needed to describe the orientation of a non-linear object in space (3 translational and 3 rotational degrees of freedom). In specifying internal geometry with a Z-matrix, dummy atoms are frequently used. Dummy atoms allow the specification of orientation other than the one dictated by the current convention for Z-matrix. They also must be used sometimes to account for collinear atoms in the molecule to avoid undefined torsional angles.
Cartesian coordinates are used as an input for many molecular modeling
software systems. The particular format depends on the system being used.
The most popular format used to describe the structure of a macromolecule
is a PDB
In most cases the hydrogen atoms are not listed in the PDB file, since they are usually below the resolution of X-ray crystallography. Molecular modeling systems find the approximate positions of hydrogen atoms based on positions of heavy atoms, and therefore, PDB files processed by the modeling software may have these atoms appended.
![]() ![]() ![]() Next: Molecular surface and volume Up: Molecular Modeling Previous: Computer representation of chemical Computational Chemistry Wed Dec 4 17:47:07 EST 1996 |
[ CCL Home Page ]
[ About CCL ]
[ Resources ]
[ Search CCL ]
[ Announcements ]
[ Links ]
[ E-mail us ]
[ Raw Version of this page ]
Modified: Fri Apr 19 01:31:27 2002 GMT |
Page accessed 12135 times since Wed Apr 21 00:19:21 1999 GMT |