CCL Home Page
Up Directory CCL dog.readme
................ SHORT DOC .............................................

DOG: Docking Geometrically two molecules.

Reference:

M. Petitjean,
Solving the Geometric Docking Problem for Planar and Spatial Sets,
Internet Electron. J. Mol. Des. 2002,1[4],185-192.
http://www.biochempress.com
 
Author email: petitjean@itodys.jussieu.fr

DOG reads the cartesian coordinates of two molecules, then translates
and rotates the molecule 2 for docking with the molecule 1.
A pairwise correspondence between two equal length ranges of atoms is
input.
When each of the two molecules is a single nucleotidic strand in
PDB/NDB format, the pairwise correspondence is defined from the
atom-pairs enabling potential hydrogen bonds.

The two input molecules should be concatenated into a single file
prior execution.


Input data and parameters:
-------------------------

INPUT  FORMAT:
  CAS : Reserved for internal purposes
  HIN : Hyperchem-type files
  MDL : Cambridge Crystallographic Model files
  ML2 : SYBYL Mol2 files
  PDB : Protein Data Bank or Nucleic Acid Data Bank files
        (only HEADER, ATOM, ENDMDL and END records are recognized)
  BIO : Biosym (MSI) files
  ISU : Reserved for internal purposes

INPUT  MOLEC FILE NAME: name of the input file containing both molecules

OUTPUT MOLEC FILE NAME: name of the output file containing the optimally
  translated and rotated molecule 2

IMOL1: sequential position number of molecule 1 in the input molecules file

IMOL2: sequential position number of molecule 2 in the input molecules file

INI1: first atom in molecule 1 to be paired (if the NUCLEO flag is set to N)
  or: first nucleotidic base to be analyzed (if the NUCLEO flag is set to Y)

INI2: first atom in molecule 2 to be paired (if the NUCLEO flag is set to N)
  or: first nucleotidic base to be analyzed (if the NUCLEO flag is set to Y)

LENGTH: number of input atom pairs or number of base pairs to be analyzed.
  Atoms INI1 to INI1-1+LENGTH are paired with atoms INI2 to INI2-1+LENGTH,
  or,
  bases INI1 to INI1-1+LENGTH are paired with bases INI2 to INI2-1+LENGTH
  for atom labeling analysis.

NUCLEO:
 Entering Y activates the automatic detection of potential hydrogen bonds
            from atom labeling analysis, provided that the format is PDB.
            3 atom pairs are included in the pairwise correspondence
            for the CG or GC base pairs, and 2 atom pairs are included
            for the AT, TA, AU or UA base pairs.
            When the K-th base pair {INI1-1+K, INI2-1+K} is mismatched,
            no atom-pair is included in the pairwise atomic correspondence.
 Entering N desactivates the option.
            The default option is Y when the selected format is PDB.
            The option is set to N when a format other than PDB is selected.


Output results:
--------------

The variance of the population of the squared distances defined by
the input pairs, and the optimal translation and rotation.
(translation is done prior rotation).

The coordinates of the optimally docked atom-pairs.

The new coordinates of the optimally translated and rotated molecule 2
are written in a separate file.

Remarks:
-------

The number of atoms is currently limited to 15000 for each molecule.
The source has to be recompiled to read larger molecules.

Connectivity and charges are not used.
No relevant initial orientation is needed before docking.

The geometric docking algorithm looks like the RMS algorithm used
for spatial alignments, except that the minimized quantity is the
variance of the squared distances rather than the mean.

When less than 7 atom-pairs are read, the problem is underdetermined,
and numerical instabilities appear.

The generated file containing the output moved molecule 2 is empty for
CAS, MDL and BIO formats, and the message "EERCO2 = 1" is displayed.

................ END SHORT DOC .........................................
Modified: Sat Jan 4 06:14:14 2003 GMT
Page accessed 7070 times since Wed Jan 15 08:50:15 2003 GMT