From chemistry-request@server.ccl.net Thu May  3 08:45:38 2001
Received: from gate.syntem-sa.fr ([146.19.248.65])
	by server.ccl.net (8.11.0/8.11.0) with ESMTP id f43CjbI06794
	for <chemistry@ccl.net>; Thu, 3 May 2001 08:45:37 -0400
Received: (from uucp@localhost) by gate.syntem-sa.fr (980427.SGI.8.8.8/980728.SGI.AUTOCF) id OAA22106 for <@firewall.syntem-sa.fr:chemistry@ccl.net>; Thu, 3 May 2001 14:44:30 +0200 (MDT)
Received: from syntem-sa.fr(10.0.0.2) by gate.syntem-sa.fr via smap (4.1)
	id xma022064; Thu, 3 May 01 14:44:17 +0200
Received: from victoria.syntem-sa.fr (victoria.syntem-sa.fr [10.0.0.44]) by laetitia.syntem-sa.fr (980427.SGI.8.8.8/980728.SGI.AUTOCF) via ESMTP id OAA03258 for <@laetitia.syntem-sa.fr:chemistry@ccl.net>; Thu, 3 May 2001 14:41:35 +0200 (MDT)
Received: from syntem-sa.fr (localhost [127.0.0.1]) by victoria.syntem-sa.fr (980427.SGI.8.8.8/980728.SGI.AUTOCF) via ESMTP id OAA13918 for <chemistry@ccl.net>; Thu, 3 May 2001 14:47:42 +0200 (MDT)
Sender: dturner@syntem-sa.fr
Message-ID: <3AF1536D.7A50E80A@syntem-sa.fr>
Date: Thu, 03 May 2001 14:47:41 +0200
From: David Turner <dturner@syntem-sa.fr>
Reply-To: dturner@syntem.com
Organization: Computational Drug Design, Syntem
X-Mailer: Mozilla 4.7C-SGI [en] (X11; I; IRIX 6.5 IP32)
X-Accept-Language: en
MIME-Version: 1.0
To: "'chemistry@ccl.net'" <chemistry@ccl.net>
Subject: Re: CCL:Critical T scores
References: <Pine.SGI.4.21.0105021459570.59346-100000@gandalf.cber.nih.gov>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit


Thanks to all who replied. I hadn't realised there is a simple
formula for determining critical points so there is no need for
look-up tables etc. I wanted this to use within some C code I
will be writing for various dofs and probability levels so
packages (Excel etc) were not an option ...

David

Wayne Steinmetz wrote:
>
> I am on sabbatical leave and do not have access to my files which are
> locked up.  Years ago I found an algorithm for calculating the
> Students t in Nature.  The algorithm is quite accurate when the
> number of degrees of freedom is greater than 2.
>
> Here is the algorithm at the 95% confidence level:
>
>  t = 1.95996 + 1/(-0.423+0.413DF)
>
>  where DF = the number of degrees of freedom =
>         = (number of data points) - (number of parameters)
>
> The two cases where the algorithm does not work are handled by a
> simple, short lookup table, i.e. t(1) = 12.706, t(2) = 4.303         

===========================================================================
Rick Venable wrote:
> 
> On Wed, 2 May 2001, David Turner wrote:
> > Does anyone know where I can get a table of critical student's
> > t-scores in electronic format?
> 
> I have a set of Fortran routines which can do both the F test (for the
> equivalence of 2 variances) and the T test (equivalence of 2 means).
> 
> =+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=
> Rick Venable
> FDA/CBER/OVRR Biophysics Lab
> 1401 Rockville Pike    HFM-419
> Rockville, MD  20852-1448  U.S.A.
> (301) 496-1905   Rick_Venable@nih.gov
> ALT email:  rvenable@speakeasy.org
> =+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=
> 
> -= This is automatically added to each message by mailing script =-
> CHEMISTRY@ccl.net -- To Everybody  | CHEMISTRY-REQUEST@ccl.net -- To Admins
> MAILSERV@ccl.net -- HELP CHEMISTRY or HELP SEARCH
> CHEMISTRY-SEARCH@ccl.net -- archive search    |    Gopher: gopher.ccl.net 70
> Ftp: ftp.ccl.net  |  WWW: http://www.ccl.net/chemistry/   | Jan: jkl@ccl.net

-- 

________________________________________________________________

David Turner                                             Synt:em
Senior Modeling Scientist              Parc Scientifique G.Besse
Computational Drug Discovery                         30000 Nimes
email: dturner@syntem.com                                 France

Tel: 				       
Switchboard: +33 (0)4 66 04 86 66      Fax: +33 (0)4 66 04 86 67
Direct Line: +33 (0)4 66 04 22 85
________________________________________________________________

          Discover New Drugs, Discover Synt:em
                 http://www.syntem.com
________________________________________________________________

From chemistry-request@server.ccl.net Thu May  3 05:15:11 2001
Received: from chaba.itq.upv.es ([158.42.92.184])
	by server.ccl.net (8.11.0/8.11.0) with SMTP id f439FAI04285
	for <CHEMISTRY@ccl.net>; Thu, 3 May 2001 05:15:10 -0400
Received: from localhost (german@localhost) by chaba.itq.upv.es (950413.SGI.8.6.12/950213.SGI.AUTOCF) via SMTP id LAA00223; Thu, 3 May 2001 11:12:47 -0700
Date: Thu, 3 May 2001 11:12:47 -0700 (PDT)
From: "German I. Sastre Navarro" <german@itq.upv.es>
Reply-To: german <gsastre@itq.upv.es>
To: "Martin D. Foster" <martinf@ri.ac.uk>
cc: CHEMISTRY@ccl.net
Subject: Re: CCL:Calculating the Vertex Symbols
In-Reply-To: <Pine.SGI.3.96.1010501111421.9897401A-100000@faraday>
Message-ID: <Pine.SGI.3.96.1010503110430.204A-100000@chaba.itq.upv.es>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII

  Dear Martin,

On Tue, 1 May 2001, Martin D. Foster wrote:

> could anyone point me to a piece of code that calculates the vertex
> symbols of a crystal structure?
> 
> TIA
> 
> cheers, martin.
> 
> --------------------------------------------------------------------------
> Martin D. Foster       
> Royal Institution of Great Britain       	Phone: +44 (0)207-409-2992
> Registered Charity Number 227938		Fax:   +44 (0)207-629-3569
> Davy Faraday Research Laboratory		email: martinf@ri.ac.uk
> 21 Albemarle Street
> LONDON W1S 4BS, UK
> __________________________________________________________________________
> 

  The fortran code "zeoTsites" can make it for three-dimensional
  four-connected nets. It also has some capabilities for higher
  connectivities but has not been properly tested.
  The code is free for academic and can be easily obtained from
  the authors. There is a recent publication which shows the program
  features:
                        
   journal title: Microporous and Mesoporous Materials
   article title: ZeoTsites: a code for topological and
                  crystallographic tetrahedral sites
                  analysis in zeolites and zeotypes
         authors: G. Sastre and J.D. Gale
            year: 2001
    volume/issue: 43/1
           pages: 27-40


   Hope this helps

   Regards
   German
                                              \|/
                                             (o o)
------------------------------------------oOO-(_)-OOo----------------
 German Sastre                            http://www.upv.es/~gsastre
 Instituto de Tecnologia Quimica         e-mail:  gsastre@itq.upv.es
 Universidad Politecnica de Valencia          Phone: +34-96-387-7803
 Av. Los Naranjos s/n, 46022 Valencia (Spain)  Fax:  +34-96-387-7809
---------------------------------------------------------------------
                                            oo0 0oo



From chemistry-request@server.ccl.net Thu May  3 05:23:09 2001
Received: from mdl.ipc.pku.edu.cn (IDENT:postfix@[162.105.177.40])
	by server.ccl.net (8.11.0/8.11.0) with ESMTP id f439N7I04355
	for <CHEMISTRY@server.ccl.net>; Thu, 3 May 2001 05:23:08 -0400
Received: by mdl.ipc.pku.edu.cn (Postfix, from userid 3200)
	id 5025326845; Fri,  4 May 2001 09:23:19 +0800 (CST)
Received: from localhost (localhost [127.0.0.1])
	by mdl.ipc.pku.edu.cn (Postfix) with ESMTP id 46EF826445
	for <CHEMISTRY@www.ccl.net>; Fri,  4 May 2001 09:23:19 +0800 (CST)
Date: Fri, 4 May 2001 09:23:19 +0800 (CST)
From: Fenglou Mao <mao@linux2.ipc.pku.edu.cn>
To: "CHEMISTRY@www.ccl.net" <CHEMISTRY@server.ccl.net>
Subject: Database summary?
Message-ID: <Pine.LNX.4.21.0105040921260.7376-100000@mdl.ipc.pku.edu.cn>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII

Dear cclers,

do anyone know a website or a paper summarized current
biological database?

-- 
Sincerely Yours,

FengLou Mao
*******************************
ADD:Mr. FengLou Mao
    Institute of Physical Chemistry
    Peking University
    BeiJing
    P.R.China
Tel:86-10-62756833
Fax:86-10-62751725
E-mail:mao@mdl.ipc.pku.edu.cn
Homepage:http://mdl.ipc.pku.edu.cn/~mao



From chemistry-request@server.ccl.net Thu May  3 14:13:29 2001
Received: from femail2.sdc1.sfba.home.com ([24.0.95.82])
	by server.ccl.net (8.11.0/8.11.0) with ESMTP id f43IDSI10329
	for <chemistry@ccl.net>; Thu, 3 May 2001 14:13:29 -0400
Received: from [24.11.177.239] by femail2.sdc1.sfba.home.com
          (InterMail vM.4.01.03.20 201-229-121-120-20010223) with ESMTP
          id <20010503181254.KTJM28160.femail2.sdc1.sfba.home.com@[24.11.177.239]>;
          Thu, 3 May 2001 11:12:54 -0700
Mime-Version: 1.0
X-Sender: arichon@awod.com
Message-Id: <a05010402b7174fceeacf@[24.11.177.239]>
Date: Thu, 3 May 2001 14:12:21 -0400
To: chemistry@ccl.net
From: Network Science <netsci@awod.com>
Subject: Mergers and Alliances within Computational Chemistry
Content-Type: text/plain; charset="us-ascii" ; format="flowed"

An updated version of Mergers and Alliances within Computational 
Chemistry has been posted on NetSci 
(http://www.netsci.org/Science/Compchem/feature17.html).

If anyone subscribing to the list has copies of the missing logos 
(BioCad, Intelligenetics, etc.) and can forward them in jpeg or gif 
format to editors@netsci.org, we will add them to the article.


From chemistry-request@server.ccl.net Thu May  3 14:19:55 2001
Received: from mailrelay1.lrz-muenchen.de ([129.187.254.101])
	by server.ccl.net (8.11.0/8.11.0) with ESMTP id f43IJtI10402
	for <chemistry@ccl.net>; Thu, 3 May 2001 14:19:55 -0400
Received: from [129.187.26.53] by mailout.lrz-muenchen.de with ESMTP; Thu, 3 May 2001 20:19:52 +0200
Sender: Eugene.Leitl@lrz.uni-muenchen.de
Message-Id: <3AF1BCB5.CEE31FDC@lrz.uni-muenchen.de>
Date: Thu, 03 May 2001 22:16:53 +0200
From: Eugene.Leitl@lrz.uni-muenchen.de
X-Mailer: Mozilla 4.75 [en] (X11; U; Linux 2.2.17-21mdksecure i586)
X-Accept-Language: en, de-DE, ru
MIME-Version: 1.0
To: chemistry@ccl.net
Subject: [Fwd: Re: Athlon SDR/DDR stats for *specific* gaussian98 jobs]
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit



-------- Original Message --------
From: Velocet <math@velocet.ca>
Subject: Re: Athlon SDR/DDR stats for *specific* gaussian98 jobs
To: "Robert G. Brown" <rgb@phy.duke.edu>
CC: beowulf@beowulf.org

On Wed, May 02, 2001 at 06:16:51PM -0400, Robert G. Brown's all...

> IIRC, somebody on the list (Josip Loncaric?) inserted prefetching into
> at least parts of ATLAS for use with athlons back when they were first
> released.  It apparently made a quite significant difference in
> performance.

Oh, btw, I did include Athlon3DNow!2 stuff in my ATLAS just for a laugh.
I then compared my results to the original non ATLAS g98 and everything
matched up for all 4 jobs I chose. I dont think its actually used
for G98 since its single precision stuff only IIRC. 

Also, note, we may start using MPQC in a bit - Graydon Hoare (ex Berlin
project, now at Redhat) is hacking on it to speed it up a bunch if
possible. I havent had a chance to benchmark that (it may demand
TB1333's on DDR boards and Ill be sunk! :)

In fact my stats indicated that a Duron 750 was slightly better for the money
in most cases, but the 900 and 700 are so close, I went with the faster chip
in the hopes that other situations that I cant benchmark now but will
encounter later will favour the faster CPU (also the supply of D700s is
waning).

> in the marginal area where it makes sense to go single.  It may not be
> very easy to tell which >>one<< is truly cost optimal, though, without
> benchmarking your particular code and doing a careful cost comparison
> including hidden costs (e.g. the fact that electricity costs and space
> costs may be 60% higher for lots of singles, each single requires a
> case, its own memory and copy of the OS and network card, and so forth).
> In many cases a dual is only 0.7-0.8 the cost of two singles, although
> the high cost of DDR makes that unlikely in this case.  Still, it is
> under $1/MB, which isn't all THAT bad -- PC133 cost that much only
> months ago.  Months from now DDR may cost little more than SDRAM in
> equivalent amounts.

I started forming my anti SMP bias when I ran a bunch of old G94 jobs
on a dual Celeron board. It *SUCKED* :) 100 Mhz ram was probably
the bottleneck, and the resource locking in Linux 2.2.(early) was not
as nice as it is now in 2.4 (so I hear). I have stats for it actually:

two jobs
""""""""
(Mhz ratio = 1.00 for 550Mhz, times for 2 jobs to be run)

#
C                                                 
P                                         speed    MHZ  efficiency
U CPU           MHz/bus RAM       total   ratio   ratio (mhz/speed ratios)
------------------------------------------------------------------------------
2 C366A         550/100 128M      4293.6  1.00    2.00	   0.50
1 P2 400        400/100 128M      6105.3  0.70    0.73	   0.96
1 P3 450        450/100 256M      5675.0  0.76    0.83	   0.92
1 C300A         450/100 64M       7253.5  0.59    0.83	   0.71

For g94 I saw almost the exact same performance out of a C450 as
a P3-450 (I guess the CPU's SSE extensions were not used by it).
So seeing that my heavily overclocked 550MHz Celeron's were only some
25% faster overall for 2 CPUs vs 1 at 450, thats pretty bad. :)
I think that this is the software fighting with the locking and the
shared bus (Abit BP6 IIRC was the board, which probably didnt have
the fastest architecture either. IIRC, dual celeron was a bad hack).
(As well, the stats above are not that fair as I ran a single job
instead of the 4 different types Im using now as my base set, not to
mention its G94 (outdated) and non-ATLAS.)

So its probably not optimal to even consider stats for SMP from a dual
Celerons considering they're so not designed for it. :) Nonetheless,
my benchmarking of various jobs showed me that memory bandwidth is really
important for most gaussian jobs, and you'd really need a speedy memory
bus to keep up with this. DDR will probably really help SMP for this
kind of thing.

> Sounds like you are dead right on all of this.  Embarrassingly parallel
> jobs, few to no communications, purely CPU bound -- Durons (or whatever
> currently delivers the most raw flops for the least money) are likely to
> be perfect for you.  And for many others, actually.  For a long time I
> like Celerons (or even dual Celerons) for the same reasons, although at
> this point I've converted to AMD-based systems as their cost-benefit has
> overwhelmed Intel's whole product line for my code.

Its interesting however, to note that the improvements per cycle especially
for DDR boards.  I actually removed this stat from my charts, but I did
compare all the Athlons to a Celeron 450 G89-atlas setup. 

These values are MUCH better than the non atlas values - ie atlas
really improves the efficiency of the jobs, especially for newer CPUs -
I assume by making better use of large L1/L2 caches and filling the longer
pipelines more optimally.

efficiency/cycle, atlas g98:
(each column is normalized to 1.00 for C450 w.r.t that column's data)

		   job 1	   job 2	   job 3	   job 4
			      |		      |		      |
		non	atlas |	non	atlas |	non	atlas |	non	atlas
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|~~~~~~~~~~~~~~~|~~~~~~~~~~~~~~~|~~~~~~~~~~~~~~
C450		1.00	1.00  |	1.00	1.00  |	1.00	1.00  |	1.00	1.00
A700			1.05  |	0.88	1.05  |	1.26	1.31  |	1.34	1.39
D750		0.86	1.30  |	0.70	0.77  |	1.05	1.10  |	1.25	1.24
T900		0.89	1.31  |	0.81	0.86  |	1.05	1.08  |	1.23	1.20
T1200DDR		1.08  |		1.04  |		1.30  |		1.42

I think we see these patterns because without optimization for caches
and pipeline, the disparate speeds between the CPU and RAM for the non
DDR machines is very large (up to 4 or 5 CPU cycles/ram cycle). The cache
helps a fair bit (the Tbird and Athlon fare much better than the duron),
but when we get ATLAS involved, the improvements are quite noticeable
over the baseline C450.
 
> Not yet, but maybe soon.  The fast Tbirds do require a big "certified"
> power supply, but I'm guessing they draw a lot less than they "require"
> except maybe in bursts.  I'm betting they draw around 100-150W running,
> a number that recently got some support on the list.

Bursts of CPU usage? Arent all our clusters all hammering our CPUs as much as
possible? And if ATLAS is really doing its job, arent we hammering all parts
of the CPU as much as possible? :)

> Not a preset package, but I'm trying to start a collection of sorts:
> 
>   http://www.phy.duke.edu/brahma/dual_athlon/tests.html

Will check it out.

> Hope this all helps or is interesting.  I'm very interested in Athlon
> performance profiles as they seem to be the current
> most-CPU-for-the-least-money winners, and when one buys in bulk (as
> beowulf humans tend to do) this sort of optimization really matters.


What kind of deals can you get in bulk? From AMD themselves? do you
need to be a big university and have a big press release event to
get these deals from them? How many do you need to get a batch deal?
How deep is the discount?

/kc

> 
>    rgb
> 
> -- 
> Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
> Duke University Dept. of Physics, Box 90305
> Durham, N.C. 27708-0305
> Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb@phy.duke.edu
> 
> 
> 

-- 
Ken Chase, math@velocet.ca  *  Velocet Communications Inc.  *  Toronto, CANADA 

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf


From chemistry-request@server.ccl.net Thu May  3 18:37:12 2001
Received: from kafka.net.nih.gov ([165.112.130.10])
	by server.ccl.net (8.11.0/8.11.0) with ESMTP id f43MbBI13448
	for <chemistry@ccl.net>; Thu, 3 May 2001 18:37:11 -0400
Received: from I.T.S.ME ([129.43.27.170])
	by kafka.net.nih.gov (8.11.3/8.10.1) with ESMTP id f43MZV900501;
	Thu, 3 May 2001 18:35:31 -0400 (EDT)
Date: Thu, 3 May 2001 18:29:09 -0400 (Eastern Daylight Time)
From: "M. Nicklaus" <mn1@helix.nih.gov>
To: chemistry@ccl.net
cc: mn1@helix.nih.gov
Subject: CCL:[Fwd: Athlon SDR/DDR stats for *specific* gaussian98 jobs]
In-Reply-To: <3AF1BCB5.CEE31FDC@lrz.uni-muenchen.de>
Message-ID: <Pine.WNT.4.21.0105031817090.-7955335-100000@lmchcaddpc1.nci.nih.gov>
X-X-Sender: mn1@helix.nih.gov
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII

For what it's worth:  Here's a recent small series of benchmarks we've run
on various Linux systems, plus a Cray and SGI Origin added for comparison.

Program:        Gaussian 98 Rev. A.7.  All executables exactly identical
                for the Linux systems (copied between machines), compiled
                w/ PGI v. 3.2 (on Linux), G.98 standard compilation (no
                tuning/hacking).

Jobs run:       G.98 test jobs # 1, 28, 94, 155, 194, 296, 302, aggregate
                time, as reported in output, all runs single-CPU.


CPU   Speed  Chipset  Kernel   Distro   glibc    Memory    HD/Contr.  time
                                               min. 256MB  MHz, RPM  (sec)
                                                         (all: U-ATA/)

P 4  1.5GHz Int. 850  2.4.2-2  RH 7.1   2.2.2-10 RDRAM-800 100, 7200   278
Athlon 1.33 VIA 686B  2.4.0   SuSE 7.1  2.2-7    DDR 266    66, 7200   299
P III 1 GHz Int. 815  2.4.3   (RH 6.1)  2.1.2    SDRAM 133 100, 7200   497
P III 1 GHz Int. 815  2.4.3   (RH 6.1)  2.1.2    SDRAM 133 100, 5400   497
P III 1 GHz Int. 815  2.2.12   RH 6.1   2.1.2    SDRAM 133 100, 7200   646
P III 1 GHz Int. 815  2.2.12   RH 6.1   2.1.2    SDRAM 133 100, 5400   646
P III 866MHz S/Works  2.4.3   (RH 6.2)  2.1.3-15 SDRAM 133  33, 7200   453
P III 866MHz S/Works  2.2.14   RH 6.2   2.1.3-15 SDRAM 133  33, 7200   526
SGI Origin 2000                                                        921
Cray SV-1                                                             2400

"(...)" in the Distro column denotes newer kernel than out-of-the-box.

Going from a 2.2 kernel to a 2.4 kernel seems to speed things up
considerably, at least on these systems.  Hard drive and IDE
controller speeds appear to be near irrelevant for these jobs.

Marc

------------------------------------------------------------------------
 Marc C. Nicklaus, Ph.D.                 NIH/NCI at Frederick
 E-mail: mn1@helix.nih.gov               Bldg 376, Rm 207
 Phone:  (301) 846-5903                  376 Boyles Street
 Fax:    (301) 846-6033                  FREDERICK, MD 21702      USA
          Head, Computer-Aided Drug Design MiniCore Facility
     Laboratory of Medicinal Chemistry, Center for Cancer Research,
 National Cancer Institute at Frederick, National Institutes of Health
       http://rex.nci.nih.gov/RESEARCH/basic/medchem/mcnbio.htm
------------------------------------------------------------------------


