FYI: [Bioinformatics] Open Source Petition FAQ (fwd)



---------- Forwarded message ----------
 Date: Tue, 04 Dec 2001 13:51:37 -0800
 From: Harry Mangalam <mangalam |-at-| home.com>
 To: bioinformatics |-at-| labs.oreilly.com
 Cc: Andrew Dalke <dalke |-at-| dalkescientific.com>
 Subject: [Bioinformatics] Open Source Petition FAQ
 Q   What is this petition about?
 A   The aim of this petition is to educate researchers and the agencies that
     fund them about the advantages of using Open Source Software (see below).
     We think that software developed under the grants from public agencies -
     those funded by your tax dollars - should be made Open Source in order to
     accelerate the process of science while reducing the cost of further
     investigation as well as making the software assisting such investigations
     more robust and widely useful.  Software developed under grants from private
     agencies would of course be exempt from such a requirement, but they might
     also consider requiring or advocating this for the reasons we provide below.
 Q:  Why even consider Open Source?  And why is Open Source for Scientific
     applications any different than for anything else?
 A:  We liken Open Source as peer review.  In a scientific endeavor, we make a
     hypothesis and provide evidence to support or discredit the hypothesis.  In
     reporting the evidence, we publish the data in as raw a form as possible so
     that others can examine and critique it.  This is how Science tends to move
     forward. We submit that like other forms of materials and methods, the
     source code of software involved in arriving at a decision should also be
     published.
   - This peer review of software allows bugs or exceptions that the original
     authors did not account for to be found and fixed by others.
   - It provides for a mechanism by which original features can be incrementally
     improved and additional features can be added without the underlying
     infrastructure (initialization, input/output, data structures, help &
     documentation files) being rebuilt from scratch.
   - Because of the wider use, it encourages more standardization of more
     efficient file or exchange formats, decreasing the time spent on converting
     one format to another.
   - It increases the speed of code development thru the methods noted above,
     can make research cheaper, and therefore increase the return on investment
     in  research, especially important these days as public support for
     research is  under pressure.
 Q   Why Open Source, why not Free Software?
 A   Open Source Software is a generic term for software that is available
     in source code form.  There is an organization that advocates Open Source
     Software in various forms and makes various cases for its use at:
     http://www.opensource.org
     Free Software (in its capitalized form anyway) is software that is
     distributed under the Free Software Foundation's General Public License
     (http://www.gnu.org/copyleft/gpl.html), which is one form of
 Open Source.
     The GPL is a relatively strict form of Open Source which, while
     guaranteeing your freedom to modify and redistribute code, also has
     restrictions which decrease some freedoms to do other things with the
     code.  These restrictions bother some people more than others.
        We are not fanatic about which kind of licensing under which  the code is
     made available; only that the code is made available without restraint for
     examination and use by the public.  We also suggest a licensing mechanism
     that we believe addresses the concerns of the inventor, copyright holder,
     technology transfer office, and private companies that wish to
     substantially increase the utility of such code and charge money for such
     proprietary extensions.
 Q   Doesn't OSS/FS require me to give my code away for free?
 A   OK - this is where the rubber meets road.  It is our opinion that if you
     take Public taxpayer money to develop an idea at an academic or
     non-profit institution, you have a responsibility to allow the Public to use
     it.  'The Public' includes both for-profit companies (which also pay taxes)
     as well as individuals who might be interested in the results of the
     research.  Note that we are strongly against the privatization of public
     resources.  If a private corporation wants to use the code, they are bound
     by the same requirements as an individual.  The company should be able to
     use and modify the code for internal purposes as would an individual.
     beyond this, things get less clear.  Whether and how the company should
     be able to redistribute the code is debatable although it is our contention
     that the best way of addressing the competing claims and requirements
     (especially the Bayh-Dole Act - see below) is a DUAL LICENSE, defined below:
     The DUAL LICENSE approach.
     We suggest that the best way to satisfy both our request for release of the
     software for public use and the requirement for institutions to be able to
     license their IP (a la Bayh-Dole) is to use a DUAL license approach. In
     this scheme, the code would be available to all for Open Source use and
     iterative development, but companies who required exemption from the
     covering Open Source license could be granted it from the copyright holder
     or their institution.
       This arrangement would not only allow the code to be examined, stressed,
     and debugged by the user community who would be getting a return on their
     tax dollar, but that *improved* code could also be licensed to external
     companies who needed to incorporate it into their products without the
     requirement to make their changes or adjunct code Open Source in return -
     essentially paying for permission to privatize code developed with public
     funds.  The idea is that the code base itself would improve over time and
     become more valuable to more companies.
       This arangement would require that developers who contributed to this
     effort were aware of and agreed to the provision that their efforts
     were being 'harvested' by the copyright holder.
 Q   Doesn't OSS/FS violate Bayh-Dole?
 A   The Bayh-Dole act of 1984 (more precisely, P.L. 96-517, The Patent and
     Trademark Law Amendments Act, with further amendments described in  P.L.
     98-620) was a landmark law that dramatically changed the way Intellectual
     Property was handled.  Rather than try to squeeze it into this FAQ, I refer
     you to the Universities' POV: [ http://www.ucop.edu/ott/bayh.html ]
     a generally supportive series of articles from Columbia University (one of
     the largest beneficiaries of the Law):
     [ http://www.columbia.edu/cu/21stC/issue-3.1 ],
     as well as an extremely harsh criticism of the Law:
     [ http://www.alternet.org/story.html?StoryID=9290 ], and a
 critical
     piece in the Atlantic describing the implications and history of it:
     [ http://www.theatlantic.com/issues/2000/03/press.htm ]
     We contend that our suggestion does not violate the Bayh-Dole Act, and in
     fact may make it MORE remunerative to the licensing agencies rather than
     less.  We are not asking that the Universities give away their Intellectual
     Property, but that they make it available under different processes than
     they do now.  Remember 2 things: 1) we are talking only about software and
     2) software is an implementation of an idea or algorithm.  Once an
     algorithm is published, the implementation of the algorithm as running code
     is less of an issue than the original research leaading up to the
     publication.
 Q   Why do some researchers oppose this petition?
 A   There have been several objections to this petition by people who are
     in the position to be impacted by it.
       Phil Green (Washington University) has indicated that while he thinks all
     research results should be open, he objects to software being specifically
     singled out because of the perception that when things are freely
     available, they are deemed to be worth less or valued less.  There are
     certainly several stories about how a low priced but competetive product
     did not begin to sell well until its price was RAISED to be more in line
     with the competition.
       Steve Brenner (UC Berkeley) has objected to the REQUIREMENT that the code
 be made
     available as Open Source, indicating that his feeling is that investigators
     should have the option to do so and come to an amicable agreement with
     their home institution and funding agencies.  He also supports the dual
     licensing approach.  Here we have no problem with this except if the grant
     was performed with public funds, the public has a right to be able to use
     the results of that funding.  If the funding was private, then the funding
     agency has the right to insist on whatever they consider correct.
       Andrew Dalke has argued that forcing Open Source may in fact harm the
     scientific process in the following ways:
     - it will decrease the heterogeneity and quality of research by preventing
       researchers from taking advantage of more advanced proprietary software
       or databases. Since proprietary code is often better maintained and
       readable, requiring OSS is a serious detriment to good research.
     - relatedly, it will harm those doing current research with proprietary
       software.
     - source code is not required to validate an algorithm or computational
       approach as long as it is well-described in a publication and therefore
       may in fact decrease the quality of publication and research.
     - even if source code is made available, there is so much variability in
       hardware, compilers, libraries and non-deterministic variables
       (especially in network operations) that source code will not help that
       much anyway.
     - much of academic source code is so poorly written that it's not worth
       looking at anyway.  It's easier to write from scratch.
     - much current academic software is a conglomeration of bits from so many
       sources that it would be impossible to unravel the ownership and
       enforce distribution under a particular license.
     - seeing the source code will 'taint' researchers and decrease innovation
       by encouraging tunnel vision. 'The way that was is the only way.'  Also,
       easy access to source code will lead to lawsuits over re-implementations
       (at what point does re-implementation/reverse engineering meld with
       copyright infringement?)
     - making source code available without redistribution rights (you can view,
       modify, and use the source code but you can't redistribute the source
       code), addresses all valid concerns.  Why go further?
     - the definition of public funding is fuzzy. Who decides what proportion of
       public funding triggers your requirement?
     - if your petition succeeds, who will verify that recipients are complying
       with its requirements?  And how?  A law without teeth hardly better than
 no
       law at all.
     - even supposedly OSS can come with such silly or onerous requirements that
       it effectively ceases to be Open Source (required notifications, pages of
       required header comments or attribution, etc).
     - why Open Source?  Why not require release of such software into the
       Public Domain such as many government agencies already do?
     - changing the 'funding ecology' this drastically will hurt science.  We
       should take more time adn smaller steps.
     - the goverment already has too much influence on our lives; this is yet
       another potentially intrusive and onerous requirement to do research.
       [Andrew - what did I miss?]
       While there has been no official statement on this from any of the
     federal funding agencies, many have unofficially been enthusiastic about
     making more more research software Open Source.
 Q   How can I add resources to the site?
 A   If you have an Open Source resource, or a resource that's useful
     for developers of Open Source software (such as a WWW site or an
     article), then you can add it to the list of resources
     (http://www.openinformatics.org/dsp_ResListing.php). You must
     first register as a user
     (http://www.openinformatics.org/frm_EditUserAccount.php). If
     you've already signed the petition, you are already registered.
 Q's currently without A's
 =========================
 Q:  What are some of the downsides of this petition if it succeeds?
 Q:  What about existing software that has multiple sources including some that
     are non-OSS? For example, SW that includes commercial libraries (Rogue
     Wave's 'tools.h', Numerical Recipe's implementation of standard algorithms?)
 Q:  If it succeeds, who will oversee that the requirements are met?   Who will
     ensure that groups so funded do in fact make their source code available?
 Q:  Does this require that certain coding standards are met?  Who ensures that
     the code is readable and not the result of running it through an obfuscator?
 Q:  Does the Scientific method really require making the source code available?
     Aren't there other ways to verify an algorithm or claim?
 Q:  I'm funded by a variety of grants, some public, some private.  What is the
     percentage cutoff for the definition of 'Public Funding'?
 Q:  I'm currently funded by a private foundation that has no such OSS
     requirement, but I want to publich in a reputable journal.  Will this
     petition reflect badly on my NOT publishing the Source Code?
 Q:  What if my research currently is based on or depends on Proprietary
     software which cannot be made Open Source?
 Q:  I'm currently using OSS for my project but the license under which it is
     distributed is incompatible with other (also acceptable) OSS licenses.  How
     will this be resolved?
 Q:  Which Open Source license do you require?  I don't like that one.  Why don't
     you use ___________, which is much more intelligently phrased and
     acceptable to all clear-thinking people?
 Q:  Why Open Source as opposed to Source Code Availability ('Source Under
     Glass') where I will will allow you to view and modify the source code but
     not redistribute it?  Wouldn't this enable people to evaluate the code to
     the extent required for scientific validation without the overbearing
     requirement for Open Source release?
 Q:  Why not require something like a Government 'SourceForge' an 'active
     repository' of publicly funded codes?
 Q:  Will such software be OSS in perpetuity or will it 'lapse' to Public Domain
     in X years?  Define and justify X.
 --
 Cheers, Harry
 Harry J Mangalam  --  (949) 856 2847 (v&f)  --  mangalam |-at-| home.com
 (primary)
 hjmangalam |-at-| yahoo.com, harrymangalam |-at-| netscape.net (if  |-at-| home
 service ends)
                     <<plain text appreciated>>
 _______________________________________________
 bioinformatics mailing list
 bioinformatics |-at-| labs.oreilly.com
 http://labs.oreilly.com/mailman/listinfo/bioinformatics