<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Tag opensource</title>
    <link>http://depth-first.com/articles/tag/opensource</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>PerlMol: A Case Study in Open Source Cheminformatics Software</title>
      <description>&lt;p&gt;&lt;a href="http://www.perlmol.org/"&gt;&lt;img src="http://depth-first.com/demo/20071115/perlmol.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;How does open source software happen? Although many factors come into play, the majority of answers seem to revolve around a simple theme: developers building solutions to fill their own needs. Yet only a fraction of these solutions end up becoming open source software. And only a fraction of those end up being used by a wider audience. What's the key ingredient? There's still a lot to learn from studying individual cases.&lt;/p&gt;

&lt;p&gt;Readable discussions about the origins of specific open source projects are pretty rare, but those dealing with the origins of open source cheminformatics software are more uncommon still. So it was with great interest that I came across &lt;a href="http://www.ddj.com/web-development/184416118"&gt;Ivan Tubert-Brohman's account&lt;/a&gt; of how &lt;a href="http://www.perlmol.org/"&gt;PerlMol&lt;/a&gt; was created.&lt;/p&gt;

&lt;p&gt;PerlMol is an open source "collection of Perl modules for cheminformatics and computational chemistry." Many software packages fit into this category, and some of them are open source, so why write another? For Tubert-Brohman, the deciding factor was being able to work in his preferred environment, Perl:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;I was surprised that CPAN [&lt;a href="http://www.cpan.org/"&gt;The Comprehensive Perl Archive Network&lt;/a&gt;] was sorely lacking in terms of modules for chemistry. The only available modules were Chemistry::Element, which allows you to convert between atomic number, element symbol, and element name and store other elemental information; and Chemistry::MolecularMass, which calculates the mass from the molecular formula. There were no modules that actually dealt with the structure of molecules. While some of the options in other languages are not bad, I was looking for something with the simplicity and conciseness of Perl that could allow me to write "chemical one-liners" to solve small problems very quickly, without having to compile anything. Hence, PerlMol was born.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The elimination of the need to compile, and relaxed syntaxes that promote succinct code are two of the biggest reasons to try a &lt;a href="http://depth-first.com/articles/2006/11/14/eleven-free-cheminformatics-scripting-environments"&gt;cheminformatics scripting environment&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;There's a lot of great software still to be written in cheminformatics, and some of it will be open source. Although open sourcing that side project you've been working on may not be the best option for your career or your company, studying case studies like that of PerlMol gives plenty of food for thought.&lt;/p&gt;</description>
      <pubDate>Thu, 15 Nov 2007 09:49:00 -0500</pubDate>
      <guid isPermaLink="false">urn:uuid:0ed89b69-2a1f-4495-b8b1-ae7c8c8eb31c</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/11/15/perlmol-a-case-study-in-open-source-cheminformatics-software</link>
      <category>Open X</category>
      <category>perlmol</category>
      <category>scripting</category>
      <category>opensource</category>
    </item>
    <item>
      <title>The Business Case for Open Source and the Small Company</title>
      <description>&lt;p&gt;Few would argue against small companies &lt;em&gt;using&lt;/em&gt; open source software - indeed many owe their very existence to it. But what real, tangible good can come from a small company &lt;em&gt;releasing&lt;/em&gt; open source software?&lt;/p&gt;

&lt;p&gt;&lt;a href="http://www.37signals.com/svn/posts/693-ask-37signals-how-has-open-source-helped-or-hindered"&gt;Signal to Noise&lt;/a&gt;, the company blog of &lt;a href="http://37signals.com"&gt;37signals&lt;/a&gt;, offers a worthwhile perspective on this issue. To summarize the business case:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Certain kinds of software, like infrastructure software, take vast amounts of time and resources to get right - something that few small companies can afford. Open sourcing can accelerate the process.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Open sourcing provides a public arena in which your own company's developers can learn from other great developers.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;That public arena provides unique access to a pool of smart, motivated developers - and offers a way to evaluate their work before even deciding to interview them.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Open source generates press attention and goodwill from potential customers.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And about the elephant in the room:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;A big fear that a lot of people have is that they&#8217;ll somehow be giving away their secret sauce. Unless your actual product is what you&#8217;re open sourcing, it really doesn't matter (and there are even plenty of examples of that working well). It&#8217;s unlikely that the piece of code that&#8217;s only seen internal development is such a silver bullet that you&#8217;re going to outshine your competition by its use alone.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The &lt;a href="http://depth-first.com/articles/2006/11/06/stone-soup"&gt;distinction&lt;/a&gt; between infrastructure software and a company's secret sauce is particularly important.&lt;/p&gt;

&lt;p&gt;By just about any standard, 37signals is a leader in the deliberate use of open source software to achieve business objectives. We can all learn from them.&lt;/p&gt;</description>
      <pubDate>Fri, 09 Nov 2007 09:37:00 -0500</pubDate>
      <guid isPermaLink="false">urn:uuid:f62bbf43-7f0a-42b7-b44f-2340cbbf58b8</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/11/09/the-business-case-for-open-source-and-the-small-company</link>
      <category>Open X</category>
      <category>opensource</category>
      <category>37signals</category>
      <category>infrastructure</category>
      <category>secretsauce</category>
    </item>
    <item>
      <title>Building Rubidium: Creating a RubyForge Project Space</title>
      <description>&lt;p&gt;&lt;img src="http://depth-first.com/demo/20071015/rubidium.png" align="right" border="0"&gt;&lt;/img&gt;Recent articles have discussed &lt;a href="http://depth-first.com/articles/tag/rubidium"&gt;Rubidium&lt;/a&gt;, the cheminformatics toolkit for Ruby. In this article, the first in a series, I'll go beyond the Ruby code to discuss the technical aspects of taking an Open Source idea from concept to release.&lt;/p&gt;

&lt;h4&gt;Finding a Home&lt;/h4&gt;

&lt;p&gt;Before setting up your Open Source project, you'll need to decide on how to host it. Project hosting can be as simple or elaborate as you wish, but the basic services include: a website; a mailing list; a discussion forum; a source code repository (typically CVS or Subversion); a bug tracking system; and a file release system.&lt;/p&gt;

&lt;p&gt;The multitude of choices can be broken down into two basic options: host the project yourself or use a free hosting service. Fortunately, Ruby-based projects enjoy two excellent free hosting options: &lt;a href="http://sourceforge.net"&gt;SourceForge&lt;/a&gt; and &lt;a href="http://rubyforge.org"&gt;RubyForge&lt;/a&gt;. Although SourceForge could certainly be used for a Ruby project, RubyForge is a more popular option. One of the reasons is that any RubyGem your project releases automatically becomes installable through the &lt;a href="http://rubygems.org/"&gt;RubyGems&lt;/a&gt; package management system with a simple one-line incantation:&lt;/p&gt;

&lt;div class="console"&gt;
&lt;pre&gt;
$ sudo gem install &amp;lt;yourprojectname&amp;gt;
&lt;/pre&gt;
&lt;/div&gt;

&lt;p&gt;Another reason to use RubyForge is discoverability. RubyForge only hosts projects related in some way to Ruby. So, your project will stand out a lot more in its category than with a much larger site like SourceForge.&lt;/p&gt;

&lt;p&gt;Given RubyForge's advantages, and my own interest in minimizing the work needed to maintain an Open Source project, Rubidium will be hosted on RubyForge.&lt;/p&gt;

&lt;h4&gt;Requesting a Project Space&lt;/h4&gt;

&lt;p&gt;Having decided on RubyForge as Rubidium's host, all that's left is to ask for free services. You'll need to &lt;a href="http://rubyforge.org/account/register.php"&gt;register for a user account&lt;/a&gt; if you haven't done so already. Then, simply &lt;a href="http://rubyforge.org/register/projectinfo.php"&gt;apply for project space&lt;/a&gt;. After about three business days, you should be notified whether your project was accepted.&lt;/p&gt;

&lt;p&gt;Several days ago, I completed this process for Rubidium. Its new home on RubyForge will be:&lt;/p&gt;

&lt;p&gt;&lt;a href="http://rubyforge.org/projects/rbtk"&gt;http://rubyforge.org/projects/rbtk&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The Rubidium home page can be found at:&lt;/p&gt;

&lt;p&gt;&lt;a href="http://rbtk.rubyforge.org"&gt;http://rbtk.rubyforge.org&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;There's nothing useful there yet, a situation that will hopefully be fixed in a few weeks.&lt;/p&gt;

&lt;h4&gt;Next Steps&lt;/h4&gt;

&lt;p&gt;With powerful free services now available for the Rubidium project, we'll want to start taking advantage of them. The next articles in this series will discuss some ways of doing so.&lt;/p&gt;</description>
      <pubDate>Fri, 26 Oct 2007 10:21:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:cc895dbc-1a02-4b7a-9fd9-11431a3e9cc3</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/10/26/building-rubidium-creating-a-rubyforge-project-space</link>
      <category>Tools</category>
      <category>rubidium</category>
      <category>rubyforge</category>
      <category>ruby</category>
      <category>opensource</category>
    </item>
    <item>
      <title>Cheminformatics in Korean: An Example of Scientific Self-Organization</title>
      <description>&lt;p&gt;&lt;a href="http://veimages.gsfc.nasa.gov//2429/globe_east_2048.jpg"&gt;&lt;img src="http://depth-first.com/demo/20070831/earth.jpg" align="right" border="0"&gt;&lt;/img&gt;&lt;/a&gt;&lt;a href="http://www.lordmiss.com/journal/"&gt;Hanjo Kim&lt;/a&gt; of the &lt;a href="http://bmdrc.org/"&gt;Bioinformatics &amp;amp; Molecular Design Research Center&lt;/a&gt; (BMDRC) in Seoul, South Korea writes in to tell about his Korean-language cheminformatics blog &lt;a href="http://agile2robust.com/tt/"&gt;Agile2Robust&lt;/a&gt;. Some of his articles, such as &lt;a href="http://agile2robust.com/tt/20"&gt;this one on PubChem&lt;/a&gt; have been &lt;a href="http://depth-first.com/articles/2007/08/16/five-ways-to-use-pubchem-right-now"&gt;translated&lt;/a&gt; from Depth-First; others are based on Depth-First articles.&lt;/p&gt;

&lt;p&gt;This completes a circle. It was Kim's &lt;a href="http://www.ruby-forum.com/topic/49337"&gt;post to the Ruby mailing list&lt;/a&gt; in late 2005 and the responses to it that helped me in the early days of developing cheminformatics software in Ruby (and writing articles about it here), although Kim probably didn't know it at the time.&lt;/p&gt;

&lt;p&gt;During this process, none of the usual scientific protocols have been observed: no journal articles were written or published; no grants to do any of the work were prepared or funded; no presentations were made; and no central authority was involved (at least on my end). In fact, if any of these conventions had been observed, the work would have never seen the light of day. Free tools, free services, and open standards enabled one scientist to connect with another in a way that's impossible to achieve through conventional means.&lt;/p&gt;</description>
      <pubDate>Fri, 31 Aug 2007 09:24:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:8c8e6159-c54f-4b6d-ab0e-be584fe20050</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/08/31/cheminformatics-in-korean-an-example-of-scientific-self-organization</link>
      <category>Meta</category>
      <category>korean</category>
      <category>openscience</category>
      <category>opensource</category>
      <category>ruby</category>
      <category>hanjokim</category>
    </item>
    <item>
      <title>Golden Rules for Open Source</title>
      <description>&lt;p&gt;&lt;a href="http://greg.chiaraquartet.net/"&gt;Greg Beaver&lt;/a&gt; has written an interesting set of &lt;a href="http://greg.chiaraquartet.net/archives/171-10-golden-rules-for-running-an-open-source-project.html"&gt;ten golden rules for running open source projects&lt;/a&gt;. Many of these rules apply to running (or working on) anything. My favorite rule is "3. Despite the evidence, it's probably your fault." Greg develops the popular Open Source project &lt;a href="http://pear.php.net/"&gt;PEAR&lt;/a&gt;, a packaging tool for PHP that looks similar in concept to &lt;a href="http://rubygems.org/"&gt;RubyGems&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Cheminformatics, like all vertical markets, has is own peculiarities when it comes to Open Source. Most importantly, cheminformatics software has a limited, but captive audience. When done well, it is both highly profitable and expensive to develop. For insightful perspectives on Open Source software as it applies to cheminformatics and vertical markets, see &lt;a href="http://dx.doi.org/10.1016/S1359-6446(04)03364-1"&gt;Matt Stahl&lt;/a&gt;, &lt;a href="http://dx.doi.org/10.1016/S1359-6446(04)03363-X"&gt;Warren DeLano&lt;/a&gt;, &lt;a href="http://www.moonviewscientific.com/essays/software_lifecycle.htm"&gt;Craig James&lt;/a&gt;, and &lt;a href="http://www.catb.org/~esr/writings/magic-cauldron/magic-cauldron-10.html"&gt;Eric S. Raymond&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 30 Apr 2007 09:27:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:72ed74cd-0bc7-4853-849f-4f3c49069444</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/04/30/golden-rules-for-open-source</link>
      <category>Open X</category>
      <category>opensource</category>
      <category>verticalmarket</category>
    </item>
    <item>
      <title>Roll Your Own Chemical Database With Free Components</title>
      <description>&lt;p&gt;Are you thinking of building a &lt;a href="http://depth-first.com/articles/2007/01/24/thirty-two-free-chemistry-databases"&gt;free chemical database&lt;/a&gt; but would rather not rent and maintain a bunch of proprietary software components? &lt;a href="http://merian.pch.univie.ac.at/pch/nh_info.html"&gt;Norbert Haider&lt;/a&gt; has thought a lot about this problem and offers some helpful resources to get you started:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/moldb.html"&gt;Creating a web-based, searchable molecular structure database using free software&lt;/a&gt; Step-by step case study&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/moldb.pdf"&gt;How to create a web-based molecular structure database with free software&lt;/a&gt; A presentation&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/cmmm.html"&gt;checkmol/matchmol&lt;/a&gt; Open source command-line utility for 2D (sub)structure matching&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/mol2ps.html"&gt;mol2ps&lt;/a&gt; Command-line utility for converting molfiles into Postscript files&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Haider's system can be deployed on commodity hardware running open source operating systems. In other words, the cost of setting up a system like the one he describes is practically zero.&lt;/p&gt;

&lt;p&gt;Creating and open sourcing your own custom components is one way to go. Building on top of existing open source tools like &lt;a href="http://cdk.sf.net"&gt;CDK&lt;/a&gt;, &lt;a href="http://openbabel.sf.net"&gt;Open Babel&lt;/a&gt;, &lt;a href="http://depth-first.com/articles/tag/octet"&gt;Octet&lt;/a&gt; and &lt;a href="http://joelib.sf.net"&gt;JOELib&lt;/a&gt; is another.&lt;/p&gt;

&lt;p&gt;Haider's work raises an interesting question. Has anyone assembled a complete, ready to install general purpose chemical database package built from open source components? It for no other reason, such an exercise would give an excellent idea of what &lt;a href="http://depth-first.com/articles/2007/01/03/open-source-and-open-data-why-we-should-eat-our-own-dogfood"&gt;the dogfood tastes like&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 13 Apr 2007 10:27:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:a017a4e0-d8a0-48c2-87a3-5554b99b7373</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components</link>
      <category>Tools</category>
      <category>database</category>
      <category>2d</category>
      <category>web</category>
      <category>cdk</category>
      <category>openbabel</category>
      <category>opensource</category>
      <category>joelib</category>
    </item>
    <item>
      <title>Source Code, Science, and Advertising</title>
      <description>&lt;blockquote&gt;
    &lt;p&gt;The minimum level of disclosure that &lt;em&gt;Nature Methods&lt;/em&gt; requires depends on how central the software is to the paper. If a software program is the focus of the report, we expect the programming code to be made available. Without this code, the software - and thus the paper - would become a black box of little use to the scientific community. In many papers, however, the software is only an ancillary part of the method, and the focus is on the methodological approach or an insight gained from it.&lt;/p&gt;
    
    &lt;p&gt;-&lt;cite&gt;&lt;a href="http://dx.doi.org/10.1038/nmeth0307-189"&gt;Nature Methods Editorial&lt;/a&gt;, March 2007&lt;/cite&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It's refreshing to see &lt;em&gt;Nature Methods&lt;/em&gt; publicly acknowledge the importance of software availability in the sciences. On the other hand, the editorial still leaves wriggle room for an author determined to keep his or her source code private.&lt;/p&gt;

&lt;p&gt;For example, consider the term "programming code." Does this mean "source code", or does it also include "object code"? The rest of the editorial makes it clear that both are acceptable to &lt;em&gt;Nature Methods&lt;/em&gt;, although source code is preferred. Releasing binaries only "... may be acceptable if the operations performed by the software are sufficiently clear." Can any amount of description ever substitute for source code itself?&lt;/p&gt;

&lt;p&gt;Over twenty years ago, John Figueras &lt;a href="http://depth-first.com/articles/2006/08/23/readily-available-without-infringements-or-restrictions"&gt;made the case&lt;/a&gt; that authors wishing to keep their source code private fundamentally misunderstood the difference between science and advertising. This may be a distinction lost on certain authors, but journals play along at their own risk.&lt;/p&gt;</description>
      <pubDate>Mon, 26 Mar 2007 11:05:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:24902a88-a06a-4a83-8abf-9469bee6d7a7</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/03/26/source-code-science-and-advertising</link>
      <category>Open X</category>
      <category>opensource</category>
      <category>figueras</category>
      <category>naturemethods</category>
    </item>
    <item>
      <title>Web 2.0 and Chemistry</title>
      <description>&lt;blockquote&gt;
    &lt;p&gt;Chemistry on the World Wide Web is picking up speed like a runaway train. A remarkable number of groups are devoting considerable time, effort, and money to a wide variety of chemical web applications.&lt;/p&gt;
    
    &lt;p&gt;-&lt;cite&gt;Stu Borman, &lt;a href="http://pubs.acs.org/hotartcl/cenear/960916/explode.html"&gt;Chemical &amp;amp; Engineering News, September 16, 1996&lt;/a&gt;&lt;/cite&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Whatever happened to chemistry and the Web? Stu Borman's article is a wonderful read, if for no other reason than to illustrate what makes technology predictions so tricky. Borman cites these developments, among others, as evidence of the rise of chemistry on the Web circa 1996:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;An &lt;a href="http://altavista.com"&gt;AltaVista&lt;/a&gt; search for the word "chemistry" returned 400,000 documents. (Remember AltaVista? Google now lists over &lt;em&gt;115 million&lt;/em&gt; documents containing the word "chemistry").&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Popular Web browsers such as Navigator 3.0 and Explorer 3.0 support Greek characters, an important characteristic of chemical information. (Support for chemistry in Web browsers has barely improved in the meantime.)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;A Java browser for &lt;a href="http://www.xml-cml.org/"&gt;Chemical Markup Language&lt;/a&gt; (CML) was soon to be released. (CML is still in use and supported by many software packages, although it has not been widely adopted. For example, the builders of &lt;a href="http://pubchem.ncbi.nlm.nih.gov/"&gt;PubChem&lt;/a&gt; opted for a custom XML format over CML.)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://www.ch.ic.ac.uk/rzepa/vrml/"&gt;Virtual Reality Markup Language&lt;/a&gt; (VRML) could be used display molecules in three-dimensions. (&lt;a href="http://en.wikipedia.org/wiki/VRML"&gt;This article&lt;/a&gt; gives a brief overview of the rise and fall of VRML.)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://www.mdl.com/products/framework/chime/"&gt;Chemscape Chime&lt;/a&gt;, a Navigator plugin that interprets chemical information, could be freely downloaded. (&lt;del&gt;MDL has apparently since discontinued free distribution of Chime.&lt;/del&gt; The &lt;a href="http://depth-first.com/articles/2006/09/27/hacking-pubchem-free-speech-or-free-beer"&gt;free as in beer&lt;/a&gt; plugin is &lt;a href="http://www.orgsyn.org/"&gt;rarely&lt;/a&gt; seen in use on the public Web.)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Chemically oriented Java applets such as "WebSketch" are proliferating. (Java has had a difficult time making it as a browser technology. Security has had little to do with it, though.)&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Stu Borman's article serves as a clear reminder that chemistry on the Web is almost as awkward today as it was at the dawn of the Internet age. The problem isn't lack of content; it's the lack of robust, widely-adopted, open standards that take the &lt;a href="http://depth-first.com/articles/2006/09/03/peculiarities-of-chemical-information"&gt;peculiarities of chemical information&lt;/a&gt; into account, and the free software to support them. Coming to terms with past failures in this area is one way to increase the chances of future success.&lt;/p&gt;</description>
      <pubDate>Mon, 12 Mar 2007 10:23:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:8c8362ba-993e-438a-bb75-8d2627aeefe6</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/03/12/web-2-0-and-chemistry</link>
      <category>Meta</category>
      <category>web20</category>
      <category>opensource</category>
      <category>openstandards</category>
      <category>cml</category>
      <category>java</category>
      <category>web</category>
    </item>
    <item>
      <title>Making the Case: Flux-2</title>
      <description>&lt;blockquote&gt;
    &lt;p&gt;&lt;a href="http://cdk.sf.net"&gt;&lt;img src="http://depth-first.com/files/cdk_logo.png" align="right" border="0"&gt;&lt;/img&gt;&lt;/a&gt;... The Flux software makes
    extensive use of the Chemistry Development Toolkit [sic] (CDK) as a cheminformatics library. Every descriptor that implements the Java interface for CDK descriptors can be utilized for the chemical similarity calculations; this includes the 69 descriptors available in CDK at the time of writing. ...&lt;/p&gt;
    
    &lt;p&gt;...&lt;/p&gt;
    
    &lt;p&gt;... CDK is employed as a cheminformatics toolkit (&lt;a href="http://cdk.sf.net"&gt;http://cdk.sf.net&lt;/a&gt;). The input and output of chemical structures in SDF format and the basic functionality for all structure manipulation are provided by CDK. Compounds are tested for uniqueness by means of canonical SMILES generated by CDK. Our ligand-based fitness function and molecule filter are based on descriptors that implement the Java interface for descriptors defined by CDK.&lt;/p&gt;
    
    &lt;p&gt;-&lt;cite&gt;Uli Fechner and Gisbert Schneider, &lt;a href="http://dx.doi.org/10.1021/ci6005307"&gt;J. Chem. Inf. Model. ASAP Articles&lt;/a&gt;&lt;/cite&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The number of peer-reviewed publications using the Open Source Java library &lt;a href="http://cdk.sf.net"&gt;Chemistry Development Kit&lt;/a&gt; (CDK) just keeps growing. The latest addition comes by way of a paper by Fechner and Schneider on ligand-based de novo design. In many areas, Open Source software is used not because it's free, but because it's perceived as superior to alternatives. How long will it be before this is true in cheminformatics?&lt;/p&gt;</description>
      <pubDate>Mon, 26 Feb 2007 09:41:00 -0500</pubDate>
      <guid isPermaLink="false">urn:uuid:c5e39684-33cd-49e4-b622-aecf31b842ea</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/02/26/making-the-case-flux-2</link>
      <category>Open X</category>
      <category>cdk</category>
      <category>liganddesign</category>
      <category>opensource</category>
      <category>java</category>
    </item>
    <item>
      <title>Google for Molecules with InChIMatic</title>
      <description>&lt;p&gt;&lt;center&gt;&lt;a href="http://inchimatic.com"&gt;&lt;img src="http://depth-first.com/demo/20070219/inchimatic_logo.png" border="0"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="http://inchimatic.com"&gt;InChIMatic&lt;/a&gt; is a simple Web application that uses Google to perform exact structure searches on the Web. After drawing your structure in the editor window, click the "InChI!" button to get a link. This link takes you to a Google query that displays matches for your molecule. You'll need both Java and JavaScript enabled in your browser to use InChIMatic.&lt;/p&gt;

&lt;h4&gt;The Technical Details&lt;/h4&gt;

&lt;p&gt;&lt;a href="http://iupac.org/dhtml_home.html"&gt;&lt;img src="http://depth-first.com/demo/20070126/iupac_logo.png" align="right" border="0"&gt;&lt;/img&gt;&lt;/a&gt;The technology at the heart of InChIMatic is the &lt;a href="http://www.iupac.org/inchi/"&gt;IUPAC International Chemical Identifier&lt;/a&gt; (InChI). An InChI is an alphanumeric string that uniquely identifies a molecular structure. By converting molecular structures to text, InChI makes it easy to use standard Internet tools to do exact structure searches.&lt;/p&gt;

&lt;p&gt;The earliest reference in the peer-reviewed literature to using Google for searching InChIs is contained in a &lt;a href="http://dx.doi.org/10.1039/b502828k"&gt;2005 paper&lt;/a&gt;. More recently, a service called &lt;a href="http://querychem.com"&gt;QueryChem&lt;/a&gt; has taken this idea one step further by using the &lt;a href="http://code.google.com/"&gt;Google API&lt;/a&gt; to perform substructure searches based on InChI.&lt;/p&gt;

&lt;p&gt;InChIMatic works differently. Unlike a raw Google search, InChIMatic builds a Google query link for you. Unlike QueryChem, InChIMatic doesn't use the Google API and so has none of its restrictions. This does result in a limitation: InChIMatic can only currently be used to for exact structure queries.&lt;/p&gt;

&lt;p&gt;&lt;a href="http://rubyonrails.org"&gt;&lt;img src="http://depth-first.com/files/rails_logo.png" align="right" border="0"&gt;&lt;/img&gt;&lt;/a&gt;The InChIMatic Web application has been discussed in greater technical detail in a &lt;a href="http://depth-first.com/articles/2006/12/15/anatomy-of-a-cheminformatics-web-application-inchimatic"&gt;previous article&lt;/a&gt;. The rapid Web application development framework &lt;a href="http://rubyonrails.com"&gt;Ruby on Rails&lt;/a&gt; made building InChIMatic a snap. InChIMatic is served by the Ruby application container &lt;a href="http://depth-first.com/articles/2007/02/05/mongrel-and-rails-its-just-not-fair"&gt;Mongrel&lt;/a&gt;, which is hosted on a Linux server running Apache. &lt;a href="http://depth-first.com/articles/tag/rino"&gt;Rino&lt;/a&gt; provided the Ruby interface to the &lt;a href="http://www.iupac.org/inchi/"&gt;IUPAC/NIST InChI toolkit&lt;/a&gt;. The 2-D structure editor is &lt;a href="http://www.molinspiration.com/jme/"&gt;Java Molecular Editor&lt;/a&gt; (JME) by Peter Ertl, which is used with his kind permission.&lt;/p&gt;

&lt;p&gt;&lt;a href="http://www.opensource.org/docs/definition.php"&gt;&lt;img src="http://www.opensource.org/trademarks/opensource/web/opensource-110x95.png" align="right" alt="Open Source (OSI) Logo" border="0" width="110" height="95"&gt;&lt;/img&gt;&lt;/a&gt;Aside from JME, all components of InChIMatic, from the operating system it runs on to the InChI system itself, are &lt;a href="http://opensource.org"&gt;Open Source&lt;/a&gt; software.&lt;/p&gt;

&lt;h4&gt;Using InChI to Raise the Visibility of Your Content&lt;/h4&gt;

&lt;p&gt;InChIMatic returns many Google results for common molecules. But less common, known molecules return no hits at all. Three factors are responsible: (1) Google doesn't index all InChIs on the Internet; (2) few content providers currently use InChI; and (3) there is no standard and convenient mechanism to embed InChIs into Web pages for indexing by Google.&lt;/p&gt;

&lt;p&gt;For these reasons, I consider InChI to be bleeding edge technology. Some will find it useful, most will not. Unfortunately, this state of affairs will persist until problems (1) and (3) are solved.&lt;/p&gt;

&lt;p&gt;Nevertheless, if you're technically adventurous, InChIMatic offers a relatively painless way to begin incorporating InChIs into your content and verifying that they get indexed. There's no software to download, install, or upgrade. Forget about operating system incompatibilities (hopefully!). Just point your Java-enabled browser to &lt;a href="http://inchimatic.com"&gt;inchimatic.com&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Although there's no standard method to encode InChIs in Web pages, some interesting ideas have been put forward. &lt;a href="http://chem-bla-ics.blogspot.com/"&gt;Egon Willighagen&lt;/a&gt; has proposed &lt;a href="http://chem-bla-ics.blogspot.com/2006/12/including-smiles-cml-and-inchi-in.html"&gt;a system&lt;/a&gt; based on &lt;a href="http://www.w3.org/TR/xhtml-rdfa-primer/"&gt;RDFa&lt;/a&gt;. Future iterations of InChIMatic may include support for generating scripts and/or markup for including InChIs into blogs and other online content.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;InChI is a complex new technology in need of easy-to-use tools. InChIMatic is one such tool that makes it possible to perform exact structure queries using Google.&lt;/p&gt;

&lt;p&gt;One of the exciting things about Web applications is how quickly they can evolve. If in trying out InChIMatic you find something you'd like changed or added, please feel free to &lt;a href="mailto:r_apodaca@users.sf.net"&gt;write me&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 19 Feb 2007 10:18:00 -0500</pubDate>
      <guid isPermaLink="false">urn:uuid:eb531bca-c3b0-4f2d-8053-4272baa8bbfb</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/02/19/google-for-molecules-with-inchimatic</link>
      <category>Tools</category>
      <category>inchimatic</category>
      <category>inchi</category>
      <category>google</category>
      <category>webapp</category>
      <category>opensource</category>
      <category>rails</category>
      <category>iupac</category>
    </item>
  </channel>
</rss>
