<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Tag joelib</title>
    <link>http://depth-first.com/articles/tag/joelib</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>Roll Your Own Chemical Database With Free Components</title>
      <description>&lt;p&gt;Are you thinking of building a &lt;a href="http://depth-first.com/articles/2007/01/24/thirty-two-free-chemistry-databases"&gt;free chemical database&lt;/a&gt; but would rather not rent and maintain a bunch of proprietary software components? &lt;a href="http://merian.pch.univie.ac.at/pch/nh_info.html"&gt;Norbert Haider&lt;/a&gt; has thought a lot about this problem and offers some helpful resources to get you started:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/moldb.html"&gt;Creating a web-based, searchable molecular structure database using free software&lt;/a&gt; Step-by step case study&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/moldb.pdf"&gt;How to create a web-based molecular structure database with free software&lt;/a&gt; A presentation&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/cmmm.html"&gt;checkmol/matchmol&lt;/a&gt; Open source command-line utility for 2D (sub)structure matching&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/mol2ps.html"&gt;mol2ps&lt;/a&gt; Command-line utility for converting molfiles into Postscript files&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Haider's system can be deployed on commodity hardware running open source operating systems. In other words, the cost of setting up a system like the one he describes is practically zero.&lt;/p&gt;

&lt;p&gt;Creating and open sourcing your own custom components is one way to go. Building on top of existing open source tools like &lt;a href="http://cdk.sf.net"&gt;CDK&lt;/a&gt;, &lt;a href="http://openbabel.sf.net"&gt;Open Babel&lt;/a&gt;, &lt;a href="http://depth-first.com/articles/tag/octet"&gt;Octet&lt;/a&gt; and &lt;a href="http://joelib.sf.net"&gt;JOELib&lt;/a&gt; is another.&lt;/p&gt;

&lt;p&gt;Haider's work raises an interesting question. Has anyone assembled a complete, ready to install general purpose chemical database package built from open source components? It for no other reason, such an exercise would give an excellent idea of what &lt;a href="http://depth-first.com/articles/2007/01/03/open-source-and-open-data-why-we-should-eat-our-own-dogfood"&gt;the dogfood tastes like&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 13 Apr 2007 10:27:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:a017a4e0-d8a0-48c2-87a3-5554b99b7373</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components</link>
      <category>Tools</category>
      <category>database</category>
      <category>2d</category>
      <category>web</category>
      <category>cdk</category>
      <category>openbabel</category>
      <category>opensource</category>
      <category>joelib</category>
    </item>
    <item>
      <title>Making the Case: Similarity by Compression</title>
      <description>&lt;p&gt;&lt;a href="http://www-ra.informatik.uni-tuebingen.de/software/joelib/"&gt;&lt;img src="http://depth-first.com/files/joelib2_logo.gif" align="right" border="0"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;...The structures were converted to SMILES format and canonicalized using a program written with the open-source Java cheminformatics library JOELib2. ... To conclude, we have demonstrated that SMILES strings and compression programs are a simple, yet powerful method for similarity searching, competitive with state-of-the-art-techniques. The Ruby scripts used to carry out the experiments described in this paper are available for download from &lt;a href="http://comp.chem.nottingham.ac.uk/download/zippity/"&gt;http://comp.chem.nottingham.ac.uk/download/zippity/&lt;/a&gt;.&lt;/p&gt;

    &lt;p&gt;&lt;cite&gt;James Melville, Jenna Riley, and Johathan Hirst, &lt;a href="http://dx.doi.org/10.1021/ci600384z"&gt;J. Chem Inf. Model.&lt;/a&gt;&lt;/cite&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Yet another appearance of Open Source software in the literature comes by way of a paper from Melville, Riley, and Hirst. This work takes advantage of the alphabet-like nature of SMILES strings and widely-available compression algorithms to perform molecular similarity analyses. Not only does this work use the Open Source &lt;a href="http://www-ra.informatik.uni-tuebingen.de/software/joelib/"&gt;JOELib&lt;/a&gt; library but the authors have made the Ruby scripts that perform the similarity analysis &lt;a href="http://comp.chem.nottingham.ac.uk/download/zippity/"&gt;freely available&lt;/a&gt; under the same terms as Ruby (&lt;a href="http://www.ruby-lang.org/en/LICENSE.txt"&gt;Ruby's license&lt;/a&gt; or the &lt;a href="http://opensource.org/licenses/gpl-license.php"&gt;GPL&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;The times they are &lt;a href="http://depth-first.com/articles/2006/10/13/making-the-case"&gt;a-changein'&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 13 Dec 2006 14:49:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:8ed0f4bf-2bde-4d9b-8953-fe2e0f1b32fd</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2006/12/13/making-the-case-similarity-by-compression</link>
      <category>Open X</category>
      <category>joelib</category>
      <category>smiles</category>
      <category>similarity</category>
      <category>ruby</category>
      <category>zippity</category>
      <category>compression</category>
    </item>
  </channel>
</rss>
