<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Roll Your Own Chemical Database With Free Components</title>
    <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>Roll Your Own Chemical Database With Free Components</title>
      <description>&lt;p&gt;Are you thinking of building a &lt;a href="http://depth-first.com/articles/2007/01/24/thirty-two-free-chemistry-databases"&gt;free chemical database&lt;/a&gt; but would rather not rent and maintain a bunch of proprietary software components? &lt;a href="http://merian.pch.univie.ac.at/pch/nh_info.html"&gt;Norbert Haider&lt;/a&gt; has thought a lot about this problem and offers some helpful resources to get you started:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/moldb.html"&gt;Creating a web-based, searchable molecular structure database using free software&lt;/a&gt; Step-by step case study&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/moldb.pdf"&gt;How to create a web-based molecular structure database with free software&lt;/a&gt; A presentation&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/cmmm.html"&gt;checkmol/matchmol&lt;/a&gt; Open source command-line utility for 2D (sub)structure matching&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://merian.pch.univie.ac.at/%7Enhaider/cheminf/mol2ps.html"&gt;mol2ps&lt;/a&gt; Command-line utility for converting molfiles into Postscript files&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Haider's system can be deployed on commodity hardware running open source operating systems. In other words, the cost of setting up a system like the one he describes is practically zero.&lt;/p&gt;

&lt;p&gt;Creating and open sourcing your own custom components is one way to go. Building on top of existing open source tools like &lt;a href="http://cdk.sf.net"&gt;CDK&lt;/a&gt;, &lt;a href="http://openbabel.sf.net"&gt;Open Babel&lt;/a&gt;, &lt;a href="http://depth-first.com/articles/tag/octet"&gt;Octet&lt;/a&gt; and &lt;a href="http://joelib.sf.net"&gt;JOELib&lt;/a&gt; is another.&lt;/p&gt;

&lt;p&gt;Haider's work raises an interesting question. Has anyone assembled a complete, ready to install general purpose chemical database package built from open source components? It for no other reason, such an exercise would give an excellent idea of what &lt;a href="http://depth-first.com/articles/2007/01/03/open-source-and-open-data-why-we-should-eat-our-own-dogfood"&gt;the dogfood tastes like&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 13 Apr 2007 10:27:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:a017a4e0-d8a0-48c2-87a3-5554b99b7373</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components</link>
      <category>Tools</category>
      <category>database</category>
      <category>2d</category>
      <category>web</category>
      <category>cdk</category>
      <category>openbabel</category>
      <category>opensource</category>
      <category>joelib</category>
    </item>
    <item>
      <title>"Roll Your Own Chemical Database With Free Components" by Geoff</title>
      <description>&lt;p&gt;For small applications, you probably don't even need the database. Mac, Windows, and Linux all have mechanisms to index chemistry data -- I showed this with ChemSpotlight. Leave the data files in place and use something like Rails to allow a query interface and return the results.&lt;/p&gt;

&lt;p&gt;(Hmm, sounds like I need to add another section before submitting the ChemSpotlight paper. Rich, if you're interested, let me know.)&lt;/p&gt;</description>
      <pubDate>Mon, 16 Apr 2007 15:28:10 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:1927cf30-37ce-43be-a69a-ebb37cab16ed</guid>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components#comment-20</link>
    </item>
    <item>
      <title>"Roll Your Own Chemical Database With Free Components" by Rajarshi Guha</title>
      <description>&lt;p&gt;You could also consider chemical backends for databases. Two OSS examples include Tigress (for PostgreSQL) and SMDC (for MySQL). Both use Openbabel internally, though Tigress also makes use of checkmol/matchmol.&lt;/p&gt;

&lt;p&gt;I like these approaches though I haven't used these specific packages myself (we're using gNova which follows the same idea).&lt;/p&gt;

&lt;p&gt;But the fact that the DB now has cheminformatics functionality makes writing applications based on DB backends much easier and a lot of our projects are based on this approach.&lt;/p&gt;

&lt;p&gt;Coupled with some simple scripts to say load SDF's into a DB, evaluate some properties and load them in etc, it's pretty easy to make a LAMP'esque application. Obviously DB's are general purpose tools, so in some cases we need to get our hands dirty (say indexing schemes for 3D search).&lt;/p&gt;

&lt;p&gt;But I can envisage a small, but relatively general distribution  starts at SDF's and ends at a simply to use web page for queries&lt;/p&gt;</description>
      <pubDate>Sat, 14 Apr 2007 14:50:46 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:06a429dd-5b61-45ed-bd56-a0a165b6cf57</guid>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components#comment-19</link>
    </item>
    <item>
      <title>"Roll Your Own Chemical Database With Free Components" by Egon Willighagen</title>
      <description>&lt;p&gt;Oh, forgot about AMBIT: check that out. Less spectrum, more molecule:&lt;/p&gt;

&lt;p&gt;&lt;a href="http://ambit.acad.bg/" rel="nofollow"&gt;http://ambit.acad.bg/&lt;/a&gt;&lt;/p&gt;</description>
      <pubDate>Sat, 14 Apr 2007 12:48:17 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:9d92c3ff-7fdd-4ad2-b22a-499821e0299f</guid>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components#comment-18</link>
    </item>
    <item>
      <title>"Roll Your Own Chemical Database With Free Components" by Rich Apodaca</title>
      <description>&lt;p&gt;I was thinking of &lt;a href="http://sf.net/projects/nmrshiftdb" rel="nofollow"&gt;NMRShiftDB&lt;/a&gt; more as a specific kind of chemical database. Still, it would be interesting to see how it works as a general-purpose chemical database.&lt;/p&gt;</description>
      <pubDate>Sat, 14 Apr 2007 10:47:45 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:1ef23ee3-a3c9-4ad1-b604-c36be96e681c</guid>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components#comment-16</link>
    </item>
    <item>
      <title>"Roll Your Own Chemical Database With Free Components" by Egon Willighagen</title>
      <description>&lt;p&gt;Stefan build the NMRShiftDB software, which in its core is a molecular compound database system. You can download it from the corresponding SourceForge webpage. The spectrum part is just an extra.&lt;/p&gt;</description>
      <pubDate>Sat, 14 Apr 2007 02:52:07 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:8c31f5e4-863b-414b-82ea-a0cfcc321954</guid>
      <link>http://depth-first.com/articles/2007/04/13/roll-your-own-chemical-database-with-free-components#comment-15</link>
    </item>
  </channel>
</rss>
