<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Structure Diagram Generation</title>
    <link>http://depth-first.com/articles/2007/04/11/structure-diagram-generation</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>Structure Diagram Generation</title>
      <description>&lt;p&gt;&lt;img src="http://depth-first.com/demo/20070411/difficult.png" align="right"&gt;&lt;/img&gt;Given a molecule with no 2D coordinates, how would you render a human-readable view? This problem arises in many situations, but most commonly in the context of interpreting &lt;a href="http://depth-first.com/articles/tag/linenotation"&gt;line notations&lt;/a&gt; such as IUPAC nomenclature, SMILES, or InChI. Whatever the solution you come up with, you'll come face-to-face with the structure diagram generation (SDG) problem.&lt;/p&gt;

&lt;p&gt;Generating 2D molecular coordinates is a fundamental (and remarkably difficult) problem in cheminformatics. Discussions in the primary literature &lt;a href="http://dx.doi.org/10.1021/ci60011a015"&gt;date back&lt;/a&gt; to at least the 1970s with Chemical Abstract Service's pioneering large-scale efforts. A &lt;a href="http://dx.doi.org/10.1021/ci050550m"&gt;recent article&lt;/a&gt; from Chemical Computing Group (CCG) described the design and implementation of an advanced SDG system. To my knowledge, the only open source implementation of an SDG system is found in the &lt;a href="http://wiki.cubic.uni-koeln.de/cdknews/index.php/CDKNews/article/view/5"&gt;Chemistry Development Kit&lt;/a&gt;, and by extension &lt;a href="http://depth-first.com/articles/tag/rubycdk"&gt;Ruby CDK&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The SDG problem plays an important role in the &lt;a href="http://depth-first.com/articles/2007/03/30/the-aesthetics-of-chemical-structure-diagrams"&gt;aesthetics of chemical structure diagrams&lt;/a&gt;, as mentioned by two readers. To render a molecule aesthetically, 2D coordinates must minimize confusing atom overlaps, unconventional orientations, and unusual bond angles.&lt;/p&gt;

&lt;p&gt;The role of SDG in cheminformatics can only continue to increase in importance, especially as more and more structures are automatically generated through mining the primary literature, the Internet, &lt;a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=28"&gt;old PDFs&lt;/a&gt;, and other sources. With all of these new computer-generated structures will come the need to make them readily understandable to a chemist through SDG.&lt;/p&gt;</description>
      <pubDate>Wed, 11 Apr 2007 10:20:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:9212c666-5a2f-414a-82c3-2a63ea07d845</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2007/04/11/structure-diagram-generation</link>
      <category>Tools</category>
      <category>sdg</category>
      <category>rubycdk</category>
      <category>aesthetics</category>
      <category>2d</category>
      <category>smiles</category>
      <category>inchi</category>
    </item>
    <item>
      <title>"Structure Diagram Generation" by Rich Apodaca</title>
      <description>&lt;p&gt;Thanks, Geoff - I had missed that. Looks like the MCDL editor does do structure diagram generation. As a bonus the MCDL source is public domain.&lt;/p&gt;</description>
      <pubDate>Sun, 22 Apr 2007 01:03:24 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:cffa7ee0-8cb8-42d1-9017-c5266f2d3cfc</guid>
      <link>http://depth-first.com/articles/2007/04/11/structure-diagram-generation#comment-25</link>
    </item>
    <item>
      <title>"Structure Diagram Generation" by Geoff</title>
      <description>&lt;p&gt;Wasn't there another open source 2D layout. I remember it was at the fall 2006 ACS meeting in San Francisco in the CINF general session.&lt;/p&gt;

&lt;p&gt;Wait, I found it: &lt;a href="http://sourceforge.net/projects/mcdl" rel="nofollow"&gt;http://sourceforge.net/projects/mcdl&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It's public domain. Their line notation is a little strange, but it might be interesting to look at their source.&lt;/p&gt;</description>
      <pubDate>Mon, 16 Apr 2007 15:35:11 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:e4e4c367-c4f0-49d6-a92c-d4f21e57d5ba</guid>
      <link>http://depth-first.com/articles/2007/04/11/structure-diagram-generation#comment-21</link>
    </item>
    <item>
      <title>"Structure Diagram Generation" by Rich Apodaca</title>
      <description>&lt;p&gt;Ah yes, MDRAW. I also remember the pains I went to many years ago to track down that source code on microfiche. Although the source code is there to read, if I'm not mistaken it's copyrighted by J. Chem. Res. The code itself has not, to my knowledge, been released under an open source license. So it's &lt;a href="http://depth-first.com/articles/2006/09/27/hacking-pubchem-free-speech-or-free-beer" rel="nofollow"&gt;free beer&lt;/a&gt;, but not free speech.&lt;/p&gt;</description>
      <pubDate>Sat, 14 Apr 2007 10:56:39 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:eda89bba-b459-4901-96b9-33bf02e87b70</guid>
      <link>http://depth-first.com/articles/2007/04/11/structure-diagram-generation#comment-17</link>
    </item>
    <item>
      <title>"Structure Diagram Generation" by Christoph Steinbeck</title>
      <description>&lt;p&gt;In fact, there is one more open source implementation:&lt;/p&gt;

&lt;p&gt;K Bley, J Brandt, A Dengler, R Frank, and I Ugi  (1991)
Constitutional Formulae generated from Connectivity Information: the Program MDRAW. Journal of Chemical Research (M):2601-2689.&lt;/p&gt;

&lt;p&gt;This is a funny one. The whole source code what published in print form in the microfiche edition of the Journal of Chemical Research. The print edition had only the article. In 1995, I got very bad print copy of the micro fiche by interlibrary loan and my daddy and I sat down and typed in the MDRAW C code. It was horrible. I never got it to run and later managed to contacted Alf Dengler, who sent me the MDRAW C sources. Those formed the basis of what I used in LUCY to display results. MDRAW was designed to produce diagrams with all carbons symbols drawn and therefore produced linear carbon chains.
In 2002 I wrote the CDK structure diagram generation completely from scratch - no concepts from MDRAW where used. &lt;/p&gt;</description>
      <pubDate>Thu, 12 Apr 2007 05:20:24 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:5e879a21-f4a8-4e21-9ec1-919f8f9ff8ca</guid>
      <link>http://depth-first.com/articles/2007/04/11/structure-diagram-generation#comment-14</link>
    </item>
  </channel>
</rss>
