<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Smarter Cheminformatics: From SD File to Image Collection with ChemPhoto</title>
    <link>http://depth-first.com/articles/2008/09/08/smarter-cheminformatics-from-sd-file-to-image-collection-with-chemphoto</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>Smarter Cheminformatics: From SD File to Image Collection with ChemPhoto</title>
      <description>&lt;p&gt;&lt;a href="http://metamolecular.com/chemphoto"&gt;&lt;img src="http://depth-first.com/demo/20080908/chemphoto.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;The old adage says time is money. Unfortunately, working chemists are often forced to spend a remarkable amount of valuable time and mental effort on menial chemical information processing tasks. These are things that could be done faster and with better quality by the right software, if it were available. Most importantly, these tasks take resources away from much more valuable work that &lt;em&gt;can't&lt;/em&gt; be automated.&lt;/p&gt;

&lt;h4&gt;The Problem in a Nutshell&lt;/h4&gt;

&lt;p&gt;As a case in point, consider the creation of 2D chemical structure images. If you maintain a compound collection of any kind, sooner or later you may end up asking yourself how you can create a set of images depicting the chemical structures in your collection.&lt;/p&gt;

&lt;h4&gt;A Specific Example: Chemical Suppliers&lt;/h4&gt;

&lt;p&gt;For example, you might work for a chemical supplier that maintains a Web-based eCommerce site, one or more PDF catalogs, or printed brochures. Your customers are chemists and they expect to see chemical structures in your product listings. How can you make this happen?&lt;/p&gt;

&lt;p&gt;If you look around for software that automates this job, you'll more likely than not come up empty-handed. The software that solves this problem well simply doesn't exist yet.&lt;/p&gt;

&lt;h4&gt;Doing it the Hard Way&lt;/h4&gt;

&lt;p&gt;In the absence of software to solve the problem, the only way to get the job done is to buckle down and do it manually. Most chemical structure editors allow you to save output as a raster image. Provided that this output matches your requirements, your system might consist of the following steps:&lt;/p&gt;

&lt;p&gt;(1) For every product in your catalog, create a single molfile or its machine-readable equivalent.&lt;/p&gt;

&lt;p&gt;(2) Load one file into your editor.&lt;/p&gt;

&lt;p&gt;(3) Save the file as a raster image, being careful to make sure all of the drawing settings and image size parameters are identical to the rest of your images.&lt;/p&gt;

&lt;p&gt;(4) Repeat Steps (2)-(3) until you have all of your images.&lt;/p&gt;

&lt;p&gt;There are many problems with this approach. For example, if your images ever need to be made larger (or smaller), you'll have to create all of your images over again (which can easily number in the thousands). Similarly, if for some reason you want to change the appearance of the images such as background, atom label coloring, or line thicknesses, you'll be forced into a lot of manual work. Finally, this system requires you to keep track of structures that have been imaged and those that haven't, which can in itself be nontrivial and error-prone, especially for thousands of products.&lt;/p&gt;

&lt;p&gt;With the right software, this problem would disappear.&lt;/p&gt;

&lt;h4&gt;One Solution: Customized Imaging Service&lt;/h4&gt;

&lt;p&gt;My company, &lt;a href="http://metamolecular.com"&gt;Metamolecular&lt;/a&gt;, has recently provided custom imaging services to a few chemical suppliers wanting thousands of good-looking structure images rendered automatically. The service made use of the versatile &lt;a href="http://metamolecular.com/chemwriter"&gt;ChemWriter&lt;/a&gt; rendering engine together with some custom code written in &lt;a href="http://depth-first.com/articles/tag/ruby"&gt;Ruby&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Although the imaging service works very well as a one-off solution, it's less than optimal in the longer term. Any changes to the image collection must be processed by Metamolecular, and then sent back to the client. A cheaper and faster solution would be to offer software that implements the functionality of the service.&lt;/p&gt;

&lt;h4&gt;A Better Solution: Chemical Structure Imaging Software&lt;/h4&gt;

&lt;p&gt;Wouldn't it be great if easy-to-use software existed that could automatically generate thousands of chemical structure images with the press of a button?&lt;/p&gt;

&lt;p&gt;In particular, the software should:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Run on any modern platform (Windows, Mac OS X, Linux).&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Read industry-standard Structure Data Files (SD Files).&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Be capable of working with tens of thousands of chemical structures at a time even on older machines.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Store fully-customizable drawing settings in a format that could be used over and over again for a consistent and professional look.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Allow the output to be previewed exactly as it will appear in the generated images ("what you see is what you get").&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Output to a variety of image formats, including: Portable Network Graphics (PNG image); JPG image; &lt;a href="http://depth-first.com/articles/2008/06/10/adobe-flash-for-cheminformatics-fast-scalable-and-attractive-2d-depiction-of-chemical-structures-with-vector-graphics"&gt;Flash&lt;/a&gt; (SWF file); &lt;a href="http://depth-first.com/articles/2006/09/09/generating-and-serving-2-d-molecular-svgs"&gt;Scalable Vector Graphics&lt;/a&gt; (SVG); and &lt;a href="http://depth-first.com/articles/2008/08/07/encapsulated-postscript-for-cheminformatics"&gt;Encapsulated PostScript&lt;/a&gt; (EPS file).&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;Introducing ChemPhoto&lt;/h4&gt;

&lt;p&gt;ChemPhoto is designed to solve the problem of consistently creating large numbers of high-quality 2D chemical structure images. Currently in development, the first versions of ChemPhoto will be available for review within the next two weeks.&lt;/p&gt;

&lt;p&gt;ChemPhoto consists of a lightweight and intuitive user interface layer built on top of the ChemWriter rendering engine. ChemPhoto focuses on doing one thing very well, so it wouldn't be useful for creating or editing SD Files (a task for which many tools already exist). The software is specifically designed to work well with large SD Files, such as the 25,000-structure sets that can be obtained from &lt;a href="http://pubchem.ncbi.nlm.nih.gov/"&gt;PubChem&lt;/a&gt;. Written in Java, ChemPhoto runs on Windows, Mac OS X, and Linux. Future articles will discuss ChemPhoto's design and implementation.&lt;/p&gt;

&lt;p&gt;If you're interested in evaluating ChemPhoto, feel free to &lt;a href="http://mailhide.recaptcha.net/d?k=01R9bxyP6XNdc0duoUCzBBHA==&amp;amp;c=vZ7R0VDctRzIRzbSs1-LZwDzjTjAnfCS4KONqGHxY9I=" onclick="window.open('http://mailhide.recaptcha.net/d?k=01R9bxyP6XNdc0duoUCzBBHA==&amp;amp;c=vZ7R0VDctRzIRzbSs1-LZwDzjTjAnfCS4KONqGHxY9I=', '', 'toolbar=0,scrollbars=0,location=0,statusbar=0,menubar=0,resizable=0,width=500,height=300'); return false;" title="Reveal this e-mail address"&gt;drop me a line&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 08 Sep 2008 19:04:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:e69d3005-c83f-463b-9826-e08b4339f91a</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/09/08/smarter-cheminformatics-from-sd-file-to-image-collection-with-chemphoto</link>
      <category>Tools</category>
      <category>chemphoto</category>
      <category>image</category>
      <category>png</category>
      <category>jpg</category>
      <category>swf</category>
      <category>svg</category>
      <category>eps</category>
      <category>sdfile</category>
      <category>automation</category>
    </item>
    <item>
      <title>"Smarter Cheminformatics: From SD File to Image Collection with ChemPhoto" by Rich Apodaca</title>
      <description>&lt;p&gt;Wolf, thanks for the link. &lt;a href="http://81.169.131.69/Documentation/csib_man.pdf" rel="nofollow"&gt;csib&lt;/a&gt; looks like a very full-featured command-line utility.&lt;/p&gt;

&lt;p&gt;I'm also aware of other command-line utilities that can accomplish a similar end result as ChemPhoto. For example, &lt;a href="http://merian.pch.univie.ac.at/~nhaider/cheminf/mol2ps.html" rel="nofollow"&gt;mol2ps&lt;/a&gt; by &lt;a href="http://merian.pch.univie.ac.at/pch/nh_info.html" rel="nofollow"&gt;Norbert Haider&lt;/a&gt; can convert SD files into PostScript files.&lt;/p&gt;

&lt;p&gt;In the area of developer toolkits, &lt;a href="http://baoilleach.blogspot.com/" rel="nofollow"&gt;Noel O'Boyle&lt;/a&gt; has summarized some &lt;a href="http://baoilleach.blogspot.com/2008/05/cheminformatics-toolkit-face-off.html" rel="nofollow"&gt;open source depiction and SDG libraries&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The key difference between ChemPhoto and all of these other tools is that ChemPhoto is built around a graphical user interface.&lt;/p&gt;

&lt;p&gt;The difference between a command-line utility and a desktop application may seem subtle, but it's really the difference between software that can't be used by the average chemist and software that can.&lt;/p&gt;

&lt;p&gt;For example, all changes to drawing settings can be viewed in real-time with ChemPhoto, without having to generate any images. The entire image collection can be viewed exactly as it will appear once images are created.&lt;/p&gt;

&lt;p&gt;This makes the process of arriving at just the right appearance very fast. This is in addition to more standard advantages such as browsing to files graphically as opposed to using file paths, and being able to select the last few most recently used files.&lt;/p&gt;

&lt;p&gt;Developers have many tools at their disposal for generating 2D chemical structure images. ChemPhoto will be a tool that non-developers can use.&lt;/p&gt;</description>
      <pubDate>Sat, 11 Oct 2008 15:01:41 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:e73c417f-db8c-4e59-b2ae-f482b9d95894</guid>
      <link>http://depth-first.com/articles/2008/09/08/smarter-cheminformatics-from-sd-file-to-image-collection-with-chemphoto#comment-817</link>
    </item>
    <item>
      <title>"Smarter Cheminformatics: From SD File to Image Collection with ChemPhoto" by Wolf Ihlenfeldt</title>
      <description>&lt;p&gt;&lt;a href="http://81.169.131.69/Documentation/csib_man.pdf" rel="nofollow"&gt;http://81.169.131.69/Documentation/csib_man.pdf&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Has all been done, available for many years, supports all planned features of application above plus many more, free for academic use, uses same (but continuously updated) image generation algorithms as in PubChem.&lt;/p&gt;

&lt;p&gt;The program does not currently generate flash, but writes for example output Windows vector formats (WMF, EMF) on all platforms, and supports a couple of niche Formats (FrameMaker MIF for professional documentation).&lt;/p&gt;

&lt;p&gt;Also: It reads dozens of file formats in addition to SDF, including 3D formats with automatic connectivity regeneration. It can be used to re-code existing ChemDraw or ISISDraw images - the software knows how to extract embedded hidden connectivity and coordinate information from WMF/EMF images written by these tools. Of course it also reads the native formats of these applications. In the latest versions you can even use RTF, DOCX or XLS files with embedded ChemDraw or ISISDRaw OLE objects as input - on any platform.&lt;/p&gt;</description>
      <pubDate>Sat, 11 Oct 2008 11:45:45 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:5ebb7065-6a6f-4127-b4d4-4b6e88744971</guid>
      <link>http://depth-first.com/articles/2008/09/08/smarter-cheminformatics-from-sd-file-to-image-collection-with-chemphoto#comment-816</link>
    </item>
  </channel>
</rss>
