<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Tag chempedia</title>
    <link>http://depth-first.com/articles/tag/chempedia</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>SciFinder Web, Greasemonkey, and REST: Embracing Divergence in Chemical Information Systems</title>
      <description>&lt;p&gt;&lt;a href="http://flickr.com/photos/trashd/754113710/"&gt;&lt;img src="http://depth-first.com/demo/20081119/diverge.jpg" align="right"&gt;&lt;/img&gt;&lt;/a&gt;Recently, &lt;a href="http://personnes.epfl.ch/alain.borel"&gt;Alain Borel&lt;/a&gt; posted a message to the &lt;a href="https://listserv.indiana.edu/cgi-bin/wa-iub.exe?A0=CHMINF-L"&gt;CHMINF-L list&lt;/a&gt; describing his successful attempt to get links to external datasources to show up in SciFinder Web:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;I'm dreaming of a world where chemical data is hyperlinked as thoroughly
    as text data is today (or even more)... here's my small contribution to
    this goal.&lt;/p&gt;
    
    &lt;p&gt;...&lt;/p&gt;
    
    &lt;p&gt;Basically, Greasemonkey allows you to rewrite the HTML content of a web
    page before it is rendered in the browser window. Thanks to this, I've
    been able to write a script that adds links to external databases
    through registry numbers. Currently, &lt;a href="http://chemspider.com"&gt;ChemSpider&lt;/a&gt;, &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt; and &lt;a href="http://chemexper.com"&gt;Chemexper&lt;/a&gt; [links added]
    are supported - I also have a private version that links to our Intranet
    chemical stocks application. The pleasant side for those who worry about
    intellectual property is that neither side of the link needs to know
    what's on the other side, and even the plugin doesn't know what's inside
    the database records.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;a href="http://personnes.epfl.ch/alain.borel"&gt;Greasemonkey&lt;/a&gt; is the client-side script plugin engine that enables users to change the appearance and content of any site. Although originally designed as a Firefox extension, as least some Greasemonkey scripts can be run in &lt;a href="http://www.simplehelp.net/2007/11/14/how-to-run-greasemonkey-scripts-in-safari/"&gt;Safari&lt;/a&gt; and &lt;a href="http://www.ghacks.net/2008/10/18/google-chrome-adds-greasemonkey-support/"&gt;Chrome&lt;/a&gt;. A &lt;a href="http://dx.doi.org/10.1186/1471-2105-8-487"&gt;recent paper&lt;/a&gt; describes some of the potential for this form of scripting in the life sciences.&lt;/p&gt;

&lt;p&gt;Alain's script, which can be &lt;a href="http://biscom.epfl.ch/scifinder_links.user.js"&gt;freely downloaded&lt;/a&gt;, uses &lt;a href="http://depth-first.com/articles/2008/05/26/simple-cas-number-lookup-and-more-with-chempedia"&gt;CAS numbers&lt;/a&gt; to link SciFinder records to the external databases. Although I don't have access to SciFinder Web, Alain's description makes it sound like each entry for a specific substance in SciFinder Web is given an additional set of links out to external datasources.&lt;/p&gt;

&lt;h4&gt;What's REST Got To Do With It?&lt;/h4&gt;

&lt;p&gt;One of Alain's external datasources is &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;. A unique feature of Chempedia is the way it exposes a &lt;a href="http://depth-first.com/articles/2008/05/26/simple-cas-number-lookup-and-more-with-chempedia"&gt;an electronic paper trail for CAS numbers&lt;/a&gt;. Rather than just reporting a CAS registry number, it fully discloses which organization is asserting that a particular CAS number belongs with a structure.&lt;/p&gt;

&lt;p&gt;For example, see &lt;a href="http://chempedia.com/registry_numbers/525-66-6"&gt;this entry on [525-66-6]&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Chempedia was designed from the start to apply the principles of &lt;a href="http://depth-first.com/articles/2007/05/30/restful-cheminformatics"&gt;REST&lt;/a&gt;. The big idea behind REST is that every resource on the Web, such as a CAS number, can be manipulated by exactly four methods: GET; PUT; POST; and DELETE.&lt;/p&gt;

&lt;p&gt;The highly-desirable side-effect of designing Web sites around the concept of resources being acted on by exactly four methods is that sites applying this philosophy become orders of magnitude easier to &lt;a href="http://depth-first.com/articles/2006/09/23/mashups-for-fun-and-profit"&gt;mash up&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Alain's Greasemonkey script is an example of a client-side mashup. But how did he do it?&lt;/p&gt;

&lt;h4&gt;CAS Numbers are First-Class Citizens&lt;/h4&gt;

&lt;p&gt;Each CAS number on Chempedia is a resource that can be accessed by a URL taking the form:&lt;/p&gt;

&lt;p&gt;&lt;tt&gt;http://chempedia.com/registry_numbers/REGISTRY_NUMBER&lt;/tt&gt;&lt;/p&gt;

&lt;p&gt;where &lt;tt&gt;REGISTRY_NUMBER&lt;/tt&gt; is the CAS number of interest. For example, acetaminophen has the registry number [103-90-2] and it can be accessed with this URL:&lt;/p&gt;

&lt;p&gt;&lt;a href="http://chempedia.com/registry_numbers/103-90-2"&gt;http://chempedia.com/registry_numbers/103-90-2&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If you request a CAS number that doesn't exist, you should receive a 404 error, although a bug in Chempedia is currently preventing that from happening.&lt;/p&gt;

&lt;p&gt;To link SciFinder Web to Chempedia, Alain's user script simply looks for which CAS number the SciFinder page is talking about and constructs the RESTful URL. It doesn't get much simpler than that.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Chemistry is a large, established field. Not surprisingly, &lt;a href="http://depth-first.com/articles/2008/05/07/1908-and-all-that-the-long-tail-and-chemistry"&gt;specialization is an essential part of being a chemist&lt;/a&gt;. It's therefore to be expected that chemical databases will diverge into a variety of specialized forms. One size will almost certainly not fit all.&lt;/p&gt;

&lt;p&gt;We can deny this simple fact and build ever more complex and unusable chemical information systems. Or we can accept it and custom-build our services for the job at hand.&lt;/p&gt;

&lt;p&gt;RESTful server architectures and mashups offer a powerful way to accomplish this goal.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Image Credit: &lt;a href="http://flickr.com/photos/trashd/"&gt;&lt;/em&gt;betenoir&lt;em&gt;&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;</description>
      <pubDate>Wed, 19 Nov 2008 17:41:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:8381b2d7-1d61-4266-ab87-522da8b3e178</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/11/19/scifinder-web-greasemonkey-and-rest-embracing-divergence-in-chemical-information-systems</link>
      <category>Meta</category>
      <category>chempedia</category>
      <category>rest</category>
      <category>mashup</category>
      <category>greasemonkey</category>
      <category>userscript</category>
      <category>scifinder</category>
      <category>scifinderweb</category>
      <category>divergence</category>
    </item>
    <item>
      <title>Building Chempedia: Learning About Contributors</title>
      <description>&lt;p&gt;&lt;a href="http://chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;&lt;a href="http://chempedia.com/"&gt;Chempedia&lt;/a&gt; is a free online chemical encyclopedia similar in concept to the Merck Index, but &lt;a href="http://depth-first.com/articles/2008/04/28/building-chempedia-indexing-wikipedias-6-411-compound-monographs"&gt;radically different&lt;/a&gt; in implementation. One key difference: the Merck Index is compiled by a small number of paid professionals while Chempedia is compiled by thousands of unpaid volunteers. Although this distinction raises a host of intriguing questions, one of the most basic revolves around what can be said about these volunteers in the aggregate. This article, the first in a series, explores this issue with some statistics compiled from Chempedia.&lt;/p&gt;

&lt;h4&gt;Learning About Contributors&lt;/h4&gt;

&lt;p&gt;Chempedia works in part by aggregating content from Wikipedia dealing with single molecular entities, or "Compound Monographs." This content is created by the now &lt;a href="http://en.wikipedia.org/wiki/Wikipedia:Introduction"&gt;famous process&lt;/a&gt; of individuals taking upon themselves the responsibility of fixing what's broken in Wikipedia. (Some take it upon themselves to &lt;a href="http://en.wikipedia.org/wiki/Wikipedia:Vandalism"&gt;break what's working&lt;/a&gt;, but that's another topic.)&lt;/p&gt;

&lt;p&gt;Chempedia associates each of its Compound Monographs with the last Wikipedia user to edit it. The current interface to these relationships is available on the &lt;a href="http://chempedia.com/contributors"&gt;Chempedia contributors page&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The interface to this page is currently limited. The analyses reported here were made for the most part by querying the Chempedia database directly.&lt;/p&gt;

&lt;p&gt;Each contributor is linked to a contributor summary page containing links to that user's Wikipedia homepage and talk page, as well as a complete listing of all active contributions. For example, you can view the contributor page for one of Chempedia's most active contributors, &lt;a href="http://chempedia.com/contributors/40"&gt;Arcadian&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The data model is also limited. Because Chempedia only records the last Contributor to edit a Monograph, when another Contributor edits a Monograph, the link between the previous Contributor is lost. As a result, many Contributors have no associated Monographs.&lt;/p&gt;

&lt;h4&gt;How Many Monographs?&lt;/h4&gt;

&lt;p&gt;Chempedia currently hosts 6,308 Compound Monographs.&lt;/p&gt;

&lt;h4&gt;How Many Contributors?&lt;/h4&gt;

&lt;p&gt;Chempedia currently lists &lt;a href="http://chempedia.com/contributors"&gt;2,516 Contributors&lt;/a&gt;. Of these, 1,046, or 42% are associated with one or more Monographs, meaning that they were the last to edit. The remainder are associated with no Monographs for which they were the last to edit.&lt;/p&gt;

&lt;p&gt;Here is a list of the top 20 Contributors and the number of Monographs they were the last to edit:&lt;/p&gt;

&lt;table&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/2"&gt;anonymous&lt;/a&gt;&lt;/td&gt;&lt;td&gt;1022&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/2"&gt;DOI bot&lt;/a&gt;&lt;/td&gt;&lt;td&gt;904&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/1"&gt;Edgar181&lt;/a&gt;&lt;/td&gt;&lt;td&gt;378&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/66"&gt;Fvasconcellos&lt;/a&gt;&lt;/td&gt;&lt;td&gt;170&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/31"&gt;Meodipt&lt;/a&gt;&lt;/td&gt;&lt;td&gt;151&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/40"&gt;Arcadian&lt;/a&gt;&lt;/td&gt;&lt;td&gt;144&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/59"&gt;Chem-awb&lt;/a&gt;&lt;/td&gt;&lt;td&gt;133&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/22"&gt;Chowbok&lt;/a&gt;&lt;/td&gt;&lt;td&gt;122&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/2"&gt;Rifleman 82&lt;/a&gt;&lt;/td&gt;&lt;td&gt;114&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/10"&gt;SmackBot&lt;/a&gt;&lt;/td&gt;&lt;td&gt;105&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/19"&gt;Thijs!bot&lt;/a&gt;&lt;/td&gt;&lt;td&gt;99&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/1236"&gt;ChemNerd&lt;/a&gt;&lt;/td&gt;&lt;td&gt;85&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/127"&gt;Puppy8800&lt;/a&gt;&lt;/td&gt;&lt;td&gt;80&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/48"&gt;DumZiBoT&lt;/a&gt;&lt;/td&gt;&lt;td&gt;78&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/182"&gt;Axiosaurus&lt;/a&gt;&lt;/td&gt;&lt;td&gt;63&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/6"&gt;Chempedia&lt;/a&gt;&lt;/td&gt;&lt;td&gt;63&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/174"&gt;Carlo Banez&lt;/a&gt;&lt;/td&gt;&lt;td&gt;55&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/13"&gt;Benjah-bmm27&lt;/a&gt;&lt;/td&gt;&lt;td&gt;52&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/93"&gt;OKBot&lt;/a&gt;&lt;/td&gt;&lt;td&gt;51&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href="http://chempedia.com/contributors/45"&gt;Cacycle&lt;/a&gt;&lt;/td&gt;&lt;td&gt;50&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;

&lt;p&gt;These Contributors represent 1.9% of all active Contributors and collectively are responsible for being the last to edit 62% of all Monographs. Although not performed here, a histogram plotting number of contributions would be expected to follow a &lt;a href="http://en.wikipedia.org/wiki/Power_law"&gt;power law&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;'Anonymous' is an aggregation of all users who edited a Monograph without a Wikipedia account. 16% of all Monographs were last edited by an anonymous user. Leaving out the aggregated 'anonymous' users indicates that roughly half of all Monographs were last edited by the top 19 Contributors.&lt;/p&gt;

&lt;h4&gt;What is a Contributor?&lt;/h4&gt;

&lt;p&gt;Although it's difficult to say a lot about individual Contributors, most appear to have some training in science, although that training may not have involved chemistry or biology. Still others (for example, &lt;a href="http://chempedia.com/contributors/2404"&gt;SJP&lt;/a&gt;) appear to have been drawn to contribute to a Monograph based on their nonscientific experience with the title compound or in an effort to fight vandalism or otherwise improve the nonscientific content of the Monograph. The ability of services like Wikipedia (and by extension Chempedia) to provide a platform for those without formal training in a particular area to make useful contributions is without question one of its most useful (and controversial) features.&lt;/p&gt;

&lt;p&gt;Some Contributors are not even human, but rather robots designed to improve the quality of Wikipedia articles in general. For example, &lt;a href="http://chempedia.com/contributors/10"&gt;SmackBot&lt;/a&gt; performs an array of tedious quality control jobs such as fixing bad checksum ISBNs (&lt;a href="http://www.cas.org/expertise/cascontent/registry/checkdig.html"&gt;CAS Numbers, anyone?&lt;/a&gt;) and capitalization errors.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Wikipedia's collaboration model has made the creation of a free and continuously-updated chemical encyclopedia feasible. Applying chemistry-specific user interfaces and data models exposes this hidden treasure. Although it's tempting to think of this process as mainly being the work of a handful of trained scientists, the numbers suggest a much broader base of contributors. Future articles will explore this idea.&lt;/p&gt;

&lt;p&gt;Related Article: &lt;a href="http://depth-first.com/articles/2008/05/21/building-chempedia-social-networking-applied-to-chemistry"&gt;&lt;em&gt;Building Chempedia: Social Networking Applied to Chemistry&lt;/em&gt;&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 02 Jul 2008 11:50:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:cc2cc82d-b3d9-4bba-89de-69f685033389</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/07/02/building-chempedia-learning-about-contributors</link>
      <category>Tools</category>
      <category>chempedia</category>
      <category>wikipedia</category>
      <category>collectiveintelligence</category>
      <category>socialnetworking</category>
      <category>merckindex</category>
    </item>
    <item>
      <title>The Other Vector Graphics Markup Language</title>
      <description>&lt;p&gt;&lt;img src="http://depth-first.com/demo/20080606/toucan.png" align="right"&gt;&lt;/img&gt;&lt;a href="http://en.wikipedia.org/wiki/Scalable_Vector_Graphics"&gt;Scalable Vector Graphics&lt;/a&gt; (SVG) is a technology that enables the creation and publication of high quality images that can be scaled to any resolution. SVG is ideally suited for the Web, and all major browsers now support it - except Internet Explorer (IE). This poses a problem: vector graphics are by far superior to raster images for many applications, but the lack of native IE support makes SVG a non-starter for most developers. This article discusses a little known IE capability that might provide a solution.&lt;/p&gt;

&lt;h4&gt;Oh Brother, Where Art Thou?&lt;/h4&gt;

&lt;p&gt;Way back in 1998 a group of companies including Microsoft submitted a proposal for a vector graphics language called &lt;a href="http://www.w3.org/TR/1998/NOTE-VML-19980513"&gt;Vector Markup Language&lt;/a&gt; (VML) to the W3C. This set in motion a series of events that culminated in the development of what we know today as SVG. But while use of SVG quickly expanded, VML remained almost exclusively limited to Microsoft products.&lt;/p&gt;

&lt;p&gt;Soon after, IE 5 &lt;a href="http://www.infoloom.com/gcaconfs/WEB/granada99/wu.HTM"&gt;introduced the ability&lt;/a&gt; to decode and display VML - a capability that exists today in IE 7.&lt;/p&gt;

&lt;p&gt;SVG and VML are two vector graphics languages, each designed to do essentially the same thing. For basic shape rendering, their similarities outweigh their differences.&lt;/p&gt;

&lt;h4&gt;About VML&lt;/h4&gt;

&lt;p&gt;To understand why VML never caught on, you need look no further than the documentation - or the lack thereof. The &lt;a href="http://www.w3.org/TR/1998/NOTE-VML-19980513"&gt;original VML submission&lt;/a&gt; is a decade old and has not been updated.&lt;/p&gt;

&lt;p&gt;For the most part, VML documentation is scattered and incomplete. Nevertheless, there are some useful resources. Here, in no particular order are some of them:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://msdn.microsoft.com/en-us/library/bb250524(VS.85).aspx"&gt;Microsoft Documentation&lt;/a&gt; Authoritative, but lacking in examples.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://weborama.blogspot.com/2006/01/vml-svg-and-canvas.html"&gt;VML, SVG, and Canvas&lt;/a&gt; Discusses some of the differences between VML and SVG.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://www.robweir.com/blog/2006/07/cum-mortuis-in-lingua-mortua.html"&gt;Cum mortuis in lingua mortua&lt;/a&gt; Good history of VML.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://www.sjsu.edu/faculty/watkins/vml.htm"&gt;Examples of the Vector Markup Lanugauge&lt;/a&gt; There are far too few of this kind of site.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://vitali.web.cs.unibo.it/Progetti/VectorConverter"&gt;VectorConverer&lt;/a&gt;
A PHP library that uses XSLT to interconvert SVG and VML. Unfortunately, the stylesheet didn't work in my hands under &lt;a href="http://vitali.web.cs.unibo.it/Progetti/VectorConverter"&gt;Xalan&lt;/a&gt; or Ruby/xslt - and I know almost nothing about PHP.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://www.cs.sjsu.edu/faculty/pollett/masters/Semesters/Fall03/JulieNabong/index.shtml?cs298proposal.html"&gt;Julie Nabong's Masters Thesis&lt;/a&gt; Julie wrote and documented an SVG/VML XSLT for interconverting the two languages.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;JSDrawing: Interconverting Vector Languages on the Fly&lt;/h4&gt;

&lt;p&gt;One VML resource deserves special note - &lt;a href="http://www.kevlindev.com/projects/jsdrawing/index.htm"&gt;JSDrawing&lt;/a&gt;. This library seems to be capable of generating Flash, VML, or SVG from a common vector graphics language precursor. I'm not sure how practical this approach would be, but it does provide some food for thought.&lt;/p&gt;

&lt;h4&gt;Why It Matters&lt;/h4&gt;

&lt;p&gt;Chemistry is in a good position to take advantage of vector graphics. Chemical structures, being closely based on graph theoretical constructs, would seem to be a perfect match for vector languages like SVG and VML, especially on the Web. So far it hasn't happened, primarily for the reasons outlined above.&lt;/p&gt;

&lt;p&gt;Currently, if you want to display 2D chemical structures in Web pages you're faced with some tradeoffs:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Raster Images.&lt;/strong&gt; This is by far the most common practice. This option unfortunately makes it very difficult to redesign the layout of a site or support multiple views of the same structure, especially with databases of one million plus compounds becoming commonplace. Even if images are never regenerated, they need to be stored and retrieved, adding to cost and complexity. Images could be dynamically generated, but at the expense of substantial memory and CPU requirements.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Applets.&lt;/strong&gt; This is the approach currently taken by &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;, the free chemical encyclopedia, and gives complete flexibility in page layout and structure appearance. Changing the dimensions of a structure is as simple as changing the size of a div. Unfortunately, some browsers handle multiple applets better than others. Firefox on OS X is very slow at refreshing applets while scrolling, and IE requires a &lt;a href="http://depth-first.com/articles/2007/11/02/eolas-and-jactivating-working-around-a-workaround"&gt;Javascript trick&lt;/a&gt; to remove the 'click to active' message that causes some flashing when in progress.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Vector Graphics Through Plugins&lt;/strong&gt; There are at least two SVG plugins for IE (&lt;a href="http://www.adobe.com/svg/viewer/install/main.html"&gt;one by Adobe&lt;/a&gt; and &lt;a href="http://www.examotion.com/index.php?id=product_player"&gt;the other from Examotion&lt;/a&gt;). Will all of your users be able to find and install them? Unless the answer to both questions is 'yes', this option is probably best left as a last resort. Another option is to render SVG on IE through the Flash or &lt;a href="http://silverlight.net/"&gt;Silverlight&lt;/a&gt; plugins. But as far as I can tell, neither approach is ready for prime-time.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Native Vector Graphics&lt;/strong&gt; Available on all major browsers including Internet Explorer 5/6/7, Firefox 1/2, and Opera 8/9. Combines the flexibility, lossless depiction, inlineability and low data storage/retrieval overhead of applets with the speed of images. Interactivity and other special effects can be achieved through DOM manipulation. All of this depends, of course, on the vector graphics format being compatible with the rendering engine.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;In some circumstances, serving VML to IE clients and SVG to everyone else would be a viable option - if it were possible to generate VML.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Vector graphics have a lot to offer chemistry, especially when used with Web applications. The combination of VML and SVG offers a proven technology platform that's ready today, but only if you can generate VML.&lt;/p&gt;</description>
      <pubDate>Fri, 06 Jun 2008 14:39:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:48c29183-75ae-4e3f-b4f7-e99de2a52048</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/06/06/the-other-vector-graphics-markup-language</link>
      <category>Tools</category>
      <category>svg</category>
      <category>vml</category>
      <category>vectorgraphics</category>
      <category>silverlight</category>
      <category>flash</category>
      <category>chemicalstructure</category>
      <category>2d</category>
      <category>chemwriter</category>
      <category>chempedia</category>
    </item>
    <item>
      <title>A Simple and Portable Ruby Interface to InChI - Part 2: Silencing Console Output</title>
      <description>&lt;p&gt;&lt;a href="http://ruby-lang.org/"&gt;&lt;img src="http://depth-first.com/files/ruby_logo_new.gif" align="right"&gt;&lt;/img&gt;&lt;/a&gt;The previous article in this series described a &lt;a href="http://depth-first.com/articles/2008/05/29/a-simple-and-portable-ruby-interface-to-inchi"&gt;simple and portable method&lt;/a&gt; for interfacing Ruby to the cInChI-1 binary. One disadvantage was noisy console output. This article offers a minor modification to disable it.&lt;/p&gt;

&lt;h4&gt;The Code&lt;/h4&gt;

&lt;div class="typocode"&gt;&lt;pre&gt;&lt;code class="typocode_ruby "&gt;&lt;span class="keyword"&gt;module &lt;/span&gt;&lt;span class="module"&gt;InChI&lt;/span&gt;
  &lt;span class="keyword"&gt;def &lt;/span&gt;&lt;span class="method"&gt;inchi_for&lt;/span&gt; &lt;span class="ident"&gt;molfile&lt;/span&gt;
    &lt;span class="ident"&gt;output&lt;/span&gt; &lt;span class="punct"&gt;=&lt;/span&gt; &lt;span class="punct"&gt;%x[&lt;/span&gt;&lt;span class="string"&gt;echo &amp;quot;&lt;span class="expr"&gt;#{molfile}&lt;/span&gt;&amp;quot; | cInChI-1 -STDIO 2&amp;gt;/dev/null&lt;/span&gt;&lt;span class="punct"&gt;]&lt;/span&gt;

    &lt;span class="ident"&gt;output&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;eql?&lt;/span&gt;&lt;span class="punct"&gt;(&amp;quot;&lt;/span&gt;&lt;span class="string"&gt;&lt;/span&gt;&lt;span class="punct"&gt;&amp;quot;)&lt;/span&gt; &lt;span class="punct"&gt;?&lt;/span&gt; &lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;&lt;span class="string"&gt;&lt;/span&gt;&lt;span class="punct"&gt;&amp;quot;&lt;/span&gt; &lt;span class="punct"&gt;:&lt;/span&gt; &lt;span class="ident"&gt;output&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;split&lt;/span&gt;&lt;span class="punct"&gt;(/&lt;/span&gt;&lt;span class="regex"&gt;&lt;span class="escape"&gt;\n&lt;/span&gt;&lt;/span&gt;&lt;span class="punct"&gt;/)[&lt;/span&gt;&lt;span class="number"&gt;1&lt;/span&gt;&lt;span class="punct"&gt;]&lt;/span&gt;
  &lt;span class="keyword"&gt;end&lt;/span&gt;
&lt;span class="keyword"&gt;end&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;

&lt;p&gt;Here, we're taking advantage of the ability to redirect certain output streams to &lt;tt&gt;/dev/null&lt;/tt&gt;.&lt;/p&gt;

&lt;h4&gt;Testing the Code&lt;/h4&gt;

&lt;p&gt;Saving the above in a file called &lt;strong&gt;inchi.rb&lt;/strong&gt;, we can test it from IRB. To make things interesting, let's pull a molfile from &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;:&lt;/p&gt;

&lt;div class="console"&gt;
&lt;pre&gt;
$ irb
irb(main):001:0&amp;gt; require 'open-uri'
=&amp;gt; true
irb(main):002:0&amp;gt; require 'inchi'
=&amp;gt; true
irb(main):003:0&amp;gt; include InChI
=&amp;gt; Object
irb(main):004:0&amp;gt; open 'http://chempedia.com/compounds/83490.mol' do |f|
irb(main):005:1*   puts inchi_for(f.read)
irb(main):006:1&amp;gt; end
InChI=1/C15H15NO3S/c17-14(16-18)11-20(19)15(12-7-3-1-4-8-12)13-9-5-2-6-10-13/h1-10,15,18H,11H2,(H,16,17)
=&amp;gt; nil
&lt;/pre&gt;
&lt;/div&gt;

&lt;p&gt;We should be able to run this code unmodified on any UNIX-like system in which the &lt;strong&gt;cInChI-1&lt;/strong&gt; binary is on the path. And of course we could take this one step further by allowing &lt;a href="http://depth-first.com/articles/2007/03/19/customize-inchi-output-with-rino"&gt;command line options&lt;/a&gt; to be passed in as parameters to the &lt;tt&gt;inchi_for&lt;/tt&gt; method.&lt;/p&gt;

&lt;p&gt;Simplicity has its advantages.&lt;/p&gt;</description>
      <pubDate>Fri, 30 May 2008 10:04:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:86f5938d-6519-4d2a-87e1-14b281f1323b</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/30/a-simple-and-portable-ruby-interface-to-inchi-part-2-silencing-console-output</link>
      <category>Tools</category>
      <category>inchi</category>
      <category>ruby</category>
      <category>designingtheobvious</category>
      <category>chempedia</category>
      <category>console</category>
      <category>unix</category>
    </item>
    <item>
      <title>Simple CAS Number Lookup (and More) with Chempedia</title>
      <description>&lt;p&gt;&lt;a href="http://chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;Despite many ingenious and energetic attempts, CAS Registry Numbers&amp;reg; remain chemistry's only universal method for referencing chemical structures and substances. They're so woven into the fabric of chemistry and trade that the &lt;a href="http://www.uspto.gov/main/profiles/otherid.htm"&gt;US Patent and Trademark Office discusses them&lt;/a&gt; in the same context as Domain Names, Drivers License Numbers, ZIP Codes, and UPC Barcodes. But for all of the system's advantages, it suffers from a significant limitation: without access to the CAS Registry database, it can be difficult if not impossible to link a chemical structure with a CAS Number. This article discusses a unique approach to this problem.&lt;/p&gt;

&lt;h4&gt;Finding CAS Numbers on Chempedia&lt;/h4&gt;

&lt;p&gt;Let's say you've found the CAS Number &lt;strong&gt;[58-08-2]&lt;/strong&gt; and needed to look up the chemical structure it refers to. How would you do it?&lt;/p&gt;

&lt;p&gt;We can use &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt; to find the answer. Entering "58-08-2" into the &lt;a href="http://chempedia.com/queries/new"&gt;text query&lt;/a&gt; box takes us to the corresponding &lt;a href="http://chempedia.com/registry_numbers/58-08-2"&gt;Registry Number Summary&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Under the heading "Compound Monographs", this page tells us that one &lt;a href="http://chempedia.com/monographs/caffeine"&gt;Compound Monograph&lt;/a&gt; referencing the CAS Number &lt;strong&gt;[58-08-2]&lt;/strong&gt; exists. We can easily see that both the name and chemical structure are consistent with &lt;a href="http://chempedia.com/monographs/caffeine"&gt;Caffeine&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;center&gt;&lt;a href="http://chempedia.com/registry_numbers/58-08-2"&gt;&lt;img src="http://depth-first.com/demo/20080526/caffeine.png"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;However, the section under the heading "Compounds" gives us something unique. Rather than simply telling us that the structure of Caffeine is linked to CAS Number &lt;strong&gt;[58-08-2]&lt;/strong&gt;, Chempedia tells us how it arrived at this conclusion. As you can see, there are over a dozen references matching CAS Number &lt;strong&gt;[58-08-2]&lt;/strong&gt; with a single chemical structure.&lt;/p&gt;

&lt;p&gt;More than that, Chempedia give us links to the organizations making these assertions and the actual Web pages recording them.&lt;/p&gt;

&lt;p&gt;Rather than just giving the answer, Chempedia says how it found the answer.&lt;/p&gt;

&lt;h4&gt;Authority, Confidence, and the Electronic Paper Trail&lt;/h4&gt;

&lt;p&gt;By definition, the only authority on CAS Registry Numbers is &lt;a href="http://www.cas.org/"&gt;Chemical Abstracts Service&lt;/a&gt; itself. But for many, many organizations, full-time access to the CAS Registry database is hopelessly out of reach, and access of the form required to incorporate the CAS Registry system into third-party products is a non-starter.&lt;/p&gt;

&lt;p&gt;In other words, there is a widespread need to work with CAS Numbers independently of the CAS Registry System, but any such attempt is inherently non-authoritative. How can we work within this constraint?&lt;/p&gt;

&lt;p&gt;What's lacking is the concept of confidence.&lt;/p&gt;

&lt;p&gt;To illustrate, let's try to find the structure associated with the CAS Number &lt;strong&gt;[480-41-1]&lt;/strong&gt;. In contrast to our earlier search, this one takes us to a &lt;a href="http://chempedia.com/registry_numbers/480-41-1"&gt;summary page&lt;/a&gt; with three different structures! (see below)&lt;/p&gt;

&lt;p&gt;&lt;center&gt;&lt;a href="http://chempedia.com/registry_numbers/480-41-1"&gt;&lt;img src="http://depth-first.com/demo/20080526/naringenin.png"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;Each of these three structures share the same connectivity, but different stereochemistry. The first structure (presumably) represents a racemate, the second represents the (&lt;em&gt;S&lt;/em&gt;) enantiomer, and the third represents the (&lt;em&gt;R&lt;/em&gt;) enantiomer.&lt;/p&gt;

&lt;p&gt;At this point, we need to decide whether we've really found the structure for the CAS Number &lt;strong&gt;[480-41-1]&lt;/strong&gt;. And we could use Chempedia's electronic paper trail to guide our thinking. Both the racemate and (&lt;em&gt;S&lt;/em&gt;) enantiomer have five references linking structure and CAS number, whereas the (&lt;em&gt;R&lt;/em&gt;) enantiomer only lists one.&lt;/p&gt;

&lt;p&gt;We can also see that the racemate and and pure (&lt;em&gt;S&lt;/em&gt;) enantiomer each are associated with yet another CAS Registry Number, &lt;a href="http://chempedia.com/registry_numbers/67604-48-2"&gt;[67604-48-2]&lt;/a&gt;. Examination of this record shows that two structures are cited, the same two structures we were originally considering.&lt;/p&gt;

&lt;p&gt;Clearly, there's some confusion regarding the exact identity of the structure represented by CAS Number &lt;strong&gt;[480-41-1]&lt;/strong&gt;. Nevertheless, we can guess that the Registry Numbers &lt;a href="http://chempedia.com/registry_numbers/67604-48-2"&gt;[67604-48-2]&lt;/a&gt; and &lt;a href="http://chempedia.com/registry_numbers/480-41-1"&gt;[480-41-1]&lt;/a&gt; refer to the racemate and (&lt;em&gt;S&lt;/em&gt;) enantiomer of the flavinoid &lt;a href="http://chempedia.com/monographs/naringenin"&gt;Naringenin&lt;/a&gt;, although we don't know which is which.&lt;/p&gt;

&lt;p&gt;For some applications this answer would be sufficient. For others, however, it wouldn't. The key point is that Chempedia has enabled us to arrive at this conclusion by exposing the electronic paper trail of third-party CAS Registry Number assignments.&lt;/p&gt;

&lt;p&gt;Chempedia offers a way to debug CAS Registry Numbers.&lt;/p&gt;

&lt;p&gt;Chempedia currently contains just over 380,000 unique CAS Numbers. To browse through the entire set, ten at a time, you can &lt;a href="http://chempedia.com/registry_numbers"&gt;begin with this page&lt;/a&gt;. Notice how &lt;a href="http://depth-first.com/articles/2007/05/30/restful-cheminformatics"&gt;RESTful URLs&lt;/a&gt; are used throughout.&lt;/p&gt;

&lt;h4&gt;Web 2.0 and All That&lt;/h4&gt;

&lt;p&gt;Those who have spent time using or developing "Web 2.0" applications may recognize a potentially powerful analogy between CAS Registry Numbers and the concept of &lt;a href="http://en.wikipedia.org/wiki/Tag_%28metadata%29"&gt;tagging&lt;/a&gt;. A tag is an alphanumeric string associated with some resource of interest, for example, &lt;a href="http://flickr.com/photos/tags/chemistry/clusters/"&gt;a photo&lt;/a&gt;, &lt;a href="http://www.connotea.org/tag/chemistry"&gt;a scientific paper&lt;/a&gt;, &lt;a href="http://del.icio.us/tag/chemistry"&gt;a URL&lt;/a&gt;, or &lt;a href="http://depth-first.com/articles/tag/chemistry"&gt;a blog post&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Although originally designed to uniquely identify a chemical substance or structure, when used in the wild, CAS Registry Numbers sometimes more closely resemble the fuzzy semantics of tagging.&lt;/p&gt;

&lt;p&gt;Chemical information system that use CAS Numbers processed by third parties need to take this reality into account or run the risk of misleading users. Chempedia offers one method for doing so.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Chempedia currently contains just over 380,000 CAS registry numbers. Although this is a minuscule fraction of the total CAS Registry, Chempedia's collection comprises some of the most widely used and most important substances known. More importantly, Chempedia now offers a tool for understanding the often complex associations between chemical structure and CAS Registry numbers that exist in real-world chemical information sources.&lt;/p&gt;

&lt;p&gt;In this sense, Chempedia could be a useful tool for small organizations to double check their CAS Number assignments and for individuals to quickly look up the chemical structure of a given CAS number and understand ambiguities.&lt;/p&gt;

&lt;p&gt;Chempedia also lays bare both the confusion and consensus around CAS Registry Numbers used in the real world. If CAS Numbers in the wild are more like tags than unique identifiers, what can we do with this insight? Future articles will describe some possibilities.&lt;/p&gt;</description>
      <pubDate>Mon, 26 May 2008 13:06:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:f0fedea9-3156-42d9-a0ce-5e0fdb3d3f8b</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/26/simple-cas-number-lookup-and-more-with-chempedia</link>
      <category>Tools</category>
      <category>cas</category>
      <category>registrynumber</category>
      <category>tagging</category>
      <category>tags</category>
      <category>chempedia</category>
      <category>manytomany</category>
    </item>
    <item>
      <title>Small Molecule 3D Coordinates From PubChem</title>
      <description>&lt;p&gt;&lt;a href="http://pubchem.ncbi.nlm.nih.gov/"&gt;&lt;img src="http://depth-first.com/files/pubchemlogo.gif" align="right"&gt;&lt;/img&gt;&lt;/a&gt;The PubChem team has quietly introduced a new feature - 3D coordinates for many of the small molecules in its compound collection. To my knowledge, these coordinates are only currently &lt;a href="ftp://ftp.ncbi.nlm.nih.gov/pubchem/Compound_3D/"&gt;available via FTP&lt;/a&gt;. From the &lt;a href="ftp://ftp.ncbi.nlm.nih.gov/pubchem/Compound_3D/README"&gt;README&lt;/a&gt;:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;The data contained here consists of a theoretical 3D description of PubChem Compound records computed using the MMFF94s force field without coulombic terms, including MMFF charges.  Each provided theoretical 3D conformer is not a stationary point on the hyper-potential surface (i.e., is not at a minimum energy).  Rather, the theoretical 3D description is a low energy conformer selected from a conformer model (a theoretical description of the conformational flexibility of a chemical structure consisting of multiple 3D representations or poses sampled using an RMSD {root mean squared distance} threshold) describing energetically-accessible and (potentially) biologically relevant coformations of a chemical structure.&lt;/p&gt;
    
    &lt;p&gt;Not every PubChem Compound record will have a theoretical 3D description. Structures considered too large (containing more than 50 non-hydrogen atoms) or too flexible (containing more than 15 rotatable bonds) are excluded.  Furthermore, chemical structures containing elements other than H, C, N, O, F, P, S, Cl, Br, and I are also excluded.&lt;/p&gt;
    
    &lt;p&gt;Generation of theoretical 3D descriptions of small molecules is computationally intensive.  As such, some PubChem Compound records may be added at a later time.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;(A few open source packages for &lt;a href="http://depth-first.com/articles/2007/12/12/simple-3d-conformer-generation-with-smi23d"&gt;generating 3D conformers&lt;/a&gt; are also available.)&lt;/p&gt;

&lt;p&gt;Recently, &lt;a href="http://hutchison.chem.pitt.edu/"&gt;Geoff Hutchison&lt;/a&gt; wrote in &lt;a href="http://depth-first.com/articles/2008/05/14/the-daily-molecule-the-wonders-of-chemistry-one-molecule-at-a-time#comment-556"&gt;to suggest&lt;/a&gt; that a potentially useful new feature of &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt; could be the ability to directly obtain 3D coordinates for a molecule of interest.&lt;/p&gt;

&lt;p&gt;One very economical way to do that would be to use PubChem's 3D dataset. It would also be trivial to display these coordinates as a resizable &lt;a href="http://jmol.sourceforge.net/"&gt;Jmol applet&lt;/a&gt;, in analogy to &lt;a href="http://depth-first.com/articles/2008/05/19/building-chempedia-resizable-structures-with-chemwriter"&gt;Chempedia's recently-added 2D molecule resizing feature&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Of course, there are many other potential uses for the PubChem conformer dataset, especially when applied to Web applications.&lt;/p&gt;</description>
      <pubDate>Fri, 23 May 2008 10:53:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:1fdc7fbf-3af8-4928-9770-668ad24d8df2</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/23/small-molecule-3d-coordinates-from-pubchem</link>
      <category>Tools</category>
      <category>chempedia</category>
      <category>3d</category>
      <category>conformer</category>
      <category>pubchem</category>
      <category>smi23d</category>
      <category>ftp</category>
    </item>
    <item>
      <title>Building Chempedia: Social Networking Applied to Chemistry</title>
      <description>&lt;p&gt;&lt;a href="http://chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;&lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt; is a free online chemical encyclopedia; it's also a work in progress, the contents of which are being written by numerous volunteers worldwide. A previous article described initial work toward &lt;a href="http://depth-first.com/articles/2008/05/15/building-chempedia-the-human-element"&gt;connecting the people behind Chempedia's content with the compound monographs they're writing&lt;/a&gt;. This article will describe new features that take this idea much further.&lt;/p&gt;

&lt;h4&gt;Contributors&lt;/h4&gt;

&lt;p&gt;Chempedia now uses the concept of a "Contributor" as part of its data model. Each Compound Monograph has one associated Contributor, the Wikipedia user who last edited it. In other words, a one-to-many relationship exists between a Contributor and a Monograph.&lt;/p&gt;

&lt;p&gt;Of course, this model is simplistic; Compound Monographs are edited by multiple users over time, and so the relationship should be many-to-many. Nevertheless, for now a one-to-many relationship works well enough.&lt;/p&gt;

&lt;h4&gt;Learning About Contributors&lt;/h4&gt;

&lt;p&gt;You can view a &lt;a href="http://chempedia.com/contributors"&gt;complete list of contributors to Chempedia&lt;/a&gt;. As you can see, over 1,000 Wikipedia users are currently listed. The number in parentheses appearing after each contributor's username is the number of Monographs for which Wikipedia lists them as the last editor.&lt;/p&gt;

&lt;p&gt;Chempedia contains just over 6,400 Compound Monographs; the fact that 1,000 Wikipedia users contributed to making that happen is remarkable. That such a large number of users contribute relative to the number Monographs may be surprising given the rule of thumb that &lt;a href="http://depth-first.com/articles/2007/10/05/what-makes-wikipedia-tick"&gt;only 2-10% of users are responsible for the majority of the work on community-driven projects&lt;/a&gt;. While the majority of work may well be done by a relatively small group of Contributors, these numbers demonstrate a &lt;a href="http://depth-first.com/articles/2006/08/19/history-of-abstracting-at-chemical-abstracts-service"&gt;widespread interest in creating and maintaining information about chemical compounds&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Chempedia lets you learn more about what an individual Contributor has done. Clicking on a Contributor name takes us to a Contributor summary page showing all of the Monographs on which they are listed as Contributor, as well as their Wikipedia home and talk pages. The latter can be used to take part in discussions.&lt;/p&gt;

&lt;p&gt;A particularly active contributor (one of several) goes by the name of &lt;a href="http://chempedia.com/contributors/1"&gt;Edgar181&lt;/a&gt;. As of this writing, s/he is listed as the Contributor on 458 Compound monographs, and the last one s/he edited was &lt;a href="http://chempedia.com/monographs/methylparaben"&gt;Methylparaben&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;And 1,000 Contributors represents only a lower limit due the the large number of &lt;a href="http://chempedia.com/contributors/2"&gt;anonymous contributions&lt;/a&gt;, which on Chempedia are lumped together. As you can see, over 1,100 Compound Monographs were last edited by a Wikipedia user who didn't log in.&lt;/p&gt;

&lt;h4&gt;Thank Goodness for Robots&lt;/h4&gt;

&lt;p&gt;Quite a few &lt;a href="http://chempedia.com/contributors"&gt;Contributors&lt;/a&gt; have the letters 'Bot' in their names. A 'Bot is a script designed to do work on Wikipedia that would be tedious and/or error prone if done by humans.&lt;/p&gt;

&lt;p&gt;One of my favorites is &lt;a href="http://chempedia.com/contributors/17"&gt;ClueBot&lt;/a&gt;. From the &lt;a href="http://en.wikipedia.org/wiki/User:ClueBot"&gt;ClueBot Wikipedia user page&lt;/a&gt;, this script's purpose in life is to revert Wikipedia vandalism, a job it does with breathtaking efficiency and accuracy.&lt;/p&gt;

&lt;p&gt;For example, one of ClueBot's last pieces of work was to &lt;a href="http://en.wikipedia.org/w/index.php?title=Lysergic_acid_diethylamide&amp;amp;diff=213850493&amp;amp;oldid=213850472"&gt;revert an edit&lt;/a&gt; made to &lt;a href="http://chempedia.com/monographs/lysergic-acid-diethylamide"&gt;Lysergic Acid Diethylamide&lt;/a&gt; in which a user tried to include some enthusiastic, but subjective, comments about &lt;a href="http://en.wikipedia.org/wiki/Albert_Hofmann"&gt;Albert Hofmann's&lt;/a&gt; discovery.&lt;/p&gt;

&lt;p&gt;In less than one minute, ClueBot had not only identified the comment as vandalism (despite the fact that no 'offensive' language was used), but had removed it as well. Amazing.&lt;/p&gt;

&lt;h4&gt;Connecting Contributors to Monographs&lt;/h4&gt;

&lt;p&gt;Chempedia is quickly forming a densely connected network of people and molecules. What can we do with this?&lt;/p&gt;

&lt;p&gt;&lt;center&gt;&lt;img src="http://depth-first.com/demo/20080521/edit_status.png"&gt;&lt;/img&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;A new edit status line has been added to Monographs summaries (above). With it, you can easily see the number of edits that have occurred, when the last one happened, and who did it. Links will take you directly to the Wikipedia edit history page and to the Chempedia Contributor page for the last editor.&lt;/p&gt;

&lt;p&gt;For example, the entry for &lt;a href="http://chempedia.com/monographs/modafinil"&gt;Modafinil&lt;/a&gt; currently lists &lt;a href="http://chempedia.com/contributors/314"&gt;Paul gene&lt;/a&gt; as the last contributor. Bringing up his Chempedia contributor page, we can see that he's listed as the last Contributor on three other Monographs, all of which are organic compounds of pharmacological interest. Curious about whether this might be one of Paul gene's interests, we click on the &lt;a href="http://en.wikipedia.org/wiki/User:Paul%20gene"&gt;User Page&lt;/a&gt; link at the top right of the contributor page and find out that this Wikipedia user received a Ph.D., works in academia, and has an interest in pharmacology, immunology, chemistry, kinase inhibitors, and antidepressants.&lt;/p&gt;

&lt;h4&gt;Newly-Edited Monographs&lt;/h4&gt;

&lt;p&gt;It might be of interest to know when Compound Monographs are edited. This can be done from the &lt;a href="http://chempedia.com/monographs"&gt;Browse&lt;/a&gt; link at the top-left main menu. On this page Monographs are sorted in descending order according to the last edit timestamp. The most recently-edited monographs appear on the first page, which is currently updated once every 30 minutes or so.&lt;/p&gt;

&lt;h4&gt;Hot Monographs&lt;/h4&gt;

&lt;p&gt;We may also be interested in which Compound Monographs are receiving the most edit activity. We can do that by choosing the &lt;a href="http://chempedia.com/monographs?sortby=activity"&gt;Active&lt;/a&gt; link at the top-right submenu. As of this writing, &lt;a href="http://chempedia.com/monographs/heroin"&gt;Heroin&lt;/a&gt; is the most actively edited Monograph, with 10 edits since May 19&lt;sup&gt;th&lt;/sup&gt;. Clicking on &lt;a href="http://en.wikipedia.org/w/index.php?title=Heroin&amp;amp;action=history"&gt;the link&lt;/a&gt; in the edit status line, we can see what all the activity is about.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;None of the technology described here is especially new or innovative; social networking has been part of information systems for several years now and relational databases are designed to make discoveries possible by linking disparate pieces of information. What is new is Chempedia's application of social networking, facilitated by relational databases, to chemistry. I'm unaware of any other chemical information system that takes the possibilities of social networking as far as Chempedia has taken them.&lt;/p&gt;

&lt;p&gt;There's quite a bit more that could be done to link people and molecules on Chempedia, but for now, it's time to move onto some related areas. It turns out that the use of CAS numbers, when used outside of the CAS database system itself, raises all kinds of difficult and interesting questions around trust and authority in which social networking ideas can be applied. But that's a story for another time.&lt;/p&gt;</description>
      <pubDate>Wed, 21 May 2008 10:40:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:6e4b6bf3-e0db-417e-9f28-87412710fdf1</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/21/building-chempedia-social-networking-applied-to-chemistry</link>
      <category>Tools</category>
      <category>chempedia</category>
      <category>socialnetworking</category>
      <category>wikipedia</category>
      <category>vandalism</category>
      <category>cluebot</category>
      <category>contributor</category>
    </item>
    <item>
      <title>Building Chempedia: Resizable Structures With ChemWriter</title>
      <description>&lt;p&gt;&lt;a href="http://chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;One of the difficulties in viewing 2D chemical structures is that molecules vary in size. In particular, larger molecules become difficult to read when confined to a small section of a screen. This article shows how this problem has been addressed in &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt; using the 2D rendering capabilities of the &lt;a href="http://metamolecular.com/chemwriter"&gt;ChemWriter&lt;/a&gt; package.&lt;/p&gt;

&lt;p&gt;&lt;a href="http://chempedia.com/monographs/aluminon"&gt;&lt;img src="http://depth-first.com/demo/20080519/screen.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;As an example, consider Chempedia's entry for &lt;a href="http://chempedia.com/monographs/aluminon"&gt;Aluminon&lt;/a&gt;. Although the summary box at the right shows the structure for Aluminon, it may not be completely readable due to the large size of the molecule.&lt;/p&gt;

&lt;p&gt;To solve this problem, Chempedia has implemented a "zoom" link for all monographs containing a chemical structure. Clicking on the zoom link for Aluminon gives a magnified, scaled, stretchable, and resizable view of the molecule.&lt;/p&gt;

&lt;p&gt;To implement this feature, Chempedia uses the &lt;a href="http://metamolecular.com/chemwriter"&gt;ChemWriter&lt;/a&gt; &lt;tt&gt;PainterApplet&lt;/tt&gt;. Simply setting the &lt;tt&gt;width&lt;/tt&gt; and &lt;tt&gt;height&lt;/tt&gt; attributes of the &lt;a href="http://depth-first.com/articles/2008/03/10/demystifying-java-applets-part-2-dry-deployment-with-the-javay-method"&gt;&lt;tt&gt;&amp;lt;object&amp;gt;&lt;/tt&gt; tag&lt;/a&gt; to "100%" gives an applet that resizes itself as the surrounding window is resized.&lt;/p&gt;

&lt;p&gt;Implementing a resizable 2D structure image window using AJAX and dynamic images is possible, but would be much more difficult to implement. It could also potentially produce a much higher, and unpredictable demand on server memory, CPU cycles, and bandwidth.&lt;/p&gt;

&lt;p&gt;ChemWriter makes it possible for the server to delegate resizable image processing to the client, resulting in a much more responsive feature with minimal effect on the server.&lt;/p&gt;</description>
      <pubDate>Mon, 19 May 2008 09:42:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:97ac70b9-bb2d-491a-9f08-ca71d796ce75</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/19/building-chempedia-resizable-structures-with-chemwriter</link>
      <category>Tools</category>
      <category>chempedia</category>
      <category>chemwriter</category>
      <category>2d</category>
      <category>resize</category>
      <category>chemicalstructure</category>
    </item>
    <item>
      <title>Building Chempedia: The Human Element</title>
      <description>&lt;p&gt;&lt;a href="http://chempedia.com" align="right"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;The study of chemistry is an inherently social activity. From the papers we use and cite, to the conferences we attend, to the informal discussions we engage in daily, being a chemist means interacting with your fellow chemists. Yet strangely, most chemical information systems either totally ignore this central fact, or provide only the most meager of tools to harness it to its full potential. This article discusses how &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt; currently integrates the social with the scientific, and what may be in store for the future.&lt;/p&gt;

&lt;h4&gt;Chempedia as a Tool for Scientific Collaboration&lt;/h4&gt;

&lt;p&gt;Like all chemical reference works, Chempedia is written by people with their own interests, skills, and ambitions. Unlike almost every other chemical reference work, Chempedia (through Wikipedia, on which it's based) offers intriguing possibilities to directly collaborate and learn from its contributors - or even become one of them.&lt;/p&gt;

&lt;p&gt;How can Chempedia better facilitate scientific collaboration?&lt;/p&gt;

&lt;h4&gt;A Simple But Possibly Useful Feature&lt;/h4&gt;

&lt;p&gt;Yesterday, a new feature was added to Chempedia that makes it easier to understand the recent history of a Compound Monograph. The new feature shows the date that a Compound Monograph was last edited, and the Wikiepdia user who edited it:&lt;/p&gt;

&lt;p&gt;&lt;center&gt;&lt;img src="http://depth-first.com/demo/20080515/screen.png"&gt;&lt;/img&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;Clicking on the link takes you to the Wikipedia users page, in this case the one for &lt;a href="http://en.wikipedia.org/wiki/User:Meodipt"&gt;Meodipt&lt;/a&gt;. (Wikipedia users frequently use handles rather than their given names.) From Meodipt's page, we can see that s/he received degrees in chemistry and pharmacology and is currently studying law. Meodipt's interests include pharmacology, chemistry, law, and science. We can also see that Meodipt is maintaining a &lt;a href="http://en.wikipedia.org/wiki/User:Meodipt/casnumbers"&gt;good-sized list of CAS numbers for drugs&lt;/a&gt;, grouped by indication.&lt;/p&gt;

&lt;p&gt;We might be curious about what Meodipt found worth changing, and how s/he changed it. We could do so by first clicking the Chempedia &lt;a href="http://chempedia.com/monographs/pravadoline/edit"&gt;edit link&lt;/a&gt;. In the Wikipedia box (framed by the red dotted lines), we would then click on the 'history' tab. Clicking on the 'last' link for the top entry shows us exactly what Meodipt changed on Pravadoline's compound monograph (also visible through &lt;a href="http://en.wikipedia.org/w/index.php?title=Pravadoline&amp;amp;diff=200731945&amp;amp;oldid=200731624"&gt;this link&lt;/a&gt;).&lt;/p&gt;

&lt;h4&gt;Looking Ahead&lt;/h4&gt;

&lt;p&gt;Linking a real person to changes in a Compound Monograph could be enormously useful, if done properly. After all, bringing people with highly focussed interests together is the essence of scientific collaboration. The Chempedia/Wikipedia combination provides one way to do that.&lt;/p&gt;

&lt;p&gt;As Chis Anderson puts it, "&lt;a href="http://www.longtail.com/the_long_tail/2007/09/social-networki.html"&gt;social networking should be a feature, not a destination&lt;/a&gt;." Scientists were social networking long before the Internet, the computer, and the telephone were invented; indeed scientists who fail to connect with their fellow scientists have a difficult time of prospering. When seen from this perspective, it's surprising that good 'social networking' features would not be viewed as a top priority in chemical information systems.&lt;/p&gt;

&lt;p&gt;The Chempedia author credit system in its current form is rather simplistic and may not actually promote scientific collaboration at all. But it's not hard to imagine ways to make it far more effective. Future articles will discuss some of the possibilities.&lt;/p&gt;</description>
      <pubDate>Thu, 15 May 2008 14:50:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:ae862028-7efd-4e91-b5ee-36b91cbed66e</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/15/building-chempedia-the-human-element</link>
      <category>Tools</category>
      <category>chempedia</category>
      <category>wikipedia</category>
      <category>socialnetworking</category>
      <category>collaboration</category>
      <category>author</category>
    </item>
    <item>
      <title>The Daily Molecule: The Wonders of Chemistry - One Molecule at a Time</title>
      <description>&lt;p&gt;&lt;a href="http://blog.chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;Chemistry is a big field judged by any standard, including the &lt;a href="http://depth-first.com/articles/2008/05/07/1908-and-all-that-the-long-tail-and-chemistry"&gt;proliferation of American Chemical Society (ACS) divisions&lt;/a&gt;. Each subdiscipline in chemistry is in turn so big, that once a chemist becomes 'differentiated' it's easy to lose touch even with neighboring subdisciplines. It doesn't have to be that way. This article introduces a new service, &lt;a href="http://blog.chempedia.com"&gt;&lt;em&gt;The Daily Molecule&lt;/em&gt;&lt;/a&gt; designed to make it just a little bit easier (and hopefully fun) to stay in the chemical loop.&lt;/p&gt;

&lt;h4&gt;What Is It?&lt;/h4&gt;

&lt;p&gt;The idea is simple: every weekday, a new molecule will be featured on &lt;em&gt;The Daily Molecule&lt;/em&gt; with a short write-up and some leading references. Although molecules in the news will get first priority, any molecule is fair game.&lt;/p&gt;

&lt;p&gt;The material for &lt;em&gt;The Daily Molecule&lt;/em&gt; will be drawn from &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;, which in turn gets some of its content from &lt;a href="http://wikipedia.org"&gt;Wikipedia&lt;/a&gt;. In other words, the entries on the Daily Molecule will be largeley written by my fellow chemists.&lt;/p&gt;

&lt;p&gt;The process of creating a &lt;em&gt;Daily Molecule&lt;/em&gt; entry is not time-consuming, but much of what is being done manually now could be automated in the future. The technology platform lends itself well to many forms of chemistry-specific modification (see below).&lt;/p&gt;

&lt;p&gt;I hesitate to use the term 'blog' to describe &lt;em&gt;The Daily Molecule&lt;/em&gt;, but the description may be helpful to an extent.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;The Daily Molecule&lt;/em&gt; is unlike a blog in that most content will be generated by others, selected by some criteria, reformatted for consistency, and published. In that sense, &lt;em&gt;The Daily Molecule&lt;/em&gt; is a something like a mini scientific journal, but it turns the process of acquiring content on its head.&lt;/p&gt;

&lt;p&gt;If chemistry ever evolves beyond the &lt;a href="http://depth-first.com/articles/2007/07/16/go-west-young-man-does-open-access-really-matter-in-the-long-run"&gt;current model of publication&lt;/a&gt;, which seems inevitable at this point, the journals of the future may resemble &lt;em&gt;The Daily Molecule&lt;/em&gt; in one or more ways.&lt;/p&gt;

&lt;h4&gt;Technology&lt;/h4&gt;

&lt;p&gt;The software running &lt;em&gt;The Daily Molecule&lt;/em&gt; is a modified version of &lt;a href="http://simplelog.net/"&gt;SimpleLog&lt;/a&gt;, a Web application based on &lt;a href="http://www.rubyonrails.org/"&gt;Ruby on Rails&lt;/a&gt;. Unlike most blogging engines, SimpleLog focuses on implementing only the most basic publication features, and doing them to perfection. If you know a little Ruby and can work with Rails, you can do a lot with SimpleLog.&lt;/p&gt;

&lt;p&gt;One of the first items of business will be to implement &lt;a href="http://depth-first.com/articles/2007/09/18/six-reasons-i-like-recaptcha-or-how-to-build-a-web-service-worth-talking-about"&gt;reCAPTCHA&lt;/a&gt; support and activate comments on articles.&lt;/p&gt;

&lt;p&gt;Some ideas for chemically-enabling &lt;em&gt;The Daily Molecule&lt;/em&gt; include a graphical abstract sidebar and (sub)structure search. Currently, the 2D chemical structure images posted to &lt;em&gt;The Daily Molecule&lt;/em&gt; &lt;a href="http://depth-first.com/articles/2007/08/08/never-draw-the-same-molecule-twice-viewing-image-metadata"&gt;have complete connection tables embedded as metadata&lt;/a&gt;, a feature with some interesting possibilities.&lt;/p&gt;

&lt;h4&gt;The Molecule of the Day/Week/Month&lt;/h4&gt;

&lt;p&gt;The basic idea behind &lt;em&gt;The Daily Molecule&lt;/em&gt; is not new. Many other services have sprung up over the last ten years that operate, at least on the surface, similarly. Some examples:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="http://www.moleculeoftheday.com/"&gt;Molecule of the Day&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://portal.acs.org/portal/acs/corg/content?_nfpb=true&amp;amp;_pageLabel=PP_TRANSITIONMAIN&amp;amp;node_id=677&amp;amp;use_sec=false&amp;amp;sec_url_var=region1"&gt;ACS Molecule of the Week&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.drugsandpoisons.com/"&gt;Drugs and Poisons&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://the-half-decent-pharmaceutical-chemistry-blog.chemblogs.org/category/saturday-night-synthesis"&gt;Saturday Night Synthesis&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.chm.bris.ac.uk/motm/motm.htm"&gt;The Molecule of the Month&lt;/a&gt; (may be the oldest continuously-operated MOTM site in existence)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.3dchem.com/motm.asp"&gt;3dchem.com Molecule of the Month&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.expasy.org/spotlight/"&gt;Protein Spotlight&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://mgl.scripps.edu/people/goodsell/illustration/pdb"&gt;PDB Molecule of the Month&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.prous.com/molecules/default.asp"&gt;Prous Molecule of the Month&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Quite a few others don't appear on this list.&lt;/p&gt;

&lt;p&gt;The different idea behind the &lt;em&gt;The Daily Molecule&lt;/em&gt; is that chemical content already exists in on the Web in machine-readable format with licenses that permit its re-use; all that's needed is a way to aggregate, format, and package that information in a form suitable for once-daily scanning and cheminformatics manipulation.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Like no other medium, the Web blurs artificial distinctions: between work and play; between private and public; between on-topic and off-topic; between fame and obscurity; between mine and yours; between big and small; and between profit and non-profit. Chemistry may be late to the party, but is not immune to its call.&lt;/p&gt;</description>
      <pubDate>Wed, 14 May 2008 11:58:00 +0000</pubDate>
      <guid isPermaLink="false">urn:uuid:804a7467-98a1-47ae-975a-b1fdd172f1c0</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/14/the-daily-molecule-the-wonders-of-chemistry-one-molecule-at-a-time</link>
      <category>Meta</category>
      <category>dailymolecule</category>
      <category>scientificpublication</category>
      <category>chempedia</category>
      <category>wikipedia</category>
      <category>journal</category>
      <category>web</category>
      <category>rails</category>
      <category>ruby</category>
      <category>simplelog</category>
    </item>
  </channel>
</rss>
