<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Depth-First: Category Meta</title>
    <link>http://depth-first.com/articles/category/meta</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description>Walking the Web of Chemical Informatics</description>
    <item>
      <title>The Daily Molecule: The Wonders of Chemistry - One Molecule at a Time</title>
      <description>&lt;p&gt;&lt;a href="http://blog.chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;Chemistry is a big field judged by any standard, including the &lt;a href="http://depth-first.com/articles/2008/05/07/1908-and-all-that-the-long-tail-and-chemistry"&gt;proliferation of American Chemical Society (ACS) divisions&lt;/a&gt;. Each subdiscipline in chemistry is in turn so big, that once a chemist becomes 'differentiated' it's easy to lose touch even with neighboring subdisciplines. It doesn't have to be that way. This article introduces a new service, &lt;a href="http://blog.chempedia.com"&gt;&lt;em&gt;The Daily Molecule&lt;/em&gt;&lt;/a&gt; designed to make it just a little bit easier (and hopefully fun) to stay in the chemical loop.&lt;/p&gt;

&lt;h4&gt;What Is It?&lt;/h4&gt;

&lt;p&gt;The idea is simple: every weekday, a new molecule will be featured on &lt;em&gt;The Daily Molecule&lt;/em&gt; with a short write-up and some leading references. Although molecules in the news will get first priority, any molecule is fair game.&lt;/p&gt;

&lt;p&gt;The material for &lt;em&gt;The Daily Molecule&lt;/em&gt; will be drawn from &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;, which in turn gets some of its content from &lt;a href="http://wikipedia.org"&gt;Wikipedia&lt;/a&gt;. In other words, the entries on the Daily Molecule will be largeley written by my fellow chemists.&lt;/p&gt;

&lt;p&gt;The process of creating a &lt;em&gt;Daily Molecule&lt;/em&gt; entry is not time-consuming, but much of what is being done manually now could be automated in the future. The technology platform lends itself well to many forms of chemistry-specific modification (see below).&lt;/p&gt;

&lt;p&gt;I hesitate to use the term 'blog' to describe &lt;em&gt;The Daily Molecule&lt;/em&gt;, but the description may be helpful to an extent.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;The Daily Molecule&lt;/em&gt; is unlike a blog in that most content will be generated by others, selected by some criteria, reformatted for consistency, and published. In that sense, &lt;em&gt;The Daily Molecule&lt;/em&gt; is a something like a mini scientific journal, but it turns the process of acquiring content on its head.&lt;/p&gt;

&lt;p&gt;If chemistry ever evolves beyond the &lt;a href="http://depth-first.com/articles/2007/07/16/go-west-young-man-does-open-access-really-matter-in-the-long-run"&gt;current model of publication&lt;/a&gt;, which seems inevitable at this point, the journals of the future may resemble &lt;em&gt;The Daily Molecule&lt;/em&gt; in one or more ways.&lt;/p&gt;

&lt;h4&gt;Technology&lt;/h4&gt;

&lt;p&gt;The software running &lt;em&gt;The Daily Molecule&lt;/em&gt; is a modified version of &lt;a href="http://simplelog.net/"&gt;SimpleLog&lt;/a&gt;, a Web application based on &lt;a href="http://www.rubyonrails.org/"&gt;Ruby on Rails&lt;/a&gt;. Unlike most blogging engines, SimpleLog focuses on implementing only the most basic publication features, and doing them to perfection. If you know a little Ruby and can work with Rails, you can do a lot with SimpleLog.&lt;/p&gt;

&lt;p&gt;One of the first items of business will be to implement &lt;a href="http://depth-first.com/articles/2007/09/18/six-reasons-i-like-recaptcha-or-how-to-build-a-web-service-worth-talking-about"&gt;reCAPTCHA&lt;/a&gt; support and activate comments on articles.&lt;/p&gt;

&lt;p&gt;Some ideas for chemically-enabling &lt;em&gt;The Daily Molecule&lt;/em&gt; include a graphical abstract sidebar and (sub)structure search. Currently, the 2D chemical structure images posted to &lt;em&gt;The Daily Molecule&lt;/em&gt; &lt;a href="http://depth-first.com/articles/2007/08/08/never-draw-the-same-molecule-twice-viewing-image-metadata"&gt;have complete connection tables embedded as metadata&lt;/a&gt;, a feature with some interesting possibilities.&lt;/p&gt;

&lt;h4&gt;The Molecule of the Day/Week/Month&lt;/h4&gt;

&lt;p&gt;The basic idea behind &lt;em&gt;The Daily Molecule&lt;/em&gt; is not new. Many other services have sprung up over the last ten years that operate, at least on the surface, similarly. Some examples:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="http://www.moleculeoftheday.com/"&gt;Molecule of the Day&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://portal.acs.org/portal/acs/corg/content?_nfpb=true&amp;amp;_pageLabel=PP_TRANSITIONMAIN&amp;amp;node_id=677&amp;amp;use_sec=false&amp;amp;sec_url_var=region1"&gt;ACS Molecule of the Week&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.drugsandpoisons.com/"&gt;Drugs and Poisons&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://the-half-decent-pharmaceutical-chemistry-blog.chemblogs.org/category/saturday-night-synthesis"&gt;Saturday Night Synthesis&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.chm.bris.ac.uk/motm/motm.htm"&gt;The Molecule of the Month&lt;/a&gt; (may be the oldest continuously-operated MOTM site in existence)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.3dchem.com/motm.asp"&gt;3dchem.com Molecule of the Month&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.expasy.org/spotlight/"&gt;Protein Spotlight&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://mgl.scripps.edu/people/goodsell/illustration/pdb"&gt;PDB Molecule of the Month&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.prous.com/molecules/default.asp"&gt;Prous Molecule of the Month&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Quite a few others don't appear on this list.&lt;/p&gt;

&lt;p&gt;The different idea behind the &lt;em&gt;The Daily Molecule&lt;/em&gt; is that chemical content already exists in on the Web in machine-readable format with licenses that permit its re-use; all that's needed is a way to aggregate, format, and package that information in a form suitable for once-daily scanning and cheminformatics manipulation.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Like no other medium, the Web blurs artificial distinctions: between work and play; between private and public; between on-topic and off-topic; between fame and obscurity; between mine and yours; between big and small; and between profit and non-profit. Chemistry may be late to the party, but is not immune to its call.&lt;/p&gt;</description>
      <pubDate>Wed, 14 May 2008 11:58:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:804a7467-98a1-47ae-975a-b1fdd172f1c0</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/14/the-daily-molecule-the-wonders-of-chemistry-one-molecule-at-a-time</link>
      <category>Meta</category>
      <category>dailymolecule</category>
      <category>scientificpublication</category>
      <category>chempedia</category>
      <category>wikipedia</category>
      <category>journal</category>
      <category>web</category>
      <category>rails</category>
      <category>ruby</category>
      <category>simplelog</category>
    </item>
    <item>
      <title>Building Chempedia: Start Simple, Then Iterate</title>
      <description>&lt;p&gt;&lt;a href="http://chempedia.com"&gt;&lt;img src="http://depth-first.com/demo/20080513/chempedia.png" align="right"&gt;&lt;/img&gt;&lt;/a&gt;As a medium for building software, the Web offers unparalleled adaptability. With nothing to download or install, users of Web applications automatically see the newest version - always. This may sound like a small thing, and technically it is. But it dramatically increases the effectiveness with which software can be created. &lt;a href="http://depth-first.com/articles/2008/04/28/building-chempedia-indexing-wikipedias-6-411-compound-monographs"&gt;The previous article in this series&lt;/a&gt; introduced &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;, the free Chemical encyclopedia and cheminformatics Web application. This article will discuss the process by which Chempedia will become a better service over time.&lt;/p&gt;

&lt;h4&gt;Iterative Web Application Development&lt;/h4&gt;

&lt;p&gt;Chempedia, like all actively-developed software, is a work in progress. It will be built in stages starting with the addition of new features, followed by a round of user feedback, bug fixing, and stabilization. This will then be followed by the next major iteration, and so on.&lt;/p&gt;

&lt;p&gt;This iterative design style is ideally suited for Web applications. Because the barrier to pushing out new versions is essentially non-existent, a Web application can evolve at a much more rapid rate than other kinds of software. Indeed, the first version of a Web application need only work well enough to prove a point.&lt;/p&gt;

&lt;p&gt;One of the keys to iterative Web development is a technology framework designed to facilitate it. Chempedia is being developed with &lt;a href="http://rubyonrails.com/"&gt;Ruby on Rails&lt;/a&gt;, a tool that enables Web developers to take full advantage of the iterative development style the Web makes possible.&lt;/p&gt;

&lt;p&gt;Another key element of iterative Web development is users willing to explore the system and offer criticism. Evolution succeeds only when the environment stresses an ecosystem; the same is true in Web application development.&lt;/p&gt;

&lt;p&gt;Chempedia will take full advantage of the evolutionary nature of Web application development. As features are added and (hopefully) use of the service grows, Chempedia will evolve in ways that are impossible to predict today.&lt;/p&gt;

&lt;h4&gt;What's Wrong With Chempedia?&lt;/h4&gt;

&lt;p&gt;If you happened to take a look at Chempedia last week (that version is now no longer visible), you probably noticed many, many things that needed improvement. Some concerns were in the areas of:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Navigation. Navigation works best when the right granularity of options is achieved. Chempedia's navigation system grouped both closely-related and dissimilar actions at the same level.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Metaphor. The initial idea behind Chempedia was to see what happened when PubChem's chemical structures were mashed up with Wikiepia articles, using &lt;a href="http://depth-first.com/articles/2007/05/21/simple-cas-number-lookup-with-pubchem"&gt;CAS numbers&lt;/a&gt; as the common link. The site design reflected this, with no clear organizing principle other than mashup. However, after the initial demonstration of the success of this approach, it became clear that Chempedia was strikingly similar in both form and function to the &lt;a href="http://depth-first.com/articles/2008/04/28/building-chempedia-indexing-wikipedias-6-411-compound-monographs"&gt;Merck Index&lt;/a&gt;. Perhaps this should be used as a clue in deriving a better organizing principle.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Wikipedia integration. The old Chempedia site didn't make it nearly as convenient as is should be to create or edit compound monographs. Because Chempedia serves as a chemically-aware front-end for Wikipedia, the easier it is to get to Wikipedia from Chempedia, the better.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;What Changed?&lt;/h4&gt;

&lt;p&gt;During the process of trying to fix Chempedia's problems, it became clear that a major redesign was in order. This consisted of:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Creating a landing page oriented toward search.&lt;/strong&gt; Using the Merck Index as a metaphor suggested that &lt;a href="http://chempedia.com"&gt;Chempedia's landing page&lt;/a&gt; should be designed around search, not browsing - as it was originally designed.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Emphasizing compound monographs, not compounds.&lt;/strong&gt; Chempedia's central organizing principle is now the Compound Monograph. One way this is seen is in the new URL structure, which makes it very easy to see where a Chempedia link is about to take you. For example, consider the URL for &lt;a href="http://chempedia.com/monographs/benzene"&gt;benzene&lt;/a&gt;. Another way this can be seen is in the inclusion of &lt;a href="http://chempedia.com/monographs/virginiamycin"&gt;Compound Monographs lacking a chemical structure&lt;/a&gt;.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Designing a streamlined menu system.&lt;/strong&gt; The main menu system has been broken down into just three main categories: &lt;a href="http://chempedia.com/"&gt;Search&lt;/a&gt;; &lt;a href="http://chempedia.com/monographs"&gt;Browse&lt;/a&gt;; and &lt;a href="http://chempedia.com/monographs/new"&gt;Create&lt;/a&gt;. These headings refer to actions on Compound Monographs, again in line with their importance as an organizing principle.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Promoting better integration with Wikipedia.&lt;/strong&gt; After experimenting with a few implementation possibilities, it is now possible to edit Wikipedia articles directly from the Chempedia site, thanks to the use of &lt;a href="http://en.wikipedia.org/wiki/IFrame"&gt;inline frame&lt;/a&gt;. Once again, this capability is tied to the Compound Monograph, from which editing and updating links are accessible.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Striving for comprehensive Wikipedia coverage.&lt;/strong&gt; Wikipedia had far more compound monographs than could be found on Chempedia, &lt;a href="http://depth-first.com/articles/2008/04/28/building-chempedia-indexing-wikipedias-6-411-compound-monographs"&gt;6,411 of them&lt;/a&gt;, to be precise. Chempedia now contains all of them, regardless of whether a chemical structure can be found based on a CAS number in PubChem. This includes inorganics, organometallics, polymers, mixtures, and polypeptides.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;Miles to Go Yet&lt;/h4&gt;

&lt;p&gt;Chempedia is far from being finished. For example, you'll notice many instances in which a Compound Monograph is &lt;a href="http://chempedia.com/monographs/parthenolide"&gt;truncated&lt;/a&gt;. This arises from difficulties in parsing Wikipedia's &lt;a href="http://en.wikipedia.org/wiki/Wikilink"&gt;Wikitext&lt;/a&gt; format (more on this later).&lt;/p&gt;

&lt;p&gt;Ultimately, the full text of each Wikipedia article will be present on Chempedia rather than just the first introductory paragraph. But it will take a significant amount of work to ensure that each article's Wikitext entry can be parsed faithfully.&lt;/p&gt;

&lt;p&gt;Chempedia allows search by CAS number, PubChem CID and exact title. Full-text searching is not yet implemented, nor is autocomplete search, both of which would greatly enhance the usability of the service.&lt;/p&gt;

&lt;p&gt;Exact structure searching is made possible by the &lt;a href="http://metamolecular.com/chemwriter"&gt;ChemWriter&lt;/a&gt; editor in combination with &lt;a href="http://en.wikipedia.org/wiki/SHA-1"&gt;SHA-1&lt;/a&gt; hashed &lt;a href="http://depth-first.com/articles/2007/09/27/inchi-for-newbies"&gt;InChIs&lt;/a&gt;. Substructure search and query atom search will ultimately be added, but for an encyclopedia containing relatively few molecules, most of which having trivial names, this isn't yet seen as being critical.&lt;/p&gt;

&lt;p&gt;You'll notice many Monographs on Chempedia that have no structure information. Behind the scenes, Chempedia uses the 350,000+ CAS numbers now contained in the &lt;a href="http://pubchem.ncbi.nlm.nih.gov/"&gt;PubChem&lt;/a&gt; database to associate a chemical structure with a Wikipedia article. In the future, these associations will be made by Chempedia and Wikipedia users, which will allow every Chempedia small-molecule Monograph to have a structure associated with it. (It will also create a rather large, publicly-curated, open database of CAS numbers linked to chemical structures, but that's a story for another time).&lt;/p&gt;

&lt;h4&gt;Your Feedback is Essential&lt;/h4&gt;

&lt;p&gt;Finally, many of the changes made in this iteration were the result of conversions with chemists and developers. If you see something on Chempedia that just doesn't work for you, please don't be shy about &lt;a href="http://chempedia.com/messages/new"&gt;saying so&lt;/a&gt;. Feedback is an essential ingredient in making Chempedia the best service it can be.&lt;/p&gt;</description>
      <pubDate>Tue, 13 May 2008 11:38:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:63df5614-92fb-4363-a060-212645be6315</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/13/building-chempedia-start-simple-then-iterate</link>
      <category>Meta</category>
      <category>chempedia</category>
      <category>evolution</category>
      <category>webapplication</category>
      <category>rails</category>
      <category>compoundmonograph</category>
      <category>merckindex</category>
      <category>iteration</category>
    </item>
    <item>
      <title>The Economics of Free: Chris Anderson on Charlie Rose</title>
      <description>&lt;p&gt;&lt;center&gt;&lt;embed id="VideoPlayback" style="width: 400px; height: 326px" src="http://video.google.com/googleplayer.swf?docId=-8119949202706402691:17000:1338000&amp;amp;hl=en" type="application/x-shockwave-flash" flashvars=""&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;Anderson's comments on the Long Tail and social networks are especially on-target, and &lt;a href="http://depth-first.com/articles/2008/05/07/1908-and-all-that-the-long-tail-and-chemistry"&gt;relevant to the sciences&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 10 May 2008 13:35:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:98981297-e456-4c32-b466-f5327a070b43</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/10/the-economics-of-free-chris-anderson-on-charlie-rose</link>
      <category>Meta</category>
      <category>thelongtail</category>
      <category>chrisanderson</category>
      <category>video</category>
    </item>
    <item>
      <title>Building a Unique Chemistry Journal: Responses to Questions from Nature Chemistry</title>
      <description>&lt;p&gt;&lt;a href="http://www.nature.com/nchem/index.html"&gt;&lt;img src="http://depth-first.com/demo/20080508/nature_chemistry.gif" align="right"&gt;&lt;/img&gt;&lt;/a&gt;&lt;a href="http://blogs.nature.com/thescepticalchymist/author/neil_withers/"&gt;Neil Withers&lt;/a&gt; of the soon-to-be-launched chemistry journal &lt;a href="http://www.nature.com/nchem/index.html"&gt;&lt;em&gt;Nature Chemistry&lt;/em&gt;&lt;/a&gt; has &lt;a href="http://blogs.nature.com/thescepticalchymist/2008/05/jj_day_98_service_with_a_simpl.html"&gt;asked for feedback&lt;/a&gt; to some questions about the best ways to display chemistry research papers on the Web. Here are some responses:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(1) HTML vs PDF: does anyone read the HTML articles? Do you read the PDF on-screen or print it out?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I've used PDFs both for offline archiving and sharing of especially important articles as well as one-off printing of a paper I'm interested in. I rarely read a paper on-screen if I can avoid it.&lt;/p&gt;

&lt;p&gt;Typical workflow: (1) download PDF; (2) print it out; (3); let paper sit while I go do something in the lab that can't wait (or bring it with me); (4) put paper onto a rather large stack of papers just like it; (5) pull paper out of stack from time to time as needed; (6) (optional) file paper in an increasingly chaotic system of folders or recycle it.&lt;/p&gt;

&lt;p&gt;This system is bad, and &lt;a href="http://depth-first.com/articles/2007/03/22/why-i-still-dont-use-connotea"&gt;I cursed it weekly during my time as a research chemist&lt;/a&gt;. Most of my colleagues had similar experiences.&lt;/p&gt;

&lt;p&gt;There are plenty of opportunities to address pain points with the Web. Some ideas:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Make it &lt;em&gt;very&lt;/em&gt; easy to find papers on the &lt;em&gt;Nature Chemistry&lt;/em&gt; site. If I know a paper is trivial to find, I'm less likely to print it out in the first place. Good search may not be enough (see question 3).&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Make the online version as readable as it can be. Minimize fluff like menus, ads and general clutter. Maximize things that promote readability like reasonable column-widths, appropriate fonts, and attractive and readable images.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Add conveniences that make it easier to read the paper online such as hover-popups that display 2D chemical structures for trivial names and IUPAC nomenclature (see below).&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Paper is portable but Web documents are alive. Both can be readable - for example, I never print out a blog posting to read it.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(2) Big vs little graphics: what does everyone else think about the tiny size of the graphics in ACS html articles?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Graphics should be sized appropriately. ACS HTML articles are a good example of failing to &lt;a href="http://depth-first.com/articles/2007/09/28/designing-the-obvious"&gt;design the obvious&lt;/a&gt;. You'd never read a blog post that looked like those articles, so it's not surprising everyone prints out the PDF.&lt;/p&gt;

&lt;p&gt;Another problem is over-wide columns. It's puzzling why journal publishers would ignore all of their hard-won design experience just because a document appears as a Web page. If the ACS used a narrower column width, the Web version would be more readable. For example, check out &lt;a href="http://www.beilstein-journals.org/bjoc/single/articleFullText.htm?vt=f&amp;amp;publicId=1860-5397-4-2&amp;amp;bpn=latest&amp;amp;dos=0"&gt;this article&lt;/a&gt; from &lt;a href="http://www.beilstein-journals.org/bjoc"&gt;&lt;em&gt;Beilstein Journal of Organic Chemistry&lt;/em&gt;&lt;/a&gt;. The only thing I'd change is to make the font larger.&lt;/p&gt;

&lt;p&gt;Both problems are correctable using the right software and techniques.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(3) Tagging/&#8217;semantic web&#8217;: what do you think about the toys on the RSC&#8217;s Project Prospect? What kind of things would you like to see tagged/linked to other content in Nature Chemistry? For instance, Steve would love to do something with named reactions.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;If by tagging, you mean giving users the ability to tag articles like &lt;a href="http://flickr.com"&gt;Flickr&lt;/a&gt; allows photos to be tagged, and for other users to make use of those tags while searching, I think it's &lt;a href="http://depth-first.com/articles/2007/01/18/collective-intelligence-and-the-dumbness-of-crowds"&gt;long overdue and could be a game-changer&lt;/a&gt;. It would clearly play to the strength of the Web as a medium.&lt;/p&gt;

&lt;p&gt;I must confess that I'm not a fan of the implementation of &lt;a href="http://www.rsc.org/Publishing/Journals/ProjectProspect/FAQ.asp"&gt;Project Prospect&lt;/a&gt;, although the idea has a lot going for it. There's too much bling and a lot of it fails on my Linux/Firefox 2 system.&lt;/p&gt;

&lt;p&gt;The one Prospect feature well worth adapting would be the one that lets you get a 2D structure by clicking on a trivial name or IUPAC name. But there's a much better way to implement it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Turn it on by default and get rid of the floating right-hand menu.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Make the structure appear, without clicking, by simply hovering the mouse over the trivial name or IUPAC nomenclature. Be sure the delay is set right so that it's not popping up unintentionally.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That's all there is to it. It needn't be complex, just usable.&lt;/p&gt;

&lt;p&gt;Another possibility: harvest all of the 2D molecular structures appearing in articles over a given period of time to be displayed in a dense, hyperlinked &lt;a href="http://depth-first.com/articles/2006/12/11/hacking-molbank-creating-a-graphical-table-of-contents"&gt;graphical abstract format&lt;/a&gt; ideal for quick browsing.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(4) 3D molecular structures: do these help your understanding of a paper?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Rarely, and in many cases they just add clutter. For almost all small molecules, a properly laid-out and well-drawn 2D chemical structure is more useful. If a central point of discussion in a paper is a 3D structure, then that &lt;em&gt;would&lt;/em&gt; be a good use of the technology.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(5) How useful to you are InChIs and SMILES?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Not very. Research chemists rarely care about this kind of technology. They'd much rather have &lt;a href="http://depth-first.com/articles/2008/02/12/the-art-and-science-of-chemical-structure-diagrams-double-trouble"&gt;a good-looking 2D chemical structure&lt;/a&gt;. InChIs and SMILES, if available, should be &lt;a href="http://depth-first.com/articles/2006/09/05/the-automatic-encoding-of-chemical-structures"&gt;hidden away and only brought out when requested&lt;/a&gt;. A more basic problem is &lt;a href="http://depth-first.com/articles/tag/flexmol"&gt;neither system will be able to encode all of the molecules&lt;/a&gt; your journal's authors are likely to discuss.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(6) Forward linking: the RSC and Elsevier/Science Direct offer this &#8211; do you use it? Would you use an RSS feed that alerted you to new citations of a particular paper.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It could be useful provided that clutter could be kept to a minimum. It's essentially a form of linkback (see below).&lt;/p&gt;

&lt;p&gt;An RSS feed that published linkback activity might be useful, but many of the chemists I know still don't know what RSS is. On the other hand, a page (or email service) that could keep an interested reader updated on linkback activity on all of their papers of interest simultaneously could be very useful.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(7) Would you actually comment on papers if there was a comments box at the end?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;a href="http://chem-bla-ics.blogspot.com/2008/05/re-what-should-nature-chemistry-paper.html"&gt;Like Egon Willighagen&lt;/a&gt;, I'd probably use &lt;a href="http://depth-first.com"&gt;my blog&lt;/a&gt; to do it.&lt;/p&gt;

&lt;p&gt;However, most chemists don't maintain blogs or other websites and for them I can see how the ability to post comments would be useful.&lt;/p&gt;

&lt;p&gt;Both kinds of users could be accommodated through a combination of comments and &lt;a href="http://en.wikipedia.org/wiki/Linkback"&gt;linkbacks&lt;/a&gt;. Provided that a good spam filtration system were used, this two-pronged approach might be very useful to readers.&lt;/p&gt;

&lt;p&gt;Blogs are just the tip of the iceberg, though. Web publication technologies are creating all kinds of opportunities for creating &lt;a href="http://depth-first.com/articles/2008/05/07/1908-and-all-that-the-long-tail-and-chemistry"&gt;highly focused, constantly evolving, collaborative mini-reviews on special topics&lt;/a&gt;. Linkbacks would create value for both readers and authors of these mini-reviews as well as forward-thinking scientific publications that embrace them.&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;(8) We really like the Biochemical Society&#8217;s HTML article style (&lt;a href="http://www.biochemj.org/bj/ev/381/0329/bj3810329_ev.htm"&gt;sample one here&lt;/a&gt;) &#8211; do you?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;No. Frames makes that site very difficult to navigate.&lt;/p&gt;

&lt;p&gt;It will be very interesting to see how Nature Publishing Group takes advantage of its opportunity to create something unique among chemistry publications. Asking the kinds of questions they're asking now, and doing so in the way they're doing it, shows they're at least on the right track.&lt;/p&gt;</description>
      <pubDate>Thu, 08 May 2008 14:48:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:22584ebd-fdde-4369-9924-b63213df357c</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/08/building-a-unique-chemistry-journal-responses-to-questions-from-nature-chemistry</link>
      <category>Meta</category>
      <category>naturechemistry</category>
      <category>scientificpublication</category>
      <category>journal</category>
      <category>designingtheobvious</category>
      <category>linkback</category>
      <category>minireview</category>
      <category>openaccess</category>
    </item>
    <item>
      <title>1908 and All That: The Long Tail and Chemistry</title>
      <description>&lt;p&gt;&lt;a href="http://longtail.com/"&gt;&lt;img src="http://depth-first.com/demo/20080507/longtail.jpg" align="right"&gt;&lt;/img&gt;&lt;/a&gt;Quite a few &lt;a href="http://acs.org"&gt;American Chemical Society&lt;/a&gt; (ACS) divisions are celebrating their 100th anniversaries this year. While this fact may at first glance seem like just a piece of nerdy trivia, Rudy Baum, Editor-in-chief of &lt;a href="http://pubs.acs.org/cen/"&gt;C&amp;amp;E News&lt;/a&gt; decided to dig deeper. And what he found was the &lt;a href="http://longtail.com/"&gt;Long Tail&lt;/a&gt; of chemistry, alive and well - in 1908.&lt;/p&gt;

&lt;p&gt;&lt;a href="http://pubs.acs.org/cen/editor/86/8618editor.html"&gt;In his editorial&lt;/a&gt;, Baum describes how he looked for the causes of the sudden appearance of so many ACS divisions in 1908. At its core, he found a growing realization on the part of influential chemists at the time that ACS membership was becoming too diverse in their interests and areas of specialization:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;Specialization in subdisciplines of chemistry was also much on ACS members' minds in these years. Some members felt strongly that subdivisions of some sort should be created in the society to provide a venue for chemists from these areas to meet separate from the society as a whole. It was noted that chemists were going off and forming their own specialized organizations in areas like electrochemistry, biological chemistry, and agricultural chemistry.&lt;/p&gt;
    
    &lt;p&gt;As early as 1903, ACS established a committee of five distinguished members to look into this issue, with Massachusetts Institute of Technology's Arthur A. Noyes as the chairman. (Throughout its history, ACS has responded to challenges by creating committees!) The committee reported to the ACS Council at its June 1, 1903, meeting, and strongly recommended that "Divisions of the Society be established representing different important branches of chemistry."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;For those familiar with the work of Chris Anderson, what's being described is nothing other than the &lt;a href="http://www.longtail.com/about.html"&gt;Long Tail&lt;/a&gt;:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;The theory of the Long Tail is that our culture and economy is increasingly shifting away from a focus on a relatively small number of "hits" (mainstream products and markets) at the head of the demand curve and toward a huge number of niches in the tail. As the costs of production and distribution fall, especially online, there is now less need to lump products and consumers into one-size-fits-all containers. In an era without the constraints of physical shelf space and other bottlenecks of distribution, narrowly-targeted goods and services can be as economically attractive as mainstream fare.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;How much money does it cost to set up a new ACS division? Probably not that much. How big is the field of chemistry? Vast. Put the two together, and you have a recipe for today's ACS. A &lt;a href="http://depth-first.com/articles/2007/08/27/the-long-tail-and-chemistry-why-so-many-acs-meeting-talks-are-uninteresting"&gt;recent Depth-First article&lt;/a&gt; described this phenomenon. And C&amp;amp;E News itself maintains a (static?) &lt;a href="http://cenlongtail.wordpress.com/"&gt;blog on the Long Tail as it applies to chemical employment&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;What does any of this have to do with chemical informatics? Although it may be tempting to think of chemists as a homogeneous group sharing a great deal of experience and knowledge, the proliferation of ACS divisions suggests otherwise. It seems reasonable to think that successful chemical information systems would do well to &lt;a href="http://depth-first.com/articles/2008/04/28/building-chempedia-indexing-wikipedias-6-411-compound-monographs"&gt;take this into account in their design and implementation&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 07 May 2008 10:37:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:022d9246-a3c9-4d03-95ba-131a255a8a45</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/05/07/1908-and-all-that-the-long-tail-and-chemistry</link>
      <category>Meta</category>
      <category>longtail</category>
      <category>chemistry</category>
      <category>acs</category>
      <category>divisions</category>
    </item>
    <item>
      <title>Building Chempedia: Indexing Wikipedia's 6,411 Compound Monographs</title>
      <description>&lt;p&gt;&lt;img src="http://depth-first.com/demo/20080428/merck.png" align="right"&gt;&lt;/img&gt;&lt;a href="http://www.merckbooks.com/mindex/"&gt;The Merck Index&lt;/a&gt; is one of chemistry's most useful reference works. Organized like an encyclopedia, each entry, or "Compound Monograph," describes a single compound complete with chemical structure, CAS Number, IUPAC name, trivial names, physical properties, and leading primary literature references describing uses. Unlike other chemistry databases, the Merck Index focuses on only those compounds with important industrial, biological, medical, or technical applications.&lt;/p&gt;

&lt;h4&gt;What's Wrong with the Merck Index?&lt;/h4&gt;

&lt;p&gt;Wonderful product though it may be, the Merck Index has some limitations. For starters, online versions are not free. The disadvantages of this access model go well beyond a simple price barrier; it prevents the very thing the Web was designed to promote: linking. Another limitation is the time it takes for new versions to appear, which is typically measured in years. Still another limitation is in the cost of adding entries for niche compounds that may not be suitable for a general audience, a major barrier to exposing &lt;a href="http://depth-first.com/articles/2007/08/27/the-long-tail-and-chemistry-why-so-many-acs-meeting-talks-are-uninteresting"&gt;chemistry's long tail&lt;/a&gt;.&lt;/p&gt;

&lt;h4&gt;What's Chempedia?&lt;/h4&gt;

&lt;p&gt;If we wanted to create a free, online service that worked like the Merck Index but which took full advantage of today's powerful collaboration and information technology tools, how could we go about doing so?&lt;/p&gt;

&lt;p&gt;This article, the first in a series, discusses &lt;a href="http://chempedia.com"&gt;Chempedia&lt;/a&gt;, a free, structure-oriented online encyclopedia of useful chemical compounds designed to answer this question.&lt;/p&gt;

&lt;h4&gt;Background&lt;/h4&gt;

&lt;p&gt;The following articles may be useful in understanding Chempedia's approach and underlying technology:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://depth-first.com/articles/2008/04/17/user-created-compound-monographs-on-chempedia-net-open-sourcing-the-collation-and-indexing-of-chemical-information"&gt;User-Created Compound Monographs on Chempedia.net: Open Sourcing the Collation and Indexing of Chemical Information&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://depth-first.com/articles/2008/04/04/chempedia-net-mashing-up-pubchem-and-wikipedia"&gt;Chempedia.net: Mashing Up PubChem and Wikipedia&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://depth-first.com/articles/2008/04/02/wikipedia-for-cheminformatics-a-simple-web-api-for-finding-cas-numbers-in-compound-monographs"&gt;Wikipedia for Cheminformatics: A Simple Web API for Finding CAS Numbers in Compound Monographs&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="http://depth-first.com/articles/2007/01/24/thirty-two-free-chemistry-databases"&gt;Thirty-Two Free Chemistry Databases&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;Where to Begin?&lt;/h4&gt;

&lt;p&gt;One of the first problems we'd face in building a free Web-based version of the Merck Index is where to get the compound monographs.&lt;/p&gt;

&lt;p&gt;It turns out that &lt;a href="http://wikipedia.org"&gt;Wikipedia&lt;/a&gt; (yes, Wikipedia) hosts a growing collection of compound monographs that, when viewed together, bear a striking resemblance to the Merck Index. And the effort is becoming increasingly organized with respect to content and data provenance.&lt;/p&gt;

&lt;p&gt;Why not start here?&lt;/p&gt;

&lt;h4&gt;The Task at Hand&lt;/h4&gt;

&lt;p&gt;To get an idea of just how Wikipedia's collection of compound monographs compares to the Merck Index, it helps to know: (1) how to find Wikipedia compound monographs; and (2) the range of information available for each entry.&lt;/p&gt;

&lt;p&gt;This tutorial will describe a simple method to index Wikipedia's compound monographs using nothing but free tools and data. Subsequent articles will discuss qualitative aspects of Wikipedia's compound monographs and the challenges involved in organizing them into a chemically-aware service.&lt;/p&gt;

&lt;h4&gt;Indexing Wikipedia's Compound Monographs&lt;/h4&gt;

&lt;p&gt;We can index Wikipedia compound monographs via a simple procedure.&lt;/p&gt;

&lt;p&gt;Most compound monographs employ one of four precompiled Wikpedia templates: &lt;a href="http://en.wikipedia.org/wiki/Template:Chembox"&gt;Chembox&lt;/a&gt; (deprecated); &lt;a href="http://en.wikipedia.org/wiki/Template:Chembox_new"&gt;Chembox new&lt;/a&gt;; &lt;a href="http://en.wikipedia.org/wiki/Template:Drugbox"&gt;Drugbox&lt;/a&gt;; and &lt;a href="http://en.wikipedia.org/wiki/Template:Explosivebox"&gt;Explosivebox&lt;/a&gt;. As an example of what these templates look like, see the right-hand box on Wikipedia's entry on &lt;a href="http://en.wikipedia.org/wiki/Modafinil"&gt;modafinil&lt;/a&gt;. To index Wikipedia's compound monographs, all we need to do is find the titles of all articles using one of these four templates.&lt;/p&gt;

&lt;p&gt;To get started, we'll need a local copy of Wikipedia. The complete set of all Wikipedia articles, as of March 12, 2008 can be &lt;a href="http://download.wikimedia.org/enwiki/20080312/enwiki-20080312-pages-articles.xml.bz2"&gt;downloaded here&lt;/a&gt;. This data dump is updated periodically, so you may have access to a more recent version.&lt;/p&gt;

&lt;p&gt;The Wikipedia dump, which contains the full text of every article in Wikipedia, consists of a 3.5 GB file in &lt;a href="http://www.bzip.org/"&gt;BZip2&lt;/a&gt; format. Fortunately, we won't need to inflate it to index its chemical content.&lt;/p&gt;

&lt;p&gt;The following code will scan the raw Wikipedia dump and produce a list of all compound monograph titles:&lt;/p&gt;

&lt;div class="typocode"&gt;&lt;pre&gt;&lt;code class="typocode_ruby "&gt;&lt;span class="ident"&gt;title&lt;/span&gt; &lt;span class="punct"&gt;=&lt;/span&gt; &lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;&lt;span class="string"&gt;&lt;/span&gt;&lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;
&lt;span class="ident"&gt;log&lt;/span&gt; &lt;span class="punct"&gt;=&lt;/span&gt; &lt;span class="constant"&gt;File&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;new&lt;/span&gt; &lt;span class="punct"&gt;'&lt;/span&gt;&lt;span class="string"&gt;monographs.txt&lt;/span&gt;&lt;span class="punct"&gt;',&lt;/span&gt; &lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;&lt;span class="string"&gt;w&lt;/span&gt;&lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;

&lt;span class="keyword"&gt;while&lt;/span&gt;&lt;span class="punct"&gt;((&lt;/span&gt;&lt;span class="ident"&gt;line&lt;/span&gt; &lt;span class="punct"&gt;=&lt;/span&gt; &lt;span class="constant"&gt;STDIN&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;gets&lt;/span&gt;&lt;span class="punct"&gt;))&lt;/span&gt;
  &lt;span class="ident"&gt;line&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;match&lt;/span&gt; &lt;span class="punct"&gt;/&amp;lt;&lt;/span&gt;&lt;span class="ident"&gt;title&lt;/span&gt;&lt;span class="punct"&gt;&amp;gt;(.*)&amp;lt;\/&lt;/span&gt;&lt;span class="regex"&gt;title&amp;gt;&lt;/span&gt;&lt;span class="punct"&gt;/&lt;/span&gt;

  &lt;span class="ident"&gt;if&lt;/span&gt; &lt;span class="global"&gt;$1&lt;/span&gt;
    &lt;span class="ident"&gt;title&lt;/span&gt; &lt;span class="punct"&gt;=&lt;/span&gt; &lt;span class="global"&gt;$1&lt;/span&gt;

    &lt;span class="keyword"&gt;next&lt;/span&gt;
  &lt;span class="keyword"&gt;end&lt;/span&gt;

  &lt;span class="keyword"&gt;if&lt;/span&gt; &lt;span class="ident"&gt;line&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;match&lt;/span&gt; &lt;span class="punct"&gt;/\{\{(&lt;/span&gt;&lt;span class="ident"&gt;chembox&lt;/span&gt;&lt;span class="punct"&gt;|&lt;/span&gt;&lt;span class="ident"&gt;drugbox&lt;/span&gt;&lt;span class="punct"&gt;|&lt;/span&gt;&lt;span class="ident"&gt;explosivebox&lt;/span&gt;&lt;span class="punct"&gt;)/&lt;/span&gt;&lt;span class="ident"&gt;i&lt;/span&gt;
    &lt;span class="keyword"&gt;unless&lt;/span&gt; &lt;span class="ident"&gt;title&lt;/span&gt; &lt;span class="punct"&gt;==&lt;/span&gt; &lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;&lt;span class="string"&gt;&lt;/span&gt;&lt;span class="punct"&gt;&amp;quot;&lt;/span&gt; &lt;span class="punct"&gt;||&lt;/span&gt; &lt;span class="ident"&gt;title&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;match&lt;/span&gt;&lt;span class="punct"&gt;(/&lt;/span&gt;&lt;span class="regex"&gt;:&lt;/span&gt;&lt;span class="punct"&gt;/)&lt;/span&gt;
      &lt;span class="ident"&gt;puts&lt;/span&gt; &lt;span class="ident"&gt;title&lt;/span&gt;
      &lt;span class="ident"&gt;log&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;puts&lt;/span&gt; &lt;span class="ident"&gt;title&lt;/span&gt;
      &lt;span class="ident"&gt;log&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;flush&lt;/span&gt;

      &lt;span class="ident"&gt;title&lt;/span&gt; &lt;span class="punct"&gt;=&lt;/span&gt; &lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;&lt;span class="string"&gt;&lt;/span&gt;&lt;span class="punct"&gt;&amp;quot;&lt;/span&gt;
    &lt;span class="keyword"&gt;end&lt;/span&gt;
  &lt;span class="keyword"&gt;end&lt;/span&gt;
&lt;span class="keyword"&gt;end&lt;/span&gt;

&lt;span class="ident"&gt;log&lt;/span&gt;&lt;span class="punct"&gt;.&lt;/span&gt;&lt;span class="ident"&gt;close&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;

&lt;p&gt;Saving this code into a file called &lt;strong&gt;filter.rb&lt;/strong&gt;, we can run it by piping the output of &lt;tt&gt;bzcat&lt;/tt&gt; on the raw dump file:&lt;/p&gt;

&lt;div class="console"&gt;
&lt;pre&gt;
$ bzcat &amp;lt;path_to_dump&amp;gt;/enwiki-20080312-pages-articles.xml.bz2 | ruby filter.rb
&lt;/pre&gt;
&lt;/div&gt;

&lt;p&gt;Alphabetizing the output file gives a complete listing of Wikipedia's compound monograph titles (all 6,411 of them), which for convenience can be &lt;a href="http://depth-first.com/demo/20080428/compound_monographs_20080315.txt"&gt;downloaded here&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;We can construct a URL to each Wikipedia compound monograph by prepending the title with &lt;strong&gt;http://wikipedia.org/wiki/&lt;/strong&gt;. In other words, our program's output can be used both as a list of chemical names and as a hash of chemical names to Wikipedia URLs. And with the URL in hand, &lt;a href="http://depth-first.com/articles/2008/04/02/wikipedia-for-cheminformatics-a-simple-web-api-for-finding-cas-numbers-in-compound-monographs"&gt;all kinds of interesting things can be done&lt;/a&gt;.&lt;/p&gt;

&lt;h4&gt;Limitations&lt;/h4&gt;

&lt;p&gt;Although easy to carry out, the procedure described here has some limitations:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Monographs added after March 12, 2008 are not visible.&lt;/li&gt;
&lt;li&gt;Monographs that don't use the chembox, chembox new, drugbox, or explosivebox templates are not visible.&lt;/li&gt;
&lt;li&gt;A very small number of articles erroneously use the chembox template, for example &lt;a href="http://en.wikipedia.org/wiki/Iraq%27s_Chemical_Warfare"&gt;this one&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;Chempedia Redesign&lt;/h4&gt;

&lt;p&gt;Currently, Chempedia doesn't include all 6,411 monographs but rather a subset created by a much less comprehensive indexing method. As part of a major redesign of the site, all Wikipedia compound monographs will be available on Chempedia, which should result in a much more useful service.&lt;/p&gt;

&lt;h4&gt;Conclusions&lt;/h4&gt;

&lt;p&gt;Wikipedia is fast becoming a major storehouse of chemical information with tantalizing potential for creating powerful new services for chemists. More to the point for cheminformatics, the entire Wikipedia dataset can be downloaded and reprocessed free of charge; Wikipedia is one of those rare cheminformatics datasets that is &lt;a href="http://depth-first.com/articles/2006/09/27/hacking-pubchem-free-speech-or-free-beer"&gt;both free as in speech and free as in beer&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;As this article has shown, some simple programming is all it takes to begin doing useful things with Wikipedia's chemical content. Future articles will discuss some of the possibilities.&lt;/p&gt;</description>
      <pubDate>Mon, 28 Apr 2008 18:22:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:6980ce0d-0482-48ba-9489-ca1235632f66</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/04/28/building-chempedia-indexing-wikipedias-6-411-compound-monographs</link>
      <category>Meta</category>
      <category>chempedia</category>
      <category>wikipedia</category>
      <category>compoundmonograph</category>
      <category>bzip2</category>
      <category>merckindex</category>
    </item>
    <item>
      <title>Thinking of Founding a Science Startup? Look to What's Getting Cheaper</title>
      <description>&lt;p&gt;&lt;a href="http://flickr.com/photos/mathoov/2428918221/"&gt;&lt;img src="http://depth-first.com/demo/20080422/startup.jpg" align="right"&gt;&lt;/img&gt;&lt;/a&gt;&lt;a href="http://mndoci.com/blog/blog/"&gt;Deepak Singh&lt;/a&gt; recently started an &lt;a href="http://mndoci.com/blog/2008/04/20/will-we-ever-see-a-biosciences-startup-school/"&gt;interesting discussion&lt;/a&gt; (and &lt;a href="follow-up](http://mndoci.com/blog/2008/04/21/continuing-thoughts-on-innovation-models/"&gt;follow-up&lt;/a&gt;) about the need for organizations that help early-stage bioscience startups in the same way that &lt;a href="http://ycombinator.com/"&gt;YCombinator&lt;/a&gt; does in the Web space. But having just attended my second YC &lt;a href="http://startupschool.org/"&gt;Startup School&lt;/a&gt;, I'm left with a new-found appreciation of the role startup economics plays in shaping not just the startup landscape, but the culture of entrepreneurship that goes with it.&lt;/p&gt;

&lt;p&gt;There's a world of difference between the kinds of startups YCombinator is interested in and the kind of startup most chemists and biologists would be in a position to found. As told by Paul Graham of YCombinator, &lt;a href="http://paulgraham.com/webstartups.html"&gt;founding a Web startup is cheap, and that changes everything&lt;/a&gt;:&lt;/p&gt;

&lt;blockquote&gt;
    &lt;p&gt;There's something interesting happening right now. Startups are undergoing the same transformation that technology does when it becomes cheaper.&lt;/p&gt;
    
    &lt;p&gt;It's a pattern we see over and over in technology. Initially there's some device that's very expensive and made in small quantities. Then someone discovers how to make them cheaply; many more get built; and as a result they can be used in new ways.&lt;/p&gt;
    
    &lt;p&gt;Computers are a familiar example. When I was a kid, computers were big, expensive machines built one at a time. Now they're a commodity. Now we can stick computers in everything.&lt;/p&gt;
    
    &lt;p&gt;This pattern is very old. Most of the turning points in economic history are instances of it. It happened to steel in the 1850s, and to power in the 1780s. It happened to cloth manufacture in the thirteenth century, generating the wealth that later brought about the Renaissance. Agriculture itself was an instance of this pattern.&lt;/p&gt;
    
    &lt;p&gt;Now as well as being produced by startups, this pattern is happening to startups. It's so cheap to start web startups that orders of magnitudes more will be started. If the pattern holds true, that should cause dramatic changes.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Contrast the options available for a computer science student with those of a biology or chemistry student.&lt;/p&gt;

&lt;p&gt;The computer science student enjoys access to state-of-the-art tools that have been commoditized to the point of being either completely free or very close to it: hardware; hosting; operating systems; programming languages; development frameworks; source code management tools; and, increasingly Web services. More than one multimillion-dollar Web startup has been founded with nothing more than a laptop, a dorm room, some macaroni, a few friends, and a good idea or two.&lt;/p&gt;

&lt;p&gt;The life- or physical science student is faced with quite a different reality. Everything needed in getting started costs money - lots of money: lab space; instruments; consumables; a patent lawyer or two; and regulatory approval, both for day-to-day operations and possibly for the product to be sold.&lt;/p&gt;

&lt;p&gt;Then there's the problem of time to market. A Web startup can go from nothing to finished product over a summer vacation. Depending on the product being sold, a science startup may take ten years or more to do the same.&lt;/p&gt;

&lt;p&gt;This glacial product development cycle leaves the science startup with almost no room for error. In contrast, the Web startup is in a position to start offering a significantly flawed product early on and then iterate until it's perfect.&lt;/p&gt;

&lt;p&gt;These contrasting situations go a long way to explaining why bioscience startups tend to be founded by thirty- or fourtysomethings and Web startups can be and are founded by teenagers.&lt;/p&gt;

&lt;p&gt;With ready access to cheap means of production, Web startups enjoy many advantages that science startups can only dream of. For one thing, a product can actually be developed before approaching outside investors even becomes necessary. It's even possible to build a profitable Web startup &lt;a href="http://depth-first.com/articles/2008/04/21/building-a-technology-company-the-old-fashioned-way"&gt;purely from the profits&lt;/a&gt; created by selling the finished product.&lt;/p&gt;

&lt;p&gt;The bio or chemistry startup, on the other hand, will tend to be dependent to varying degrees on outside investors from the beginning. In some cases, the University hosting a science startup's early-phase research will play the role of outside investor, much to the founders' disadvantage.&lt;/p&gt;

&lt;p&gt;What do you get when you combine a need for large sums of money up-front with a need for almost perfect execution? A recipe for failing in business more frequently than anybody else.&lt;/p&gt;

&lt;p&gt;We might expect this situation to change if the cost of founding a startup in the life- or physical sciences dropped significantly. It may take a little imagination to see this as a possibility right now. But the process of new markets forming when technolgy becomes radically cheaper is a fundamental feature of captitalist societies that has played out time and again over the last several hundred years.&lt;/p&gt;

&lt;p&gt;If a transformation is in store for the economics of biotech and chemistry startups, what could trigger it?&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Image Credit: &lt;a href="http://flickr.com/photos/mathoov/"&gt;mathoov&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;</description>
      <pubDate>Tue, 22 Apr 2008 17:59:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:8fdb3c62-fab7-46ff-8978-06174c9d6bca</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/04/22/thinking-of-founding-a-science-startup-look-to-whats-getting-cheaper</link>
      <category>Meta</category>
      <category>startup</category>
      <category>bbgm</category>
      <category>startupschool</category>
      <category>commoditization</category>
      <category>biotech</category>
      <category>chemistry</category>
    </item>
    <item>
      <title>Building a Technology Company the Old-Fashioned Way</title>
      <description>&lt;p&gt;&lt;center&gt;&lt;object type="application/x-shockwave-flash" height="263" width="320" id="jtv_player_flash" data="http://www.justin.tv/widgets/jtv_tip_embed.swf" bgcolor="#000000"&gt;&lt;param name="movie" value="http://www.justin.tv/widgets/jtv_tip_embed.swf" /&gt;&lt;param name="allowFullScreen" value="true" /&gt;&lt;param name="flashvars" value="auto_play=false&amp;amp;start_volume=25&amp;amp;title=DHH Talk - Startup School 2008&amp;amp;start_time=1208631951000&amp;amp;end_time=1208633866000&amp;amp;channel=hackertv&amp;amp;tip_id=97862" /&gt;&lt;/object&gt;&lt;/center&gt;&lt;br /&gt;&lt;/p&gt;

&lt;p&gt;What's the secret to making money on the Web? &lt;a href="http://www.loudthinking.com/"&gt;David Heinemeier Hansson's&lt;/a&gt; talk at &lt;a href="http://depth-first.com/articles/2008/03/19/startup-school-2008-at-stanford"&gt;Startup School 2008&lt;/a&gt; offers a much-needed, but all too easy to ignore, dose of reality. Although some of his comments aren't nearly as funny out of &lt;a href="http://www.justin.tv/hackertv/episodes?order=most_recent"&gt;context&lt;/a&gt;, this presentation is well worth the time for anyone serious about building a solid technology business.&lt;/p&gt;</description>
      <pubDate>Mon, 21 Apr 2008 08:44:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:6b34c1c6-8e96-4664-a023-a9300046991a</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/04/21/building-a-technology-company-the-old-fashioned-way</link>
      <category>Meta</category>
      <category>dhh</category>
      <category>startupschool</category>
      <category>business</category>
      <category>company</category>
      <category>price</category>
      <category>profit</category>
    </item>
    <item>
      <title>Is reCAPTCHA Trying to Tell Me Something?</title>
      <description>&lt;p&gt;&lt;center&gt;&lt;img src="http://depth-first.com/demo/20080418/recap.png"&gt;&lt;/img&gt;&lt;/center&gt; &lt;/p&gt;</description>
      <pubDate>Fri, 18 Apr 2008 10:59:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:2c3be54c-4b6c-40d9-b5d0-6163afb99b12</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/04/18/is-recaptcha-trying-to-tell-me-something</link>
      <category>Meta</category>
      <category>recaptcha</category>
    </item>
    <item>
      <title>Casual Saturdays: Periodicity is Just a Theory</title>
      <description>&lt;p&gt;&lt;center&gt;&lt;img src="http://depth-first.com/demo/20080329/kansas_periodic.png"&gt;&lt;/img&gt;&lt;/center&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Credit: &lt;a href="http://www.re-discovery.org/per_table.html"&gt;reDiscovery Institute&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;</description>
      <pubDate>Sat, 29 Mar 2008 09:16:00 -0400</pubDate>
      <guid isPermaLink="false">urn:uuid:6bb33adb-2fdf-4c72-84cf-76bd2d1e4264</guid>
      <author>Rich Apodaca</author>
      <link>http://depth-first.com/articles/2008/03/29/casual-saturdays-periodicity-is-just-a-theory</link>
      <category>Meta</category>
      <category>periodictable</category>
      <category>kansas</category>
      <category>casualsaturdays</category>
    </item>
  </channel>
</rss>
