Roll Your Own Chemical Database With Free Components
Are you thinking of building a free chemical database but would rather not rent and maintain a bunch of proprietary software components? Norbert Haider has thought a lot about this problem and offers some helpful resources to get you started:
- Creating a web-based, searchable molecular structure database using free software Step-by step case study
- How to create a web-based molecular structure database with free software A presentation
- checkmol/matchmol Open source command-line utility for 2D (sub)structure matching
- mol2ps Command-line utility for converting molfiles into Postscript files
Haider's system can be deployed on commodity hardware running open source operating systems. In other words, the cost of setting up a system like the one he describes is practically zero.
Creating and open sourcing your own custom components is one way to go. Building on top of existing open source tools like CDK, Open Babel, Octet and JOELib is another.
Haider's work raises an interesting question. Has anyone assembled a complete, ready to install general purpose chemical database package built from open source components? It for no other reason, such an exercise would give an excellent idea of what the dogfood tastes like.