<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-14568785</id><updated>2012-01-12T13:21:56.436-05:00</updated><category term='imine'/><category term='logs'/><category term='ai'/><category term='open science'/><category term='collaboration'/><category term='4-benzyltoluene'/><category term='furfuryl'/><category term='NSF'/><category term='American Chemical Society'/><category term='laboratory protocol'/><category term='events'/><category term='proposal'/><category term='x-ray crystallography'/><category term='asist2007'/><category term='JSpecView'/><category term='4-hydroxybenzaldehyde'/><category term='Wikispaces'/><category term='academia'/><category term='aiai09'/><category term='t-shirt'/><category term='job'/><category term='melting point'/><category term='niso'/><category term='acfas'/><category term='PI'/><category term='license'/><category term='video'/><category term='parallel synthesis'/><category term='scio11'/><category term='priority'/><category term='open access'/><category term='green chemistry'/><category term='TFA'/><category term='phenylacetaldehyde'/><category term='Southampton'/><category term='dtp'/><category term='strychnine'/><category term='open notebook science'/><category term='THF'/><category term='thescientist'/><category term='msds'/><category term='prize'/><category term='Nature'/><category term='science2.0'/><category term='visualization'/><category term='synapticleap'/><category term='nature precedings'/><category term='workshop'/><category term='RDF'/><category term='onschallenge'/><category term='zazzle'/><category term='webinar'/><category term='drugbank'/><category term='lab notebook'/><category term='success'/><category term='definitions'/><category term='drug design'/><category term='falcipain'/><category term='praziquantel'/><category term='precipitation'/><category term='cheminformatics'/><category term='Crossfire'/><category term='chemistry'/><category term='organic chemistry'/><category term='bioscreencast'/><category term='chemistry central'/><category term='green solvent'/><category term='CAS'/><category term='VBA'/><category term='lecture'/><category term='interview'/><category term='patent'/><category term='drexel'/><category term='Mettler-Toledo'/><category term='Grubbs'/><category term='Gephi'/><category term='ACS'/><category term='paclitaxel'/><category term='Scribd'/><category term='clinical trials'/><category term='Google books'/><category term='screencasting'/><category term='wiley'/><category term='lise'/><category term='eScience'/><category term='Haumea'/><category term='Excel'/><category term='submeta'/><category term='hubbub'/><category term='education'/><category term='thesis'/><category term='challenge'/><category term='furan'/><category term='QSAR'/><category term='eChemistry'/><category term='ChemDoodle'/><category term='precedings'/><category term='#scio11'/><category term='ijcai09'/><category term='chembiofoo'/><category term='iminium salt'/><category term='FriendFeed'/><category term='lawva'/><category term='creative commons'/><category term='snapshot'/><category term='NaH'/><category term='leadership'/><category term='bradley'/><category term='logo'/><category term='Ugi reaction'/><category term='telephone gambit'/><category term='hemiacetal'/><category term='nature chemistry'/><category term='scifoo08'/><category term='biology'/><category term='course'/><category term='oxidation'/><category term='#oss2010'/><category term='docking'/><category term='Abraham'/><category term='proteins'/><category term='posters'/><category term='spectra prize NMR game'/><category term='Grant'/><category term='cml'/><category term='MiniMapper'/><category term='spectra'/><category term='23andme'/><category term='game SpectralGame'/><category term='methanol'/><category term='artificial intelligence'/><category term='repository'/><category term='ecrystals'/><category term='teaching'/><category term='#scio10'/><category term='reactive reports'/><category term='periodic table'/><category term='bryan vickery'/><category term='sigma-aldrich'/><category term='Google Co-op'/><category term='nci'/><category term='PLoS ONE'/><category term='opal'/><category term='RSC'/><category term='virtual library'/><category term='scifoo scifoo08'/><category term='enoyl reductase'/><category term='sflo'/><category term='scholarship'/><category term='reaction optimization'/><category term='syntheticpages'/><category term='NERM'/><category term='crowdsoucing'/><category term='ego'/><category term='happy accidents'/><category term='quiz'/><category term='Google'/><category term='ASIST'/><category term='wikipedia'/><category term='energy'/><category term='sql'/><category term='CombiUgi'/><category term='ELN'/><category term='Drug discovery'/><category term='Marketplace'/><category term='BCCE'/><category term='standards'/><category term='inkspot'/><category term='lab'/><category term='social media'/><category term='acsfall2008'/><category term='symposium'/><category term='Mechanical Turk'/><category term='cancer'/><category term='beer'/><category term='cmlreact'/><category term='astronomy'/><category term='JCAMP'/><category term='Warr'/><category term='chemical and engineering news'/><category term='collaborator'/><category term='Amazon'/><category term='funding'/><category term='TunedIT'/><category term='per contra'/><category term='open chemistry'/><category term='wolframalpha'/><category term='polymer'/><category term='application note'/><category term='temperature'/><category term='experimental design'/><category term='blue obelisk'/><category term='reactionattempts'/><category term='library'/><category term='solubility'/><category term='NCCB08'/><category term='screening'/><category term='density'/><category term='regression'/><category term='NIST'/><category term='second life'/><category term='spectral game'/><category term='sodium hydride'/><category term='tips'/><category term='Mendeley'/><category term='journal'/><category term='web service'/><category term='breast cancer'/><category term='ugichem2cml'/><category term='malaria'/><category term='IP'/><category term='resveratrol'/><category term='eclipse'/><category term='glatiramer'/><category term='science careers'/><category term='relaxation time'/><category term='ischool'/><category term='molecular descriptors'/><category term='research data'/><category term='open collaboration'/><category term='chemspider'/><category term='1cellPK'/><category term='nanoinformatics'/><category term='hubzero'/><category term='molecules'/><category term='DiscoveryGate'/><category term='curation'/><category term='tenure-track'/><category term='triples'/><category term='experiments'/><category term='XML'/><category term='Joan Slonczewski'/><category term='wald'/><category term='game'/><category term='GoogleDoc'/><category term='science online'/><category term='google scripts'/><category term='usefulchem'/><category term='chemtiles'/><category term='claims'/><category term='cdd'/><category term='social networks'/><category term='integration'/><category term='distillation'/><category term='#acs'/><category term='lulu'/><category term='BMS'/><category term='automation'/><category term='crowdsourcing'/><category term='scivee'/><category term='reaction attempts'/><category term='teaching lab'/><category term='IGERT'/><category term='Vioxx'/><category term='high throughput'/><category term='#acs_sf'/><category term='Gates Foundation'/><category term='trust'/><category term='planets'/><category term='open data'/><category term='nobel'/><category term='youtube'/><category term='hydrino'/><category term='conference'/><category term='chemmantis'/><category term='acetate'/><category term='MiniBlocks'/><category term='GoogleSpreadsheet'/><category term='inchi'/><category term='acid'/><category term='panel'/><category term='archive'/><category term='spreadsheet'/><category term='x2project'/><category term='peer review'/><category term='wikis'/><category term='SpectralGame'/><category term='tumor'/><category term='class'/><category term='biomolecules'/><category term='chemical information'/><category term='Indiana University'/><category term='O&apos;Reilly'/><category term='acsspring2008'/><category term='prediction'/><category term='reproducibility'/><category term='database'/><category term='science blogging'/><category term='cssp'/><category term='Beilstein'/><category term='UCSF'/><category term='postdoc'/><category term='science commons'/><category term='pKa'/><category term='back-up'/><category term='jove'/><category term='pubchem'/><category term='justscience2008'/><category term='inchimatic'/><category term='berkeley'/><category term='metathesis'/><category term='award'/><category term='book'/><category term='sponsor'/><category term='SLA'/><category term='demographics'/><category term='student'/><category term='open science summit'/><category term='mechanism'/><category term='brighttalk'/><category term='web interface'/><category term='3D'/><category term='SMILES'/><category term='web2.0'/><category term='NMR'/><category term='decompress'/><category term='intellectual property'/><category term='scienceblogging.com'/><category term='microsoft'/><category term='scifinder'/><category term='chemADVISOR'/><category term='SLA2011'/><category term='myexperiment'/><category term='model'/><category term='failure'/><category term='scholar2scholar'/><category term='scifoo'/><category term='dark data'/><category term='NASA'/><category term='medicine'/><category term='discovery'/><category term='Kenyon College'/><title type='text'>Useful Chemistry</title><subtitle type='html'>This blog chronicles the research of the UsefulChem project in the Bradley lab at Drexel University. The main project currently involves the synthesis of novel anti-malarial compounds. The work is done under Open Notebook Science conditions with the actual detailed lab notebook located at usefulchem.wikispaces.com. More general comments posted here relate to Open Science, especially when associated with chemistry.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default?start-index=101&amp;max-results=100'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>522</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-14568785.post-7764966488390301203</id><published>2011-10-05T12:47:00.000-04:00</published><updated>2011-10-05T12:47:44.785-04:00</updated><title type='text'>MiniSymposium Bradley Lab 2011</title><content type='html'>I recently presented a 15 minute summary of the current research in my lab on September 29, 2011 at the Drexel University Department of Chemistry Faculty Mini-Symposium. The main project discussed was the &lt;a href="http://usefulchem.blogspot.com/2011/08/open-melting-point-collection-book.html"&gt;Open Melting Point Collection&lt;/a&gt; done in collaboration with &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang &lt;/a&gt;and &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt;. Work by &lt;a href="http://onschallenge.wikispaces.com/students"&gt;Evan Curtin&lt;/a&gt; is also shown, demonstrating the application of melting point and solubility in reaction design.  I'll discuss this imine synthesis project in more detail later but &lt;a href="http://usefulchem.wikispaces.com/All+Reactions"&gt;Evan's experiments are listed in the notebook&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_9560399"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/minisymp2011-bradley" title="MiniSymp2011 Bradley"&gt;MiniSymp2011 Bradley&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse9560399" height="355" width="425"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=minisymp2011-111005113007-phpapp02&amp;amp;stripped_title=minisymp2011-bradley&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse9560399" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=minisymp2011-111005113007-phpapp02&amp;amp;stripped_title=minisymp2011-bradley&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" height="355" width="425"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;object height="400" width="480"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=34677&amp;amp;type=3"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" wmode="transparent" allowscriptaccess="always" flashvars="id=34677&amp;amp;type=3" height="400" width="480"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-7764966488390301203?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/7764966488390301203/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=7764966488390301203' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7764966488390301203'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7764966488390301203'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/10/minisymposium-bradley-lab-2011.html' title='MiniSymposium Bradley Lab 2011'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-6078283422587262019</id><published>2011-08-18T15:04:00.000-04:00</published><updated>2011-08-18T15:04:39.588-04:00</updated><title type='text'>Patrick Ndungu talk at Drexel on Nanotechnology</title><content type='html'>One of my former Ph. D. students, &lt;a href="http://chemistrywst.ukzn.ac.za/Staff_profiles/Patrick-Ndungu.aspx"&gt;Patrick Ndungu&lt;/a&gt; (now at University of KwaZulu Natal, South Africa) will be speaking at Drexel University on &lt;span style="font-weight: bold;"&gt;Friday August 19, 2011 at 12:30&lt;/span&gt; in Disque 109.&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Some Interesting Perspectives on the Integration of Nanomaterials with Energy and Water Treatment Technologies&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;As part of various key concerns in a developing economy, clean energy and access to potable water are an integral part of most strategic visions for sustainable socio-economic development. Of particular interest is the search for greener energy solutions that includes R&amp;amp;D into hydrogen energy technologies, and devices that utilize solar energy. Whilst clean water concerns centre on indigenous, cost-effective, and relatively simple technologies that can be easily deployed in remote or off-grid areas. Within this framework, this presentation will look at the evolution of a select body of work that has focused on the integration of carbon Nanomaterials into systems for hydrogen storage, fuel cells, and photo-catalytic materials for water treatment.&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-6078283422587262019?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/6078283422587262019/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=6078283422587262019' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6078283422587262019'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6078283422587262019'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/08/patrick-ndungu-talk-at-drexel-on.html' title='Patrick Ndungu talk at Drexel on Nanotechnology'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-6099063938600010097</id><published>2011-08-17T15:59:00.001-04:00</published><updated>2011-08-17T16:01:09.854-04:00</updated><title type='text'>Google Apps Scripts Workshop at Drexel University</title><content type='html'>&lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; will be in Philadelphia next week and we will be running a workshop on &lt;span style="font-weight: bold;"&gt;Leveraging Google Spreadsheets with Scripts for Research and Teaching&lt;/span&gt;.  Now that our institution is no longer providing Microsoft Office for students in the fall term, it seems like a good time to explore converting some assignments and projects relying on Excel to freely available Google Spreadsheets. (&lt;a href="http://onswebservices.wikispaces.com/GoogleAppsScripts"&gt;Resources available here&lt;/a&gt;)&lt;br /&gt;&lt;blockquote&gt;Andrew Lang (Department of Mathematics at Oral Roberts University) and Jean-Claude Bradley (Department of Chemistry at Drexel University) will host a workshop on Google Apps Scripts from 10:30 to 12:00 on Tuesday August 23, 2011 at the Hagerty Library in room L13C. They will demonstrate how users with no programming experience can easily add functions and drop-down menus to a Google Spreadsheet.  Some chemistry examples will be detailed, such as inter-converting compound identifiers (common name, SMILES, CAS number, etc.) and reporting properties (melting points, solubility, density, etc.) with a single click. Participants are encouraged to suggest applications in other fields to explore during the workshop.&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-6099063938600010097?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/6099063938600010097/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=6099063938600010097' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6099063938600010097'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6099063938600010097'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/08/google-apps-scripts-workshop-at-drexel.html' title='Google Apps Scripts Workshop at Drexel University'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4940284562628598534</id><published>2011-08-11T16:02:00.000-04:00</published><updated>2011-08-11T16:03:04.939-04:00</updated><title type='text'>Open Melting Point Collection Book Edition 1</title><content type='html'>Several months of work through a collaboration between myself, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt; and &lt;a href="http://usefulchem.blogspot.com/2011/05/evan-curtin-is-may-2011-rsc-ons.html"&gt;Evan Curtin&lt;/a&gt; have culminated in the publication of an Open Melting Point Collection Book.  Like our other books on &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;solubility&lt;/a&gt; and &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt;, the conversion from a database format to a PDF has several advantages.&lt;br /&gt;&lt;br /&gt;Now that the &lt;a href="http://precedings.nature.com/documents/6229/version/1"&gt;book has been accepted by Nature Precedings&lt;/a&gt;, it provides a convenient mechanism for citation via DOI, a formal author list, version control, etc.  The book is also now available from &lt;a href="http://www.lulu.com/product/paperback/open-notebook-science-melting-point-data/16526708"&gt;LuLu.com either as a free PDF download or a physical copy&lt;/a&gt;.   Because the book runs 699 pages (it covers 2706 unique compounds) the lowest price we could get is $30.96, which just covers printing and shipping.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-1hRwlM2B8K0/TkQkB0UYd6I/AAAAAAAABFw/st0z99I4OEk/s1600/mpbook1.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 212px; height: 320px;" src="http://2.bp.blogspot.com/-1hRwlM2B8K0/TkQkB0UYd6I/AAAAAAAABFw/st0z99I4OEk/s400/mpbook1.jpg" alt="" id="BLOGGER_PHOTO_ID_5639672247017306018" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Even though we have melting points for about &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;20,000 unique compounds&lt;/a&gt;, most of these are from single sources.  Unless we can get another major donation of melting points (not using any of the sources we already have), progress in curating single values manually will take time.&lt;br /&gt;&lt;br /&gt;As described in the abstract:&lt;br /&gt;&lt;blockquote&gt;This book represents a PDF version of Dataset ONSMP029 (2706 unique compounds, 7413 measurements) from a project to collect and curate melting points made available as Open Data. This particular collection was selected from the application of a threshold to favor the likelihood of reliability. Specifically, the entire range of averaged values for a data point was set to 0.01 C to 5 C, with at least two different measurements within this range. Measurements were pooled and processed from the following sources: Alfa Aesar, MDPI, Bergstrom, PhysProp, DrugBank, Bell, Oxford MSDS, Hughes, Griffiths and the Chemical Information Validation Spreadsheet. Links to all the information sources and web services are available from the Open Melting Point Resource page:&lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt; http://onswebservices.wikispaces.com/meltingpoint&lt;/a&gt;&lt;/blockquote&gt;&lt;br /&gt;This filtering of double validated melting point measurements within a range of 5C is an attempt to provide a "reasonably" good source,  It is imperative to understand that this is not a "trusted source" - as I've mentioned several time there is no such thing.  However, since absolute trusted sources do not exist, this double validated dataset of 2706 compounds is probably the best we can do for now.  In fact, use of this double validated to build melting point model has led to some &lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel004"&gt;excellent models&lt;/a&gt;, which are far superior to models constructed from the entire database of 20,000 compounds.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4940284562628598534?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4940284562628598534/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4940284562628598534' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4940284562628598534'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4940284562628598534'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/08/open-melting-point-collection-book.html' title='Open Melting Point Collection Book Edition 1'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-1hRwlM2B8K0/TkQkB0UYd6I/AAAAAAAABFw/st0z99I4OEk/s72-c/mpbook1.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4742848701341284676</id><published>2011-07-19T21:53:00.000-04:00</published><updated>2011-07-19T21:54:07.918-04:00</updated><title type='text'>Rapid analysis of melting point trends and models using Google Apps Scripts</title><content type='html'>I &lt;a href="http://usefulchem.blogspot.com/2011/07/practical-tips-on-using-google-apps.html"&gt;recently reported&lt;/a&gt; on how Google Apps Scripts can be used to facilitate the recording and calculations associated with a chemistry laboratory notebook. (also see &lt;a href="http://onswebservices.wikispaces.com/GoogleAppsScripts"&gt;resource page&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;I will demonstrate here how these scripts can be used to rapidly discover trends in the melting points of analogs for the curation of data and the evaluation of models.  The two melting point services that &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; created under the gONS menu were used to keep track of the measured and predicted melting points for all reactants and product as part of a "dashboard view" of the reaction being performed.&lt;br /&gt;&lt;br /&gt;For looking at melting point trends, the following &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=23"&gt;template sheet&lt;/a&gt; can be used.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-ozsUnSEcIRQ/TiWArdzX-6I/AAAAAAAABEA/ROYX5zk7wDw/s1600/mptemp1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 265px;" src="http://4.bp.blogspot.com/-ozsUnSEcIRQ/TiWArdzX-6I/AAAAAAAABEA/ROYX5zk7wDw/s400/mptemp1.png" alt="" id="BLOGGER_PHOTO_ID_5631048393319512994" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;For &lt;a href="http://usefulchem.blogspot.com/2011/07/practical-tips-on-using-google-apps.html"&gt;reasons explained previously&lt;/a&gt;, the template sheet has no active scripts in the page (except for the images). These are just the values generated from running the scripts corresponding to the column headings on the common names. In order to use for another series of compounds just make a copy of the entire &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=23"&gt;Google Spreadsheet&lt;/a&gt; (File-&amp;gt;Make a Copy) then enter the new list and pick the desired script to run from the menus.  Once the values are computed remember to copy and paste as values.&lt;br /&gt;&lt;br /&gt;It is important to understand that &lt;span style="font-weight: bold;"&gt;our melting point service is not a "trusted source"&lt;/span&gt; - it simply reports the average of all recorded data sources, ignoring values marked as DONOUSE.  &lt;span style="font-weight: bold;"&gt;That means that not all data points are equal and it is up to the user to determine a threshold of some type to decide how to use a particular data point.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;In this investigation, I have marked in green averaged experimental values where at least 3 different values are clustered within a few degrees.  A link in column H is automatically generated from the CSID to provide a very convenient way to evaluate the data sources.  For example the &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=864"&gt;link for methanol&lt;/a&gt; has &lt;span style="font-weight:bold;"&gt;3 very close but different&lt;/span&gt; melting point values: -98 C, -97.6 C and -97.53 C.  The -98 C value is repeated 7 times because this resulted from the automatic merging of several &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;Open Collections&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-hA8kOukZ_cY/TiWKZW9ZTPI/AAAAAAAABEI/21i3P0iGGU4/s1600/mptrends2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 361px;" src="http://2.bp.blogspot.com/-hA8kOukZ_cY/TiWKZW9ZTPI/AAAAAAAABEI/21i3P0iGGU4/s400/mptrends2.png" alt="" id="BLOGGER_PHOTO_ID_5631059077361126642" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;In general we don't manually add values that are identical from different sources because it is likely that these all originate from the same source.  We have to make that assumption because proper data provenance is usually lacking in chemical information sources today.  A Google search will often return the same one or two melting points from dozens of sites, which may turn out to be an outlier when compared with other independent sources. (CAS numbers are generated in the template sheet because they are useful for searching Google for melting points - for example &lt;a href="http://www.google.com/search?q=%2267-56-1%22+%22melting+point%22"&gt;see here for methanol&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;In another scenario where there are 3 or more different but close values and a  few clear marked outliers, I considered these averages as having passed my threshold and colored these green as well.  A good example is &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=682"&gt;ethanol&lt;/a&gt;, which I have previously used to &lt;a href="http://usefulchem.blogspot.com/2011/05/more-open-melting-points-from-epi-and.html"&gt;illustrate our curation method&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-9qfblY3Pbgg/TiWRZ9tBFyI/AAAAAAAABEQ/0DuN5exscKQ/s1600/mptemp3.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 389px; height: 400px;" src="http://4.bp.blogspot.com/-9qfblY3Pbgg/TiWRZ9tBFyI/AAAAAAAABEQ/0DuN5exscKQ/s400/mptemp3.png" alt="" id="BLOGGER_PHOTO_ID_5631066784342808354" border="0" /&gt;&lt;/a&gt;It turns out that for the &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=23"&gt;series of n-alcohols from methanol to 1-decanol&lt;/a&gt;, I was able to mark in green every experimental melting point average, making the confidence level of the following plot about as high as it can get from current chemical information sources.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/--IuGUxdLBkQ/TiWWLV2tKdI/AAAAAAAABEY/fH9LcxYzHiU/s1600/mptemp4.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 132px;" src="http://2.bp.blogspot.com/--IuGUxdLBkQ/TiWWLV2tKdI/AAAAAAAABEY/fH9LcxYzHiU/s400/mptemp4.png" alt="" id="BLOGGER_PHOTO_ID_5631072030685997522" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;It is particularly gratifying to note that the predicted melting points based on Andrew Lang's random forest &lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel002"&gt;Model002&lt;/a&gt; perform very well here, even predicting a melting point minimum at 3 carbons. Note that this model is Open Source and uses Open Descriptors derived from the CDK.  It does not yet include the results of our most recent curation efforts. Any new models incorporating improved datasets will be &lt;a href="http://onswebservices.wikispaces.com/meltingpoint"&gt;listed here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Extending the analysis to n-alkyl carboxylic acids from &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=24"&gt;formic acid to decanoic acid&lt;/a&gt; provides the following plot, with the same confidence for the experimental averages.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-lECCXmw3lAo/TiXZumu8SOI/AAAAAAAABEg/23grE3UwW5k/s1600/mptemp5.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 148px;" src="http://2.bp.blogspot.com/-lECCXmw3lAo/TiXZumu8SOI/AAAAAAAABEg/23grE3UwW5k/s400/mptemp5.jpg" alt="" id="BLOGGER_PHOTO_ID_5631146303791319266" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;For this series, the random forest model not only predicts that the lowest melting point is for the 5 carbon analog but it also appears to take the shape of a zig-zag pattern, especially for the first 6 acids.  Since this alternating pattern has been attributed to the way that carboxylic acid dimer bilayers pack in 3D (&lt;a href="http://pubs.rsc.org/en/Content/ArticleLanding/2004/NJ/b307208h"&gt;Bond2004&lt;/a&gt;), it is hard to imagine how simple 2D descriptors from the CDK can predict this.  We will have to investigate this in more detail.&lt;br /&gt;&lt;br /&gt;More generally, molecular symmetry can greatly affect the melting point via the way that crystals pack in 3D (see &lt;a href="http://en.wikipedia.org/wiki/Melting_point"&gt;Carnelley's Rule&lt;/a&gt;, &lt;a href="http://pubs.acs.org/doi/abs/10.1021/ed077p724"&gt;Brown2000&lt;/a&gt;).  At some point we would like to incorporate this factor in our models.  The current model should not be able to make predictions based on symmetry or stereochemistry.&lt;br /&gt;&lt;br /&gt;We can also explore the melting point patterns of cyclic systems.  Going from &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=14"&gt;cyclopropane to cyclohexane&lt;/a&gt; there is a large jump from a 5 to a 6 membered ring and this is roughly reflected in the model:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: left;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-H5gpuQEmcss/TiYTCD9hFPI/AAAAAAAABE4/cWQoA-erVV4/s1600/mptemp6.jpg"&gt;&lt;img style="display:block; margin:0px auto http://www.blogger.com/img/blank.gif10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 227px;" src="http://2.bp.blogspot.com/-H5gpuQEmcss/TiYTCD9hFPI/AAAAAAAABE4/cWQoA-erVV4/s400/mptemp6.jpg" alt="" id="BLOGGER_PHOTO_ID_5631209310217377010" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;Cycloalkanones behave similarly to cycloalkanes, showing a jump from 5 to 6 membered rings which agrees well with the model going from &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=22"&gt;cyclobutanone to cyclohexanone&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-TdqEfUvaaUw/TiYe2hYk8VI/AAAAAAAABFY/GMX0NrSFESU/s1600/mptemp8.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 199px;" src="http://3.bp.blogspot.com/-TdqEfUvaaUw/TiYe2hYk8VI/AAAAAAAABFY/GMX0NrSFESU/s400/mptemp8.jpg" alt="" id="BLOGGER_PHOTO_ID_5631222306096607570" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;However, in going from &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=20"&gt;methylcylopropane to methylcyclohexane&lt;/a&gt;, the model diverges substantially from experimental results.  It does start to get harder to find corroborating melting points and only 2 values can be found for &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=11232"&gt;methylcyclobutane&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-NtJw-0VCGfI/TiYVhJdB7SI/AAAAAAAABFI/ETwYXx-VRrE/s1600/mptemp7.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 228px;" src="http://2.bp.blogspot.com/-NtJw-0VCGfI/TiYVhJdB7SI/AAAAAAAABFI/ETwYXx-VRrE/s400/mptemp7.jpg" alt="" id="BLOGGER_PHOTO_ID_5631212043290930466" border="0" /&gt;&lt;/a&gt;Going from &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=19"&gt;cyclopropanecarboxylic acid to cyclohexanecarboxylic acid&lt;/a&gt; shows a U-type pattern and is not well matched by the model.  However, there is additional uncertainty about the melting point of &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=17789"&gt;cyclopentanecarboxylic acid&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/--SbQYxU3lwk/TiYjGYxq9JI/AAAAAAAABFg/PPOcJJGJZ2g/s1600/mptemp9.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 166px;" src="http://4.bp.blogspot.com/--SbQYxU3lwk/TiYjGYxq9JI/AAAAAAAABFg/PPOcJJGJZ2g/s400/mptemp9.jpg" alt="" id="BLOGGER_PHOTO_ID_5631226976710358162" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;For the series from &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHhvSVFvd0x6ZExqdlQ0NEd1MURZY0E&amp;amp;hl=en_US#gid=21"&gt;cyclopropylamine to cyclohexylamine&lt;/a&gt;, there initially appears to be a significant mismatch between the model and experiment.  However, because we have retained the provenance information in the spreadsheet it becomes clear that the &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=68166"&gt;cyclobutylamine number (in the orange square below) comes from a single source&lt;/a&gt;.  There is actually a good match between the other 3 values.  However, as demonstrated here, there has not been enough information on when the model is reliable to assign the source of the discrepancy at this point.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-I6nAY_EIP6A/TiYk70aODFI/AAAAAAAABFo/FmSrXn5Hnx4/s1600/mptemp10.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 179px;" src="http://2.bp.blogspot.com/-I6nAY_EIP6A/TiYk70aODFI/AAAAAAAABFo/FmSrXn5Hnx4/s400/mptemp10.jpg" alt="" id="BLOGGER_PHOTO_ID_5631228994172882002" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;These examples show that provenance information is a critical dimension in the analysis of trends in melting point data.  The Google Apps Scripts and associated Google Spreadsheet template presented here offer a quick and convenient way to provide access to both averaged values and a way of assessing confidence in an averaged value.  Performing these tasks manually is generally too time-consuming to encourage researchers to follow such a practice.  This is perhaps the reason that the current peer-review process accepts a single "trusted source" in analyses of this kind, even though such a practice inevitably leads to mis-interpretations and errors that &lt;a href="http://usefulchem.blogspot.com/2011/06/more-on-4-benzyltoluene-and-impact-of.html"&gt;cascade through the scientific literature&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4742848701341284676?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4742848701341284676/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4742848701341284676' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4742848701341284676'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4742848701341284676'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/07/rapid-analysis-of-melting-point-trends.html' title='Rapid analysis of melting point trends and models using Google Apps Scripts'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-ozsUnSEcIRQ/TiWArdzX-6I/AAAAAAAABEA/ROYX5zk7wDw/s72-c/mptemp1.png' height='72' width='72'/><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4721651843860172381</id><published>2011-07-14T14:52:00.001-04:00</published><updated>2011-07-14T14:55:28.508-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='GoogleSpreadsheet'/><category scheme='http://www.blogger.com/atom/ns#' term='google scripts'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='organic chemistry'/><title type='text'>Practical Tips on using Google Apps Scripts for Chemistry Applications</title><content type='html'>A few weeks ago I &lt;a href="http://usefulchem.blogspot.com/2011/06/google-apps-scripts-for-intuitive.html"&gt;described our use of Google Apps Scripts&lt;/a&gt;, developed by &lt;a href="http://depth-first.com/about/"&gt;Rich Apodaca&lt;/a&gt; and &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt;, as an intuitive interface to information related to a chemistry laboratory notebook.  Since then we have been using these tools to actively plan and record experiments (e.g. &lt;a href="http://usefulchem.wikispaces.com/Exp269"&gt;UC-EXP269&lt;/a&gt;) and we have learned their strengths and weaknesses.&lt;br /&gt;&lt;br /&gt;The most problematic aspect of Google Apps Scripts running within Google Spreadsheets turns out to be the way caching and refreshing operate.  There does not appear to be an obvious way to refresh a single cell.  So if a script times out or fails, Google stores that failed output on their servers and will not run it again until some time has elapsed (which seems to be on the order of about an hour).  Typing in a new input for that cell will cause the script to run again but entering a previously entered input will only retrieve the cached output, even a failed output.  For example, if you have a cell calculating the MW from "benzene" entered in another cell and the script fails for any reason, typing in "ethanol" will get it to run again for the new input, but going back to "benzene" will just pull up the cached output of "Failed".&lt;br /&gt;&lt;br /&gt;Nevertheless, I did come across some tricks to force a refresh indirectly.  If you insert a row or column then re-enter the desired scripts in the new cells, they will run again.  You simply need to then delete the old column with failed outputs.  This is fine for simple sheets but it can be a headache for sheets that have several calculation dependencies between cells.&lt;br /&gt;&lt;br /&gt;To avoid these complications, simply refresh the entire sheet by duplicating it, deleting the old sheet and then renaming the new one to the original name.  The problem now is that it will refresh all the cells, not just those that had failed outputs.  And if there are a large number of scripts on that sheet the odds are good that at least one will fail on that particular attempt, especially if several are hitting the same web server.&lt;br /&gt;&lt;br /&gt;As a result of all these problems, I would not recommend using these services as I had initially hoped, where a researcher would enter data into a template sheet loaded with scripts to automatically generate a series of calculated outputs.  There is a way to achieve this end but it requires thinking about the scripts in a slightly different way.&lt;br /&gt;&lt;br /&gt;As I mentioned above, there are tricks for refreshing an entire sheet or a column or row.  In order to avoid re-running the scripts that already returned desired outputs, we need to lock them in.  This can be done by highlighting the completed cells, copying them (either control-c or Edit-&amp;gt;Copy) then &lt;span style="font-weight: bold;"&gt;pasting them as values&lt;/span&gt; (from the Edit menu).  Now refreshing will only be done on the cells with failed outputs and these can be locked in as well as soon as they complete.&lt;br /&gt;&lt;br /&gt;The downside of this approach is that you lose the information about which script was run to generate the output values.  And to change an input requires re-selecting the desired script.  But in practice it is so convenient to hit a dropdown menu and hit getMW (for example) that this downside is quite minimal, especially when contrasted with the upside of knowing that others will see your information reliably, independent of how the services are running at a particular time.&lt;br /&gt;&lt;br /&gt;Over the past few weeks we have found that some services fail more often than others and it would be advantageous to have some redundancies.  This has been particularly problematic for the &lt;a href="http://cactus.nci.nih.gov/"&gt;cactus services&lt;/a&gt; recently, which we often use for resolving common names.  By using ChemSpiderIDs (CSIDs), the cactus services can be bypassed for several of the gONS services.  So a good practice for any application is to generate and lock in SMILES and CSIDs right away from the common name.  CAS numbers can be used too but the gChem service that Rich has created sometimes yields multiple CAS numbers and these will fail as input for a subsequent script.&lt;br /&gt;&lt;br /&gt;We now have a &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdEw4RVp3RXlLOU1faGRwampYX0FWRmc&amp;amp;hl=en_US#gid=0"&gt;chemistry Google Apps Scripts spreadsheet&lt;/a&gt; to keep track of which inputs are allowed for all the available services, along with information about the output, creator and description.  We also keep track of requests and plans for new scripts, marked as "pending" under the status field.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-J02pKXgOqSI/Th7h513oM_I/AAAAAAAABDw/GAyviWPxE0U/s1600/gscripts.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 171px;" src="http://4.bp.blogspot.com/-J02pKXgOqSI/Th7h513oM_I/AAAAAAAABDw/GAyviWPxE0U/s400/gscripts.png" alt="" id="BLOGGER_PHOTO_ID_5629184968088892402" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Surprisingly, pasting images "as values" within a Google Spreadsheet cell does not ensure that they will appear consistently - often the cells are just blank upon loading.  This makes the idea of using an embedded sheet to display reaction schemes within a wiki lab notebook page not practical.  However, using the scripts and a template to generate the scheme by just typing the name, SMILES or CSID for the reactants and product is a very efficient way to generate a consistent look for schemes within a notebook.  It only requires a final step of taking the image of the screen and cropping using Paint. For example, here is a scheme thus generated for &lt;a href="http://usefulchem.wikispaces.com/exp269"&gt;UC-EXP269&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-ESYc0E02z6E/Th7ymMmiVxI/AAAAAAAABD4/68Ms4N3PQQ0/s1600/UC-EXP269scheme.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 315px;" src="http://3.bp.blogspot.com/-ESYc0E02z6E/Th7ymMmiVxI/AAAAAAAABD4/68Ms4N3PQQ0/s400/UC-EXP269scheme.png" alt="" id="BLOGGER_PHOTO_ID_5629203322291509010" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Taking into account all of these factors, the &lt;a href="http://usefulchem.wikispaces.com/exp269"&gt;reaction template sheet&lt;/a&gt; we provide does not have by default any scripts running within cells (except for the images).  However, it is set up to quickly adapt to other reactions for planning amounts of reactants (by weight or volume), calculating concentrations, yields, melting points (experimental and predicted), solubilities, links to ChemSpider, 2D rendering of structures (including full schemes) and links to interactive NMR spectra using ChemDoodle.  It simply requires users to hit one of the 3 drop-down menus (gChem, gCDK or gONS) and select the appropriate script for a particular cell.&lt;br /&gt;&lt;br /&gt;Even if the user does not want to use this particular reaction template it still makes sense to make a copy of the template sheet because it is an easy way to copy all of the necessary Google Script without opening the editor.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4721651843860172381?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4721651843860172381/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4721651843860172381' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4721651843860172381'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4721651843860172381'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/07/practical-tips-on-using-google-apps.html' title='Practical Tips on using Google Apps Scripts for Chemistry Applications'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-J02pKXgOqSI/Th7h513oM_I/AAAAAAAABDw/GAyviWPxE0U/s72-c/gscripts.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1182745138870839237</id><published>2011-07-01T09:09:00.000-04:00</published><updated>2011-07-01T09:09:14.082-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hubbub'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='hubzero'/><title type='text'>Open Notebook Science Talk at HUBbub 2011</title><content type='html'>On April 6, 2011 I presented at the &lt;a href="https://hubzehttp//www.blogger.com/img/blank.gifhttp//www.blogger.com/img/blank.gifro.org/hubbub2011"&gt;HUBzero Conference&lt;/a&gt; in Indianapolis on "&lt;span style="font-weight: bold;"&gt;Open Notebook Science: Does Transparency Work?&lt;/span&gt;".&lt;br /&gt;&lt;blockquote&gt;This presentation will first describe Open Notebook Science, the practice of making the laboratory notebook and all associated raw data available to the public in real time. Examples of current applications in organic chemistry - solubility and chemical reactions - will be detailed. Key details of the current technical implementation will be described and possible applicability to nanotechnology projects will be explored. Finally, the implications for Intellectual Property protection, claims of priority, subsequent publication in peer reviewed journals and the eventual automation of the scientific process will be explored.&lt;/blockquote&gt;The organizers did a great job in making the recording available as &lt;a href="https://hubzero.org/resources/419"&gt;either a video or audio podcast&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;I learned a great deal at the conference about how researchers from various fields use the HUBzero software to manage and share their data. As described on their &lt;a href="http://hubzero.org/"&gt;website&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;HUBzero® is a platform used to create dynamic web sites for scientific research and educational activities. With HUBzero, you can easily publish your research software and related educational materials on the web.&lt;/blockquote&gt; Although the system is not primarily designed for completely Open sharing, I did get the impression that for some applications there was significant interest in making data and processes more Open.  There is certainly an enthusiastic user community around HUBzero - check out the &lt;a href="https://hubzero.org/hubbub2011"&gt;recordings for some of the other talks here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_7533037"&gt; &lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/open-notebook-science-hubzero-2011" title="Open Notebook Science HUBzero 2011" target="_blank"&gt;Open Notebook Science HUBzero 2011&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/7533037" marginwidth="0" marginheight="0" scrolling="no" frameborder="0" height="355" width="425"&gt;&lt;/iframe&gt; &lt;div style="padding:5px 0 12px"&gt; View more &lt;a href="http://www.slideshare.net/" target="_blank"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley" target="_blank"&gt;Jean-Claude Bradley&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1182745138870839237?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1182745138870839237/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1182745138870839237' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1182745138870839237'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1182745138870839237'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/07/open-notebook-science-talk-at-hubbub.html' title='Open Notebook Science Talk at HUBbub 2011'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1221603377249628502</id><published>2011-06-22T19:56:00.000-04:00</published><updated>2011-06-22T19:56:40.911-04:00</updated><title type='text'>The 4-benzyltoluene melting point twist</title><content type='html'>&lt;a href="http://usefulchem.blogspot.com/2011/05/evan-curtin-is-may-2011-rsc-ons.html"&gt;Evan Curtin&lt;/a&gt; and I were in the lab this morning to follow up on our effort to &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=62501"&gt;curate the melting point of 4-benzyltoluene&lt;/a&gt;.  I &lt;a href="http://usefulchem.blogspot.com/2011/06/more-on-4-benzyltoluene-and-impact-of.html"&gt;identified the next step&lt;/a&gt; to confirm an upper limit of -15 C:&lt;br /&gt;&lt;blockquote&gt;With the information available thus far from our experiments (&lt;a href="http://usefulchem.wikispaces.com/Exp266"&gt;UC-EXP266&lt;/a&gt;), we think it is unlikely that the +4.6 C value can be correct because we observed no solidification after 2 days at -15 C. The patent reports that solidification of some viscous mixtures took up to a full week but we did not observe an appreciable increase in viscosity for 4-benzyltoluene at -15 C. But in order to be sure we will first freeze the sample again below -40 C and let it warm up to -15 C in the freezer and confirm that it melts completely.&lt;/blockquote&gt;But when we took the sample out of the freezer after 16 days it was completely frozen!&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-t37zsHeyuFA/TgJzAqdRl3I/AAAAAAAABCg/HN1Z89gLZKM/s1600/UC-EXP266-pic5.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 267px; height: 400px;" src="http://4.bp.blogspot.com/-t37zsHeyuFA/TgJzAqdRl3I/AAAAAAAABCg/HN1Z89gLZKM/s400/UC-EXP266-pic5.png" alt="" id="BLOGGER_PHOTO_ID_5621181740146071410" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;This now effectively ruled out the -30 C value and re-opened the possibility that the +4.6 C value could be the best estimate.  Learning from our &lt;a href="http://usefulchem.blogspot.com/2011/06/quest-to-determine-melting-point-of-4.html"&gt;previous failed attempt to observe a temperature plateau&lt;/a&gt; when heating the sample, this time we let it warm as slowly as possible by leaving it in an ice water bath inside of a Styrofoam container.  This worked much better as the sample warmed a few degrees over several hours.  This time Evan observed a clear transition from the solid to the liquid phase in the 4-6 C range.(&lt;a href="http://usefulchem.wikispaces.com/Exp266"&gt;UC-EXP266&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;The curation record for the melting point of 4-benzyltoluene now &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=62501"&gt;looks like this&lt;/a&gt;:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-RCi5tuoKvzc/TgJ3Fj1WzgI/AAAAAAAABCo/R_9xZUfAU3Y/s1600/4-BTfinal.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 317px;" src="http://1.bp.blogspot.com/-RCi5tuoKvzc/TgJ3Fj1WzgI/AAAAAAAABCo/R_9xZUfAU3Y/s400/4-BTfinal.png" alt="" id="BLOGGER_PHOTO_ID_5621186222313885186" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;When I introduce the concept of &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; in my &lt;a href="http://usefulchem.blogspot.com/2011/06/my-talk-at-sla-on-trust-in-science-and.html"&gt;talks&lt;/a&gt; I usually make the point that &lt;span style="font-weight: bold;"&gt;there are no facts - just measurements embedded within assumptions&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;The 4-benzyltoluene melting point story is a really good example of this principle.  When I stated that I thought that "&lt;span style="font-weight:bold;"&gt;it is unlikely that the +4.6 C value can be correct because we observed no solidification after 2 days at -15 C&lt;/span&gt;", it was not the measurement that was in error - it was the interpretation.  And when &lt;a href="http://usefulchem.blogspot.com/2011/06/more-on-4-benzyltoluene-and-impact-of.html"&gt;new information came to light&lt;/a&gt;, an experiment was proposed to either challenge or further support that interpretation.  There were never any "facts" in this story (nor is the +4.6 C value a "fact" from these results).&lt;br /&gt;&lt;br /&gt;I think that this is how science functions best and most efficiently.  Unfortunately we don't usually have access to all pertinent raw measurements, assumptions and interpretations.  I would be extremely interested in seeing how the -30 C value was determined.  This is actually the value provided by the company that sold us this batch of material (as well as the PhysProp entry in the image above).  Because of slow crystallization, I can see how this could happen if the temperature was dropped until solidification was observed.  In our observations, the -30 C to -35 C range is roughly where we observed rapid solidification upon cooling. (&lt;a href="http://usefulchem.wikispaces.com/Exp266"&gt;UC-EXP266&lt;/a&gt;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1221603377249628502?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1221603377249628502/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1221603377249628502' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1221603377249628502'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1221603377249628502'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/4-benzyltoluene-melting-point-twist.html' title='The 4-benzyltoluene melting point twist'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-t37zsHeyuFA/TgJzAqdRl3I/AAAAAAAABCg/HN1Z89gLZKM/s72-c/UC-EXP266-pic5.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3911544018491207636</id><published>2011-06-18T12:46:00.001-04:00</published><updated>2011-06-18T12:48:03.720-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reaction attempts'/><category scheme='http://www.blogger.com/atom/ns#' term='google scripts'/><category scheme='http://www.blogger.com/atom/ns#' term='melting point'/><category scheme='http://www.blogger.com/atom/ns#' term='usefulchem'/><title type='text'>Google Apps Scripts for an intuitive interface to organic chemistry Open Notebooks</title><content type='html'>Rich Apodaca &lt;a href="http://metamolecular.com/blog/2011/05/31/gchem-convert-names-and-cas-numbers-to-chemical-structures-in-google-spreadsheets/"&gt;recently demonstrated&lt;/a&gt; how &lt;a href="http://code.google.com/googleapps/appsscript/"&gt;Google Apps Scripts&lt;/a&gt; can be added to Google Spreadsheets to enable simple calling of web services for chemistry applications (gChem).  Although we have been using web service calls from within a Google spreadsheet for some time (solubility calculation by NMR &lt;a href="http://onswebservices.wikispaces.com/NMR"&gt;link #3&lt;/a&gt; and misc chem conversions &lt;a href="http://onswebservices.wikispaces.com/misc"&gt;link #1&lt;/a&gt;), the process wasn't as intuitive as it could be because one had to find then paste lengthy urls.&lt;br /&gt;&lt;br /&gt;Rich's approach enables simply clicking the desired web service from a menu on Google Spreadsheets and these functions have simple names like getSMILES.  &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; has now added several &lt;a href="http://onswebservices.wikispaces.com/"&gt;web services from our ONS projects&lt;/a&gt; and the &lt;a href="http://en.wikipedia.org/wiki/Chemistry_Development_Kit"&gt;CDK&lt;/a&gt;.  There are now 3 menus to choose from: &lt;span style="font-weight:bold;"&gt;gChem, gCDK and gONS&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-sJOY5Z-5_28/Tfy2-lQwvfI/AAAAAAAABCI/bBeqlSNHZvg/s1600/3menus.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 282px;" src="http://1.bp.blogspot.com/-sJOY5Z-5_28/Tfy2-lQwvfI/AAAAAAAABCI/bBeqlSNHZvg/s400/3menus.png" alt="" id="BLOGGER_PHOTO_ID_5619567621322096114" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;To demonstrate the power of these tools consider the rapid construction of a &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHZRYXVPakxtRWZRd3AxQTNwcVdIcUE&amp;amp;hl=en_US#gid=0"&gt;customized interface&lt;/a&gt; to an experiment in a lab notebook (in this example &lt;a href="http://usefulchem.wikispaces.com/Exp263"&gt;UC-EXP263&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;1) Because Andy has added a gONS service to render images of molecules from ChemSpider, consistent &lt;a href="https://spreadsheets.google.com/spreadsheet/pub?hl=en_US&amp;amp;hl=en_US&amp;amp;key=0Ak1R8T6wt4YQdHZRYXVPakxtRWZRd3AxQTNwcVdIcUE&amp;amp;single=true&amp;amp;gid=1&amp;amp;output=html"&gt;reaction schemes&lt;/a&gt; can now be constructed from this template by &lt;span style="font-weight:bold;"&gt;simply typing the name of the reactants and products&lt;/span&gt; then embedding in the wiki.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-ZjQdXHkjHww/Tfy-ShJR5iI/AAAAAAAABCQ/glH27MZS_rs/s1600/schemeUC-EXP233.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 121px;" src="http://2.bp.blogspot.com/-ZjQdXHkjHww/Tfy-ShJR5iI/AAAAAAAABCQ/glH27MZS_rs/s400/schemeUC-EXP233.jpg" alt="" id="BLOGGER_PHOTO_ID_5619575660395750946" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;2) Planning of the reaction to calculate reactant amounts and product yield can then be processed by simply typing the name of the chemicals.  Services calling molecular weight and density are automatic based on the chemical name as input.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-zkjs9m7rqVQ/Tfy_ExKjodI/AAAAAAAABCY/vkSNvJOL9vo/s1600/calcsheet.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 197px;" src="http://1.bp.blogspot.com/-zkjs9m7rqVQ/Tfy_ExKjodI/AAAAAAAABCY/vkSNvJOL9vo/s400/calcsheet.png" alt="" id="BLOGGER_PHOTO_ID_5619576523689533906" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;3) Typing the name of the solvent then allows easy access to the solubility properties of the reaction components.  The calculated concentrations of the reactants and product can be directly compared with their measured maximum solubility. In this experiment the observed separation of the product from the solution is consistent with these measurements.&lt;br /&gt;&lt;br /&gt;4) Both experimental and predicted melting points (using &lt;a href="http://onswebservices.wikispaces.com/meltingpoint"&gt;Model002&lt;/a&gt;) can then be lined up for comparison.  A large discrepancy between the two would flag a possible error - in this case good agreement is found.  Noting that the &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=10400"&gt;product's melting point&lt;/a&gt; is near room temperature (53 C) explains why two layers were were observed to form during the course of the reaction and cooling to 0 C induced the product to precipitate.  Links to the melting measurements are also provided in column N for easy exploration.&lt;br /&gt;&lt;br /&gt;5) Column O provides a quick link to the ChemSpider entries for all compounds and column P provides links to the Reaction Attempts Explorer where, for example, one can explore other &lt;a href="http://showme.physics.drexel.edu/onsc/reactionattempts/advancedsearch.php?compound=10400"&gt;reactions where the product was involved&lt;/a&gt;.  Finally columns Q and R provide one click access to an &lt;a href="http://lxsrv7.oru.edu/%7Ealang/ChemDoodleWeb/spectrumwidget.php?url=http://usefulchem.wikispaces.com/file/view/UCexp263_1B_CDCl3.jdx/227835724/UCexp263_1B_CDCl3.jdxs=1"&gt;interactive NMR spectrum of the product&lt;/a&gt;, powered by ChemDoodle.&lt;br /&gt;&lt;br /&gt;The last few columns still use our older code to call web services but over time these should be added to the gONS collection for convenience.&lt;br /&gt;&lt;br /&gt;The easiest way to experiment with this interface is probably to just &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0Ak1R8T6wt4YQdHZRYXVPakxtRWZRd3AxQTNwcVdIcUE&amp;amp;hl=en_US#gid=0"&gt;make a copy&lt;/a&gt; (File -&amp;gt; Make a Copy from the Google Spreadsheet menu).  The sheet can then be customized for other applications.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3911544018491207636?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3911544018491207636/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3911544018491207636' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3911544018491207636'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3911544018491207636'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/google-apps-scripts-for-intuitive.html' title='Google Apps Scripts for an intuitive interface to organic chemistry Open Notebooks'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-sJOY5Z-5_28/Tfy2-lQwvfI/AAAAAAAABCI/bBeqlSNHZvg/s72-c/3menus.png' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3381570950237470718</id><published>2011-06-16T14:43:00.001-04:00</published><updated>2011-06-16T14:43:22.156-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='eclipse'/><category scheme='http://www.blogger.com/atom/ns#' term='Haumea'/><category scheme='http://www.blogger.com/atom/ns#' term='astronomy'/><category scheme='http://www.blogger.com/atom/ns#' term='open science'/><title type='text'>Live Tweeting Haumea: the Open Science Ratchet at work?</title><content type='html'>&lt;a href="http://www.nasw.org/users/essreich/"&gt;Eugenie Samuel Reich&lt;/a&gt; just announced on the &lt;a href="http://blogs.nature.com/news/2011/06/killer_of_pluto_on_livetweetin.html"&gt;Nature NewsBlog&lt;/a&gt; that astronomer &lt;a href="http://en.wikipedia.org/wiki/Michael_E._Brown"&gt;Mike Brown&lt;/a&gt; &lt;a href="http://twitter.com/#%21/plutokiller"&gt;live-tweeted&lt;/a&gt; his observations of a transit of dwarf planet &lt;a href="http://en.wikipedia.org/wiki/Haumea_%28dwarf_planet%29"&gt;Haumea&lt;/a&gt; by its moon, &lt;a href="http://en.wikipedia.org/wiki/Namaka_%28moon%29"&gt;Namaka&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;About a year ago, &lt;a href="http://usefulchem.blogspot.com/2010/07/secrecy-in-astronomy-and-open-science.html"&gt;I wrote about Mike Brown&lt;/a&gt; and the &lt;a href="http://en.wikipedia.org/wiki/Controversy_over_the_discovery_of_Haumea"&gt;controversy about the discovery of Haumea&lt;/a&gt; stemming from a competitor's more aggressive data dissemination practice.  In that post I speculated that we could expect accelerated data sharing over time due to the &lt;span style="font-weight: bold;"&gt;Open Science Ratchet, where the actions of scientists that are most open set the pace for everyone else working on that particular project, regardless of their views on how secretive science should be.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;I don't know if Mike Brown has changed his views on data sharing - or if he has always felt this way but thought it was too risky until now.  Either way, he certainly is taking the lead at this point to demonstrate how radical openness can be done in astronomy!&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-ryMbgfbexHM/TfpN0-sXpJI/AAAAAAAABCA/aiEIw8gk8yA/s1600/haumea.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 323px; height: 369px;" src="http://3.bp.blogspot.com/-ryMbgfbexHM/TfpN0-sXpJI/AAAAAAAABCA/aiEIw8gk8yA/s400/haumea.png" alt="" id="BLOGGER_PHOTO_ID_5618889057675748498" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;div style="text-align: center; font-style: italic;"&gt;&lt;span style="font-size:78%;"&gt;&lt;a href="http://en.wikipedia.org/wiki/Haumea_%28dwarf_planet%29"&gt;image from Wikipedia entry for Haumea&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3381570950237470718?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3381570950237470718/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3381570950237470718' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3381570950237470718'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3381570950237470718'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/live-tweeting-haumea-open-science.html' title='Live Tweeting Haumea: the Open Science Ratchet at work?'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-ryMbgfbexHM/TfpN0-sXpJI/AAAAAAAABCA/aiEIw8gk8yA/s72-c/haumea.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-760485297715084322</id><published>2011-06-16T11:28:00.001-04:00</published><updated>2011-06-16T11:33:24.106-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='trust'/><category scheme='http://www.blogger.com/atom/ns#' term='SLA'/><category scheme='http://www.blogger.com/atom/ns#' term='open data'/><category scheme='http://www.blogger.com/atom/ns#' term='melting point'/><category scheme='http://www.blogger.com/atom/ns#' term='SLA2011'/><title type='text'>My talk at SLA on Trust in Science and Open Melting Point Collections</title><content type='html'>On June 14 and 15, 2011 I attended the &lt;a href="http://sla2011.tornado1.com/"&gt;Special Libraries Association conference&lt;/a&gt; and made presentations on two panels on the role of trust in science with a case-study of the &lt;a href="http://usefulchem.blogspot.com/2011/06/more-on-4-benzyltoluene-and-impact-of.html"&gt;Open Melting Point collections&lt;/a&gt; that &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt; and I have been assembling and curating.&lt;br /&gt;&lt;br /&gt;The first panel was on the "&lt;span style="font-weight: bold;"&gt;International Year of Chemistry: Perils and Promises of Modern Communication in the Sciences&lt;/span&gt;".  My colleague &lt;a href="http://www.drexel.edu/culturecomm/ccdept/faculty/souder.asp"&gt;Laurence Souder&lt;/a&gt; from the Department of Culture and Communications at Drexel presented on "&lt;span style="font-weight: bold;"&gt;Trust in Science and Science by Blogging&lt;/span&gt;", using as an example the &lt;a href="http://science.nasa.gov/science-news/science-at-nasa/2010/02dec_monolake/"&gt;NASA press release on arsenic&lt;/a&gt; replacing phosphorus in bacteria and subsequent controversy taking place in the blogosphere. (see post in &lt;a href="http://www.scientificamerican.com/blog/post.cfm?id=arsenic-bacteria-have-changed-scien-2011-06-16&amp;amp;WT.mc_id=SA_Twitter_sciam"&gt;Scientific American blog&lt;/a&gt; today)&lt;br /&gt;&lt;br /&gt;Watch Lawrence Souder's presentation &lt;a href="http://www.scivee.tv/node/31170"&gt;screencast&lt;/a&gt; and &lt;a href="http://www.slideshare.net/jcbradley/souder-trust-in-science-sla-2011"&gt;slides&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The second panel was on "&lt;span style="font-weight:bold;"&gt;New Forms of Scholarly Communications in the Sciences&lt;/span&gt;".  Don Hagen from the &lt;a href="http://www.ntis.gov/"&gt;National Technical Information Service&lt;/a&gt; presented on "&lt;span style="font-weight: bold;"&gt;NTIS Focus on Science and Data: Open and Sustainable Models for Science Information Discovery&lt;/span&gt;" and &lt;a href="http://www.linkedin.com/in/dorotheasalo"&gt;Dorothea Salo&lt;/a&gt; discussed the evolving role of libraries and institutional repositories on scholarly communication and archiving.&lt;br /&gt;&lt;br /&gt;Watch Don Hagen's presentation &lt;a href="http://www.scivee.tv/node/31169"&gt;screencast&lt;/a&gt; and &lt;a href="http://www.slideshare.net/jcbradley/ntis-sla-15-june2011iia"&gt;slides&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;My own slides and screencast from the second panel are available below:&lt;br /&gt;&lt;br /&gt;&lt;object height="400" width="480"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=31161&amp;amp;type=3"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" wmode="transparent" allowscriptaccess="always" flashvars="id=31161&amp;amp;type=3" height="400" width="480"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_8326752"&gt; &lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/bradley-sla-talk-on-open-melting-point-collections" title="Bradley SLA Talk on Open Melting Point Collections"&gt;Bradley SLA Talk on Open Melting Point Collections&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/8326752" marginwidth="0" marginheight="0" frameborder="0" height="355" scrolling="no" width="425"&gt;&lt;/iframe&gt; &lt;div style="padding:5px 0 12px"&gt; View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-760485297715084322?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/760485297715084322/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=760485297715084322' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/760485297715084322'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/760485297715084322'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/my-talk-at-sla-on-trust-in-science-and.html' title='My talk at SLA on Trust in Science and Open Melting Point Collections'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-9213244404403577209</id><published>2011-06-11T21:59:00.003-04:00</published><updated>2011-06-12T08:53:58.769-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='melting point'/><category scheme='http://www.blogger.com/atom/ns#' term='4-benzyltoluene'/><title type='text'>More on 4-benzyltoluene and the impact of melting point data curation and transparency</title><content type='html'>There are many motivations for performing scientific research.  One of these is the desire to advance public scientific knowledge.&lt;br /&gt;&lt;br /&gt;This is a difficult concept to quantify or even qualitatively assess.  One can try to use literature citations and impact factors but that captures only a small fraction of the true scientific impact.  For example, &lt;a href="http://usefulchem.blogspot.com/2011/02/ons-solubility-challenge-book-cited-in.html"&gt;one formal citation&lt;/a&gt; of our solubility dataset doesn't represent the &lt;a href="http://www.sitemeter.com/?a=stats&amp;amp;s=s40onschallenge&amp;amp;r=0"&gt;100,000&lt;/a&gt; anonymous solubility queries made directly to our database.  And of these the actual impact will depend on exactly how the information was used.  Egon Willighagen has identified this as a &lt;a href="http://chem-bla-ics.blogspot.com/2010/02/further-statistics-on-papers-citing-cdk.html"&gt;problem&lt;/a&gt; for the Chemistry Development Kit (CDK) as well: many more people use the CDK than reflected simply by the number of citations to the &lt;a href="http://pubs.acs.org/doi/abs/10.1021/ci025584y"&gt;original paper&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;There are a few of us who believe that curating chemistry data is a high impact activity. &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt; spends a considerable amount of time on this activity and &lt;a href="http://www.chemconnector.com/"&gt;frequently uncovers&lt;/a&gt; very serious errors from a number of data sources.  &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; and I have put in a similar effort in collecting and curating solubility measurements openly - and recently (with Antony) we have been doing the same for &lt;a href="http://usefulchem.blogspot.com/2011/05/more-open-melting-points-from-epi-and.html"&gt;melting points&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Although attempting to estimate the total impact of the curation activity isn't really practical, we can look at a specific and representative example to capture the scope.&lt;br /&gt;&lt;br /&gt;I recently exposed the situation with the melting point measurements of &lt;a href="http://usefulchem.blogspot.com/2011/06/quest-to-determine-melting-point-of-4.html"&gt;4-benzyltoluene&lt;/a&gt;.  In brief, the literature provided contradictory information that could not be resolved without performing an experiment.  Although an exact measurement was not found, a limit was determined that ruled out all measurements except for one.&lt;br /&gt;&lt;br /&gt;Ironically it turns out that the melting point of this compound is its most important property for industrial use!  Derivatives of diphenylmethane were sought out to replace PCBs as electrical insulating oils for capacitors because of toxicity concerns.  As described in this patent (&lt;a href="http://www.google.com/patents?id=JJUbAAAAEBAJ&amp;amp;printsec=frontcover&amp;amp;dq=5134761&amp;amp;hl=en&amp;amp;ei=x3PzTZ3nHoPMgQe3xcG3Cw&amp;amp;sa=X&amp;amp;oi=book_result&amp;amp;ct=result&amp;amp;resnum=1&amp;amp;ved=0CCkQ6AEwAA"&gt;US5134761&lt;/a&gt;), for this application one requires the oil to remain liquid down to -50 C. Another key requirement is the ability to absorb hydrogen gas liberated at the electrode surface (a solubility property).  Since this is optimal for smaller alkyl groups on the rings, it places benzyltoluene isomers at the focal point of research for this application.&lt;br /&gt;&lt;br /&gt;The patent states: "According to references, the melting points of the position isomers of benzyltoluenes are as follows..." but does not make a specific reference.  However, by comparing the numbers with other sources we can presume that the reference is the &lt;a href="http://pubs.acs.org/doi/abs/10.1021/ja01633a050"&gt;Lemneck1954&lt;/a&gt; paper I &lt;a href="http://usefulchem.blogspot.com/2011/06/quest-to-determine-melting-point-of-4.html"&gt;discussed&lt;/a&gt; previously.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-vFTtdESymSU/TfO_o9ORu9I/AAAAAAAABBo/ofU5vmIKM60/s1600/mpcompare.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 232px; height: 257px;" src="http://2.bp.blogspot.com/-vFTtdESymSU/TfO_o9ORu9I/AAAAAAAABBo/ofU5vmIKM60/s400/mpcompare.png" alt="" id="BLOGGER_PHOTO_ID_5617043870611258322" border="0" /&gt;&lt;/a&gt;The patent then uses these melting points to &lt;span style="font-weight:bold;"&gt;calculate&lt;/span&gt; the melting behavior of &lt;span style="font-weight:bold;"&gt;mixtures of these isomers&lt;/span&gt;, as they obtain without further purification from a &lt;a href="http://en.wikipedia.org/wiki/Friedel%E2%80%93Crafts_reaction"&gt;Friedel-Crafts&lt;/a&gt; reaction.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-2kF16anT7t4/TfQG_CveusI/AAAAAAAABBw/538iJdxJruo/s1600/benzyltolmix.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 243px;" src="http://1.bp.blogspot.com/-2kF16anT7t4/TfQG_CveusI/AAAAAAAABBw/538iJdxJruo/s400/benzyltolmix.png" alt="" id="BLOGGER_PHOTO_ID_5617122315375393474" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;If our results are correct and the melting point of 4-benzyltoluene is not +4.6 C but well below -15 C, then the calculated properties in the patent may be significantly in error as well.  With the information available thus far from our experiments (&lt;a href="http://usefulchem.wikispaces.com/Exp266"&gt;UC-EXP266&lt;/a&gt;), we think it is unlikely that the +4.6 C value can be correct because we observed no solidification after 2 days at -15 C.  The patent reports that solidification of some viscous mixtures took up to a full week but we did not observe an appreciable increase in viscosity for 4-benzyltoluene at -15 C.  But in order to be sure we will first freeze the sample again below -40 C and let it warm up to -15 C in the freezer and confirm that it melts completely.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully()http://www.blogger.com/img/blank.gif;} catch(e) {}" href="http://3.bp.blogspot.com/-LNeS4QMj4gY/TfQIf9BVGcI/AAAAAAAABB4/KP4oWU4b6T4/s1600/longcryst.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 397px; height: 239px;" src="http://3.bp.blogspot.com/-LNeS4QMj4gY/TfQIf9BVGcI/AAAAAAAABB4/KP4oWU4b6T4/s400/longcryst.png" alt="" id="BLOGGER_PHOTO_ID_5617123980286958018" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;It is in light of this analysis that I make the case that open curation of melting point data is likely to be a high impact activity relative to the amount of time required to perform it.  The problem is that errors such as these cascade through the scientific record and likely retard scientific progress by causing confusion and wasted effort.  Consider the total cost in terms of research and legal fees for just one patent.  As I &lt;a href="http://usefulchem.blogspot.com/2011/03/validating-melting-point-data-from-alfa.html"&gt;discussed previously&lt;/a&gt;, consider the effect of compromised and contradictory data now known to exist within training sets on the pace of developing reliable melting point models (cascading down to solubility models dependent upon melting point predictions or measurements - and ultimately cascading to the efficiency of drug design).&lt;br /&gt;&lt;br /&gt;It is important to note that the benefits of curation would be greatly diminished without the component of transparency.  &lt;span style="font-weight: bold;"&gt;We are not claiming to provide a "trusted source" of melting point data.&lt;/span&gt;  There is no such thing - and operating under the illusion of the trusted source model has resulted in the mess we are in now - with multiple melting point values for the same compound cascading and multiplying to different databases (a good and still unresolved example is &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=7223"&gt;benzylamine&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;What we are doing is reporting all the sources we can use and marking some sources as DONOTUSE so they are not included in the calculation of the average - with an explanation.  We &lt;span style="font-weight: bold;"&gt;never delete data&lt;/span&gt; so users can make informed choices and not be in a position of having to trust our judgement. If someone does not agree with me that failure to freeze after 2 days at -15 C does not necessarily rule out the +4.6 C value for the melting point for 4-benzyltoluene then they are &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=62501"&gt;free to use it&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Using a trusted source model, all values within a collection are equally valid.  In the transparency model not all values are equal - we are justifiably more confident in a melting point value near &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=682"&gt;-114 C for ethanol&lt;/a&gt; than for a melting point with a single source (&lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=60237"&gt;like this compound&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;And finally, an important factor for having an impact on science is discoverability. It is likely that someone doing research involving the melting behavior of 4-benzyltoluene would perform at least quick Google search.  What they are likely to find is not just a simple number without provenance but rather a &lt;a href="http://www.google.com/search?q=4-benzyltoluene+melting+point"&gt;collection of results capturing the full subtlety of the situation under discussion&lt;/a&gt;. This is a natural outcome of working transparently.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-9213244404403577209?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/9213244404403577209/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=9213244404403577209' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/9213244404403577209'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/9213244404403577209'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/more-on-4-benzyltoluene-and-impact-of.html' title='More on 4-benzyltoluene and the impact of melting point data curation and transparency'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-vFTtdESymSU/TfO_o9ORu9I/AAAAAAAABBo/ofU5vmIKM60/s72-c/mpcompare.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-2073003435413234979</id><published>2011-06-10T14:32:00.000-04:00</published><updated>2011-06-10T14:32:29.316-04:00</updated><title type='text'>Open Melting Points on iPhone via MMDS</title><content type='html'>As Alex Clark &lt;a href="http://cheminf20.org/2011/06/09/melting-point-prediction-for-mmds-by-way-of-open-notebook-science/"&gt;explained on his blog Cheminformatics 2.0&lt;/a&gt;, both predicted and experimental melting points from our &lt;a href="http://onswebservices.wikispaces.com/meltingpoint"&gt;Open Data collection&lt;/a&gt; are now available on iPhones via his &lt;a href="http://molmatinf.com/products.html"&gt;MMDS webservices protocol&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-QBXmQ9OPcoI/TfJdo6dwMXI/AAAAAAAABBY/2OVR03idohY/s1600/mmds_ons1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 206px; height: 400px;" src="http://4.bp.blogspot.com/-QBXmQ9OPcoI/TfJdo6dwMXI/AAAAAAAABBY/2OVR03idohY/s400/mmds_ons1.png" alt="" id="BLOGGER_PHOTO_ID_5616654642754630002" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Although the app is not free, the &lt;a href="http://lxsrv7.oru.edu/%7Ealang/mmds/"&gt;web service&lt;/a&gt; (&lt;a href="http://onswebservices.wikispaces.com/meltingpoint"&gt;#7 from our collection&lt;/a&gt;) that &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; and Alex created for this purpose is Open and available for anyone to use.  It reads an XML formatted molfile and returns the average measured melting point, &lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel002"&gt;predicted melting point&lt;/a&gt;, SMILES, CSID and a link to the ChemSpider entry.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-nAHGLsNP6gU/TfJgz2jawDI/AAAAAAAABBg/ibuLVwDqRpU/s1600/mpserv7.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 307px;" src="http://2.bp.blogspot.com/-nAHGLsNP6gU/TfJgz2jawDI/AAAAAAAABBg/ibuLVwDqRpU/s400/mpserv7.png" alt="" id="BLOGGER_PHOTO_ID_5616658129218093106" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-2073003435413234979?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/2073003435413234979/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=2073003435413234979' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2073003435413234979'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2073003435413234979'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/open-melting-points-on-iphone-via-mmds.html' title='Open Melting Points on iPhone via MMDS'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-QBXmQ9OPcoI/TfJdo6dwMXI/AAAAAAAABBY/2OVR03idohY/s72-c/mmds_ons1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-7037557318602508483</id><published>2011-06-09T19:57:00.000-04:00</published><updated>2011-06-09T19:57:49.107-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='melting point'/><title type='text'>The quest to determine the melting point of 4-benzyltoluene</title><content type='html'>I recently &lt;a href="http://usefulchem.blogspot.com/2011/05/more-open-melting-points-from-epi-and.html"&gt;reported&lt;/a&gt; that we are attempting to curate the open melting point measurements collected from  multiple sources such as &lt;a href="http://www.alfa.com/"&gt;Alfa Aesar&lt;/a&gt;, &lt;a href="http://www.srcinc.com/what-we-do/product.aspx?id=133"&gt;PhysProp (EPIsuite)&lt;/a&gt; and several smaller collections.  I mentioned that some values - like &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=7223"&gt;benzylamine&lt;/a&gt; - simply don't converge and the only way to resolve the issue is to actually get a high purity sample and do a measurement.&lt;br /&gt;&lt;br /&gt;Since that report, we found another non-converging situation with &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=62501"&gt;4-benzyltoluene&lt;/a&gt;.  As shown below, reported measurements range from -30 C to 125C.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-JsKUxaNiK0c/TfD7YN4uVQI/AAAAAAAABAw/37-psMuP1M4/s1600/4btmplist.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 300px;" src="http://1.bp.blogspot.com/-JsKUxaNiK0c/TfD7YN4uVQI/AAAAAAAABAw/37-psMuP1M4/s400/4btmplist.jpg" alt="" id="BLOGGER_PHOTO_ID_5616265128794019074" border="0" /&gt;&lt;/a&gt;The values in red have been removed from the calculation of the average based on evidence we obtained from ordering the compound from &lt;a href="http://www.transworldchemicals.com/"&gt;TransWorld Chemicals&lt;/a&gt; and observing its behavior when exposed to various temperatures.  The details can be found from &lt;a href="http://usefulchem.wikispaces.com/Exp266"&gt;UC-EXP266&lt;/a&gt; (which I performed with &lt;a href="http://usefulchem.blogspot.com/2011/05/evan-curtin-is-may-2011-rsc-ons.html"&gt;Evan Curtin&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Immediately after opening the package it was clear that the compound was a liquid and thus the 125C and 98.5C values became improbable enough to remove.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-JHSJ3U1j3Y8/Te6CfTM4I9I/AAAAAAAABAQ/qYEJAGHccxM/s1600/UC-EXP266-pic1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 218px;" src="http://2.bp.blogspot.com/-JHSJ3U1j3Y8/Te6CfTM4I9I/AAAAAAAABAQ/qYEJAGHccxM/s400/UC-EXP266-pic1.png" alt="" id="BLOGGER_PHOTO_ID_5615569259619427282" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;First  Evan Curtin and I dropped the still sealed bottle into an ice bath (0C) and after 10 minutes there was no trace of solidification.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-UCyekRjiYQM/Te6CzZNFuNI/AAAAAAAABAY/LRGNVcbu2zQ/s1600/UC-EXP266-pic2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 267px; height: 400px;" src="http://3.bp.blogspot.com/-UCyekRjiYQM/Te6CzZNFuNI/AAAAAAAABAY/LRGNVcbu2zQ/s400/UC-EXP266-pic2.png" alt="" id="BLOGGER_PHOTO_ID_5615569604828313810" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;At this point, this does not necessarily rule out the values near 5C because of the short time in the bath.&lt;br /&gt;&lt;br /&gt;We then used an acetone/dry ice bath and did see a rapid and clear solidification after reaching -30C to -35C.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-aQdHVwT0OAk/Te6DBsx7P8I/AAAAAAAABAg/Kv3dBIgClLg/s1600/UC-EXP266-pic3.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 267px; height: 400px;" src="http://2.bp.blogspot.com/-aQdHVwT0OAk/Te6DBsx7P8I/AAAAAAAABAg/Kv3dBIgClLg/s400/UC-EXP266-pic3.png" alt="" id="BLOGGER_PHOTO_ID_5615569850601258946" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Letting the bath temperature rise it was difficult to tell what was happening but there seemed to be some liquefaction  around -12C.&lt;br /&gt;&lt;br /&gt;In order to get a more precise measurement, we transferred about 2 mls of the sample into a test tube and introduced the &lt;span style="font-weight: bold;"&gt;thermometer directly in contact with the substance&lt;/span&gt;.  After quickly freezing the contents in a dry ice/acetone bath, the sample was removed and its behavior was observed over time, as shown below.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-yEE727HXizs/Te6DT57s9UI/AAAAAAAABAo/a3kDM1HwjPE/s1600/UC-EXP266-pic4.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 243px; height: 400px;" src="http://4.bp.blogspot.com/-yEE727HXizs/Te6DT57s9UI/AAAAAAAABAo/a3kDM1HwjPE/s400/UC-EXP266-pic4.png" alt="" id="BLOGGER_PHOTO_ID_5615570163369571650" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;I was expecting to see the internal temperature rise then plateau at the melting point until all the solid disappeared and then finally observe a second temperature rise.  This comes from experience in making 0C baths within minutes by simply throwing ice into pure water.&lt;br /&gt;&lt;br /&gt;As shown above that is not at all what happened.  The liquid formed gradually starting at about -9C and never reached a plateau even up to +7C, where there was still much solid left.&lt;br /&gt;&lt;br /&gt;If we look at the method used to generate the 4.58 C value (&lt;a href="http://pubs.acs.org/doi/abs/10.1021/ja01633a050"&gt;Lamneck1954&lt;/a&gt;) we find that a similar method was cited - but not actually described there.  The actual curves are not available either.  However, this paper provides melting points for several compounds within a series, which is often useful for spotting possible errors - unless of course these are systematic errors.  In this particular case it doesn't help much because the 2-methyl derivative is similar but the 3-methyl analogue is very close to -30 C value listed in our sources.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-560ZVJilb78/TfEEGlWHHeI/AAAAAAAABBI/lUnggS9JBio/s1600/Lamneck54fig.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 234px; height: 188px;" src="http://1.bp.blogspot.com/-560ZVJilb78/TfEEGlWHHeI/AAAAAAAABBI/lUnggS9JBio/s400/Lamneck54fig.jpg" alt="" id="BLOGGER_PHOTO_ID_5616274721458298338" border="0" /&gt;&lt;/a&gt;Notice that one of the "melting points"  (3-methyldicyclohexylmethane) is &lt;span style="font-weight: bold;"&gt;not even measurable&lt;/span&gt; because it forms a glass.  It is easy to see how melting points below room temperature can generate very different values - and very difficult to assess if the full experimental details of the measurements are not reported.&lt;br /&gt;&lt;br /&gt;Trying to get at more details lets look at the referenced paper (&lt;a href="http://pubs.acs.org/doi/abs/10.1021/ja01163a075"&gt;Goodman1950&lt;/a&gt;).  Indeed the researchers determine the melting point by plotting the temperature over time as the sample is heated and looking for a plateau.  The obvious difference is that the heating rate is about an order of magnitude slower than in our experiment.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-QpwPTskfJco/TfEBhN_odAI/AAAAAAAABA4/lJwjX_UoN-M/s1600/mpcurve1.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 297px;" src="http://4.bp.blogspot.com/-QpwPTskfJco/TfEBhN_odAI/AAAAAAAABA4/lJwjX_UoN-M/s400/mpcurve1.jpg" alt="" id="BLOGGER_PHOTO_ID_5616271880511583234" border="0" /&gt;&lt;/a&gt;This paper also highlights the fact that there are more twists and turns in the melting point story.  One compound (&lt;a href="http://www.chemspider.com/Chemical-Structure.126310.html"&gt;2-butylbiphenyl&lt;/a&gt;) was found to have &lt;span style="font-weight: bold;"&gt;2 melting points&lt;/span&gt; that can be observed by seeding with different polymorphic crystals.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-SOImwhN5Xhk/TfFT4FY46NI/AAAAAAAABBQ/FEG7YmyeCgo/s1600/2mps.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 195px;" src="http://4.bp.blogspot.com/-SOImwhN5Xhk/TfFT4FY46NI/AAAAAAAABBQ/FEG7YmyeCgo/s400/2mps.jpg" alt="" id="BLOGGER_PHOTO_ID_5616362433292069074" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;At this point, our objective of obtaining an actual melting point was replaced with trying to at least mark a reasonably confident upper limit.  &lt;span style="font-weight: bold;"&gt;After leaving the sample at -15 C in a freezer for two days, no solidification was observed&lt;/span&gt; - not even an appreciable increase in viscosity.  For this reason, all melting point values above -15C were removed from the calculation of the average and &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=62501"&gt;show up in red&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;With only the -30 C measurement left, this is now the default value for 4-benzyltoluene - until further experimentation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-7037557318602508483?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/7037557318602508483/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=7037557318602508483' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7037557318602508483'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7037557318602508483'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/06/quest-to-determine-melting-point-of-4.html' title='The quest to determine the melting point of 4-benzyltoluene'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-JsKUxaNiK0c/TfD7YN4uVQI/AAAAAAAABAw/37-psMuP1M4/s72-c/4btmplist.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-5163971830821461543</id><published>2011-05-25T21:11:00.000-04:00</published><updated>2011-05-25T21:12:05.491-04:00</updated><title type='text'>More Open Melting Points from EPI and other sources: on the path to ultimate curation</title><content type='html'>As recently as 2008, Hughes et al published a paper asking: &lt;a href="http://pubs.acs.org/doi/abs/10.1021/ci700307p"&gt;Why Are Some Properties More Difficult To Predict than Others? A Study of QSPR of Solubility, Melting Point, and Log P&lt;/a&gt;&lt;br /&gt;&lt;blockquote&gt;The question then is: &lt;span style="font-weight: bold;"&gt;why do QSPR models consistently perform significantly worse with regard to melting point?&lt;/span&gt;  In the Introduction, we proposed three reasons for the failure of QSPR models: problems with the data, the descriptors, or the modeling methods. &lt;span style="font-weight:bold;"&gt;We find issues with the data unlikely to be the only source of error in Log S, Tm, and Log P predictions&lt;/span&gt;. Although the accuracy of the data provides a fundamental limit on the quality of a QSPR model, we attempted to minimize its influence by &lt;span style="font-weight: bold;"&gt;selecting consistent, high quality data&lt;/span&gt;... With regards to the accuracy of Tm and Log P data, both properties are associated with smaller errors than Log S measurement. &lt;span style="font-weight: bold;"&gt;Moreover, the melting point model performed the worst, yet it is by far the most straightforward property to measure&lt;/span&gt;...We suggest that the failure of existing chemoinformatics descriptors adequately to describe interactions in the crystalline solid phase may be a significant cause of error in melting point prediction.&lt;/blockquote&gt;Indeed, I have often heard that melting point prediction is notoriously difficult.  This paper attempted to discover why and suggested that it is more likely that the problem is related to a deficiency in available descriptors rather than data quality.  The authors seem to argue that taking a melting point is so straightforward that the resulting dataset is almost self-evidently high quality.&lt;br /&gt;&lt;br /&gt;I might have thought the same before we started collecting &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;melting point datasets&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;It turns out that validating melting points can be very challenging and we have found enormous errors - even cases where the &lt;a href="http://usefulchem.blogspot.com/2011/03/validating-melting-point-data-from-alfa.html"&gt;same compound in the same dataset is assigned very different melting points&lt;/a&gt;.  Under such conditions it is mathematically impossible to obtain high correlations between predicted and "measured" values.&lt;br /&gt;&lt;br /&gt;Since we have no additional information to go on (no spectral proof of purity, reports of heating rate, observations of melting behavior, etc.) the only way we can validate data points is to look for strong convergence from multiple sources.   For example, consider the -130 C value for the melting point of ethanol (as &lt;a href="http://usefulchem.blogspot.com/2011/03/validating-melting-point-data-from-alfa.html"&gt;discussed previously&lt;/a&gt; in detail).  It is clearly an outlier from the very closely clustered values near -114 C.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-gl5eY6p1tas/TdxDSE6Vi-I/AAAAAAAAA_U/gxL6J3-kspo/s1600/ethanolcurated.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 333px;" src="http://4.bp.blogspot.com/-gl5eY6p1tas/TdxDSE6Vi-I/AAAAAAAAA_U/gxL6J3-kspo/s400/ethanolcurated.jpg" alt="" id="BLOGGER_PHOTO_ID_5610433213631925218" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;This outlier value is now highlighted in red to indicate that it was explicitly identified to not be used in calculating the average.  Andrew Lang has now updated the melting point explorer to allow a convenient way to select or deselect outliers and indicate a reason (&lt;a href="http://onswebservices.wikispaces.com/meltingpoint"&gt;service #3&lt;/a&gt;).  For large separate datasets - such as the Alfa Aesar collection - this can be done right on the melting point explorer interface with a click.  For values recorded in the &lt;a href="http://onswebservices.wikispaces.com/ChemInfoValidation"&gt;Chemical Information Validation&lt;/a&gt; sheet, one has to update the spreadsheet directly.&lt;br /&gt;&lt;br /&gt;This is the same &lt;a href="http://precedings.nature.com/documents/4918/version/1/files/npre20104918-1.pdf"&gt;strategy that we used for our solubility data&lt;/a&gt; - in that case by marking outliers with "DONOTUSE".  This way, we never delete data so that anyone can question our decision to exclude data points.  Also by not deleting data, meaningful statistical analyses of the quality of currently available chemical information can be performed for a variety of applications.&lt;br /&gt;&lt;br /&gt;The donation of the &lt;a href="http://usefulchem.blogspot.com/2011/02/alfa-aesar-melting-point-data-now.html"&gt;Alfa Aesar dataset&lt;/a&gt; to the public domain was instrumental in allowing us to start systematically validating or excluding data points for practical or modeling applications.  We have also just received confirmation that the entire &lt;span style="font-weight: bold;"&gt;&lt;a href="http://esc.syrres.com/interkow/EpiSuiteData.htm"&gt;EPI (PhysProp) melting point dataset &lt;/a&gt;can be used as Open Data&lt;/span&gt;.  Many thanks to Antony Williams for coordinating this agreement and for approval and advice from Bob Boethling at the EPA and Bill Meylan at SRC.&lt;br /&gt;&lt;br /&gt;In the best case scenario, most of the melting point values will quickly converge as in the ethanol case above.  However, we have also observed cases where convergence simply doesn't happen.&lt;br /&gt;&lt;br /&gt;Consider the collection of &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=7223"&gt;reported melting points for benzylamine&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-4HLq9a3HxJE/Tdxw1lpNOBI/AAAAAAAAA_c/XjWdyfT78jE/s1600/benzylaminesum.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 395px; height: 400px;" src="http://2.bp.blogspot.com/-4HLq9a3HxJE/Tdxw1lpNOBI/AAAAAAAAA_c/XjWdyfT78jE/s400/benzylaminesum.jpg" alt="" id="BLOGGER_PHOTO_ID_5610483301737117714" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;One has to be careful when determining how many "different" values are in this collection. Identical values are suspicious since they may very well originate from the same ultimate source.  Convergence for the ethanol value above is credible because most of the values are very close but not completely identical, suggesting truly independent measurements.&lt;br /&gt;&lt;br /&gt;In this case values actually diverge into sources of either +10 C, - 10 C, -30 C or about -45 C.  If you want to play the "trusted source" game, do you trust more the Sigma-Aldrich value at +10C or the Alfa Aesar value at -43 C?&lt;br /&gt;&lt;br /&gt;Lets try looking at the peer-reviewed literature.  A search on SciFinder gives the following ranges:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-7fsMO_eYWBU/Tdx3SJp6wXI/AAAAAAAAA_k/wRvOGwidNlY/s1600/benzylaminemp.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 316px;" src="http://1.bp.blogspot.com/-7fsMO_eYWBU/Tdx3SJp6wXI/AAAAAAAAA_k/wRvOGwidNlY/s400/benzylaminemp.jpg" alt="" id="BLOGGER_PHOTO_ID_5610490389509882226" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The &lt;span style="font-weight:bold;"&gt;lowest&lt;/span&gt; melting point listed there is the +10C value we already have in our collection but these references are to other databases.  The &lt;span style="font-weight:bold;"&gt;lowest&lt;/span&gt; value from a &lt;span style="font-weight:bold;"&gt;peer-reviewed paper&lt;/span&gt; is 37-38 C.&lt;br /&gt;&lt;br /&gt;This is strange because I have a bottle of benzylamine in my lab and it is definitely a liquid.  Investigating the individual references reveals a variety of errors.  In one, benzylamine is listed as a product but from the context of the reaction it should be phenylbenzylamine:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-157Dm20TI0k/Td2RiT6ihYI/AAAAAAAAA_s/_uU7n0EZlbE/s1600/benzylaminempkametani.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 183px;" src="http://3.bp.blogspot.com/-157Dm20TI0k/Td2RiT6ihYI/AAAAAAAAA_s/_uU7n0EZlbE/s400/benzylaminempkametani.jpg" alt="" id="BLOGGER_PHOTO_ID_5610800729420891522" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;(In a strange co-incidence the actual intermediate - &lt;a href="http://www.chemspider.com/Chemical-Structure.10400.html"&gt;benzalaniline&lt;/a&gt; - is the imine that &lt;a href="http://usefulchem.blogspot.com/2011/05/evan-curtin-is-may-2011-rsc-ons.html"&gt;Evan Curtain has synthesized&lt;/a&gt; recently in order to measure its solubility)&lt;br /&gt;&lt;br /&gt;In another example, the melting point of a product is incorrectly associated with the &lt;span style="font-weight:bold;"&gt;reactant&lt;/span&gt; benzylamine:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-wHwlDU_pXgA/Td2SfAxKM5I/AAAAAAAAA_0/8ITR1LkleJY/s1600/benzylaminebad2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 168px;" src="http://2.bp.blogspot.com/-wHwlDU_pXgA/Td2SfAxKM5I/AAAAAAAAA_0/8ITR1LkleJY/s400/benzylaminebad2.jpg" alt="" id="BLOGGER_PHOTO_ID_5610801772253295506" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The erroneous melting points range all the way up to 280 C and I suspect that many of these are for salts of benzylamine, as I &lt;a href="http://usefulchem.blogspot.com/2009/11/cas-curates-strychnine-mp-cheminfo.html"&gt;reported previously for the strychnine&lt;/a&gt; melting point results from SciFinder.&lt;br /&gt;&lt;br /&gt;With no other obvious recourse from the literature to resolve this issue, Evan attempted to freeze a sample of benzylamine from our lab.(&lt;a href="http://usefulchem.wikispaces.com/Exp265"&gt;UC-EXP265&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-Pi0LPBUSW6M/Td2bL9crz-I/AAAAAAAABAE/NKCOLDFlJgY/s1600/UCEXP265-pic1.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 365px; height: 400px;" src="http://2.bp.blogspot.com/-Pi0LPBUSW6M/Td2bL9crz-I/AAAAAAAABAE/NKCOLDFlJgY/s400/UCEXP265-pic1.jpg" alt="" id="BLOGGER_PHOTO_ID_5610811340549246946" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Unfortunately, the benzylamine sample proved to be too impure (&amp;lt;85% by NMR) and didn't solidify even down to -78 C.  We'll have to try again from a much more pure source.  It would be useful to get reports from a few labs who happen to have benzylamine handy and provide proof of purity by NMR and a pic to demonstrate solidification.&lt;br /&gt;&lt;br /&gt;As most organic chemists will attest, amines are notorious for appearing as oils below their melting points in the presence of small amounts of impurities.  I wonder if the divergence of melting points in this case is due to this effect.  By providing NMR data from various samples subjected to freezing, it might be possible to quantify the effect of purity on the apparent freezing point.  I think the images of the solidification are also important because I think that some may mistake very high viscosity with actual formation of a solid.  At -78 C we observed the sample to exhibit a viscosity similar to that of syrup.&lt;br /&gt;&lt;br /&gt;Our model &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=7223"&gt;predicts a melting point of about -38 C&lt;/a&gt; for benzylamine and so I suspect that the values of -43 C and -46 C are most likely to be close to the correct range.  Lets find out.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-5163971830821461543?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/5163971830821461543/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=5163971830821461543' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5163971830821461543'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5163971830821461543'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/05/more-open-melting-points-from-epi-and.html' title='More Open Melting Points from EPI and other sources: on the path to ultimate curation'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-gl5eY6p1tas/TdxDSE6Vi-I/AAAAAAAAA_U/gxL6J3-kspo/s72-c/ethanolcurated.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3724301614475456038</id><published>2011-05-10T10:33:00.001-04:00</published><updated>2011-05-10T10:33:55.953-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='acfas'/><title type='text'>La Science par Cahier de Laboratoire Ouvert à l'Acfas</title><content type='html'>On May 9, 2011 I presented remotely for the &lt;a href="http://en.wikipedia.org/wiki/Association_francophone_pour_le_savoir"&gt;French-Canadian Association for the Advancement of Science&lt;/a&gt; (&lt;a href="http://www.acfas.ca/"&gt;ACFAS&lt;/a&gt;).  This was the first time I gave a talk about &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; in French. In fact the last time I gave a scientific talk in French was probably in 1995, when I was doing a postdoc at the &lt;a href="http://en.wikipedia.org/wiki/Coll%C3%A8ge_de_France"&gt;Collège de France&lt;/a&gt; in Paris.  I remember being teased for my French Canadian accent back then so happily that wasn't an issue this time.  Even though I was a bit rusty I think I managed to communicate the key points well enough. (At least I hope I did)&lt;br /&gt;&lt;br /&gt;My presentation was a good fit for the theme of the conference: &lt;a style="font-weight: bold;" href="http://uneautrescienceestpossible.wikispaces.com/"&gt;Une autre science est possible&lt;/a&gt;&lt;span style="font-weight: bold;"&gt; : science collaborative, science ouverte, science engagée, contre la marchandisation du savoir&lt;/span&gt;. (Another Science is possible: collaborative science, open science, against the commercialization of knowledge).  I would like to thank the organizers (&lt;a href="http://fr-ca.facebook.com/people/M%C3%A9lissa-Lieutenant-Gosselin/590192274"&gt;Mélissa Lieutenant-Gosselin&lt;/a&gt; and &lt;a href="http://www.com.ulaval.ca/no-cache/departement/personnel/dic/retour-dic/85/nom/florence-piron/"&gt;Florence Piron&lt;/a&gt;) for inviting me to participate.&lt;br /&gt;&lt;br /&gt;I was able to record most of the talk (see below) but very near the end Skype decided to install an update and shut down so the recording ends somewhat abruptly.  Given what people use Skype for, that default setting for updates really doesn't make much sense.&lt;br /&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_7900534"&gt; &lt;strong style="http://www.blogger.com/img/blank.gif&amp;quot;display:block;margin:12px" 0=""&gt;&lt;a href="http://www.slideshare.net/jcbradley/la-science-a-cahier-de-laboratoire-ouvert-7900534" title="La Science a Cahier de Laboratoire Ouvert"&gt;La Science a Cahier de Laboratoire Ouvert&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/7900534" marginwidth="0" marginheight="0" frameborder="0" height="355" scrolling="no" width="425"&gt;&lt;/iframe&gt; &lt;div style="padding:5px 0 12px"&gt; View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;object height="400" width="480"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=30225&amp;amp;type=3"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" wmode="transparent" allowscriptaccess="always" flashvars="id=30225&amp;amp;type=3" height="400" width="480"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3724301614475456038?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3724301614475456038/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3724301614475456038' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3724301614475456038'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3724301614475456038'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/05/la-science-par-cahier-de-laboratoire.html' title='La Science par Cahier de Laboratoire Ouvert à l&apos;Acfas'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1758996501666630224</id><published>2011-05-08T16:28:00.000-04:00</published><updated>2011-05-08T16:29:20.930-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='paclitaxel'/><category scheme='http://www.blogger.com/atom/ns#' term='drug design'/><category scheme='http://www.blogger.com/atom/ns#' term='breast cancer'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><category scheme='http://www.blogger.com/atom/ns#' term='solubility'/><title type='text'>Breast Cancer Coalition talk on ONS and Taxol solubility</title><content type='html'>On May 1, 2011 I presented "&lt;span style="font-weight: bold;"&gt;Accelerating Discovery by Sharing: a case for Open Notebook Science&lt;/span&gt;" at the &lt;a href="http://act.breastcancerdeadline2020.org/site/PageServer?pagename=2011Conference"&gt;National Breast Cancer Coalition Annual Advocacy Conference&lt;/a&gt; in Arlington, VA. This was the first year where they had a session on an Open Science related theme and the organizers invited me to highlight some of the tools and practices in chemistry which might be applicable to cancer research.&lt;br /&gt;&lt;br /&gt;I was really touched by the passion from those in the audience as well as the other speakers and conference participants I met afterward.  For many, their deep connection with the cause was strongly rooted in a personal experience as breast cancer survivors themselves or their loved ones.  Several expressed a frustration with the current system of sharing results from scientific studies.  They felt that knowledge sharing is much slower than it needs to be and that potentially useful "negative" results are generally not disclosed at all.&lt;br /&gt;&lt;br /&gt;The NBCC has ambitiously set 2020 as the deadline to end breast cancer  (including a &lt;a href="http://www.breastcancerdeadline2020.org/"&gt;countdown clock&lt;/a&gt;).  It seems reasonable to me that encouraging transparency in research is a good strategy to accelerate progress.  Of course, great care must be exercised wherever patient confidentiality is a factor.  But health care researchers are already experienced with following protocols to anonymize datasets for publication.  Opting to work more openly would not change that but it might affect when and how results are shared.  Also there is a great deal of science related to breast cancer that does not directly involve human subjects.&lt;br /&gt;&lt;br /&gt;One initiative that particularly impressed me was &lt;a href="https://komentissuebank.iu.edu/"&gt;The Susan G. Komen for the Cure Tissue Bank&lt;/a&gt;, presented by &lt;a href="http://www.cancer.iu.edu/research/members/member_bio.php?id=1263"&gt;Susan Clare&lt;/a&gt; from Indiana University and moderated by &lt;a href="http://www.ibcresearch.org/stories/inspirational-stories-from-ibc-patients/ginny-mason/"&gt;Virginia Mason&lt;/a&gt; from the &lt;a href="http://www.ibcresearch.org/"&gt;Inflammatory Breast Cancer Research Foundation&lt;/a&gt;.  As a result of this effort, thousands of women have donated &lt;span style="font-style: italic;"&gt;healthy&lt;/span&gt; breast tissue to create a comprehensive database richly annotated with donor genetics and medical history. The idea of trying to tackle a disease state by first understanding normal functioning in great detail was apparently somewhat of a paradigm shift for the cancer research community and it was challenging to implement.  According to Dr. Clare, data from the Tissue Bank have shown that the common practice of using &lt;span style="font-style: italic;"&gt;apparently unaffected&lt;/span&gt; tissue adjacent to a tumor as a control may not be valid.&lt;br /&gt;&lt;br /&gt;This example highlights one of the key principles of Open Science: &lt;span style="font-weight: bold;"&gt;there is value in everyone knowing more - even if it isn't immediately clear how that knowledge will prove to be useful&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;In my experience, this is a fundamental point that distinguishes those who are likely to favor Open Science from those who reject its value.  If two researchers are discussing Open Science and only one of them views this philosophy as being self-evident the conversation will likely be about &lt;span style="font-weight: bold;"&gt;why&lt;/span&gt; someone would want (or not want) to share more and the focus will fall on extrinsic motivators such as academic credit, intellectual property, etc.  If both researchers view this philosophy as self-evident the conversation will probably gravitate towards &lt;span style="font-weight: bold;"&gt;how&lt;/span&gt; and &lt;span style="font-weight: bold;"&gt;what&lt;/span&gt; to share.&lt;br /&gt;&lt;br /&gt;I refer to this philosophy as being self-evident because I don't think people can become convinced through &lt;a href="http://en.wikipedia.org/wiki/Argumentation_theory"&gt;argumentation&lt;/a&gt; (I've never seen that happen).  Within the realm of &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; I have been involved in countless discussions about the value of sharing all experimental details - even when errors are discovered.  I can think of a few ways in which this is useful - for example telegraphing a research direction to those in the field or providing data for researchers who study how science is actually done (such as &lt;a href="http://usefulchem.blogspot.com/2010/12/visualizing-social-networks-in-open.html"&gt;Don Pellegrino&lt;/a&gt;).  But even if I couldn't think of a single application I &lt;a href="http://en.wikipedia.org/wiki/Belief"&gt;believe&lt;/a&gt; that there is value in sharing all available data.&lt;br /&gt;&lt;br /&gt;A good example of this philosophy at work is the &lt;a href="http://www.jcheminf.com/content/1/1/9"&gt;Spectral Game&lt;/a&gt;.  Researchers who uploaded spectral data to &lt;a href="http://www.chemspider.com/"&gt;ChemSpider&lt;/a&gt; as &lt;a href="http://en.wikipedia.org/wiki/Open_science_data"&gt;Open Data&lt;/a&gt; did not anticipate how their contribution would be used.  They didn't do it for extrinsic motives such as traditional academic credit.  Assuming that their motivation was similar to our group's, they did it because they &lt;span style="font-weight: bold;"&gt;believed it was an obviously useful thing to do&lt;/span&gt;.  It is only much later - after a critical mass of open spectra were collected - that the idea arose to create a game from the dataset.&lt;br /&gt;&lt;br /&gt;With this mindset, I explored what contribution we might make to breast cancer research by performing a &lt;a href="http://usefulchem.blogspot.com/2011/03/towards-automated-discovery-of-useful.html"&gt;phrase search strategy&lt;/a&gt;.  Doing a simple Google search for &lt;a href="http://www.google.com/#hl=en&amp;amp;sugexp=ldymls&amp;amp;xhr=t&amp;amp;q=%22breast+cancer%22+solubility"&gt;"breast cancer" solubility&lt;/a&gt; generated mainly two types of results.&lt;br /&gt;&lt;br /&gt;The first set involve the solubility behavior of biomolecules within the cellular environment.  An example would be the observed&lt;a href="http://scripps.academia.edu/edcho/Papers/153282/Delocalization_of_g-tubulin_due_to_increased_solubility_in_human_breast_cancer_cell_lines"&gt; increased solubility of gamma-tubulin&lt;/a&gt; in cancerous cells.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-f0DnX5y1qi0/TcVqp5EFeMI/AAAAAAAAA-U/3e5CN4kLzEM/s1600/tubulin1.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 91px;" src="http://4.bp.blogspot.com/-f0DnX5y1qi0/TcVqp5EFeMI/AAAAAAAAA-U/3e5CN4kLzEM/s400/tubulin1.png" alt="" id="BLOGGER_PHOTO_ID_5604002579257587906" border="0" /&gt;&lt;/a&gt;The second type of results address the difficulty in preparing formulations for cancer drugs due to solubility problems.  A good example of this is &lt;a href="http://en.wikipedia.org/wiki/Paclitaxel"&gt;Taxol (paclitaxel)&lt;/a&gt;, where existing excipients are not completely satisfactory - in the case of Cremophor EL some patients experience a &lt;a href="http://jnci.oxfordjournals.org/content/90/4/300.full"&gt;hypersensitivity&lt;/a&gt;.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-EqlOH5fH_Ys/TcVqvVdNh6I/AAAAAAAAA-c/0U72IoaPAmU/s1600/pacli1.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 68px;" src="http://2.bp.blogspot.com/-EqlOH5fH_Ys/TcVqvVdNh6I/AAAAAAAAA-c/0U72IoaPAmU/s400/pacli1.png" alt="" id="BLOGGER_PHOTO_ID_5604002672778512290" border="0" /&gt;&lt;/a&gt;Since our modeling efforts thus far have focused on non-aqueous solubility, there is possibly an opportunity to contribute by exploring the solubility behavior of paclitaxel.  By inputting solubility data from a paper by &lt;a href="http://dx.doi.org/10.1016/S0378-5173%2801%2900986-3"&gt;Singla 2002&lt;/a&gt; into our &lt;a href="http://onswebservices.wikispaces.com/solubility"&gt;solubility database&lt;/a&gt;, &lt;a href="http://usefulchem.blogspot.com/2009/12/ons-solubility-book-edition-2-with.html"&gt;Abraham descriptors&lt;/a&gt; for paclitaxel are automatically calculated and the &lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=10368587&amp;amp;limreact=0.3&amp;amp;limprod=6&amp;amp;bp=400&amp;amp;washes=0"&gt;solubilities in over 70 solvents are predicted&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-kJe8SMQuBdE/TcWQ2QkRf6I/AAAAAAAAA-k/kku9mZgUPVg/s1600/soltaxol1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 327px;" src="http://2.bp.blogspot.com/-kJe8SMQuBdE/TcWQ2QkRf6I/AAAAAAAAA-k/kku9mZgUPVg/s400/soltaxol1.jpg" alt="" id="BLOGGER_PHOTO_ID_5604044573166895010" border="0" /&gt;&lt;/a&gt;In addition, by simply adding the melting point of paclitaxel, we &lt;a href="http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html"&gt;automatically predict its solubility at any temperature&lt;/a&gt; where these solvents are liquids (see for example &lt;a href="http://lxsrv7.oru.edu/%7Ealang/solubility/soltemp.php?csids=10368587&amp;amp;cs=0.001&amp;amp;solvent=water&amp;amp;solmp=&amp;amp;solbp=100&amp;amp;limreact=0.3&amp;amp;limprod=6"&gt;water&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-69Dps1hV3bo/TcbnbkyG5rI/AAAAAAAAA-0/7sn05u0LeUw/s1600/paclisoltemp.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 201px;" src="http://1.bp.blogspot.com/-69Dps1hV3bo/TcbnbkyG5rI/AAAAAAAAA-0/7sn05u0LeUw/s400/paclisoltemp.png" alt="" id="BLOGGER_PHOTO_ID_5604421247225489074" border="0" /&gt;&lt;/a&gt;Because of the way we expose our results to the web, a &lt;a href="http://www.google.com/search?q=paclitaxel+solubility+acetonitrile"&gt;Google search for "paclitaxel solubility acetonitrile"&lt;/a&gt; now returns the actual value in the Google summary on the first page of results (currently 7th on the first page).  The other hits have all 3 keywords somewhere in the document but one has to click on each link then perform a search within the document to find out if the acetonitrile solubility for paclitaxel is actually reported. (Note that clicking on &lt;a href="http://old.oru.edu/cccda/sl/solubility/ugidata.php?solute=paclitaxel&amp;amp;solvent=acetonitrile"&gt;our link&lt;/a&gt; ultimately takes you to the &lt;a href="http://dx.doi.org/10.1016/S0378-5173%2801%2900986-3"&gt;peer-reviewed paper&lt;/a&gt; with the original measurement.)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-LkWb59IG41k/TcbpvedzphI/AAAAAAAAA-8/Dl-ZbgLylFo/s1600/pacliacetonitrile.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 79px;" src="http://1.bp.blogspot.com/-LkWb59IG41k/TcbpvedzphI/AAAAAAAAA-8/Dl-ZbgLylFo/s400/pacliacetonitrile.jpg" alt="" id="BLOGGER_PHOTO_ID_5604423788150367762" border="0" /&gt;&lt;/a&gt;To be clear about what we are doing here - we are not claiming to be the first to predict the solubility of paclitaxel in these solvents using Abraham descriptors or any other method.  Nor are we claiming that we have directly made a dent in the formulation problem of paclitaxel.  We are not even indicating that we have done a thorough search of the literature - that would take a lot more time than we have had given the enormous amount of work on paclitaxel and its derivatives.&lt;br /&gt;&lt;br /&gt;All we are doing is fleshing out the natural interface between the knowledge space of the UsefulChem/ONS Challenge projects and that of breast cancer research - AND - we are exposing the results of that intersection through easily discoverable channels.  By design, these results are exposed as self-contained "smallest publishable units" and they are shared as quickly (and as automatically) as possible.  The traditional publication system does not have mechanism to disseminate this type of information.  (Of course when enough of these are collected and woven into a narrative that fits the criteria for a traditional paper they can and should be submitted for peer-reviewed publication).&lt;br /&gt;&lt;br /&gt;Here is a scenario for how this could work in this specific instance.  A graduate student (who has never heard of Open Science or UsefulChem, the ONS Challenge, etc.) is asked to look for new formulations for paclitaxel (or other difficult to solubilize anti-cancer agents).  They do a search on commercial databases offered by their university for various solubilities of paclitaxel and cannot find a measurement for acetonitrile.  They then do a search on Google and find a hit directly answering their query, as I detailed above.  This leads them to our prediction services and they start using those numbers in their own models.&lt;br /&gt;&lt;br /&gt;That is a good outcome - and that is exactly what has been happening (see the &lt;a href="http://usefulchem.blogspot.com/2011/02/ons-solubility-challenge-book-cited-in.html"&gt;gold nanodot paper&lt;/a&gt; and the &lt;a href="http://usefulchem.blogspot.com/2009/06/crowdsourcing-solubility-requests-from.html"&gt;phenanthrene soil contamination study&lt;/a&gt; as examples).  But the real paydirt would come from the graduate student recognizing that we've done a lot of work collecting measurements and building models for solubility and melting points, and contact us about a collaboration.  As long as they are comfortable with working openly we would be happy actively work together.&lt;br /&gt;&lt;br /&gt;I'm using the formulation of paclitaxel as an example but I'm sure that there are many more intersections between solubility and breast cancer research.  With a bit of luck I hope we can find a few researchers who are open to this type of collaboration.&lt;br /&gt;&lt;br /&gt;As another twist to this story, I will briefly mention here too that &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; has started to screen our Ugi product virtual library for &lt;a href="http://en.wikipedia.org/wiki/Docking_%28molecular%29"&gt;docking&lt;/a&gt; with the site where paclitaxel binds to gamma-tubulin (&lt;a href="http://usefulchem.wikispaces.com/D-EXP018"&gt;D-EXP018&lt;/a&gt;).  This might shed some light on some much cheaper alternatives to the extremely expensive paclitaxel and derivatives.   The drug binds through 3 hydrogen bonds, shown below - rendered in 2D and 3D representations (obtained from the &lt;a href="http://www.pdb.org/pdb/explore/viewerLaunch.do?viewerType=LX&amp;amp;structureId=1JFF&amp;amp;hetId=TA1"&gt;PDB ligand viewer&lt;/a&gt;)&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-FaB9eIXYjIY/Tcb4StNj8SI/AAAAAAAAA_E/IQvc7_xuMoc/s1600/paclibonds2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 345px;" src="http://2.bp.blogspot.com/-FaB9eIXYjIY/Tcb4StNj8SI/AAAAAAAAA_E/IQvc7_xuMoc/s400/paclibonds2.jpg" alt="" id="BLOGGER_PHOTO_ID_5604439786566971682" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-RXz77n94C2w/Tcb4aIAX7aI/AAAAAAAAA_M/6jnJKA_UtIQ/s1600/pacliHbonds.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 253px;" src="http://4.bp.blogspot.com/-RXz77n94C2w/Tcb4aIAX7aI/AAAAAAAAA_M/6jnJKA_UtIQ/s400/pacliHbonds.jpg" alt="" id="BLOGGER_PHOTO_ID_5604439914018500002" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The slides and recording of my talk are embedded below:&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_7793659"&gt; &lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/nbcc-open-notebook-science-talk" title="NBCC Open Notebook Science Talk"&gt;NBCC Open Notebook Science Talk&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/7793659" marginwidth="0" marginheight="0" frameborder="0" height="355" scrolling="no" width="425"&gt;&lt;/iframe&gt; &lt;div style="padding: 5px 0pt 12px;"&gt; View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;object height="400" width="480"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=29145&amp;amp;type=3"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" wmode="transparent" allowscriptaccess="always" flashvars="id=29145&amp;amp;type=3" height="400" width="480"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1758996501666630224?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1758996501666630224/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1758996501666630224' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1758996501666630224'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1758996501666630224'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/05/breast-cancer-coalition-talk-on-ons-and.html' title='Breast Cancer Coalition talk on ONS and Taxol solubility'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-f0DnX5y1qi0/TcVqp5EFeMI/AAAAAAAAA-U/3e5CN4kLzEM/s72-c/tubulin1.png' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8996026475934222778</id><published>2011-05-08T11:37:00.000-04:00</published><updated>2011-05-08T11:37:29.206-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='collaboration'/><category scheme='http://www.blogger.com/atom/ns#' term='wiley'/><category scheme='http://www.blogger.com/atom/ns#' term='book'/><title type='text'>Collaboration using Open Notebook Science in Academia book chapter</title><content type='html'>I am very pleased to report that the book chapter that I co-wrote with &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt;, &lt;a href="http://www.linkedin.com/in/stevekoch"&gt;Steve Koch&lt;/a&gt; and &lt;a href="http://cameronneylon.net/"&gt;Cameron Neylon&lt;/a&gt; is now available online: &lt;a href="http://media.wiley.com/product_data/excerpt/36/04706380/0470638036-1.pdf"&gt;Collaboration using Open Notebook Science in Academia&lt;/a&gt;.  This is the 25th chapter of &lt;a href="http://www.wiley.com/WileyCDA/WileyTitle/productCd-0470638036.html"&gt;Collaborative Computational Technologies for Biomedical Research&lt;/a&gt;, edited by &lt;a href="http://en.wikipedia.org/wiki/Sean_Ekins"&gt;Sean Ekins&lt;/a&gt;, &lt;a href="http://www.linkedin.com/pub/maggie-hupcey/0/273/434"&gt;Maggie Hupcey&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt; and &lt;a href="http://www.linkedin.com/pub/alph-bingham/3/877/157"&gt;Alpheus Bingham&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Our chapter provides some fairly detailed examples of how Open Notebook Science can be used to enhance collaboration between researchers from both similar or distant fields. It also suggests certain paths towards machine/human collaboration in science.  Hopefully it will encourage researchers who have an interest in Open Science to experiment with some of the tools and strategies mentioned.&lt;br /&gt;&lt;br /&gt;I am also grateful to Wiley for choosing our chapter as the &lt;a href="http://media.wiley.com/product_data/excerpt/36/04706380/0470638036-1.pdf"&gt;free online sample&lt;/a&gt; for the book!&lt;br /&gt;&lt;blockquote&gt;This book discusses the state-of-the-art collaborative and computing techniques for the pharmaceutical industry, the present and future implications and opportunities to advance healthcare research. The book tackles problems thoroughly, from both the human collaborative and the data and informatics side, and is very relevant to the day-to-day activities running a laboratory or a collaborative R&amp;amp;D project. It can be applied to help organizations make critical decisions about managing drug discovery and development partnership. The book follows a “man- methods-machine” format with sections on how to get people to collaborate, collaborative methods, and computational tools for collaboration. This book offers the reader a “getting started guide” or instruction on “how to collaborate” for new laboratories, new companies, and new partnerships, as well as a user manual for how to troubleshoot existing collaborations.&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-f7DwT-NSsvQ/Tcazk3TYH0I/AAAAAAAAA-s/rTdY4E2GwU4/s1600/wileybookcover.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 265px; height: 400px;" src="http://4.bp.blogspot.com/-f7DwT-NSsvQ/Tcazk3TYH0I/AAAAAAAAA-s/rTdY4E2GwU4/s400/wileybookcover.jpg" alt="" id="BLOGGER_PHOTO_ID_5604364232211046210" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8996026475934222778?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8996026475934222778/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8996026475934222778' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8996026475934222778'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8996026475934222778'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/05/collaboration-using-open-notebook.html' title='Collaboration using Open Notebook Science in Academia book chapter'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-f7DwT-NSsvQ/Tcazk3TYH0I/AAAAAAAAA-s/rTdY4E2GwU4/s72-c/wileybookcover.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3151659689937720985</id><published>2011-05-07T10:23:00.001-04:00</published><updated>2011-05-07T10:24:19.385-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='RSC'/><category scheme='http://www.blogger.com/atom/ns#' term='imine'/><category scheme='http://www.blogger.com/atom/ns#' term='award'/><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='solubility'/><title type='text'>Evan Curtin is the May 2011 RSC ONS Challenge Winner</title><content type='html'>&lt;a href="http://www.linkedin.com/pub/evan-curtin/30/495/149"&gt;Evan Curtin&lt;/a&gt;, a chemistry freshman student working under the supervision of Jean-Claude Bradley at Drexel University, is the May 2011 &lt;a href="http://onschallenge.wikispaces.com/"&gt;Royal Society of Chemistry Open Notebook Science Challenge Award&lt;/a&gt; winner. He wins a cash prize from the RSC.&lt;br /&gt;&lt;br /&gt;Evan's primary focus has centered on synthesizing &lt;a href="http://onschallenge.wikispaces.com/imine"&gt;aromatic imines&lt;/a&gt; and measuring their solubility in a number of organic solvents.  This will allow us to generate Abraham descriptors for this class of compounds in order to predict their solubility in 70+ solvents. Coupled with our new model to include &lt;a href="http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html"&gt;temperature dependent solubility&lt;/a&gt;, this should greatly facilitate optimal solvent prediction for this and related reactions.&lt;br /&gt;&lt;br /&gt;Imine formation is of particular interest to the &lt;a href="http://usefulchem.wikispaces.com"&gt;UsefulChem&lt;/a&gt; group because it is the first step of the &lt;a href="http://usefulchem.blogspot.com/2007/01/anatomy-of-ugi-reaction.html"&gt;Ugi reaction&lt;/a&gt;, which we have used to synthesize compounds with &lt;a href="http://usefulchem.blogspot.com/2008/01/we-have-anti-malarial-activity.html"&gt;anti-malarial activity&lt;/a&gt;.  But it is also a simple convenient reaction in itself to test our &lt;a href="http://usefulchem.blogspot.com/2010/08/reaction-attempts-solvent-selector.html"&gt;Solvent Selector&lt;/a&gt;'s ability to predict optimal conditions (solvent and temperature) for isolation of products by precipitation.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Evan's synthesis experiments are available here:&lt;/span&gt;&lt;br /&gt;&lt;a href="http://usefulchem.wikispaces.com/Exp263"&gt;http://usefulchem.wikispaces.com/Exp263&lt;/a&gt;&lt;br /&gt;&lt;a href="http://usefulchem.wikispaces.com/Exp262"&gt;http://usefulchem.wikispaces.com/Exp262&lt;/a&gt;&lt;br /&gt;&lt;a href="http://usefulchem.wikispaces.com/Exp261"&gt;http://usefulchem.wikispaces.com/Exp261&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;and his solubility experiments are listed here:&lt;/span&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp207"&gt;http://onschallenge.wikispaces.com/Exp207&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp206"&gt;http://onschallenge.wikispaces.com/Exp206&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp205"&gt;http://onschallenge.wikispaces.com/Exp205&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp204"&gt;http://onschallenge.wikispaces.com/Exp204&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp201"&gt;http://onschallenge.wikispaces.com/Exp201&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp198"&gt;http://onschallenge.wikispaces.com/Exp198&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/Exp197"&gt;http://onschallenge.wikispaces.com/Exp197&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Three more RSC ONS Awards will be made during 2011. Submissions from students in the US and the UK are still welcome.&lt;br /&gt;For more information see:&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com"&gt;http://onschallenge.wikispaces.com&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/RSCAwards2010"&gt;http://onschallenge.wikispaces.com/RSCAwards2010&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3151659689937720985?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3151659689937720985/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3151659689937720985' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3151659689937720985'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3151659689937720985'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/05/evan-curtin-is-may-2011-rsc-ons.html' title='Evan Curtin is the May 2011 RSC ONS Challenge Winner'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4322575203719035296</id><published>2011-04-04T14:50:00.001-04:00</published><updated>2011-05-24T13:47:33.043-04:00</updated><title type='text'>ACS and ACRL presentations on web services and trust in science</title><content type='html'>&lt;span style="font-weight:bold;"&gt;Update: the recording of my ACS talk on Rapid Dissemination of Chemical Information for people and machines using Open Notebook Science is now &lt;a href="http://www.softconference.com/ACSchem/sessionDetail.asp?SID=250355"&gt;available here&lt;/a&gt;.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;On March 30 and 31, 2011 I presented two related talks - the first remotely for the &lt;a href="http://abstracts.acs.org/chem/241nm/program/divisionindex.php?act=presentations&amp;amp;val=Internet+and+Chemistry%3A+Social+Networking&amp;amp;ses=Internet+and+Chemistry%3A+Social+Networking&amp;amp;prog=54108"&gt;American Chemical Society (ACS) Meeting&lt;/a&gt; and the second in Philadelphia at the meeting for the &lt;a href="http://www.ala.org/ala/mgrps/divs/acrl/events/national/2011/index.cfm"&gt;Association of College and Research Libraries (ACRL)&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;In the ACS talk "&lt;span style="font-weight: bold;"&gt;Rapid Dissemination of Chemical Information for people and machines using Open Notebook Science&lt;/span&gt;", I spoke for the first time in detail about the results of the &lt;a href="http://usefulchem.blogspot.com/2011/03/open-modeling-of-melting-point-data.html"&gt;open modeling&lt;/a&gt; &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; and I carried out on the open dataset of melting points we collected starting with the &lt;a href="http://usefulchem.blogspot.com/2011/02/alfa-aesar-melting-point-data-now.html"&gt;Alfa Aesar dataset recently made public&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;We used Skype and Google Presenter with the help of &lt;a href="http://en.wikipedia.org/wiki/Peter_Murray-Rust"&gt;Peter Murray-Rust&lt;/a&gt; on site at the conference and it went fairly well I think.  &lt;a href="http://en.wikipedia.org/wiki/Henry_Rzepa"&gt;Henry Rzepa&lt;/a&gt; had a good question about &lt;a href="http://en.wikipedia.org/wiki/Polymorphism_%28materials_science%29"&gt;polymorphism&lt;/a&gt; possibly being responsible for different melting points from various sources.  I don't think that is the problem in most of these cases but we can certainly spend some time investigating the reports of polymorphism for cases where the information is available.  One of the big problems is that we don't know the history of the sample used for a melting point from most sources like chemical vendor sites.  At least in journal articles we might be told which solvent was used to crystallize the sample.  If multiple sources agree on a certain melting point and there is one outlier, I think it is reasonable to assume that the common melting point is likely to correspond to the thermodynamically favored polymorph.  This might not be correct in all cases but - without the means to discover more information about the sample histories - I think it makes sense to proceed in this way.  Since we don't consider polymorphism in our modeling, there is an implicit assumption that - in the case of polymorphism - we are dealing with the thermodynamically most stable form.&lt;br /&gt;&lt;br /&gt;My ACRL talk "&lt;span style="font-weight: bold;"&gt;Is there a role for Trust in Science?&lt;/span&gt;" focused more on the &lt;a href="http://usefulchem.blogspot.com/2011/01/chemical-information-validation-results.html"&gt;Chemical Information Validation study and outcomes&lt;/a&gt;.  There were several good questions at the end.  One particularly good comment addressed my speculation that within a few years, the open models in most of the useful chemical spaces will be sufficiently good that it will be as easy to Google a melting point or a solubility as it is now to get driving directions.  The question was: weren't we just replacing trust from one information source to another, namely these models.  I don't think the concept of trust applies in these cases because the training sets, the descriptors and the performance of the models are (and will be) open.  This is in sharp contrast with most commercial software generating predictions for solubility and melting points - these are generally black boxes because either the training set, the model or the descriptors are not open.&lt;br /&gt;&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_7453631"&gt; &lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/bradley-acs-sp2011" title="Open Notebook Science Web Services - ACS Spring 2011"&gt;Open Notebook Science Web Services - ACS Spring 2011&lt;/a&gt;&lt;/strong&gt; &lt;object id="__sse7453631" width="425" height="355"&gt; &lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyacs-sp2011-110330165149-phpapp01&amp;amp;stripped_title=bradley-acs-sp2011&amp;amp;userName=jcbradley"&gt; &lt;param name="allowFullScreen" value="true"&gt; &lt;param name="allowScriptAccess" value="always"&gt; &lt;embed name="__sse7453631" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyacs-sp2011-110330165149-phpapp01&amp;amp;stripped_title=bradley-acs-sp2011&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt; &lt;/object&gt; &lt;div style="padding: 5px 0pt 12px;"&gt; View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_7460372"&gt; &lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/acrl-trust-in-science-talk" title="ACRL Trust in Science Talk"&gt;ACRL Trust in Science Talk&lt;/a&gt;&lt;/strong&gt; &lt;object id="__sse7460372" width="425" height="355"&gt; &lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=acrl2011-110331051805-phpapp02&amp;amp;stripped_title=acrl-trust-in-science-talk&amp;amp;userName=jcbradley"&gt; &lt;param name="allowFullScreen" value="true"&gt; &lt;param name="allowScriptAccess" value="always"&gt; &lt;embed name="__sse7460372" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=acrl2011-110331051805-phpapp02&amp;amp;stripped_title=acrl-trust-in-science-talk&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt; &lt;/object&gt; &lt;div style="padding: 5px 0pt 12px;"&gt; View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4322575203719035296?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4322575203719035296/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4322575203719035296' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4322575203719035296'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4322575203719035296'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/04/acs-and-acrl-presentations-on-web.html' title='ACS and ACRL presentations on web services and trust in science'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-489150242492541988</id><published>2011-03-27T21:57:00.000-04:00</published><updated>2011-03-27T21:57:35.161-04:00</updated><title type='text'>Towards the automated discovery of useful solubility applications</title><content type='html'>Last week, I came across (via &lt;a href="http://ff.im/zwRuH"&gt;David Bradley&lt;/a&gt;) a paper by an MIT group regarding the desalination of water using a very clever application of solubility behavior:&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-weight: bold;"&gt;Anurag Bajpayee, Tengfei Luo, Andrew Muto and Gang Chen&lt;/span&gt;, &lt;span style="font-style: italic;"&gt;Energy Environ. Sci.&lt;/span&gt;, &lt;span style="font-weight: bold;"&gt;2011&lt;/span&gt; Very low temperature membrane-free desalination by directional solvent extraction (&lt;a href="http://pubs.rsc.org/en/Content/ArticleLanding/2011/EE/c1ee01027a"&gt;article&lt;/a&gt;, &lt;a href="http://www.rsc.org/chemistryworld/News/2011/March/18031105.asp"&gt;summary&lt;/a&gt;)&lt;/blockquote&gt;The technique simply involves the heating of saltwater with molten decanoic acid to 40-80 C.  Some water dissolves into the decanoic acid, leaving the salt behind.  The layers are then separated and, upon cooling to 34C, sufficiently pure water separates out.  Any traces of decanoic acid are inconsequential since this compound is already present in many foods at higher levels.&lt;br /&gt;&lt;br /&gt;From a technological standpoint, I can't think of a reason why this solution could not have been discovered and implemented 100 years ago.  It makes you wonder how many other elegant solutions to real problems could be uncovered by connecting the right pieces together.&lt;br /&gt;&lt;br /&gt;To me, this is where the efforts of Open Science and the automation of the scientific process will pay off first.  For this to happen on a global level, two key requirements must be met:&lt;br /&gt;&lt;blockquote&gt;1) Information must be freely available, optimally as a web service (measurements if possible - otherwise a predicted value, preferably from an &lt;a href="http://usefulchem.blogspot.com/2011/03/open-modeling-of-melting-point-data.html"&gt;Open Model&lt;/a&gt;)&lt;br /&gt;2) There has to be a significantly automated way of identifying what is important enough to be solved.&lt;/blockquote&gt;Since we have been working on fulfilling the first requirement for solubility data, I first looked at our &lt;a href="http://onswebservices.wikispaces.com/"&gt;available services&lt;/a&gt; to see if there was anything there that could have pointed towards this solution.&lt;br /&gt;&lt;br /&gt;Although we have a &lt;a href="http://old.oru.edu/cccda/sl/solubility/ugidata.php?solute=decanoic+acid&amp;amp;solvent=water"&gt;measured&lt;/a&gt; (0.0004 M) and &lt;a href="http://showme.physics.drexel.edu/onsc/models/solutesolventprediction.php?solute=decanoic+acid&amp;amp;solvent=water"&gt;predicted&lt;/a&gt; (0.001 M) room temperature solubility of &lt;a href="http://www.chemspider.com/Chemical-Structure.2863.html"&gt;decanoic acid&lt;/a&gt; in water, our best prediction service can't do the opposite: the solubility of &lt;span style="font-style: italic;"&gt;water in decanoic acid&lt;/span&gt;.  For that we would need the &lt;a href="http://usefulchem.blogspot.com/2010/07/general-transparent-solubility.html"&gt;Abraham descriptors&lt;/a&gt; for decanoic acid as a &lt;span style="font-style: italic;"&gt;solvent&lt;/span&gt; and those are not yet available as far as I'm aware.&lt;br /&gt;&lt;br /&gt;Also, we use a &lt;a href="http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html"&gt;model to predict solubility at different temperatures&lt;/a&gt; - but it assumes that the solute is miscible with the solvent at its melting point.  This is probably a reasonable assumption for the most part but it fails when the solute and the solvent are too radically dissimilar (e.g. water/hydrophobic organic compounds).  In this particular application, &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=2863"&gt;decanoic acid melts at 31C&lt;/a&gt; and the process occurs in the 34-80 C range.&lt;br /&gt;&lt;br /&gt;But even if we had the necessary models (and corresponding &lt;a href="http://onswebservices.wikispaces.com/"&gt;web services&lt;/a&gt;) for the decanoic acid/water/NaCl system, could it have been flagged in an automated way as being potentially "useful" or even "interesting"?&lt;br /&gt;&lt;br /&gt;For utility assessment, humans are still the best source.  Luckily, they often record this information tagged with common phrases in the introductory paragraphs of scientific documents. (In fact, this is the &lt;a href="http://usefulchem.blogspot.com/2005/07/sample-search-phrases.html"&gt;origin of the UsefulChem project&lt;/a&gt;).  For example, if we search for &lt;a href="http://www.google.com/#q=%22there+is+a+pressing+need+for%22+solubility"&gt;"there is a pressing need for" AND solubility in a Google search&lt;/a&gt;, most of the results provide reasonable answers to the question of what a useful application of solubility might be.  I have summarized the &lt;a href="https://spreadsheets.google.com/ccc?hl=en&amp;amp;key=t1dKqZv_zq0U8wzbkd5fG3A&amp;amp;hl=en#gid=0"&gt;initial results in this sheet&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The first result is:&lt;br /&gt;"there is a pressing need for &lt;span style="font-weight: bold;"&gt;new materials for efficient CO2 separation&lt;/span&gt;" from a &lt;a href="http://uwadmnweb.uwyo.edu/rpc/Papers%20powerpoints%20attachments/04-088%20publication%20II.pdf"&gt;Macromolecules article in 2005&lt;/a&gt;.  The general problem needing solving would correspond to "&lt;span style="font-weight: bold;"&gt;global warming/CO2 sequestration&lt;/span&gt;" and the modeling challenge would be "&lt;span style="font-weight: bold;"&gt;gas solubility&lt;/span&gt;".&lt;br /&gt;&lt;br /&gt;Analyzing the first 9 results in this way gives us the following &lt;span style="font-weight: bold;"&gt;problem types&lt;/span&gt;:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;global warming/CO2 sequestration&lt;/li&gt;&lt;li&gt;fire control&lt;/li&gt;&lt;li&gt;global warming/refrigeration fluid&lt;/li&gt;&lt;li&gt;AIDS prevention&lt;/li&gt;&lt;li&gt;Iron absorption in developing countries&lt;/li&gt;&lt;li&gt;agriculture/making phosphate from rock bioavailable&lt;/li&gt;&lt;li&gt;water treatment/flocculation&lt;/li&gt;&lt;li&gt;natural gas purification/environmental&lt;/li&gt;&lt;li&gt;waste water treatment&lt;/li&gt;&lt;/ol&gt;and the following &lt;span style="font-weight: bold;"&gt;modeling challenges&lt;/span&gt;:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;gas solubility&lt;/li&gt;&lt;li&gt;polymer solubility&lt;/li&gt;&lt;li&gt;hydrofluoroether solubility&lt;/li&gt;&lt;li&gt;solubility of drug in gels&lt;/li&gt;&lt;li&gt;inorganics&lt;/li&gt;&lt;li&gt;inorganics/pH dependence of solubility&lt;/li&gt;&lt;li&gt;polymer solubility/flocculation/colloidal dispersions&lt;/li&gt;&lt;li&gt;gas solubility&lt;/li&gt;&lt;li&gt;inorganics&lt;/li&gt;&lt;/ol&gt;These preliminary results are instructive.  The problem types are broad and varied - and I think they will be helpful for keeping in mind as we continue to work on solubility.  The modeling challenges can be compared directly with our existing services - and none of them overlap at this time!  All of these involve either gasses, polymers, gels, salts, inorganics or colloids while our services are strictly for small, non-ionic organic compounds in liquid solvents.&lt;br /&gt;&lt;br /&gt;Part of the reason for our focus on these types of compounds relates to our ulterior objective of assessing and synthesizing &lt;a href="http://usefulchem.blogspot.com/2008/01/we-have-anti-malarial-activity.html"&gt;drug-like compounds&lt;/a&gt;.  But a more important consideration is what type of information is available and what can be processed related to cheminformatics.  Currently most cheminformatics tools deal only with organic chemicals, with essential sources such as &lt;a href="http://www.chemspider.com"&gt;ChemSpider&lt;/a&gt; and the &lt;a href="http://sourceforge.net/apps/mediawiki/cdk/index.php?title=Main_Page"&gt;CDK&lt;/a&gt; providing measurements, models, descriptors, etc.&lt;br /&gt;&lt;br /&gt;Even though some inorganic compounds are on ChemSpider, most of the properties are unavailable.  Consider the example of &lt;a href="http://www.chemspider.com/Chemical-Structure.5044.html"&gt;sodium chloride&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-EsqCwxxqEn8/TY_i5JaYXuI/AAAAAAAAA-M/UW3fCkSpQd0/s1600/naclcs.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 293px;" src="http://2.bp.blogspot.com/-EsqCwxxqEn8/TY_i5JaYXuI/AAAAAAAAA-M/UW3fCkSpQd0/s400/naclcs.png" alt="" id="BLOGGER_PHOTO_ID_5588935133997063906" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;This doesn't mean that the situation is hopeless but it does make the challenge much more difficult.  Solubility measurements and models for inorganic salts do exist (for example see &lt;a href="http://www.eis.hu.edu.jo/deanshipfiles/pub10224209.pdf"&gt;Abdel-Halim et al.&lt;/a&gt;) but they are much more fragmented.&lt;br /&gt;&lt;br /&gt;With the feedback we obtain from this search phrase approach - and hopefully help from experts in the chemistry community - we can piece together a federated service to provide reasonable estimates for most types of solubility behavior.&lt;br /&gt;&lt;br /&gt;I think that this desalination solution will prove to be a good test for automated (or at least semi-automated) scientific discovery in the realm of open solubility information.  In order to pass the test, the phrase searching algorithm should eventually identify desalination as a "useful problem to solve" and should connect with the predicted behavior of water/NaCl/decanoic acid (or other similar compound).&lt;br /&gt;&lt;br /&gt;Luckily we have &lt;a href="http://usefulchem.blogspot.com/2010/12/visualizing-social-networks-in-open.html"&gt;Don Pellegrino&lt;/a&gt; on board.  His expertise on automated scientific discovery should prove quite valuable for this approach.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-489150242492541988?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/489150242492541988/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=489150242492541988' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/489150242492541988'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/489150242492541988'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/03/towards-automated-discovery-of-useful.html' title='Towards the automated discovery of useful solubility applications'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-EsqCwxxqEn8/TY_i5JaYXuI/AAAAAAAAA-M/UW3fCkSpQd0/s72-c/naclcs.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3562128357493831131</id><published>2011-03-22T16:31:00.000-04:00</published><updated>2011-03-22T16:31:49.592-04:00</updated><title type='text'>Open modeling of melting point data</title><content type='html'>The contribution of &lt;a href="http://usefulchem.blogspot.com/2011/02/alfa-aesar-melting-point-data-now.html"&gt;Alfa Aesar melting point data&lt;/a&gt; to our open collection has facilitated the &lt;a href="http://usefulchem.blogspot.com/2011/03/validating-melting-point-data-from-alfa.html"&gt;validation&lt;/a&gt; of a significant amount of the entire dataset.  However, this process of curation is never-ending.  A good example is the discovery of an error in one of the sources for the melting point of &lt;a href="http://www.chemspider.com/Chemical-Structure.10442445.html"&gt;warfarin&lt;/a&gt;.  Following &lt;a href="http://www.hyperorg.com/blogger/2011/03/19/2b2k-melting-points-a-model-for-open-data/"&gt;David Weinberger's post&lt;/a&gt; about our melting point explorer, his brother Andy noticed a problem and this &lt;a href="http://ff.im/zwGGD"&gt;enabled us to fix it&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;In a way, creating an open environment to make it easy to find and report errors - as well as add new data - complicates scientific evaluation.  In order to report a reproducible process and outcome, it is necessary to take a snapshot of the dataset.  Choosing the exact composition of a dataset for a particular application is somewhat arbitrary.  Aside from selecting a threshold for excluding measurements that deviate too much, compounds may be excluded based on their type.&lt;br /&gt;&lt;br /&gt;For the sake of clarity, we &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;archived the various datasets&lt;/a&gt; we created from multiple sources with brief descriptions of the filtering and merging at each step.  From the perspective of an organic chemist, &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;ONSMP013&lt;/a&gt; is probably the most useful at this time.  It contains averaged measurements for 12634 organic compounds and excludes salts, inorganics or organometallics.  The original file provided by Alfa Aesar contained several of these excluded compounds and can be obtained from &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;ONSMP000&lt;/a&gt;.  It might be interesting at some point to create a collection of melting points for inorganics or salts.  We would welcome contributions of collections of melting points with different filters.&lt;br /&gt;&lt;br /&gt;One of the advantages of &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;ONSMP013&lt;/a&gt; is that it is possible to generate &lt;a href="http://sourceforge.net/apps/mediawiki/cdk/index.php?title=Main_Page"&gt;CDK descriptors&lt;/a&gt; for each entry (and these are included in the spreadsheet).  By not using commercial software to generate descriptors, it enables fully transparent modeling - and extension of that modeling by anyone.&lt;br /&gt;&lt;br /&gt;With this in mind, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; has used &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;ONSMP013&lt;/a&gt; to generate a &lt;a href="http://en.wikipedia.org/wiki/Random_forest"&gt;Random forest&lt;/a&gt; melting point model (&lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel002"&gt;MPM002&lt;/a&gt;).  The most important descriptors turned out to be the number of hydrogen bond donors and the Topological Polar Surface Area (TPSA).  The scatter plot below shows the correlation (R2 = 0.79) between the predicted and experimental values. (color represents TPSA and size relates to H-bond donors)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/--kpxYaEUALI/TYjWNgPWSxI/AAAAAAAAA98/ig6Ym-UwiEs/s1600/observedpredicted.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 278px;" src="http://3.bp.blogspot.com/--kpxYaEUALI/TYjWNgPWSxI/AAAAAAAAA98/ig6Ym-UwiEs/s400/observedpredicted.png" alt="" id="BLOGGER_PHOTO_ID_5586950865234053906" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Andy has described in much more detail the rationale for selecting the Random forest approach over a linear model in &lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel001"&gt;MPM001&lt;/a&gt;.  He has also compared the performance of CDK descriptors versus those from a commercial program for a small set of drug melting points in &lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel003"&gt;MPM003&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The Random forest model (&lt;a href="http://onschallenge.wikispaces.com/MeltingPointModel002"&gt;MPM002&lt;/a&gt;) is also now available as a web service by entering the ChemSpiderID (CSID) of a compound in a URL.  See &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=238"&gt;this example for benzoic acid&lt;/a&gt;.  If experimental results exist they will appear on top and a link to obtain the predicted melting point will appear underneath.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-iwxD4fUDpPo/TYkDpZ7G4-I/AAAAAAAAA-E/7k5KTOFm13o/s1600/predictedbenzoic.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 370px; height: 400px;" src="http://3.bp.blogspot.com/-iwxD4fUDpPo/TYkDpZ7G4-I/AAAAAAAAA-E/7k5KTOFm13o/s400/predictedbenzoic.jpg" alt="" id="BLOGGER_PHOTO_ID_5587000822598132706" border="0" /&gt;&lt;/a&gt;Note that the current web service for predicting melting points can be slow - it may take a minute to process.&lt;br /&gt;&lt;br /&gt;Additional web services for melting point data will be listed on the &lt;a href="http://onswebservices.wikispaces.com/meltingpoint"&gt;ONS web services wiki&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3562128357493831131?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3562128357493831131/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3562128357493831131' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3562128357493831131'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3562128357493831131'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/03/open-modeling-of-melting-point-data.html' title='Open modeling of melting point data'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/--kpxYaEUALI/TYjWNgPWSxI/AAAAAAAAA98/ig6Ym-UwiEs/s72-c/observedpredicted.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1319518207911794807</id><published>2011-03-04T20:03:00.000-05:00</published><updated>2011-03-04T20:03:19.786-05:00</updated><title type='text'>Validating Melting Point Data from Alfa Aesar, EPI and MDPI</title><content type='html'>I recently reported that &lt;a href="http://usefulchem.blogspot.com/2011/02/alfa-aesar-melting-point-data-now.html"&gt;Alfa Aesar publicly released their melting point dataset&lt;/a&gt; for us to use to take into account &lt;a href="http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html"&gt;temperature in solubility measurements&lt;/a&gt;.  Since then, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt; and I have had the opportunity to look into the details of this and other open melting point datasets. (See &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;here&lt;/a&gt; for links and definitions of each dataset)&lt;br /&gt;&lt;br /&gt;An &lt;a href="http://onschallenge.wikispaces.com/Alfa+Aesar+and+Karthikeyan"&gt;initial evaluation&lt;/a&gt; by Andy found that the Alfa Aesar collection yielded better correlations with selected molecular descriptors compared to the &lt;a href="http://dx.doi.org/10.1021/ci0500132"&gt;Karthikeyan dataset&lt;/a&gt;  (originally from &lt;a href="http://www.mdpi.org/"&gt;MDPI&lt;/a&gt;), an open collection of melting points used by several researchers to provide predictive melting point models. This suggested that the quality of the Alfa Aesar dataset might be higher.&lt;br /&gt;&lt;br /&gt;Inspection of the Karthikeyan dataset did reveal some anomalies that may account for the poor correlations.  First there were several duplicates - identical compounds with different melting points, sometimes radically different (&lt;span style="font-weight: bold;"&gt;up to 176 C&lt;/span&gt;). A total of 33 duplicates (66 measurements) were found with a difference in melting points greater than 10 C.(&lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;see ONSMP008 dataset&lt;/a&gt;)  Here are some examples.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-tK5-vm6qEaQ/TW_5tlpANlI/AAAAAAAAA88/kXGlxI4z6lg/s1600/KarthDup1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 228px;" src="http://4.bp.blogspot.com/-tK5-vm6qEaQ/TW_5tlpANlI/AAAAAAAAA88/kXGlxI4z6lg/s400/KarthDup1.jpg" alt="" id="BLOGGER_PHOTO_ID_5579953024928855634" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;A second problem we ran into involved difficulty processing the SMILES in the Karthikeyan collection.  Most of these involved SO2 groups.  An attempt to view this SMILES string in &lt;a href="http://download.cnet.com/ACD-ChemSketch-Freeware/3000-2054_4-10591465.html"&gt;ChemSketch&lt;/a&gt; ends up with two extra hydrogens on the sulfur.&lt;br /&gt;&lt;blockquote style="font-weight: bold;"&gt;[S+2]([O-])([O-])(OCC#N)c1ccc(C)cc1&lt;/blockquote&gt;Other SMILES strings render with 5 bonds on a carbon and ChemSketch draws these with a red X on the problematic atom.  See for example this SMILES string:&lt;br /&gt;&lt;blockquote style="font-weight: bold;"&gt;O=C(OC=1=C2C=CC=CC2=NC=1c1ccccc1)C&lt;/blockquote&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-bUSpQl1qp2w/TXFNqVBpZ4I/AAAAAAAAA9c/q79-dg0iep8/s1600/BadSMILES1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 240px;" src="http://2.bp.blogspot.com/-bUSpQl1qp2w/TXFNqVBpZ4I/AAAAAAAAA9c/q79-dg0iep8/s400/BadSMILES1.jpg" alt="" id="BLOGGER_PHOTO_ID_5580326802882652034" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Note that the sulfur compounds appear to render correctly on &lt;a href="http://www.daylight.com/daycgi_tutorials/depictmatch.cgi"&gt;Daylight's Depict site&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-SnEzgde01y4/TW__9cLZc4I/AAAAAAAAA9M/29bDDcUxzxQ/s1600/BadSMILES2.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 278px; height: 176px;" src="http://4.bp.blogspot.com/-SnEzgde01y4/TW__9cLZc4I/AAAAAAAAA9M/29bDDcUxzxQ/s400/BadSMILES2.jpg" alt="" id="BLOGGER_PHOTO_ID_5579959894336435074" border="0" /&gt;&lt;/a&gt;In total 311 problematic SMILES from the Karthikeyan collection were removed (see &lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;ONSMP009&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;With the accumulation of melting point sources, overlapping coverage is revealing likely incorrect values.  For example, &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=10181341"&gt;5 measurements are reported for phenylacetic acid&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-tKsBoAHbDao/TXFOXf-wSrI/AAAAAAAAA9k/SbEnH1QIF54/s1600/phenylaceticmp1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 369px;" src="http://1.bp.blogspot.com/-tKsBoAHbDao/TXFOXf-wSrI/AAAAAAAAA9k/SbEnH1QIF54/s400/phenylaceticmp1.jpg" alt="" id="BLOGGER_PHOTO_ID_5580327578917423794" border="0" /&gt;&lt;/a&gt;Four of the values cluster very close to 77 C and the other - from the Karthikeyan dataset - is clearly an outlier at 150 C.&lt;br /&gt;&lt;br /&gt;In order to predict the temperature dependence for the solutes in our database, Andy collected the &lt;a href="http://www.epa.gov/opptintr/exposure/pubs/episuite.htm"&gt;EPI&lt;/a&gt; experimental melting points, which are listed under the predicted properties tab in &lt;a href="http://www.chemspider.com/"&gt;ChemSpider&lt;/a&gt; (ultimately from the &lt;a href="http://www.epa.gov/opptintr/exposure/pubs/episuite.htm"&gt;EPA&lt;/a&gt;). (There are predicted EPI values there but we only used the ones marked &lt;span style="font-weight: bold;"&gt;exp&lt;/span&gt;).&lt;br /&gt;&lt;br /&gt;This collection of 150 compounds was then listed in a spreadsheet (&lt;a href="http://onschallenge.wikispaces.com/Open+Melting+Point+Datasets"&gt;ONSMP010&lt;/a&gt;) and each entry was marked as having only an EPI value (44 compounds) or having at least one other measurement from another source (106 compounds).  Out of those having at least one more value, 10 reported significant differences (&gt; 5C) between the measurements.  Upon investigation, many of these point strongly to the error lying with the EPI dataset.  For example, the EPI &lt;a href="http://www.blogger.com/href=" edu="" 7ealang="" meltingpoints="" csid="8058&amp;quot;"&gt;melting point for phenyl salicylate&lt;/a&gt; is over 85 C higher than that reported by both Sigma-Aldrich and Alfa Aesar.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/-3XaZ5Epe-vY/TXF4XREbspI/AAAAAAAAA9s/RpYJwvaJF9E/s1600/phensalmp1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 307px;" src="http://4.bp.blogspot.com/-3XaZ5Epe-vY/TXF4XREbspI/AAAAAAAAA9s/RpYJwvaJF9E/s400/phensalmp1.jpg" alt="" id="BLOGGER_PHOTO_ID_5580373754403074706" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;These preliminary results suggest that as much as 10% of the EPI experimental melting point dataset is significantly in error.  Only a systematic analysis over time will reveal the full extent of the deficiencies.&lt;br /&gt;&lt;br /&gt;So far the Alfa Aesar dataset has not produced many outliers, when other sources are available for comparison.  However, even here, there are some surprising results.  One of the most well studied organic compounds - &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/meltingpointof.php?csid=682"&gt;ethanol - is listed with a melting point&lt;/a&gt; of -130 C by Alfa Aesar, clearly an outlier from the other values clustered around -114 C.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-XbhrNd5u79M/TXF89426IaI/AAAAAAAAA90/yTEhob_3KZg/s1600/ethanolmp1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 366px; height: 400px;" src="http://1.bp.blogspot.com/-XbhrNd5u79M/TXF89426IaI/AAAAAAAAA90/yTEhob_3KZg/s400/ethanolmp1.jpg" alt="" id="BLOGGER_PHOTO_ID_5580378815965307298" border="0" /&gt;&lt;/a&gt;When downloading the Karthikeyan dataset from &lt;a href="http://www.cheminformatics.org/"&gt;Cheminformatics.org&lt;/a&gt;, a Trust Level field indicates: "High - Original Author Data". &lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-bxOyJUICcc4/TXEVnZSG25I/AAAAAAAAA9U/BSnbt17Olok/s1600/trust1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 83px;" src="http://3.bp.blogspot.com/-bxOyJUICcc4/TXEVnZSG25I/AAAAAAAAA9U/BSnbt17Olok/s400/trust1.jpg" alt="" id="BLOGGER_PHOTO_ID_5580265179834669970" border="0" /&gt;&lt;/a&gt;It would be nice if it were that simple.  Unfortunately there are no shortcuts. There is no place for trust in science.  The best we can do is to collect several measurements from truly independent sources and look for consensus over time.  Where consensus is not obvious and information sources are exhausted, performing new measurements will be the only option left to progress.&lt;br /&gt;&lt;br /&gt;The idea that a dataset has been validated - and can be trusted completely - simply because it is attached to a peer-reviewed paper is a dangerous one.  This is perhaps the rationale used by projects such as &lt;a href="http://datadryad.org/"&gt;Dryad&lt;/a&gt;, where datasets are not accepted unless they are associated with a peer-reviewed paper.  Peer review was not designed to validate datasets - even if we wanted it to, reviewers don't typically have access to enough information to do so.&lt;br /&gt;&lt;br /&gt;The usefulness of a measurement is related much more to the details in the raw data provided by following the chain of provenance (when available) than it is in where it is published.  To be fair, in the case of melting point measurements, there really isn't that much additional experimental information to provide, except perhaps an NMR of the sample to prove that it was completely dry.  In such a case, we have no choice but to use redundancy until a consensus number is finally reached.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1319518207911794807?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1319518207911794807/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1319518207911794807' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1319518207911794807'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1319518207911794807'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/03/validating-melting-point-data-from-alfa.html' title='Validating Melting Point Data from Alfa Aesar, EPI and MDPI'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-tK5-vm6qEaQ/TW_5tlpANlI/AAAAAAAAA88/kXGlxI4z6lg/s72-c/KarthDup1.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-6677994057004332009</id><published>2011-02-26T15:32:00.000-05:00</published><updated>2011-02-26T15:33:01.702-05:00</updated><title type='text'>ONS Solubility Challenge Book cited in a Langmuir nanotechnology paper</title><content type='html'>An interesting application of the data from the &lt;a href="http://onschallenge.wikispaces.com/"&gt;Open Notebook Science Solubility Challenge&lt;/a&gt; has recently been reported in Langmuir: "&lt;span style="font-weight: bold;"&gt;Enhanced Ordering in Gold Nanoparticles Self-Assembly through Excess Free Ligands&lt;/span&gt;" &lt;span style="font-style: italic;"&gt;by Cindy Y. Lau, Huigao Duan, Fuke Wang, Chao Bin He, Hong Yee Low and Joel K. W. Yang&lt;/span&gt; (&lt;a href="http://pubs.acs.org/doi/abs/10.1021/la104786z"&gt;Feb 24, 2011&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;The context is as follows, and the reference is to &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;Edition 3 of the ONS Solubility Challenge Book&lt;/a&gt;.&lt;br /&gt;&lt;blockquote&gt;Although to our best knowledge there lacks literature value of OA solubility in the two solvents, the 10-fold better solubility of 1-otadecylamine (&lt;span style="font-style: italic;"&gt;sic&lt;/span&gt;), the saturated version of oleylamine, in toluene than hexane is in line with our hypothesis.(33) This increased solubility caused the OA molecules that were originally attached to the AuNPs to gradually detach from the AuNPs, which is supported by our observations in poor AuNP stability and surface-pressure isotherms.&lt;/blockquote&gt;This is a nice application of solubility to understand and control the behavior of gold nanoparticles.  It is in line with some of the applications I &lt;a href="http://usefulchem.blogspot.com/2010/11/nanoinformatics-2010-conference-report.html"&gt;discussed at a recent Nanoinformatics conference&lt;/a&gt;, where I think there is a place for the interlinking of information between solubility and nanotechnology databases.&lt;br /&gt;&lt;br /&gt;I have to admit that it is somewhat ironic to see this citation in Langmuir, given the controversy about a year ago (&lt;a href="http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html"&gt;post&lt;/a&gt; and &lt;a href="http://ff.im/ixztk"&gt;FF discussion&lt;/a&gt;) regarding the citation of non-traditional literature.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-6677994057004332009?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/6677994057004332009/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=6677994057004332009' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6677994057004332009'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6677994057004332009'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/02/ons-solubility-challenge-book-cited-in.html' title='ONS Solubility Challenge Book cited in a Langmuir nanotechnology paper'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-5236971753103038889</id><published>2011-02-21T21:01:00.000-05:00</published><updated>2011-02-21T21:02:08.947-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open data'/><category scheme='http://www.blogger.com/atom/ns#' term='melting point'/><title type='text'>Alfa Aesar melting point data now openly available</title><content type='html'>A few weeks ago, &lt;a href="http://www.linkedin.com/pub/john-shirley/16/690/b46"&gt;John Shirley&lt;/a&gt; - Global Marketing Manager at &lt;a href="http://www.alfa.com/"&gt;Alfa Aesar&lt;/a&gt; - contacted me to discuss the &lt;a href="http://usefulchem.blogspot.com/2011/01/chemical-information-validation-results.html"&gt;Chemical Information Validation results I posted&lt;/a&gt; from my 2010 Chemical Information Retrieval class.  Our research showed that Alfa Aesar was the second most common source of chemical property information from the class assignment.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-tZf9Xi-_oOA/TWLN5MqGF1I/AAAAAAAAA8k/QE7Qv6cJvM4/s1600/cheminfosources.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 313px; height: 286px;" src="http://3.bp.blogspot.com/-tZf9Xi-_oOA/TWLN5MqGF1I/AAAAAAAAA8k/QE7Qv6cJvM4/s400/cheminfosources.jpg" alt="" id="BLOGGER_PHOTO_ID_5576245671172118354" border="0" /&gt;&lt;/a&gt;We explored some possible ways that we could collaborate.  With our recent report of the use of &lt;a href="http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html"&gt;melting point measurements to predict temperature solubility curves&lt;/a&gt;, the Alfa Aesar melting point data collection could prove immensely useful for our &lt;a href="http://onschallenge.wikispaces.com/"&gt;Open Notebook Science solubility project&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;However, since we are committed to working transparently, the only way we could accept the dataset is if it were shared as Open Data.  I am extremely pleased to report that Alfa Aesar has agreed to this requirement and we hope that this gesture will encourage other chemical companies to follow suit.&lt;br /&gt;&lt;br /&gt;The initial file provided by Alfa Aesar did not store melting points in a database ready format - it included ranges, non-numeric characters and entries reporting decomposition or sublimation.  One of benefits we could provide back to the company was cleaning up the melting point field to pure numerical values ready for sorting and other database processing.  This processed &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/AlfaAesar_AllSimpleMPSingleNumbers.xlsx"&gt;collection contains 12986 entries&lt;/a&gt;.  Note that these entries are not necessarily different chemical compositions since they refer to specific catalog entries with different purities or packaging.&lt;br /&gt;&lt;br /&gt;For our purposes of prioritizing organic chemicals for solubility modeling and applications we curated this initial dataset by collapsing redundant chemical compositions and excluded inorganics (including organometallics) and salts. We did retain organosilicon, organophosphorus and organoboron compounds.  Because the primary key for all of our projects depend on ChemSpiderIDs, all compounds were assigned CSIDs by deposition in the ChemSpider database if necessary.  SMILES were also provided for each entry, as well as a corresponding link to the Alfa Aesar catalog page.  This &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/AlfaAesarCurated.xlsx"&gt;curated collection contains 8739 entries&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;For completeness, we thought it would be useful to merge the Alfa Aesar curated dataset with other collections for convenient federated searches.  We thus added the &lt;a href="http://dx.doi.org/10.1021/ci0500132"&gt;Karthikeyan melting point dataset&lt;/a&gt;, which has been used in several cases to model melting point predictions.  This &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/4173_185_92_DataSetMTPSMIDescr.zip"&gt;dataset&lt;/a&gt; was downloaded from &lt;a href="http://www.cheminformatics.org/"&gt;Cheminformatics.org&lt;/a&gt;. Although we were able to use most of the structures in that collection, a few hundred were left out because of some difficulty in resolving some of the SMILES, perhaps related to the differences in algorithms used by OpenBabel and OpenEye.  Hopefully this issue will be resolved in a simple way and the whole dataset can be incorporated in the near future.  This &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/KarthikeyanCurated.xlsx"&gt;final curated collection contains 4084 entries&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Similarly the smaller &lt;a href="http://dx.doi.org/10.1021/ci020280x"&gt;Bergstrom dataset&lt;/a&gt; was included after processing the &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/SMILES_substances_JCICS2003_mtp.xls"&gt;original file&lt;/a&gt; to a &lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/BergstromCurated.xls"&gt;curated collection of 277 drug molecules&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Finally, the melting point entries from the&lt;a href="https://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdDdNd0wzRVBQdWRMSWpxZjd3X2QtVUE#gid=0"&gt; ChemInfo Validation sheet&lt;/a&gt; itself, generated by student contributions, is added to amount to a collection of currently &lt;span style="font-weight: bold;"&gt;13,436 Open Data melting point values&lt;/span&gt;.  We believe that this is currently the largest such collection and that it should facilitate the development of completely transparent and free models for the prediction of melting points.  As we have &lt;a href="http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html"&gt;argued recently&lt;/a&gt;, improved access to measured or predicted melting points is critical to the prediction of the temperature dependence of solubility.&lt;br /&gt;&lt;br /&gt;In addition to providing the melting point data in tabular format, Andrew Lang has created a convenient&lt;a href="http://lxsrv7.oru.edu/%7Ealang/meltingpoints/"&gt; web based tool to explore the combined dataset&lt;/a&gt;. A drop down menu at the top allows quick access to a specific compound and reports the average melting point as well as a link to the information source.  In the case of an Alfa Aesar source, a link to the catalog is provided, where the compound can be conveniently ordered if desired.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-hFHmn-hd1O8/TWMQDL8zIYI/AAAAAAAAA8s/U7D8yMnmiXo/s1600/mpexp1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 260px;" src="http://1.bp.blogspot.com/-hFHmn-hd1O8/TWMQDL8zIYI/AAAAAAAAA8s/U7D8yMnmiXo/s400/mpexp1.jpg" alt="" id="BLOGGER_PHOTO_ID_5576318410548257154" border="0" /&gt;&lt;/a&gt;In another type of search, a SMARTS string can be entered with an optional range limit for the melting points.  In the following example 14 hits are obtained for benzoic acid derivatives with melting points between 0C and 25C.  Clicking on an image will reveal its source. (BTW even if you don't know how to perform sophisticated SMARTS queries, simply looking up the SMILES for a substructure on &lt;a href="http://www.chemspider.com"&gt;ChemSpider&lt;/a&gt; or ChemSketch will likely be sufficient for most types of queries).&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-tHaNrJ_a-dI/TWMSjWYK_GI/AAAAAAAAA80/qa-a--WpnTQ/s1600/mpexp2.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 268px;" src="http://1.bp.blogspot.com/-tHaNrJ_a-dI/TWMSjWYK_GI/AAAAAAAAA80/qa-a--WpnTQ/s400/mpexp2.jpg" alt="" id="BLOGGER_PHOTO_ID_5576321162126490722" border="0" /&gt;&lt;/a&gt;Preliminary tests on a Droid smartphone indicate that these search capabilities work quite well.&lt;br /&gt;&lt;br /&gt;Finally, I would like to thank &lt;a href="http://en.wikipedia.org/wiki/Antony_John_Williams"&gt;Antony Williams&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; and the people at &lt;a href="http://www.alfa.com/"&gt;Alfa Aesar&lt;/a&gt; (now added as an &lt;a href="http://onschallenge.wikispaces.com/"&gt;official sponsor&lt;/a&gt;) who contributed many hours to collecting, curating and coding for the final product we are presenting here.  We hope that this will be of value to the researchers in the cheminformatics community for a variety of open projects where melting points play a role.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-5236971753103038889?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/5236971753103038889/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=5236971753103038889' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5236971753103038889'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5236971753103038889'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/02/alfa-aesar-melting-point-data-now.html' title='Alfa Aesar melting point data now openly available'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-tZf9Xi-_oOA/TWLN5MqGF1I/AAAAAAAAA8k/QE7Qv6cJvM4/s72-c/cheminfosources.jpg' height='72' width='72'/><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4251011229296338492</id><published>2011-02-13T19:56:00.002-05:00</published><updated>2011-02-13T20:02:02.564-05:00</updated><title type='text'>Predicting temperature-dependent solubility for solvent selection</title><content type='html'>During the summer of 2010 I &lt;a href="http://usefulchem.blogspot.com/2010/08/reaction-attempts-solvent-selector.html"&gt;reported&lt;/a&gt; on the &lt;a href="http://onswebservices.wikispaces.com/reactions"&gt;Solvent Selector&lt;/a&gt; web service that &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; and I constructed.  The idea was to flag potential solvents with a high solubility for the reactants and a low solubility for the product, so that the work-up would require a simple filtration.&lt;br /&gt;&lt;br /&gt;If available, the &lt;a href="http://onswebservices.wikispaces.com/reactions"&gt;Solvent Selector service&lt;/a&gt; uses measured solubility values.  If not available, it attempts to predict the room temperature solubility using one of two models based on Abraham descriptors.&lt;br /&gt;&lt;br /&gt;We now have modified the Solvent Selector so that it takes into account temperature.  Andrew has inserted a thermometer icon next to each solvent in the report.  When clicked, a plot is displayed over the entire range of temperatures where the solvent is a liquid. Curves for each starting material and the product are provided - and hovering over a data point provides numerical values.&lt;br /&gt;&lt;br /&gt;There are several ways this resource could be used by chemists.&lt;br /&gt;&lt;br /&gt;For reactions where some starting materials are not soluble enough at room temperature, the reaction could be carried out at a higher temperature. A higher temperature might also be desirable simply to speed up the reaction. Being able to predict the solubility of the product at that higher temperature would allow the course of the reaction to be monitored by the appearance of a precipitate.&lt;br /&gt;&lt;br /&gt;For reactions where the solubility of the product is too high at room temperature, the curves could be used to estimate how low one could cool the reaction mixture without any chance that one of the starting materials would precipitate out.  For example, consider the following Ugi reaction.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-0rmk_chXTUs/TVhLBu4E6yI/AAAAAAAAA8U/MPwIwErCS_M/s1600/solsel3.gif"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 141px;" src="http://1.bp.blogspot.com/-0rmk_chXTUs/TVhLBu4E6yI/AAAAAAAAA8U/MPwIwErCS_M/s400/solsel3.gif" alt="" id="BLOGGER_PHOTO_ID_5573287032006175522" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;An optimization study was performed and found that methanol and ethanol provided much better yields than THF.(see &lt;a href="http://www.jove.com/details.stp?ID=942"&gt;JoVE article&lt;/a&gt;)  This makes sense from a solubility standpoint, where the &lt;a href="http://www.chemspider.com/Chemical-Structure.21106059.html"&gt;Ugi product&lt;/a&gt; room temperature solubility is less than 0.05 M for methanol and ethanol but is 0.26 M for THF &lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660%7C21106059"&gt;Solvent Selector results&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-zspvTlrmgOQ/TVhMu0GpeHI/AAAAAAAAA8c/JHkN9XxPBk8/s1600/solsel2.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 298px;" src="http://3.bp.blogspot.com/-zspvTlrmgOQ/TVhMu0GpeHI/AAAAAAAAA8c/JHkN9XxPBk8/s400/solsel2.jpg" alt="" id="BLOGGER_PHOTO_ID_5573288906015209586" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;However, by clicking on the thermometer icon for the THF entry, one gets the following &lt;a href="http://lxsrv7.oru.edu/%7Ealang/solubility/soltemp.php?csids=70660%7C21106059&amp;amp;cs=3.716%7C0.258&amp;amp;solvent=THF&amp;amp;solmp=-108.4&amp;amp;solbp=68.3&amp;amp;limreact=0.3&amp;amp;limprod=0.03"&gt;temperature curves for solubility&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-YOyJ2yzJ_7s/TVhJsviAp8I/AAAAAAAAA8E/smfQVfsSQis/s1600/solsel1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 266px;" src="http://1.bp.blogspot.com/-YOyJ2yzJ_7s/TVhJsviAp8I/AAAAAAAAA8E/smfQVfsSQis/s400/solsel1.jpg" alt="" id="BLOGGER_PHOTO_ID_5573285571893176258" border="0" /&gt;&lt;/a&gt;By hovering over the curve for the Ugi product we find that the predicted solubility in THF at -78C (conveniently a &lt;a href="http://en.wikipedia.org/wiki/Cooling_bath"&gt;dry ice in acetone bath&lt;/a&gt;) is 0.01M, while that for the starting material &lt;a href="http://www.chemspider.com/Chemical-Structure.70660.html"&gt;boc-glycine&lt;/a&gt; is 0.65 M, well above the 0.5 M concentration at the start of the reaction.  This means that even if the reaction did not take place to a significant extent, we would not expect the starting material to precipitate at -78C.  Any precipitate should be the pure product.&lt;br /&gt;&lt;br /&gt;Of course another obvious application is for solvent selection for re-crystallization.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;How it works&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;For some time now we have been collecting literature on the temperature dependence of solubility in various systems (&lt;a href="http://showme.physics.drexel.edu/onsc/mendeley/?tags=temperature+dependence"&gt;live Mendeley collection here&lt;/a&gt; - be patient it might take a minute to display).  Although equations vary depending on the specific approach there seem to be the following commonalities:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;An assumption is made that miscibility is reached at the melting point of the solute.&lt;/li&gt;&lt;li&gt;The log of the solubility is linearly proportional to the inverse of the temperature in Kelvin.&lt;/li&gt;&lt;/ol&gt;I have looked into a few examples that we have of solubility over a temperature range and the above do seem to hold.  This means that with only a room temperature solubility and a melting point for a given solute, the solubility at any temperature can be interpolated or extrapolated. (The concentration at the miscibility point is calculated from the predicted density of the solute divided by its molecular weight).&lt;br /&gt;&lt;br /&gt;Although there are situations where two liquids are not miscible because of extreme dissimilarity (e.g. methanol and hexane), for the most part our experience shows that the first assumption is valid. Also, we are not correcting for changes in density for the solute or solvent at different temperatures.  Nevertheless, when only a single solubility measurement in a given solvent and a melting point are known, this simple model may prove to be of use as a rough guide for reaction design or re-crystallization. We'll report on its practicality over time as we put it to use.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4251011229296338492?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4251011229296338492/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4251011229296338492' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4251011229296338492'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4251011229296338492'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/02/predicting-temperature-dependent.html' title='Predicting temperature-dependent solubility for solvent selection'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-0rmk_chXTUs/TVhLBu4E6yI/AAAAAAAAA8U/MPwIwErCS_M/s72-c/solsel3.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1590126160790329932</id><published>2011-02-10T11:56:00.000-05:00</published><updated>2011-02-10T11:56:13.231-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ChemDoodle'/><category scheme='http://www.blogger.com/atom/ns#' term='spectral game'/><category scheme='http://www.blogger.com/atom/ns#' term='NMR'/><title type='text'>The Spectral Game with ChemDoodle</title><content type='html'>In the summer of 2009, we published an &lt;a href="http://www.jcheminf.com/content/1/1/9"&gt;article on the Spectral Game&lt;/a&gt;.  This game is based on spectra uploaded as Open Data (in &lt;a href="http://www.acornnmr.com/JCAMP.htm"&gt;JCAMP-DX&lt;/a&gt; format) on &lt;a href="http://chemspider.com/"&gt;ChemSpider&lt;/a&gt; (currently about 2000 H NMRs and a few C NMRs, IRs and NIRs).  Students get points by clicking on the molecule associated with the spectrum on display.&lt;br /&gt;&lt;br /&gt;Although this has proved to be a useful tool to teach spectroscopy (especially H NMR), there have been some limitations, which are related to the use of Java (&lt;a href="http://jspecview.sourceforge.net/"&gt;JSpecView&lt;/a&gt;) to provide an interactive display of the spectra.&lt;br /&gt;&lt;br /&gt;1) Spectra do not display properly on Macs - there are problems with the "right-click" options in JSpecView.  It took me a really long time to understand why some of my friends were really unimpressed by JSpecView.  When I recognized that they were all Mac users I took a look and it became clear.&lt;br /&gt;&lt;br /&gt;2) Spectra do not display at all on smartphones because of the Java components&lt;br /&gt;&lt;br /&gt;I am very happy to report that these issues have been overcome (for the most part) using &lt;a href="http://www.chemdoodle.com/"&gt;ChemDoodle&lt;/a&gt;.  Through a collaborative effort between &lt;a href="http://www.ichemlabs.com/management"&gt;Kevin Theisen&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt;, &lt;a href="http://www.linkedin.com/in/antonywilliams"&gt;Antony Williams&lt;/a&gt; and myself, we now have a non-Java based version of the Spectral Game at &lt;a href="http://spectralgame.com/"&gt;SpectralGame.com&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-FTk9aiRLgNE/TVQNt0Xn8lI/AAAAAAAAA70/4N96M3PE808/s1600/spdoodle1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 300px;" src="http://1.bp.blogspot.com/-FTk9aiRLgNE/TVQNt0Xn8lI/AAAAAAAAA70/4N96M3PE808/s400/spdoodle1.jpg" alt="" id="BLOGGER_PHOTO_ID_5572093719767151186" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-rnegpedliIE/TVQU96d3N4I/AAAAAAAAA78/IQrFW37SC2s/s1600/sgdoodle2.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 299px;" src="http://1.bp.blogspot.com/-rnegpedliIE/TVQU96d3N4I/AAAAAAAAA78/IQrFW37SC2s/s400/sgdoodle2.jpg" alt="" id="BLOGGER_PHOTO_ID_5572101692863231874" border="0" /&gt;&lt;/a&gt;The game plays well on Mac, &lt;a href="http://ff.im/xBDFM"&gt;iPhone and iPad&lt;/a&gt;.  However I have seen it fail on 2 Androids so there are still a few kinks to work out.  Luckily I happen to be teaching NMR right now in my &lt;a href="http://chem242.wikispaces.com/content"&gt;organic chemistry course&lt;/a&gt; so my students will be testing out the ChemDoodle version extensively.&lt;br /&gt;&lt;br /&gt;There are some really nice additional features as well.  My favorite is the auto-scaling of the integration line when zooming in.  In the JSpecView version, integration is problematic because, when zooming into high field peaks, the start of the integration line does not reset to zero and this requires several iterations of changing the integration offset to get a usable measurement.&lt;br /&gt;&lt;br /&gt;Another advantage in the ChemDoodle design is the simplicity of the interface.  There are no right-click options: everything available is clearly labeled at all times (toggle integration, reset spectrum and view header information).  This makes the game easier to learn and play.&lt;br /&gt;&lt;br /&gt;I would especially like to thank Kevin Theisen for being so responsive on the ChemDoodle end.  I was skeptical that we would have a playable game for this term but he addressed all of our major issues very quickly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1590126160790329932?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1590126160790329932/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1590126160790329932' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1590126160790329932'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1590126160790329932'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/02/spectral-game-with-chemdoodle.html' title='The Spectral Game with ChemDoodle'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-FTk9aiRLgNE/TVQNt0Xn8lI/AAAAAAAAA70/4N96M3PE808/s72-c/spdoodle1.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-5796048120230512599</id><published>2011-01-17T15:59:00.001-05:00</published><updated>2011-01-17T16:38:45.189-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='#scio11'/><category scheme='http://www.blogger.com/atom/ns#' term='scio11'/><title type='text'>Science Online 2011 Thoughts</title><content type='html'>On January 15, 2011 I co-moderated a &lt;a href="http://scio11.wikispaces.com/"&gt;Science Online 2011&lt;/a&gt; session on &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; with &lt;a href="http://www.chemconnector.com/chemunicating/"&gt;Antony Williams&lt;/a&gt; and &lt;a href="http://www.carlboettiger.info/"&gt;Carl Boettiger&lt;/a&gt;.  The projector failed so we did our best to introduce the topic without relying on visual aids.  My main objective was to demonstrate that it is not necessary for researchers (or their machines) to interface with the actual lab notebook to benefit from the information generated from the work.  By introducing simple and rapid abstraction steps, both solubility and reaction information can be converted to &lt;a href="http://onswebservices.wikispaces.com/"&gt;web services&lt;/a&gt; for a variety of uses. As long as a link to the original lab notebook page (including the raw data) is attached, no information is lost and details can be investigated on demand.&lt;br /&gt;&lt;br /&gt;One of the most powerful tools to use in this context is the tracking of chemical entities as &lt;a href="http://www.chemspider.com/"&gt;ChemSpider&lt;/a&gt; IDs.  This enables direct access to many other web services which &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; and I have leveraged to generate our own services.  Tony spoke a bit more about this in his part and outlined some of the benefits and frustrations with crowdsourcing.  Carl spoke eloquently about his experiences with Open Notebook Science as a graduate student for computational projects.  The slides from all of us are provided below.&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_6600687"&gt;&lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/science-online-2011-ons-session" title="Science Online 2011 ONS session"&gt;Science Online 2011 ONS session&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse6600687" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=onsscio11-110117094031-phpapp02&amp;amp;stripped_title=science-online-2011-ons-session&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse6600687" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=onsscio11-110117094031-phpapp02&amp;amp;stripped_title=science-online-2011-ons-session&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding: 5px 0pt 12px;"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;The overall tone of the discussion during our session was quite positive and productive.  This was the case with all of the other sessions I attended, as it has been in prior years.  The Science Online conference has evolved to attract a large proportion of people advocating Open Science.  The presenters and the audience feel that they are among friends and the result is usually a free and easy exchange of ideas.  Not all conferences and symposia relating to the online aspects of science share this. I have seen many examples where the "online science" theme is overrun by Closed Science proponents, for example commercial databases or Electronic Laboratory Notebook (ELN) vendors.  Hopefully this conference will retain its Open Science focus in the future.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://kaythaney.wordpress.com/"&gt;Kaitlin Thaney&lt;/a&gt; proved to be a very effective moderator during her session on "&lt;span style="font-weight: bold;"&gt;The Digital Toolbox: What's Needed?&lt;/span&gt;" and she stirred up some insightful discussion.  I also enjoyed &lt;a href="http://stevekochscience.blogspot.com/"&gt;Steve Koch&lt;/a&gt;'s session (co-moderated with &lt;a href="http://libraryadventures.com/"&gt;Kiyomi Deards&lt;/a&gt; and &lt;a href="http://cloud.lib.wfu.edu/blog/pd/"&gt;Molly Keener&lt;/a&gt;) on "&lt;span style="font-weight: bold;"&gt;Data Discoverability: Institutional Support Strategies&lt;/span&gt;".  Steve shared a particularly &lt;a href="http://arxiv.org/PS_cache/arxiv/pdf/1101/1101.2225v1.pdf"&gt;compelling example&lt;/a&gt; of the collaborative benefits of Open Notebook Science, where a computational research group came across images and videos from one of his group's notebooks and incorporated these in their paper - with all due credit acknowledged.&lt;br /&gt;&lt;br /&gt;I very much appreciated the opportunity to catch up with old friends and some new.  I had never met &lt;a href="http://www.carlboettiger.info/"&gt;Carl Boettiger&lt;/a&gt; in person before and we had some very interesting discussions about Open Science and Open Education.  It was good to meet &lt;a href="http://www.science3point0.com/"&gt;Mark Hahnel&lt;/a&gt; from &lt;a href="http://figshare.com/"&gt;FigShare&lt;/a&gt; and explore possible paths for data sharing.  I had some nice chats with &lt;a href="http://www.chemconnector.com/chemunicating/"&gt;Antony Williams&lt;/a&gt;, &lt;a href="http://stevekochscience.blogspot.com/"&gt;Steve Koch&lt;/a&gt;, &lt;a href="http://comporgchem.com/blog/"&gt;Steven Bachrach&lt;/a&gt;, &lt;a href="http://researchremix.org/"&gt;Heather Piwowar&lt;/a&gt; and &lt;a href="http://ananelson.com/"&gt;Ana Nelson&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The Saturday evening banquet proved to be surprisingly entertaining. Despite the sedate title of her talk, "&lt;span style="font-weight: bold;"&gt;Out on a Limb: Challenges of Training Scientists to Communicate&lt;/span&gt;", &lt;a href="http://www.naturalsciences.org/"&gt;Meg Lowman&lt;/a&gt; pounded the audience with a hilarious performance.  Science comedian &lt;a href="http://www.sciencecomedian.com/"&gt;Brian Malow&lt;/a&gt; kicked this up a notch with some very clever material.  Later on, using a brilliant comedic judo technique, he repeated some choice derisive comments he received from his performances on YouTube.  I hope he comes back next year!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-5796048120230512599?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/5796048120230512599/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=5796048120230512599' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5796048120230512599'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5796048120230512599'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/01/science-online-2011-thoughts.html' title='Science Online 2011 Thoughts'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8540171083827984054</id><published>2011-01-12T18:41:00.000-05:00</published><updated>2011-01-12T18:41:58.713-05:00</updated><title type='text'>Talk on Open Education in Chemistry at the University of the Sciences</title><content type='html'>I presented on "Open Education in Chemistry Research and Classroom" at the &lt;a href="http://www.usciences.edu/"&gt;University of the Sciences&lt;/a&gt; on January 11, 2011. The talk covered screencasting, wikis, Open Notebook Science, games and smartphones.&lt;br /&gt;&lt;br /&gt;This was also an opportunity to present some new work Andrew Lang and I did on &lt;a href="http://usefulchem.blogspot.com/2011/01/chemical-information-validation-results.html"&gt;Chemical Information Validation&lt;/a&gt;, resulting from the students in my &lt;a href="http://cheminfo2010.wikispaces.com/"&gt;Chemical Information Retrieval class&lt;/a&gt; during the Fall 2010 term.  I also highlighted Don Pellegrino's recent &lt;a href="http://usefulchem.blogspot.com/2010/12/visualizing-social-networks-in-open.html"&gt;social/chemical network&lt;/a&gt; visualization project.&lt;br /&gt;&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_6518894"&gt;&lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/philadelphia-u-sciences-2011" title="Philadelphia U Sciences 2011"&gt;Philadelphia U Sciences 2011&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse6518894" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=usciences2011-110111110334-phpapp02&amp;amp;stripped_title=philadelphia-u-sciences-2011&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse6518894" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=usciences2011-110111110334-phpapp02&amp;amp;stripped_title=philadelphia-u-sciences-2011&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding: 5px 0pt 12px;"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=26626&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" allowscriptaccess="always" flashvars="id=26626&amp;amp;type=3" width="480" height="400"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8540171083827984054?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8540171083827984054/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8540171083827984054' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8540171083827984054'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8540171083827984054'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/01/talk-on-open-education-in-chemistry-at.html' title='Talk on Open Education in Chemistry at the University of the Sciences'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-2284628391747044494</id><published>2011-01-05T13:56:00.000-05:00</published><updated>2011-01-05T13:56:40.951-05:00</updated><title type='text'>Chemical Information Validation Results from Fall 2010</title><content type='html'>As I &lt;a href="http://usefulchem.blogspot.com/2010/10/cheminfo-retrieval-classes-1-and-2-in.html"&gt;mentioned earlier&lt;/a&gt;, one of the outcomes from my &lt;a href="http://cheminfo2010.wikispaces.com/"&gt;Fall 2010 Chemical Information Retrieval&lt;/a&gt; class involved the collection of chemical property information from different sources in a database format.  Now that the course is over, this has resulted in 567 measurements for 24 compounds (including one compound &lt;a href="http://usefulchem.blogspot.com/2010/01/dangerous-data-lessons-from-my-cheminfo.html"&gt;EGCG from the previous term&lt;/a&gt;).  I have curated the dataset to ensure that the original numbers, conversions to common units, categorizations, etc. are correct.  Links to the information source or to a screenshot of the source are available for each entry - so if I missed something, anyone can unambiguously verify it for correction. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The dataset is available from a &lt;a href="https://spreadsheets0.google.com/ccc?key=t7MwL3EPPudLIjqf7w_d-UA&amp;amp;hl=en#gid=0"&gt;Google Spreadsheet&lt;/a&gt;.  Andrew Lang has also created a web based interface: the &lt;a href="http://showme.physics.drexel.edu/onsc/cir/"&gt;ChemInfo Validation Explorer&lt;/a&gt;.  By simply specifying the compound of interest and the property using drop-down menus the list of measurements from the relevant sources is provided with values outside of one standard deviation marked in orange.  Links to the information source, or an image in cases where the information source cannot be directly linked, are provided in the results.  Here is an example for the boiling point of benzene.&lt;/div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/TRjuHQbU6QI/AAAAAAAAA6c/LpBznSUAfVs/s1600/cheminfoval1.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 361px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/TRjuHQbU6QI/AAAAAAAAA6c/LpBznSUAfVs/s400/cheminfoval1.jpg" alt="" id="BLOGGER_PHOTO_ID_5555451948797585666" border="0" /&gt;&lt;/a&gt;The visualization and analysis of the data was greatly facilitated by the use of &lt;a href="http://www.tableausoftware.com/public"&gt;Tableau Public&lt;/a&gt;.  After downloading the free program anyone can easily re-create the queries in this post by first &lt;a href="https://spreadsheets.google.com/pub?key=0Ak1R8T6wt4YQdDdNd0wzRVBQdWRMSWpxZjd3X2QtVUE&amp;amp;hl=en&amp;amp;output=xls"&gt;downloading the dataset as an Excel document&lt;/a&gt; then importing into Tableau Public. Interactive charts can then be freely hosted on the TP server and embedded as I have done in this post below.&lt;br /&gt;&lt;br /&gt;The students were shown how to search &lt;a href="http://cheminfo2010.wikispaces.com/resources"&gt;both commercial and free information sources&lt;/a&gt; and were given complete freedom for which compounds and chemical properties to target.  The results can be analyzed from the perspective of a reasonable sampling of the current state of chemical information available to the average chemist.  The 5 most frequently obtained properties were melting point, density, boiling point, flash point and refractive index.&lt;br /&gt;&lt;br /&gt;&lt;script type="text/javascript" src="http://public.tableausoftware.com/javascripts/api/viz_v1.js"&gt;&lt;/script&gt;&lt;object class="tableauViz" style="display: none;" width="400" height="400"&gt;&lt;param name="name" value="PropertyCount/Sheet1"&gt;&lt;param name="toolbar" value="yes"&gt;&lt;param name="tabs" value="no"&gt;&lt;param name="tabs" value="no"&gt;&lt;/object&gt;&lt;noscript&gt;Sheet 1 &lt;br /&gt;&lt;a href="#"&gt;&lt;img alt="Sheet 1 " src="http://public.tableausoftware.com/static/images/Pr/PropertyCount/Sheet1/1_rss.png" height="100%" /&gt;&lt;/a&gt;&lt;/noscript&gt;&lt;div style="width: 1027px; height: 22px; padding: 0px 10px 0px 0px; color: black; font: 8pt verdana,helvetica,arial,sans-serif;"&gt;&lt;div style="float: right; padding-right: 8px;"&gt;&lt;a href="http://www.tableausoftware.com/public?ref=http://public.tableausoftware.com/views/PropertyCount/Sheet1" target="_blank"&gt;Powered by Tableau&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;The information sources were categorized and are reported below by frequency.  Chemical vendor sites were by far the most frequently used information source.&lt;br /&gt;&lt;script type="text/javascript" src="http://public.tableausoftware.com/javascripts/api/viz_v1.js"&gt;&lt;/script&gt;&lt;object class="tableauViz" style="display: none;" width="400" height="300"&gt;&lt;param name="name" value="DataSourceTypeCount/Sheet1"&gt;&lt;param name="toolbar" value="yes"&gt;&lt;param name="tabs" value="no"&gt;&lt;/object&gt;&lt;noscript&gt;Sheet 1 &lt;br /&gt;&lt;a href="#"&gt;&lt;img alt="Sheet 1 " src="http://public.tableausoftware.com/static/images/Da/DataSourceTypeCount/Sheet1/1_rss.png" height="100%" /&gt;&lt;/a&gt;&lt;/noscript&gt;&lt;div style="width: 675px; height: 22px; padding: 0px 10px 0px 0px; color: black; font: 8pt verdana,helvetica,arial,sans-serif;"&gt;&lt;div style="float: right; padding-right: 8px;"&gt;&lt;a href="http://www.tableausoftware.com/public?ref=http://public.tableausoftware.com/views/DataSourceTypeCount/Sheet1" target="_blank"&gt;Powered by Tableau&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;It is important to note that the &lt;i&gt;information source&lt;/i&gt; does not represent the &lt;i&gt;method&lt;/i&gt; by which the measurements were found.  The source is simply the end of the chain of provenance: the document that provides no specific reference for the reported measurement. For example, even though ChemSpider was frequently used as a search engine, it would not be listed as an information source when it provided links to other sources (mainly MSDS sheets) for properties.  ChemSpider was treated as a source for some predicted properties.&lt;br /&gt;&lt;br /&gt;&lt;script type="text/javascript" src="http://public.tableausoftware.com/javascripts/api/viz_v1.js"&gt;&lt;/script&gt;&lt;object class="tableauViz" style="display: none;" width="400" height="450"&gt;&lt;param name="name" value="DataSourceCount/Sheet1"&gt;&lt;param name="toolbar" value="yes"&gt;&lt;param name="tabs" value="no"&gt;&lt;/object&gt;&lt;noscript&gt;Sheet 1 &lt;br /&gt;&lt;a href="#"&gt;&lt;img alt="Sheet 1 " src="http://public.tableausoftware.com/static/images/Da/DataSourceCount/Sheet1/1_rss.png" height="100%" /&gt;&lt;/a&gt;&lt;/noscript&gt;&lt;div style="width: 675px; height: 22px; padding: 0px 10px 0px 0px; color: black; font: 8pt verdana,helvetica,arial,sans-serif;"&gt;&lt;div style="float: right; padding-right: 8px;"&gt;&lt;a href="http://www.tableausoftware.com/public?ref=http://public.tableausoftware.com/views/DataSourceCount/Sheet1" target="_blank"&gt;Powered by Tableau&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;The chemical vendor Sigma-Aldrich was the most frequently used information source, followed by Alfa Aesar.  Wolfram Alpha - categorized as a "free database" was third. Oxford University follows closely behind as fourth and is categorized as an "academic website", hosting MSDS sheets.  Many universities host MSDS sheets but the Oxford web site seems to turn up most frequently from chemical property queries on search engines.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The fifth most frequent information source was Wikipedia, reflecting the fact that specific references are usually not provided there for chemical properties.  Like ChemSpider, Wikipedia was categorized as a "crowdsourced database".&lt;/div&gt;&lt;br /&gt;&lt;b&gt;Flagging Outliers&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;One of the advantages of this type of collection is that it is much easier to identify outliers.  In the case of non-aqueous solubility data, we were able to create an &lt;a href="http://usefulchem.blogspot.com/2009/06/crowdsourcing-solubility-requests-from.html"&gt;outlier bot&lt;/a&gt; to automatically flag potentially problematic results.  Since different properties may have very different typical variabilities, outliers are most easily discovered by comparisons within the same property.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For example consider the following plot showing the standard deviation to mean ratio for melting point measurements.&lt;br /&gt;&lt;br /&gt;&lt;script type="text/javascript" src="http://public.tableausoftware.com/javascripts/api/viz_v1.js"&gt;&lt;/script&gt;&lt;object class="tableauViz" style="display: none;" width="400" height="400"&gt;&lt;param name="name" value="OutliersMP/Sheet1"&gt;&lt;param name="toolbar" value="yes"&gt;&lt;param name="tabs" value="no"&gt;&lt;/object&gt;&lt;noscript&gt;Sheet 1 &lt;br /&gt;&lt;a href="#"&gt;&lt;img alt="Sheet 1 " src="http://public.tableausoftware.com/static/images/Ou/OutliersMP/Sheet1/1_rss.png" height="100%" /&gt;&lt;/a&gt;&lt;/noscript&gt;&lt;div style="width: 675px; height: 22px; padding: 0px 10px 0px 0px; color: black; font: 8pt verdana,helvetica,arial,sans-serif;"&gt;&lt;div style="float: right; padding-right: 8px;"&gt;&lt;a href="http://www.tableausoftware.com/public?ref=http://public.tableausoftware.com/views/OutliersMP/Sheet1" target="_blank"&gt;Powered by Tableau&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;This reveals that the average melting point for EGCG is suspect.  At this point, an easy way to inspect the results is to use the &lt;a href="http://showme.physics.drexel.edu/onsc/cir/"&gt;Validation Explorer&lt;/a&gt; and look at the individual measurements.&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TRzNOAyndDI/AAAAAAAAA7I/ZOjVnI0qJq0/s1600/cheminfoval6.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 253px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TRzNOAyndDI/AAAAAAAAA7I/ZOjVnI0qJq0/s400/cheminfoval6.jpg" alt="" id="BLOGGER_PHOTO_ID_5556541680882512946" border="0" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;By clicking on the images we can verify that the numbers have been correctly copied from the primary sources.  In this case we can also ascertain that the sources - a peer reviewed paper and the Merck Index - are considered by most chemists to be generally reliable. There is no compelling reason at this point to weigh one result over the other and one has to be careful when using the average value for any practical application. (Note that all temperature data is recorded as Kelvin.  A zero-based scale is necessary to ensure that the standard deviation to mean ratio is meaningful.)&lt;br /&gt;&lt;br /&gt;The next flagging hit in this collection is the melting point of cyclohexanone. In this case 5 results are returned and the &lt;a href="http://showme.physics.drexel.edu/onsc/cir/"&gt;Validation Explorer&lt;/a&gt; highlights the Alfa Aesar value as being more than one standard deviation from the average.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TRzXW7-puGI/AAAAAAAAA7Q/pHlUfNk0RlM/s1600/cheminfoval7.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 316px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TRzXW7-puGI/AAAAAAAAA7Q/pHlUfNk0RlM/s400/cheminfoval7.jpg" alt="" id="BLOGGER_PHOTO_ID_5556552829325916258" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;However, one has to be careful when assessing this and assuming that the Alfa Aesar value is most likely to being the odd value out.  Notice that 3 of the values - Sigma-Aldrich, Acros and Wolfram Alpha are identical.  The most likely explanation for this is that all three used the same information source and should thus be counted as a single measurement.  &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We can test this hypothesis by looking for cases where Sigma-Aldrich, Acros and Wolfram Alpha don't share identical values.  As shown below, for melting point measurements, there is no case where the values don't match.&lt;/div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TRzm2swUeWI/AAAAAAAAA7Y/upUI4eqa45g/s1600/cheminfoval8.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 300px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TRzm2swUeWI/AAAAAAAAA7Y/upUI4eqa45g/s400/cheminfoval8.jpg" alt="" id="BLOGGER_PHOTO_ID_5556569867669502306" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The same is true for boiling points:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/TRznfh67MNI/AAAAAAAAA7g/j8WIGTF6R3Y/s1600/cheminfoval9.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 212px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/TRznfh67MNI/AAAAAAAAA7g/j8WIGTF6R3Y/s400/cheminfoval9.jpg" alt="" id="BLOGGER_PHOTO_ID_5556570569135829202" border="0" /&gt;&lt;/a&gt;However, in the case of flash points it is clear that the three are not using a common data source.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TRznpFqvchI/AAAAAAAAA7o/d_F0wZd9RVc/s1600/cheminfoval10.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 333px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TRznpFqvchI/AAAAAAAAA7o/d_F0wZd9RVc/s400/cheminfoval10.jpg" alt="" id="BLOGGER_PHOTO_ID_5556570733350449682" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Using the data we collected - and will continue to collect - we could start to identify which data sources are likely using the same ultimate sources and avoid over-counting measurements.  This would save time in searching since one would know which sources to check for a particular property while avoiding duplication.  This information is extremely difficult to obtain using other approaches.&lt;br /&gt;&lt;br /&gt;The same type of outlier analysis can be performed for &lt;a href="http://public.tableausoftware.com/views/OutliersAllProps/Sheet1?:embed=yes&amp;amp;:toolbar=yes&amp;amp;:tabs=no"&gt;all the properties&lt;/a&gt; collected in this study.&lt;br /&gt;&lt;br /&gt;I believe that there is much more useful analysis to be done on this dataset, especially for chemistry librarians.  When this class is run next year, more data will be added.  In the meantime, contributions from other sources would be welcome.&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-2284628391747044494?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/2284628391747044494/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=2284628391747044494' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2284628391747044494'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2284628391747044494'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2011/01/chemical-information-validation-results.html' title='Chemical Information Validation Results from Fall 2010'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_R3jz5f_NkRo/TRjuHQbU6QI/AAAAAAAAA6c/LpBznSUAfVs/s72-c/cheminfoval1.jpg' height='72' width='72'/><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-271224965066654315</id><published>2010-12-20T12:56:00.000-05:00</published><updated>2010-12-20T12:57:12.809-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='social networks'/><category scheme='http://www.blogger.com/atom/ns#' term='Gephi'/><title type='text'>Visualizing Social Networks in Open Notebooks</title><content type='html'>Increasing the role of automation in the scientific process has long been a fundamental objective of Open Notebook Science.  The automatic discovery of new connections in open scientific work is potentially a very important contribution to this end.&lt;br /&gt;&lt;br /&gt;Visualizing social networks within and between Open Notebooks is certainly a good first step.  Luckily, our Reaction Attempts project has already abstracted the key elements of organic chemical reactions within a collection of Open Notebooks.  This means that creating connection maps between people and chemicals can be attempted with reliable and semantically unambiguous database sources.&lt;br /&gt;&lt;br /&gt;The&lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt; Reaction Attempts&lt;/a&gt; database records the identity of reactants and products as ChemSpiderIDs for each reaction within a collection of notebooks. Also the name of the researcher, the solvent, the yield (when available) and a few more key identifiers are recorded.&lt;br /&gt;&lt;br /&gt;We are very fortunate that &lt;a href="http://www.donpellegrino.com/"&gt;Don Pellegrino&lt;/a&gt;, an IST student at Drexel, has selected the analysis of networks within Open Notebooks as part of his Ph.D. work. He has started to &lt;a href="http://onschallenge.wikispaces.com/Pellegrino+dissertation+-+novel+connections"&gt;report his progress on our wiki&lt;/a&gt; and is eager to receive any feedback as the work progresses (his FriendFeed account is &lt;a href="http://friendfeed.com/donpellegrino"&gt;donpellegrino&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Don's first report is available &lt;a href="http://onschallenge.wikispaces.com/DC-Exp-001"&gt;here&lt;/a&gt;.  He is using the Open Source software Gephi for visualization and has provided all of the data and code on the associated wiki page. (also see &lt;a href="http://blog.ouseful.info/2010/10/03/graph-structure-of-an-open-science-notebook-linked-science-ftw/"&gt;Tony Hirst's description of mapping ONS work&lt;/a&gt; which provided some very useful insights) Don has provided a detailed report of his findings but I think the most important can be seen in the global plot below.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TQ-KAWYg6NI/AAAAAAAAA6E/8MTa8FtLRbg/s1600/map1.PNG"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 216px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TQ-KAWYg6NI/AAAAAAAAA6E/8MTa8FtLRbg/s400/map1.PNG" border="0" alt="" id="BLOGGER_PHOTO_ID_5552808604184144082" /&gt;&lt;/a&gt;This represents a map connecting people through chemicals. The large top right structure represents the connections within the UsefulChem project and the main circle represents the activity of graduate student &lt;a href="http://usefulchem.blogspot.com/2010/12/mirza-phd-defense-on-ugi-reaction-for.html"&gt;Khalid Mirza&lt;/a&gt; who was the most active on this project.  The crescent structure to the right of the circle represents other students - mainly undergraduates - who worked with the same chemicals as Khalid.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;At the top left there are 3 isolated small networks, representing completely separate projects: the &lt;a href="http://usefulchem.blogspot.com/2009/08/our-attempt-to-reproduce-oxidation-by.html"&gt;sodium hydride (NaH) oxidation study&lt;/a&gt;, &lt;a href="http://usefulchem.blogspot.com/2010/07/resveratrol-thesis-on-reaction-attempts.html"&gt;Dustin Sprouse&lt;/a&gt; and &lt;a href="http://showme.physics.drexel.edu/onsc/reactionattempts/advancedsearch.php?compound=&amp;amp;solvent=&amp;amp;reactiontype=&amp;amp;researcher=Sebastian+Petrik"&gt;Sebastian Petrik&lt;/a&gt;.  I'll be posting about Sebastian's work in a future post.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Near the bottom middle there is another small network connected to the main group by a single link mediated by &lt;a href="http://www.chemspider.com/Chemical-Structure.80987.html"&gt;2,2-dimethoxyethylamine&lt;/a&gt;.&lt;/div&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TQ-OeJwN1GI/AAAAAAAAA6M/SfWZKsC4aoY/s1600/connectmol.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 232px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TQ-OeJwN1GI/AAAAAAAAA6M/SfWZKsC4aoY/s400/connectmol.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5552813514236482658" /&gt;&lt;/a&gt;This represents the overlap between Open Notebooks (&lt;a href="http://usefulchem.blogspot.com/2010/05/synaptic-leap-experiments-on-reaction.html"&gt;Wolfle from Todd group&lt;/a&gt; and Mirza from Bradley group) that I &lt;a href="http://usefulchem.blogspot.com/2010/06/use-of-ons-to-protect-open-research.html"&gt;mentioned previously&lt;/a&gt;. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I think that automatically discovering such connections as they occur could be a really useful outcome of this network analysis work.  For example, the researchers could be alerted by email that a new potentially interesting overlap between their projects now exists.  This could accelerate new collaborations.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A key challenge in Don's work is to figure out the right questions so that the results will be genuinely useful and novel to the researchers involved and the research community.  I'm optimistic that he will succeed.  As a separate outcome, just learning how researchers collaborate and record their work over time is bound to be interesting.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For a description of Don's planned work over the next several months take a look at his full Thesis Proposal: "&lt;a href="http://www.donpellegrino.com/is/Pellegrino-Proposal.pdf"&gt;Proposal of a System and Methods for Integrating Literature and Data&lt;/a&gt;".&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-271224965066654315?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/271224965066654315/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=271224965066654315' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/271224965066654315'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/271224965066654315'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/12/visualizing-social-networks-in-open.html' title='Visualizing Social Networks in Open Notebooks'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_R3jz5f_NkRo/TQ-KAWYg6NI/AAAAAAAAA6E/8MTa8FtLRbg/s72-c/map1.PNG' height='72' width='72'/><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-944492883866902910</id><published>2010-12-13T09:16:00.000-05:00</published><updated>2010-12-13T09:17:00.912-05:00</updated><title type='text'>Mirza PhD defense on the Ugi reaction for anti-malarial screening</title><content type='html'>My student Khalid Baig Mirza defended his Ph.D. thesis at Drexel University on December 6, 2010. It was nice to see all the pieces come together - even though there are still a few intriguing puzzles left to be fully resolved.  Like most research projects, every answer generates several interesting additional questions and Khalid did a good job of showing what these key issues are for his work.&lt;br /&gt;&lt;br /&gt;In his presentation, Khalid first discusses Open Notebook Science and his contribution to the sodium hydride oxidation controversy. Then he describes the UsefulChem project, involving the use of the Ugi reaction as an approach to synthesizing new anti-malarial agents, including a few unexpected side reactions and challenges. Finally he presents an overview of the ONS Solubility Challenge and its application to organic synthesis.&lt;br /&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_6142937"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/mirza-phd-defense-on-the-ugi-reaction-for-antimalarial-screening" title="Mirza PhD defense on the Ugi reaction for anti-malarial screening"&gt;Mirza PhD defense on the Ugi reaction for anti-malarial screening&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse6142937" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=phddefensekhalid-101213074747-phpapp01&amp;stripped_title=mirza-phd-defense-on-the-ugi-reaction-for-antimalarial-screening&amp;userName=jcbradley" /&gt;&lt;param name="allowFullScreen" value="true"/&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;embed name="__sse6142937" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=phddefensekhalid-101213074747-phpapp01&amp;stripped_title=mirza-phd-defense-on-the-ugi-reaction-for-antimalarial-screening&amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf" /&gt;&lt;param name="allowfullscreen" value="true" /&gt;&lt;param name="allowscriptaccess" value="always" /&gt;&lt;param name="flashvars" value="id=26244&amp;type=3" /&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" allowscriptaccess="always" width="480" height="400" flashvars="id=26244&amp;type=3"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-944492883866902910?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/944492883866902910/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=944492883866902910' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/944492883866902910'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/944492883866902910'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/12/mirza-phd-defense-on-ugi-reaction-for.html' title='Mirza PhD defense on the Ugi reaction for anti-malarial screening'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-862046303384578982</id><published>2010-12-05T15:29:00.000-05:00</published><updated>2010-12-05T15:29:59.985-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open data'/><category scheme='http://www.blogger.com/atom/ns#' term='malaria'/><category scheme='http://www.blogger.com/atom/ns#' term='BMS'/><category scheme='http://www.blogger.com/atom/ns#' term='Drug discovery'/><title type='text'>Dana Vanderwall on Cheminformatics at Drexel</title><content type='html'>&lt;div&gt;Dana Vanderwall, Associate Director of Cheminformatics at Bristol-Myers Squibb, presented for my last &lt;a href="http://cheminfo2010.wikispaces.com/"&gt;Chemical Information Retrieval class&lt;/a&gt; on December 2, 2010. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The first part covered "&lt;b&gt;Cheminformatics &amp;amp;  The evolving relationship between data in the public domain &amp;amp; pharma&lt;/b&gt;" and included a general discussion of modern drug discovery and the details of a malaria dataset recently released from the pharmaceutical industry to the public. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_6039183"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/vanderwall-cheminformatics-drexel-part-1" title="Vanderwall cheminformatics Drexel Part 1"&gt;Vanderwall cheminformatics Drexel Part 1&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse6039183" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=vanderwallcheminformaticsdrexel-101205140822-phpapp01&amp;amp;stripped_title=vanderwall-cheminformatics-drexel-part-1&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse6039183" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=vanderwallcheminformaticsdrexel-101205140822-phpapp01&amp;amp;stripped_title=vanderwall-cheminformatics-drexel-part-1&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The second part described a project based on "&lt;b&gt;Molecular Clinical Safety Intelligence&lt;/b&gt;", where tracking side effects from approved drugs can help in the design of new drugs.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_6039380"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/dana-vanderwall-cheminformatics-drexel-part-2" title="Dana Vanderwall cheminformatics Drexel Part 2"&gt;Dana Vanderwall cheminformatics Drexel Part 2&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse6039380" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=mcsifordrexel-101205142338-phpapp02&amp;amp;stripped_title=dana-vanderwall-cheminformatics-drexel-part-2&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse6039380" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=mcsifordrexel-101205142338-phpapp02&amp;amp;stripped_title=dana-vanderwall-cheminformatics-drexel-part-2&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It was a very nice way to close out the course, showing very practical applications of the concepts we covered over the term.  The recording is available below.&lt;/div&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=26112&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" allowscriptaccess="always" width="480" height="400" flashvars="id=26112&amp;amp;type=3"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-862046303384578982?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/862046303384578982/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=862046303384578982' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/862046303384578982'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/862046303384578982'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/12/dana-vanderwall-on-cheminformatics-at.html' title='Dana Vanderwall on Cheminformatics at Drexel'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8649661215538139465</id><published>2010-11-04T12:03:00.000-04:00</published><updated>2010-11-04T12:04:22.883-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='nanoinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><title type='text'>Nanoinformatics 2010 Conference Report</title><content type='html'>On November 3, 2010 I presented on "The implications of Open Notebook Science and other new forms of scientific communication for Nanoinformatics" at the &lt;a href="http://nanotechinformatics.org/program"&gt;Nanoinformatics 2010&lt;/a&gt; conference. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The presentation first covers the use of the laboratory knowledge management system SMIRP for nanotechnology applications during the period of 1999-2001 at Drexel University. The exporting of single experiments from &lt;a href="http://www.jalajournal.com/article/S1535-5535(04)00074-7/abstract"&gt;SMIRP&lt;/a&gt; and publication to the &lt;a href="http://www.sciencedirect.com/preprintarchive"&gt;Chemistry Preprint Archive&lt;/a&gt; is then described followed by the evolution to &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; in 2005. Abstraction of semantic structure from ONS projects in the areas of drug discovery and solubility is then detailed as an efficient mechanism to provide &lt;a href="http://onswebservices.wikispaces.com/"&gt;web services&lt;/a&gt; and machine readable data feeds.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This was a terrific opportunity to tie together my current ONS projects with my work in nanotechnology about 10 years ago, when the focus was to capture laboratory information in a structured format so that autonomous agent could begin to replace human workflows.   I found it really interesting that the most active workflows back then were related to processing reference information.  It took a team of students to find, photocopy and scan many of our key papers, with all the problems that come with training and managing new students.   Today, obtaining relevant papers and extracting metadata is not so much of a challenge with tools like &lt;a href="http://www.mendeley.com/"&gt;Mendeley&lt;/a&gt;.  I ended the talk with a mention of our use of &lt;a href="http://usefulchem.blogspot.com/2010/10/dynamic-links-to-private-tagged.html"&gt;Mendeley tags&lt;/a&gt; to share dynamic links of article collections.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another important development over the course of the past decade is the availability of free and hosted tools to easily communicate research.  This includes wikis, blogs, Nature Precedings, institutional repositories, Google Spreadsheets and many others.  It also includes some &lt;a href="http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html"&gt;failed attempts like the Chemistry Preprint Archive&lt;/a&gt;. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I didn't anticipate in the late 90s just how crucial openness would prove to be for the evolution of the automation of the scientific process.  It isn't my impression that there is currently a consensus on this point.  Obviously it is possible to leverage automation in very &lt;a href="http://www.sciencemag.org/cgi/content/short/324/5923/85"&gt;clever ways&lt;/a&gt; for private use.  But I think that exponential impact requires very low barriers to contribution (human or not) that can only be achieved with openness and transparency.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I have been very impressed with the ideas and projects discussed at this conference.  Open sharing of nanotechnology data and integration of resources are clearly high priority items for many in this community.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;As we have shown with our &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; and &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;ONS Solubility&lt;/a&gt; projects, abstracting meaningful semantic structure is necessarily field specific.  One of the exciting opportunities to result from this meeting is finding ways to interconnect our solubility dataset with the nanotechnology community resources.  I have met with a few people who would like to collaborate on this and I will be sure to report on our progress.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_5664754"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/nanoinformatics-2010-smirpons-talk" title="Nanoinformatics 2010 SMIRP-ONS Talk"&gt;Nanoinformatics 2010 SMIRP-ONS Talk&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse5664754" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleynanoinfo10-101104044736-phpapp02&amp;amp;stripped_title=nanoinformatics-2010-smirpons-talk&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse5664754" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleynanoinfo10-101104044736-phpapp02&amp;amp;stripped_title=nanoinformatics-2010-smirpons-talk&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;br /&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=25485&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" allowscriptaccess="always" width="480" height="400" flashvars="id=25485&amp;amp;type=3"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8649661215538139465?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8649661215538139465/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8649661215538139465' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8649661215538139465'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8649661215538139465'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/11/nanoinformatics-2010-conference-report.html' title='Nanoinformatics 2010 Conference Report'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3682888686753776496</id><published>2010-11-04T09:15:00.000-04:00</published><updated>2010-11-04T09:15:57.715-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><title type='text'>Sozit Kurtu is Nov10 RSC ONS Award Winner</title><content type='html'>Sozit Kurtu, a chemistry student working under the supervision of Jean-Claude Bradley at Drexel University, is the November 2010 Royal Society of Chemistry Open Notebook Science Challenge Award winner. She wins a cash prize from the RSC.&lt;br /&gt;&lt;br /&gt;Sozit has performed important work in determining the accuracy of a density method of determining solubility and has explored the temperature dependence of the solubility of low melting point solutes in hexane. See her experiments here:&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/list+of+experiments"&gt;http://onschallenge.wikispaces.com/list+of+experiments&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Four more RSC ONS Awards will be made during 2010-11. Submissions from students in the US and the UK are still welcome.&lt;br /&gt;For more information see:&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com"&gt;http://onschallenge.wikispaces.com&lt;/a&gt;&lt;br /&gt;&lt;a href="http://onschallenge.wikispaces.com/RSCAwards2010"&gt;http://onschallenge.wikispaces.com/RSCAwards2010&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3682888686753776496?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3682888686753776496/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3682888686753776496' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3682888686753776496'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3682888686753776496'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/11/sozit-kurtu-is-nov10-rsc-ons-award.html' title='Sozit Kurtu is Nov10 RSC ONS Award Winner'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4462442149715066319</id><published>2010-10-26T11:02:00.000-04:00</published><updated>2010-10-26T11:02:56.424-04:00</updated><title type='text'>Elizabeth Brown's guest lecture for ChemInfo Retrieval</title><content type='html'>&lt;a href="http://www.linkedin.com/pub/elizabeth-brown/11/21b/69b"&gt;Elizabeth Brown&lt;/a&gt; from the Binghamton University Libraries presented on "Web 0.0/1.0/2.0/3.0 and Chemical Information" on October 21, 2010 as a guest lecturer for my fifth class on &lt;a href="http://cheminfo2010.wikispaces.com/"&gt;Chemical Information Retrieval&lt;/a&gt; this term.&lt;div&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;param name="flashvars" value="id=25281&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" allowscriptaccess="always" width="480" height="400" flashvars="id=25281&amp;amp;type=3"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="width:425px" id="__ss_5477821"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/ebrown/chemical-information-retrieval-class-10-21-2010" title="Chemical information retrieval class 10 21 2010"&gt;Chemical information retrieval class 10 21 2010&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse5477821" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=chemicalinformationretrievallecturedrexel10182010-101018103306-phpapp02&amp;amp;stripped_title=chemical-information-retrieval-class-10-21-2010&amp;amp;userName=ebrown"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse5477821" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=chemicalinformationretrievallecturedrexel10182010-101018103306-phpapp02&amp;amp;stripped_title=chemical-information-retrieval-class-10-21-2010&amp;amp;userName=ebrown" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/ebrown"&gt;Elizabeth  Brown&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;Beth made an interesting analogy with art to illustrate the differences between these communication platforms.  What stuck me during her presentation was the similarity between the current state of the semantic web (Web3.0) and the state of computerized searching in chemistry when I was a graduate student in the early 90s.  I had the chance to do just one substructure search at the time and it had to be done through an expert librarian.  The search had to be carefully planned because of the expense.&lt;br /&gt;&lt;br /&gt;From what I recall, the perception at the time was that computerized searching was impractical and perhaps even unnecessary.  After all "the way" to search for chemical information was to spend a weekend in the library systematically going through Chemical Abstracts books.  This had "worked" for long time for the chemistry community and doing things differently was considered superfluous and even wasteful.&lt;br /&gt;&lt;br /&gt;Today, "the way" to search for chemical information is to use expensive databases to perform a targeted search and extract the information from mainly toll access peer reviewed journals. If you are off the academic grid you have to rely on free information on the web which is rarely associated with a chain of provenance. As my students will attest, even with access to the best tools, it still takes a lot of time to find the information and compare it for consistency.&lt;br /&gt;&lt;br /&gt;The unfamiliarity with computerized searching in the early 90s is now the common attitude towards the semantic web.  This is understandable because its availability is limited and people don't understand what to do with the tools that are available.  &lt;a href="http://onswebservices.wikispaces.com/"&gt;Web services that we provide&lt;/a&gt; (derived from other services from ChemSpider and other sources) are usable by anyone who can open up a Google Spreadsheet and copy and paste a URL but it will take time for people to understand how to incorporate these into their workflows.&lt;br /&gt;&lt;br /&gt;I think that in 10 years the semantic web will simply be part of the infrastructure.   Currently, when you type a word in a browser or word processing software the system knows enough to alert you to possible misspelling by underlining a word.  Similarly, in the near future, typing or specifying in some way a chemical compound will automatically pull in all relevant measured or calculated properties and provide suggestions in the context of a chemical reaction under consideration.  Access to information will be free and unencumbered and the chain of provenance will be clear.&lt;br /&gt;&lt;br /&gt;Instead of taking hours to find and process property data for single compounds, I think students in the future will be handling large libraries of compounds, looking for the best synthetic targets for their applications.&lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4462442149715066319?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4462442149715066319/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4462442149715066319' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4462442149715066319'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4462442149715066319'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/10/elizabeth-browns-guest-lecture-for.html' title='Elizabeth Brown&apos;s guest lecture for ChemInfo Retrieval'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-7288098103452858104</id><published>2010-10-15T16:20:00.000-04:00</published><updated>2010-10-15T16:20:31.053-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Mendeley'/><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='web service'/><title type='text'>Dynamic links to private tagged Mendeley collections</title><content type='html'>My close collaborators and I have been using &lt;a href="http://www.mendeley.com/"&gt;Mendeley&lt;/a&gt; as a convenient way to share PDFs of journal articles.  Not all of us have access to the same libraries so links are not enough - we need the full documents.  We also use &lt;a href="http://www.dropbox.com/"&gt;Dropbox&lt;/a&gt; as a redundancy but Mendeley allows tagging and recording notes, which is very handy for everyone in the group.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Now that &lt;a href="http://dev.mendeley.com/"&gt;Mendeley is providing an API&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; has written &lt;a href="http://onswebservices.wikispaces.com/mendeley"&gt;code&lt;/a&gt; that significantly leverages the information in our private ONS collection.  We can now create public links that return the most updated results for specific tags, including multiple tags (which I don't think you can do on Mendeley).  For example the following link returns all articles in the ONS collection tagged with "science2.0" and "chemistry":&lt;/div&gt;&lt;div&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/mendeley/?tags=science2.0,chemistry"&gt;&lt;blockquote&gt;http://showme.physics.drexel.edu/onsc/mendeley/?tags=science2.0,chemistry&lt;/blockquote&gt;&lt;/a&gt;&lt;/div&gt;The results include available information from Mendeley, including the title, authors, journal citation, doi, url, tags and the abstract.  Because this information is public the PDFs can't be provided but the hyperlinks make it as convenient as possible.&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/TLiuv77qpBI/AAAAAAAAA5k/NUVNF-yvP5I/s1600/mendservice1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 171px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/TLiuv77qpBI/AAAAAAAAA5k/NUVNF-yvP5I/s400/mendservice1.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5528360681162318866" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;At the end of the report the full list of all available tags for the ONS collection is provided.  A more refined or different search can be done immediately simply by checking boxes and hitting the submit button.&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TLixCID2BrI/AAAAAAAAA5s/7RVz3fqyOyk/s1600/mendservice2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 203px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TLixCID2BrI/AAAAAAAAA5s/7RVz3fqyOyk/s400/mendservice2.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5528363192678745778" /&gt;&lt;/a&gt;Because the tags are controlled by the users of the private collection, these links can be useful when discussing an ongoing project and referring to a very specific topic.  For example, we have been collecting examples of articles where a Ugi reaction is carried out and the product precipitates.  This link provides an updated report on that very narrow topic:&lt;br /&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/mendeley/?tags=Ugi+precipitate"&gt;&lt;blockquote&gt;http://showme.physics.drexel.edu/onsc/mendeley/?tags=Ugi+precipitate&lt;/blockquote&gt;&lt;/a&gt;There are still 2 major limitations to this service:&lt;div&gt;&lt;br /&gt;1) The search is very slow (can take a minute or two) because there is no way currently to use the Mendeley API to selectively return results based on tags.  Every search requires initially returning all results for the collection (currently a few hundred).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;2) Notes are currently not returned.  If the API is updated to include these the usefulness would increase dramatically.  For example in the results for the above query I took notes of the conditions involved in the Ugi precipitate for each paper. With the current format, one has to read each paper to find the relevant information.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Progress on our Mendeley related services will be posted on the &lt;a href="http://onswebservices.wikispaces.com/mendeley"&gt;ONSwebservices wiki&lt;/a&gt;.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-7288098103452858104?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/7288098103452858104/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=7288098103452858104' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7288098103452858104'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7288098103452858104'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/10/dynamic-links-to-private-tagged.html' title='Dynamic links to private tagged Mendeley collections'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_R3jz5f_NkRo/TLiuv77qpBI/AAAAAAAAA5k/NUVNF-yvP5I/s72-c/mendservice1.png' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3907782816473451421</id><published>2010-10-07T12:47:00.001-04:00</published><updated>2010-10-07T12:49:50.624-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='drug design'/><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='malaria'/><category scheme='http://www.blogger.com/atom/ns#' term='solubility'/><title type='text'>Drexel Chemistry Mini-Symposium on Bradley Lab</title><content type='html'>Every year the chemistry department at Drexel gives faculty the opportunity to present their research to incoming students in 10 minutes slots.  On September 30, 2010 I presented on "Open Notebook Science for Malaria Drug Discovery and Solubility Modeling".  I think such a short format is good for keeping student attention.  Recording it also provides a handy link to use for other purposes.  Most people just don't have time for 30-60 minute presentations.&lt;br /&gt;&lt;div style="width:425px" id="__ss_5324479"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/bradley-drexel-minisymp-2010" title="Bradley Drexel MiniSymp 2010"&gt;Bradley Drexel MiniSymp 2010&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse5324479" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyminisymp10-100930100932-phpapp01&amp;amp;stripped_title=bradley-drexel-minisymp-2010&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse5324479" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyminisymp10-100930100932-phpapp01&amp;amp;stripped_title=bradley-drexel-minisymp-2010&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;br /&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf" /&gt;&lt;param name="allowfullscreen" value="true" /&gt;&lt;param name="flashvars" value="id=24821&amp;type=3" /&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" width="480" height="400" flashvars="id=24821&amp;type=3"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3907782816473451421?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3907782816473451421/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3907782816473451421' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3907782816473451421'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3907782816473451421'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/10/drexel-chemistry-mini-symposium-on.html' title='Drexel Chemistry Mini-Symposium on Bradley Lab'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-2792289905846251410</id><published>2010-10-07T11:22:00.001-04:00</published><updated>2010-10-07T15:24:38.240-04:00</updated><title type='text'>The Meaning of Data panel at a class on the Rhetoric of Science</title><content type='html'>&lt;div&gt;&lt;b&gt;Update: Lawrence Souder provided the &lt;/b&gt;&lt;a href="http://drop.io/Rhetoric_of_Science"&gt;&lt;b&gt;audio for the presentation and panel questions&lt;/b&gt;&lt;/a&gt;&lt;b&gt;.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;On October 6, 2010 I had the pleasure to participate in a panel discussion at &lt;a href="http://www.pages.drexel.edu/~ls39/com400/"&gt;Lawrence Souder's class on the Rhetoric of Science&lt;/a&gt;.  I gave a brief presentation on "The Meaning of Data" to kick off the discussion.  I provided a scientist's perspective of data - with the basic idea that at best data are evidence.  Data cannot be treated as irrefutable facts since there is always some uncertainty and many assumptions must be made in their interpretation.  I also argued that, although uncertainty cannot be eliminated completely, transparency goes a long way to reducing it.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The students in the class did not have a science background and I think it was informative for them to explore a different perspective from their other exposure to science through popular media or even textbooks.  The term "fact" is thrown around a lot in these information sources to simplify but it doesn't reflect how scientists think about data.  We discussed how unbelievably wrong the scientific details in movies and TV shows such as NCIS and House can be.  Nevertheless, one student pointed out that these shows can be effective in attracting people to science - as long as it was understood that the details were probably incorrect.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Because of market forces we recognized that science portrayed in the popular media would have a strong tendency to be exaggerated or oversimplified resulting in the phenomenon of "hype".  A related effect can distort the way scientists communicate with each other, where there is a perception that exposing ambivalence in the form of seemingly contradictory data may not be in the researcher's best interest.  This is a strong deterrent to the general adoption of transparency.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We explored the possible evolution of scientific communication as new tools such as blogs and wikis become increasingly used by scientists.  Of course the issue of claims of priority was brought up and I discussed how this issue was handled in &lt;a href="http://friendfeed.com/jcbradley/8c2ae960/bipolar-electrodeposition-of-cds-scientific"&gt;my own research work&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We debated whether these new forms of communication would alter the language used in scientific communication - even in traditional journals.  I think that with the advent of the semantic web many researchers will start to write in a way that is understandable to humans as well as machines, their new target audience.  One student remarked that the new generation coming up is very used to texting and that type of succinct communication is certainly in line with machine readability.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The assigned reading for the class involved a study of the language used by the Nobel laureates who discovered the buckyball to describe their research:&lt;/div&gt;"&lt;a href="http://jbt.sagepub.com/content/21/3/303.abstract"&gt;In Praise of Carbon, In Praise of Science - The Epideictic Rhetoric of the 1996 Nobel Lectures in Chemistry by Christian Casper&lt;/a&gt;".  This paper demonstrates that both personality and the perceived target audience can dramatically affect the language and focus that scientists use to explain their research.&lt;br /&gt;&lt;div style="width:425px" id="__ss_5373297"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/the-meaning-of-data" title="The Meaning of Data"&gt;The Meaning of Data&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse5373297" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleysouderclass10-101006111213-phpapp01&amp;amp;stripped_title=the-meaning-of-data&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse5373297" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleysouderclass10-101006111213-phpapp01&amp;amp;stripped_title=the-meaning-of-data&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-2792289905846251410?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/2792289905846251410/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=2792289905846251410' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2792289905846251410'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2792289905846251410'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/10/meaning-of-data-panel-at-class-on.html' title='The Meaning of Data panel at a class on the Rhetoric of Science'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8274137570203164090</id><published>2010-10-06T13:21:00.000-04:00</published><updated>2010-10-06T13:21:41.754-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='class'/><category scheme='http://www.blogger.com/atom/ns#' term='cheminformatics'/><title type='text'>Cheminfo Retrieval Classes 1 and 2 in 2010</title><content type='html'>My first &lt;a href="http://cheminfo2010.wikispaces.com/"&gt;Chemical Information Retrieval class for the Fall of 2010&lt;/a&gt; took place on Sept 23, 2010.  This is the second time that I've taught the class as sole instructor and it was certainly convenient to have &lt;a href="http://getcheminfo.wikispaces.com/"&gt;last year's wiki&lt;/a&gt; to build upon.  The assignments are the same so it was helpful to be able to give students access to what students did last year as examples.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The key message from my introductory lecture was that it can be really difficult to find usable chemical information and that there are no shortcuts like relying on a true trusted source - those don't exist.  I showed a few examples of emerging models - Open Access, Open Notebook Science, Collaborative Competition (like pharma companies sharing some drug data openly) and other Open Science initiatives.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I also announced that we would be doing something new in the Science3.0 theme (the semantic web).  One of the assignments involves collecting 5 values from the literature for each of 5 properties for a compound of the student's choice.  In addition to adding these values on the wiki, we will collect them in a format that is friendly to machines: a &lt;a href="https://spreadsheets0.google.com/ccc?key=t7MwL3EPPudLIjqf7w_d-UA&amp;amp;hl=en#gid=0"&gt;ChemInfo Validation Google Spreadsheet&lt;/a&gt;.  Andrew Lang has agreed to help with adapting our previous code for solubility to creating web services for this application.  For example, we can have a service that reports the mean and standard deviation for a particular property and chemical.  Another could produce statistics for a given data source or compare peer reviewed vs non peer reviewed sources, etc.  Since it will be possible to to call these web services from within a Google Spreadsheet or Excel it should enable much more sophisticated analysis of the data related to the "validity" of chemical information as it exists today.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I didn't record the first lecture but I have the slides below:&lt;/div&gt;&lt;div style="width:425px" id="__ss_5271429"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/cheminfo-retrieval-2010-class-1-5271429" title="Cheminfo Retrieval 2010 Class 1"&gt;Cheminfo Retrieval 2010 Class 1&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse5271429" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=cheminforet2010class1-100923152724-phpapp01&amp;amp;stripped_title=cheminfo-retrieval-2010-class-1-5271429&amp;amp;userName=jcbradley"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse5271429" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=cheminforet2010class1-100923152724-phpapp01&amp;amp;stripped_title=cheminfo-retrieval-2010-class-1-5271429&amp;amp;userName=jcbradley" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;During the second lecture on September 30, 2010 I spent most of the time showing students how to use Beilstein Crossfire, SciFinder and ChemSpider to find values for chemical properties.  The recording for the second lecture is available below:&lt;br /&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="flashvars" value="id=24862&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" width="480" height="400" flashvars="id=24862&amp;amp;type=3"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8274137570203164090?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8274137570203164090/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8274137570203164090' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8274137570203164090'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8274137570203164090'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/10/cheminfo-retrieval-classes-1-and-2-in.html' title='Cheminfo Retrieval Classes 1 and 2 in 2010'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1875248479185586930</id><published>2010-09-01T10:21:00.000-04:00</published><updated>2010-09-01T10:21:42.963-04:00</updated><title type='text'>Impact of Open Notebook Science Interview</title><content type='html'>Richard Poynder's interview of me on Information Today is now available: "&lt;a href="http://www.infotoday.com/IT/sep10/Poynder.shtml"&gt;The Impact of Open Notebook Science&lt;/a&gt;".  I appreciate the time he took to go into this much depth.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1875248479185586930?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1875248479185586930/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1875248479185586930' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1875248479185586930'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1875248479185586930'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/09/impact-of-open-notebook-science.html' title='Impact of Open Notebook Science Interview'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-2896494517943224859</id><published>2010-08-26T10:21:00.001-04:00</published><updated>2010-08-26T10:22:03.360-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='drug design'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='open science'/><category scheme='http://www.blogger.com/atom/ns#' term='opal'/><title type='text'>Open Notebook Science in Drug Discovery at Opal Event</title><content type='html'>I presented on "Open Notebook Science in Drug Discovery" on August 24, 2010 at a panel on Industry and Academia part of the Opal Event "&lt;a href="http://www.opalevents.org/conferencehtml/current/drug_discovery_easing_the_bottleneck/drug_discovery_easing_the_bottleneck_agenda.php"&gt;Drug Discovery: Easing the Bottleneck&lt;/a&gt;".&lt;div&gt;&lt;div style="width:425px" id="__ss_5046934"&gt;&lt;strong style="display:block;margin:12px 0 4px"&gt;&lt;a href="http://www.slideshare.net/jcbradley/bradley-opal10" title="Open Notebook Science in Drug Discovery"&gt;Open Notebook Science in Drug Discovery&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse5046934" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyopal10-100824120207-phpapp02&amp;amp;stripped_title=bradley-opal10"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse5046934" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyopal10-100824120207-phpapp02&amp;amp;stripped_title=bradley-opal10" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding:5px 0 12px"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;I only had about 15 minutes to present so I could not go into much detail but I did want to highlight the most recent work Andrew Lang and I (also with &lt;a href="http://usefulchem.blogspot.com/2010/08/chemtaverna-workflows-of-ons-web.html"&gt;Peter Li&lt;/a&gt; from ChemTaverna) carried out involving &lt;a href="http://usefulchem.blogspot.com/2010/08/reaction-attempts-solvent-selector.html"&gt;solubility prediction and web services&lt;/a&gt;.  Most of the attendees were from industry and I appropriately used the recent &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20622810"&gt;GSK malaria data sharing&lt;/a&gt; to introduce the talk.  It is clear that there is a role for Open Science in drug discovery and I think that industry involvement will continue to increase in this area.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;My co-panelist &lt;a href="http://www.ohio.edu/research/vpr-bio.cfm"&gt;Rathindra Bose&lt;/a&gt; from Ohio University presented on his group's development of a novel cancer treatment compound based on platinum.  He made the point that academic research complements that from industry by being able to explore more speculative hypotheses.  The dominant hypothesis for the mechanism of action of platinum based drugs is binding with DNA.  By exploring alternative scenarios, his group found an active platinum drug that does not bind with DNA.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;During the preceding session on the Emergence of Biologics in Drug Discovery, &lt;a href="http://www.linkedin.com/pub/albert-giovenella/a/b8a/707"&gt;Albert Giovanella&lt;/a&gt; from the University of Pennsylvania School of Medicine gave a particularly enlightening talk about comparing &lt;a href="http://en.wikipedia.org/wiki/Biologic"&gt;biologics&lt;/a&gt; with small molecule drugs.  Although biological drugs tend to have less toxicity, the overall cost to bring them to market is still quite high and their cost to the consumer may be so high as to limit their impact.  It looks like it will not be generally easy to translate new biomedical knowledge to a widespread impact on human health.&lt;/div&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-2896494517943224859?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/2896494517943224859/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=2896494517943224859' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2896494517943224859'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/2896494517943224859'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/08/open-notebook-science-in-drug-discovery.html' title='Open Notebook Science in Drug Discovery at Opal Event'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4338671438708208499</id><published>2010-08-23T16:48:00.000-04:00</published><updated>2010-08-23T16:49:18.089-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='myexperiment'/><category scheme='http://www.blogger.com/atom/ns#' term='web service'/><category scheme='http://www.blogger.com/atom/ns#' term='solubility'/><title type='text'>ChemTaverna Workflows of ONS Web Services now on MyExperiment</title><content type='html'>I'm pleased to report that one of the collaborations initiated at the &lt;a href="http://usefulchem.blogspot.com/2010/08/berkeley-open-science-summit-2010-notes.html"&gt;Berkeley Open Science conference&lt;/a&gt; last month is progressing very well.  &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://www.cs.man.ac.uk/~carole/"&gt;Carole Goble&lt;/a&gt; introduced me to &lt;a href="http://www.myexperiment.org/users/221.html"&gt;Peter Li&lt;/a&gt; who runs the &lt;a href="http://www.mygrid.org.uk/projects/chemtaverna/"&gt;ChemTaverna&lt;/a&gt; project.  The idea was to use &lt;a href="http://www.taverna.org.uk/"&gt;Taverna&lt;/a&gt; to construct workflows using the web services developed by &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; for our Open Notebook Science projects: UsefulChem and the ONS Solubility Challenge.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Peter quickly created several workflows to demonstrate what is possible.  Here is a &lt;a href="http://usefulchem.wikispaces.com/file/view/test8_UgiSolubility.t2flow/157905387/test8_UgiSolubility.t2flow"&gt;workflow&lt;/a&gt; that uses a &lt;a href="https://spreadsheets1.google.com/ccc?key=tR6lhYF_iqGdmceaAg-WRLg&amp;amp;hl=en#gid=0"&gt;Google Spreadsheet&lt;/a&gt; as input.  SMILES for amines, carboxylic acids, aldehydes and isonitriles are entered in the appropriate columns.  The workflow first creates a virtual library of Ugi products from all possible combinations of reactants.  Then each product is submitted to a web service that predicts the solubility in methanol, the most common solvent for Ugi reactions.  &lt;/div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/THKgQ_9PniI/AAAAAAAAA4g/7x3f8V04a2U/s1600/workflowbig.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 188px; height: 400px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/THKgQ_9PniI/AAAAAAAAA4g/7x3f8V04a2U/s400/workflowbig.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5508641508134198818" /&gt;&lt;/a&gt;The &lt;a href="https://spreadsheets0.google.com/ccc?key=tAuvy5vM2Py5ZGuKBzT_s2Q&amp;amp;hl=en#gid=0"&gt;resulting spreadsheet&lt;/a&gt; can then be sorted by predicted solubility to recommend products that are more likely to precipitate from the reaction mixture.  In this particular example Ugi products derived from boc-glycine are predicted to have a low solubility in methanol.  The least soluble &lt;a href="http://www.chemspider.com/Chemical-Structure.16505544.html"&gt;compound&lt;/a&gt; is predicted to have a solubility of only 0.07M&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/THK45IxRTYI/AAAAAAAAA4o/LmMczxXe25U/s1600/ugilowsol.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 214px; height: 196px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/THK45IxRTYI/AAAAAAAAA4o/LmMczxXe25U/s400/ugilowsol.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5508668585973730690" /&gt;&lt;/a&gt;In this library, Ugi products derived from boc-methionine are predicted to be too soluble to precipitate.  For example this &lt;a href="http://www.chemspider.com/Chemical-Structure.16518100.html"&gt;Ugi product&lt;/a&gt; has a predicted solubility of 3.7 M.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/THK6bQw0WqI/AAAAAAAAA4w/nmYXcQ4bxPY/s1600/ugihighsol.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 236px; height: 196px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/THK6bQw0WqI/AAAAAAAAA4w/nmYXcQ4bxPY/s400/ugihighsol.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5508670271746497186" /&gt;&lt;/a&gt;(note: &lt;i&gt;ChemSpider has a tendency to draw the minor tautomer for some amides and carbamates&lt;/i&gt;)&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;There are a few issues to take into consideration in order to use this particular workflow:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;1) This will only work on &lt;a href="https://launchpad.net/taverna/+milestone/2.1.2"&gt;Taverna Workbench 2.1.2&lt;/a&gt; with these &lt;a href="http://dbk-ed.mib.man.ac.uk/taverna/t2/plugins/"&gt;plug-ins&lt;/a&gt; installed.  At one point it will be made to work on &lt;a href="http://www.taverna.org.uk/download/workbench/"&gt;Taverna Workbench 2.2&lt;/a&gt; and uploaded onto &lt;a href="http://www.myexperiment.org/"&gt;MyExperiment&lt;/a&gt;.  The workflow used here is currently available &lt;a href="http://usefulchem.wikispaces.com/file/view/test8_UgiSolubility.t2flow/157905387/test8_UgiSolubility.t2flow"&gt;here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;2) The SMILES in the input Google Spreadsheet must be written in the format of the &lt;a href="https://spreadsheets1.google.com/ccc?key=tR6lhYF_iqGdmceaAg-WRLg&amp;amp;hl=en#gid=0"&gt;current example&lt;/a&gt; (aldehyde, amine and isonitrile groups on the left and carboxylic acid groups on the right)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;3) All of the Ugi products in the virtual library must already exist in ChemSpider.  Otherwise, the solubility predictions will fail because of missing descriptors as &lt;a href="http://usefulchem.blogspot.com/2010/07/general-transparent-solubility.html"&gt;discussed previously&lt;/a&gt;. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Peter has uploaded simpler workflows onto MyExperiment that are compatible with the current version of Taverna Workbench (v2.2).  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;First, the generation of Ugi product libraries from reactant SMILES in a Google Spreadsheet is available &lt;a href="http://www.myexperiment.org/workflows/1474.html"&gt;here&lt;/a&gt;.&lt;/div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/THLDM_OQBTI/AAAAAAAAA44/wIemtNjZHrw/s1600/libworkflow.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 250px; height: 320px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/THLDM_OQBTI/AAAAAAAAA44/wIemtNjZHrw/s400/libworkflow.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5508679922124588338" /&gt;&lt;/a&gt;Another workflow handles the &lt;a href="http://www.myexperiment.org/workflows/1471.html"&gt;prediction of Abraham descriptors&lt;/a&gt;.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/THLEEdqsTvI/AAAAAAAAA5A/MEhNKrd9JDA/s1600/ADworkflow.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 235px; height: 305px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/THLEEdqsTvI/AAAAAAAAA5A/MEhNKrd9JDA/s400/ADworkflow.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5508680875189751538" /&gt;&lt;/a&gt;This workflow processes the &lt;a href="http://www.myexperiment.org/workflows/1467.html"&gt;prediction of solubility for a given solute and solvent&lt;/a&gt;.&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/THLFGF9OQGI/AAAAAAAAA5Q/6erg4oDZCUQ/s1600/solpredworkflow.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 190px; height: 400px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/THLFGF9OQGI/AAAAAAAAA5Q/6erg4oDZCUQ/s400/solpredworkflow.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5508682002696388706" /&gt;&lt;/a&gt;The main rationale for incorporating web services derived from our Open Notebook Science projects into Taverna is &lt;b&gt;leverage&lt;/b&gt;.  MyExperiment already benefits from a vigorous community of developers in the bioinformatics arena. With the growth of the ChemTaverna initiative, the integration of cheminformatics and bioinformatics workflows should become seamless.&lt;br /&gt;&lt;br /&gt;By making our &lt;a href="http://onswebservices.wikispaces.com"&gt;solubility and chemical reaction web services&lt;/a&gt; available in formats that are convenient for others to use it increases the opportunities that our work will be actually useful.  It also makes it easier for us to leverage the resources made available by others for our own applications in drug discovery and reaction design.&lt;br /&gt;&lt;br /&gt;Essentially this means that we have extended the reach of the information cascade triggered by the recording of an experiment in a laboratory notebook and a very simple &lt;a href="http://onsbooks.wikispaces.com/"&gt;abstraction process&lt;/a&gt; to represent that experiment in a semantically addressable format.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4338671438708208499?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4338671438708208499/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4338671438708208499' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4338671438708208499'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4338671438708208499'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/08/chemtaverna-workflows-of-ons-web.html' title='ChemTaverna Workflows of ONS Web Services now on MyExperiment'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_R3jz5f_NkRo/THKgQ_9PniI/AAAAAAAAA4g/7x3f8V04a2U/s72-c/workflowbig.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-7822134275570705357</id><published>2010-08-18T11:45:00.001-04:00</published><updated>2010-08-26T15:49:17.760-04:00</updated><title type='text'>ONS Challenge Solubility Data used in COSMO-RS Tutorial</title><content type='html'>&lt;div&gt;&lt;b&gt;Update: Andreas Klamt pointed out to me that the best source of his software is the &lt;/b&gt;&lt;a href="http://www.cosmologic.de/index.php?cosId=4203&amp;amp;crId=2"&gt;&lt;b&gt;COSMOtherm&lt;/b&gt;&lt;/a&gt;&lt;b&gt; package from COSMOlogic.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;It is really interesting to see how our solubility data from the &lt;a href="http://onschallenge.wikispaces.com/"&gt;ONS Challenge&lt;/a&gt; are being used.  In this example, our measurements for &lt;a href="http://old.oru.edu/cccda/sl/solubility/allsolvents.php?solute=vanillin"&gt;vanillin in various solvents&lt;/a&gt; are being compared to predicted values in a &lt;a href="http://www.scm.com/Doc/Doc2010.01/CRS/CRSGUI_tutorial/page43.html"&gt;tutorial for the commercial software package COSMO-RS&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TGv-WiJBgYI/AAAAAAAAA4Q/UFvlQFMGA1A/s1600/cosmo1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 383px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TGv-WiJBgYI/AAAAAAAAA4Q/UFvlQFMGA1A/s400/cosmo1.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5506774632465793410" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-7822134275570705357?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/7822134275570705357/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=7822134275570705357' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7822134275570705357'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7822134275570705357'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/08/ons-challenge-solubility-data-used-in.html' title='ONS Challenge Solubility Data used in COSMO-RS Tutorial'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_R3jz5f_NkRo/TGv-WiJBgYI/AAAAAAAAA4Q/UFvlQFMGA1A/s72-c/cosmo1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8853874504409509127</id><published>2010-08-16T10:35:00.000-04:00</published><updated>2010-08-16T10:35:55.818-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='green solvent'/><category scheme='http://www.blogger.com/atom/ns#' term='green chemistry'/><title type='text'>Green Solvent Metric on Solvent Predictor</title><content type='html'>In the spirit of contributing to Peter Murray-Rust's initiative to collect &lt;a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2529"&gt;Green Chemistry&lt;/a&gt; information, Andrew Lang and I have added a green solvent metric for 28 of the 72 solvents we include in our &lt;a href="http://usefulchem.blogspot.com/2010/08/reaction-attempts-solvent-selector.html"&gt;Solvent Selector&lt;/a&gt; service.  The scale represents the combined contributions for Safety, Health and Environment (SHE) as calculated by &lt;a href="http://www.sust-chem.ethz.ch/tools/ehs"&gt;ETH Zurich&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;For example consider the following &lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660|10481766"&gt;Ugi reaction solvent selection&lt;/a&gt;.  Using the default thresholds, 6 solvents are proposed and 5 have SHE values.  Assuming there are no additional selection factors, a chemist might start with ethyl acetate with a SHE value of 2.9 rather than acetonitrile with a value of 4.6.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TGgZlqYRLQI/AAAAAAAAA4I/2FwUHbrCIf4/s1600/green1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 231px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TGgZlqYRLQI/AAAAAAAAA4I/2FwUHbrCIf4/s400/green1.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5505678679282363650" /&gt;&lt;/a&gt;&lt;br /&gt;Individual values of Safety, Health and Environment for each solvent are available from the &lt;a href="http://www.sust-chem.ethz.ch/tools/ehs"&gt;ETH tool&lt;/a&gt;. We are just including the sum of the three out of convenience.  &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Note that the &lt;a href="http://www.sust-chem.ethz.ch/tools/ehs/ehs_tool_license"&gt;license&lt;/a&gt; for using the data from this tool requires citing this reference: &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Koller, G., U. Fischer, and K. Hungerbühler, &lt;b&gt;(2000)&lt;/b&gt;. Assessing Safety, Health and Environmental Impact during Early Process Development. &lt;b&gt;Industrial &amp;amp; Engineering Chemistry Research&lt;/b&gt; 39: 960-972.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8853874504409509127?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8853874504409509127/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8853874504409509127' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8853874504409509127'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8853874504409509127'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/08/green-solvent-metric-on-solvent.html' title='Green Solvent Metric on Solvent Predictor'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/TGgZlqYRLQI/AAAAAAAAA4I/2FwUHbrCIf4/s72-c/green1.png' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4273924170371068137</id><published>2010-08-06T13:09:00.000-04:00</published><updated>2010-08-06T13:09:48.194-04:00</updated><title type='text'>The Reaction Attempts Solvent Selector</title><content type='html'>The &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;ONS Solubility Challenge&lt;/a&gt; and the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; project have now been integrated with code written by &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; to the point that recommendations for solvents are just a click away. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;First use the &lt;a href="http://showme.physics.drexel.edu/onsc/reactionattempts/"&gt;Reaction Attempts Explorer&lt;/a&gt; either using the drop-down menus or substructure search as &lt;a href="http://usefulchem.blogspot.com/2010/06/reaction-attempts-explorer.html"&gt;described previously&lt;/a&gt;.  When a reaction of interest is identified just click on the the link for "Optimal Solvent Prediction".  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TFwj4Zr7yBI/AAAAAAAAA34/dXpfXupVS6I/s1600/solventpred1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 186px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TFwj4Zr7yBI/AAAAAAAAA34/dXpfXupVS6I/s400/solventpred1.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5502312296614709266" /&gt;&lt;/a&gt;The service will then &lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660|21106059"&gt;provide a summary of solubility measurements and predictions&lt;/a&gt;, organized by the default criteria of minimum 0.3 M solubility of reactants, maximum 0.03 M solubility of the product and maximum solvent boiling point of 100 C.  Liquid reactants (or reactants with melting points within 15 C of room temperature) are excluded since these generally have a high enough solubility in most solvents.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In the case of the Ugi reaction in this example, only the solubility of boc-glycine and the product are considered.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TFwkAGOwo1I/AAAAAAAAA4A/GLYlC8n-IhE/s1600/solventpred2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 344px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TFwkAGOwo1I/AAAAAAAAA4A/GLYlC8n-IhE/s400/solventpred2.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5502312428831023954" /&gt;&lt;/a&gt;&lt;/div&gt;The results are color-coded.  In this case 14 solvents are coded green, indicating that all criteria were met.  The fifteenth solvent is coded yellow, indicating that one of the criteria was not met - in this case the boiling point of 205 C is outside of the limit of 100 C.  High boiling point solvents are not optimal for quickly obtaining the product as a dry solid after filtering.  This criterion can be changed in the input fields at the top of the page. It is also possible to change the number of times the product is washed there.  This will only change the estimated yield, which is based on carrying out the reaction at the concentration of the least soluble reactant, up to 1 M.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Three columns are generated for the product and each reactant.  The column on the right is the average of all measurements, as recorded in the &lt;a href="http://spreadsheets.google.com/pub?key=0Ak1R8T6wt4YQcGx3d3VmcDMwaGZxMHVkbkVtUkQxYVE"&gt;SolubilitiesSum Spreadsheet&lt;/a&gt;.  The middle column is a solubility prediction based on Abraham descriptors derived from experimental values, as described and used in the &lt;a href="http://precedings.nature.com/documents/4243/version/3"&gt;ONS Solubility Challenge book&lt;/a&gt;.  The column on the left contains predictions from the &lt;a href="http://usefulchem.blogspot.com/2010/07/general-transparent-solubility.html"&gt;Abraham001 model&lt;/a&gt;, which is based on calculated molecular descriptors only.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The numbers in bold represent the best solubility value available for each solvent.  If a measurement is known, that will be the number used.  If no measurement is available, the experimental Abraham descriptor model is used.  If neither of these are available the predictions from the Abraham001 model are used by default. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From the list of solvents in the green section we find ethanol and acetonitrile.  Both of these solvents were tried (as mixtures with methanol) in the optimization of this reaction (&lt;a href="http://www.jove.com/index/details.stp?ID=942"&gt;Bradley et al JoVE 2008&lt;/a&gt;) and provided good to intermediate results.  THF was found to give low yields for this reaction and it scores at #51 in the yellow section, with a high solubility of the product accounting for the missed criterion.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;One should keep in mind that this is just a tool to flag potentially interesting solvents.  Common chemical sense needs to be used as well.  For example, acetone and butanone are listed in the green section but these are incompatible with the Ugi reaction since they would compete with the aldehyde.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Note that the predictive models are way off in some cases.  For example the Abraham001 model dramatically underestimates the solubilities of boc-glycine in the green section, while the measured Abraham descriptor model does much better for these cases.  We will prioritize our next solubility measurements to try to improve the models - or at least understand what types of compounds are most likely to yield useful solubility estimates from these models.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In addition to being called from the Reaction Attempts Explorer, the Solvent Selector can be used for any compounds that have ChemSpider IDs.  Simply separate the CSIDs with the pipe character:&lt;/div&gt;&lt;div&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660|21106059"&gt;http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660|21106059&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;After modifying the criteria and hitting update, the new criteria are conveniently represented in the URL in this format, making sharing a specific search with anyone easy:&lt;/div&gt;&lt;div&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660|21106059&amp;amp;limreact=1&amp;amp;limprod=0.01&amp;amp;bp=200&amp;amp;washes=0"&gt;http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660|21106059&amp;amp;limreact=1&amp;amp;limprod=0.01&amp;amp;bp=200&amp;amp;washes=0&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It is even possible to use the service listing just one compound's CSID - this is useful for quickly comparing the measured solubilities with predictions from both models:&lt;/div&gt;&lt;div&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660"&gt;http://showme.physics.drexel.edu/onsc/models/solventselector.php?csids=70660&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4273924170371068137?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4273924170371068137/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4273924170371068137' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4273924170371068137'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4273924170371068137'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/08/reaction-attempts-solvent-selector.html' title='The Reaction Attempts Solvent Selector'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_R3jz5f_NkRo/TFwj4Zr7yBI/AAAAAAAAA34/dXpfXupVS6I/s72-c/solventpred1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3599267176427363374</id><published>2010-08-02T18:53:00.000-04:00</published><updated>2010-08-02T18:53:43.954-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open science summit'/><category scheme='http://www.blogger.com/atom/ns#' term='berkeley'/><category scheme='http://www.blogger.com/atom/ns#' term='#oss2010'/><title type='text'>Berkeley Open Science Summit 2010 Notes</title><content type='html'>I just returned from the &lt;a href="http://opensciencesummit.com/"&gt;Open Science Summit&lt;/a&gt; held at Berkeley July 29-31, 2010.  &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;There certainly was an impressive list of &lt;a href="http://opensciencesummit.com/presenters/"&gt;presenters&lt;/a&gt; as well as attendees.  Many of the talks were quite good, although several on the last day were more about closed collaborations than Open Science.  During these presentations the assumption that patents are required to exploit discoveries in health care was repeated.  This was in sharp contrast to the second day's session on gene patents, where IP protection was shown to stifle innovation and the exploitation of discoveries.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A refreshing exception to this pattern on the last day was Andrew Hessel's presentation on the &lt;a href="http://pinkarmy.org/"&gt;Pink Army Cooperative&lt;/a&gt;.  Andrew's strategy to cure cancer is based on the idea of customizing drugs for each individual affected by the disease.  Since each drug is only applicable to one individual, the approach of expensive clinical trials doesn't apply.  Since he is not interested in generating a profit from selling the drugs, IP protection also doesn't apply and allows him to make every part of the drug design process, including genetic analysis, publicly available.  It wasn't clear if such an approach would be legal in the US but he did mention going to another country if necessary.  Although he didn't currently have cancer, he did indicate that he might have need of this technology one day by pulling out a pack of cigarettes in the middle of his talk.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Unfortunately my panel on Open Data was canceled at the last minute due to time management problems (see &lt;a href="http://ff.im/oD5sx"&gt;FF discussion&lt;/a&gt; on how it happened).  However, I did have a chance to generally catch up with old friends (&lt;a href="http://pubs.acs.org/cen/staff/biocd.html"&gt;Carmen Drahl&lt;/a&gt;, &lt;a href="http://network.nature.com/profile/joannascott"&gt;Joanna Scott&lt;/a&gt;, &lt;a href="http://cameronneylon.net/"&gt;Cameron Neylon&lt;/a&gt;, &lt;a href="http://people.kmi.open.ac.uk/jack/"&gt;Jack Park&lt;/a&gt;).  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I also discussed some promising collaborations with several people:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;1) &lt;a href="http://colabscience.com"&gt;&lt;b&gt;CoLab&lt;/b&gt;&lt;/a&gt;&lt;b&gt;.&lt;/b&gt;  I spoke at length with DJ Sprouse and Casey Stark about their system for scientific collaboration.  We will try to represent one solubility experiment from the &lt;a href="http://onschallenge.wikispaces.com/list+of+experiments"&gt;ONS Challenge notebook&lt;/a&gt; and one organic synthesis experiment from the &lt;a href="http://usefulchem.wikispaces.com/All+Reactions"&gt;UsefulChem notebook&lt;/a&gt; to see how the information can be represented within CoLab.  There may be some opportunities to visualize raw data in new ways - perhaps using non-Java tools to interact with JCAMP-DX spectra.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;2) &lt;b&gt;IPzero Principles&lt;/b&gt;. I continued a conversation with Lisa Green &lt;a href="http://ff.im/li1xA"&gt;started with John Wilbanks and Thinh Nguyen&lt;/a&gt; at Creative Commons about coming up with a series of simple recommendations for ensuring that an Open Notebook can effectively prevent the patenting of inventions within an area of interest to the Open Science community.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;3) &lt;b&gt;Open Chemistry Reactions&lt;/b&gt;.  I had the chance to discuss our &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; database with Peter Murray-Rust over breakfast on Saturday.  He also showed me how he is using &lt;a href="http://www.rsc.org/Publishing/ReSourCe/AuthorGuidelines/AuthoringTools/ExperimentalDataChecker/getOSCAR.asp"&gt;Oscar&lt;/a&gt; to extract chemical reaction information from various documents. &lt;a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2512"&gt;Peter suggested&lt;/a&gt; that we pool together our data for a demonstration in September at the &lt;a href="http://www.scienceonlinelondon.org/"&gt;London Science Online Conference&lt;/a&gt;.  Reaction Attempts will cover the reactions done in the &lt;a href="http://usefulchem.wikispaces.com/All+Reactions"&gt;UsefulChem&lt;/a&gt; and the &lt;a href="http://usefulchem.blogspot.com/2010/05/synaptic-leap-experiments-on-reaction.html"&gt;Todd group&lt;/a&gt;'s Open Notebooks.  Peter will extract information from both patents and Acta Crystallographica.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;4) &lt;b&gt;ChemTaverna.&lt;/b&gt; I was pleased to learn from Carole Goble that &lt;a href="http://www.taverna.org.uk/"&gt;Taverna&lt;/a&gt; is extending its coverage to cheminformatics applications with the &lt;a href="http://www.mygrid.org.uk/projects/chemtaverna/"&gt;ChemTaverna&lt;/a&gt; project.  I had &lt;a href="http://usefulchem.blogspot.com/2010/07/general-transparent-solubility.html"&gt;just mentioned&lt;/a&gt; that we would be interested in revisiting Taverna for creating virtual libraries of organic compounds and filtering them based on predicted solubilities in various solvents. This would allow us to contribute cheminformatics workflows to &lt;a href="http://www.myexperiment.org/"&gt;MyExperiment&lt;/a&gt;.  Carole put me in touch with the project leader Peter Li at the University of Manchester.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3599267176427363374?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3599267176427363374/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3599267176427363374' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3599267176427363374'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3599267176427363374'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/08/berkeley-open-science-summit-2010-notes.html' title='Berkeley Open Science Summit 2010 Notes'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1990253563769379118</id><published>2010-07-25T18:18:00.001-04:00</published><updated>2010-07-26T07:11:54.454-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Abraham'/><category scheme='http://www.blogger.com/atom/ns#' term='web service'/><category scheme='http://www.blogger.com/atom/ns#' term='model'/><category scheme='http://www.blogger.com/atom/ns#' term='prediction'/><category scheme='http://www.blogger.com/atom/ns#' term='solubility'/><title type='text'>General Transparent Solubility Prediction using Abraham Descriptors</title><content type='html'>Making solubility estimations for most organic compounds in a wide range of solvents freely available has always been a main long term objective for the &lt;a href="http://onschallenge.wikispaces.com/"&gt;Open Notebook Science Solubility Challenge&lt;/a&gt;.  With current expertise and technology, &lt;i&gt;it should be as easy to obtain a solubility estimate as it is now to get driving directions off the web&lt;/i&gt;.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Obviously this won't be attained purely by exhaustive measurements, although we have been focused on strategic measurements over the past two years.  In parallel, we have been constantly evaluating the various solubility models out there for suitability.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Although there are several solubility models available for non-aqueous solvents, our additional requirement for transparent model building has proved surprisingly difficult to satisfy.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From this search, the Abraham solubility model [&lt;a href="http://www3.interscience.wiley.com/journal/122604742/abstract"&gt;Abraham2009&lt;/a&gt;] floated to the top, with an important factor being that Abraham has made available extensive compilations of descriptors for solutes and solvents.  In addition the algorithms used to convert solubility measurements to Abraham descriptors (a minimum of 5 different solvents per solute) has allowed us to &lt;a href="http://usefulchem.blogspot.com/2009/12/ons-solubility-book-edition-2-with.html"&gt;generate our own Abraham descriptors&lt;/a&gt; automatically simply by recording new measurements into our &lt;a href="https://spreadsheets0.google.com/ccc?key=plwwufp30hfq0udnEmRD1aQ&amp;amp;hl=en#gid=0"&gt;SolSum&lt;/a&gt; Google Spreadsheet. These can be obtained in &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;real time&lt;/a&gt; as well.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This approach permitted us to provide predictions for a limited number of solutes in a wide range of solvents and we have included these predictions in the past two editions (&lt;a href="http://precedings.nature.com/documents/4243/version/2"&gt;2nd&lt;/a&gt; and &lt;a href="http://precedings.nature.com/documents/4243/version/3"&gt;3rd&lt;/a&gt;) of the ONS Challenge Solubility Book.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Coming at the problem from a different approach, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; has also been trying to predict solubility using only open molecular descriptors, mainly relying on the &lt;a href="http://en.wikipedia.org/wiki/Chemistry_Development_Kit"&gt;CDK&lt;/a&gt;.  Since our most commonly used solvent has been methanol, Andy recently generated a &lt;a href="http://usefulchem.blogspot.com/2010/07/methanol-solubility-prediction-model-4.html"&gt;web service&lt;/a&gt; to predict solubility in that solvent.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;By combining these two approaches, Andy has now created a &lt;a href="http://onschallenge.wikispaces.com/AbrahamDescriptorsModel001"&gt;modeling system&lt;/a&gt; that can not only generally predict solubility in a wide range (70+) of solvents - but it can also provide related data that can be used for modeling other phenomena such as intestinal absorption of a drug or crossing the blood-brain barrier.[&lt;a href="http://digital.library.unt.edu/ark:/67531/metadc5342/m1/1/"&gt;Stovall 2007&lt;/a&gt;] &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The idea is to use a &lt;a href="http://en.wikipedia.org/wiki/Random_forest"&gt;Random Forest&lt;/a&gt; approach to select freely available descriptors to &lt;a href="http://showme.physics.drexel.edu/onsc/models/AbrahamDescriptorsModel001.php"&gt;predict the Abraham descriptors&lt;/a&gt; for any solute.  A separate &lt;a href="http://showme.physics.drexel.edu/onsc/models/solubilityfromabrahamdescriptors.php"&gt;service then generates predicted solubilities&lt;/a&gt; for a wide range of solvents based on these Abraham descriptors. I'm using the term "freely available" because - although the CDK descriptors and &lt;a href="http://www.vcclab.org/"&gt;VCCLab&lt;/a&gt; services are open - the model requires 2 descriptors only available from &lt;a href="http://www.chemspider.com/"&gt;ChemSpider&lt;/a&gt; (ultimately from &lt;a href="http://www.acdlabs.com/home/"&gt;ACD/Labs&lt;/a&gt;).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Here is an example with benzoic acid.  As long as the common name resolves to a single entry on ChemSpider, it is enough to enter it and it automatically populates the rest of the fields, which are then used by the service to generate the Abraham descriptors.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TEybbH2L24I/AAAAAAAAA3Y/G3M2N4Ulu3I/s1600/Predict1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 368px; height: 400px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TEybbH2L24I/AAAAAAAAA3Y/G3M2N4Ulu3I/s400/Predict1.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5497940135377623938" /&gt;&lt;/a&gt;Hitting the prediction link above will automatically populate the second service and generate predicted solubilities for over 70 solvents.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TEybhqadldI/AAAAAAAAA3g/7E_kcTSSqkk/s1600/Predict2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 204px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TEybhqadldI/AAAAAAAAA3g/7E_kcTSSqkk/s400/Predict2.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5497940247735801298" /&gt;&lt;/a&gt;This approach of allowing people to access these components separately can be useful. It can be instructive to manually play with the Abraham descriptors directly to see how predicted solubilities are affected.  There are also situations where one has experimentally determined Abraham descriptors for a solute and bypassing the descriptor prediction step is required.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;However, for those who prefer to cut to the chase, a convenient &lt;a href="http://showme.physics.drexel.edu/onsc/models/solutesolventprediction.php"&gt;web service&lt;/a&gt; is available where the common name (or SMILES) of the solute is entered and the list of available solvents appears as a drop down menu.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TEygH0_HnZI/AAAAAAAAA3o/LPVD7W6zgPQ/s1600/Predict3.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 222px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TEygH0_HnZI/AAAAAAAAA3o/LPVD7W6zgPQ/s400/Predict3.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5497945301455445394" /&gt;&lt;/a&gt;Now here is where I think the real payoff comes for accelerating science with openness.  Andy has also created a web service that returns the predicted solubility in molar as a number from common names (or SMILES) for solute and solvent via the URL.  For example click this for &lt;a href="http://showme.physics.drexel.edu/onsc/models/solutesolventprediction.php?solute=benzoic+acid&amp;amp;solvent=methanol"&gt;benzoic acid in methanol&lt;/a&gt;.  The advantage here is that solubility prediction can be easily integrated as a web service call from intuitive interfaces such as a &lt;a href="https://spreadsheets0.google.com/ccc?key=tMgJ3_SaFbGZ8H4WUNQrxGw&amp;amp;hl=en#gid=0"&gt;Google Spreadsheet&lt;/a&gt; to enable even non-programmers to make use of the data. Notice that the web service provided in the fourth column for the average of &lt;a href="http://old.oru.edu/cccda/sl/solubility/singlesolvent.php?solute=benzoic%20acid&amp;amp;solvent=methanol"&gt;measured solubility values&lt;/a&gt; enables an easy way to explore the accuracy of specific predictions.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TEy16uFQWKI/AAAAAAAAA3w/OzrK2Ajj0ks/s1600/predictdemo.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 124px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TEy16uFQWKI/AAAAAAAAA3w/OzrK2Ajj0ks/s400/predictdemo.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5497969265519646882" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;Such web services could also be integrated with data from ChemSpider or custom systems.  If those who use these services feed back their processed data to the open web, it could take us a step closer to automated reaction design.  For example consider the custom application to &lt;a href="http://onschallenge.wikispaces.com/Optimal+solvent+selection+for+reactions"&gt;select solvents for the Ugi reaction&lt;/a&gt;. Model builders could also use the web services for predicted and measured solubility directly.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A while back we &lt;a href="http://usefulchem.blogspot.com/2007/08/combiugi-on-myexperiment.html"&gt;explored using Taverna for MyExperiment&lt;/a&gt; to create virtual libraries of SMILES.  Unfortunately we ran into issues with getting the applications developed on Macs to run on our PCs.  This might be worth revisiting as a means of filtering virtual libraries through different thresholds of predicted solubility.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Andy has described his model in detail in a fully transparent way - the model itself, how it was generated and the entire dataset can be found &lt;a href="http://onschallenge.wikispaces.com/AbrahamDescriptorsModel001"&gt;here&lt;/a&gt;.  We would welcome improvements of the model as well as completely new models based on our dataset using only freely available tools.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It should be noted that when I use term "general" it refers to the ability for the model to generate a number for most compounds listed in ChemSpider.  Obviously compounds that most closely resemble the training set are more likely to generate better estimates.  Because of our synthetic objectives using the &lt;a href="http://www.jove.com/index/Details.stp?ID=942"&gt;Ugi reaction&lt;/a&gt; we have mainly focused on collecting solubility data for carboxylic acids, aldehydes and amides either from new measurements or from the literature.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another important point concerns the main intended application of the model: organic synthesis.  Generally the range of interest for such applications is about 0.01 - 3M.  This might be very different for other applications - such as the aqueous solubility of a drug, where distinctions between much lower solubilities may be important.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For a typical organic synthesis, a solubility of 0.001M or 0.005M will probably translate as effectively insoluble.  This might be a desired property for a product intended to be isolated by filtration.  On the other end of the scale knowing that a solubility is either 4M or 6M will not usually have an impact on reaction design.  It is enough to know that a reactant will have good solubility in a particular solvent.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Given the above considerations for intended applications and the likelihood that the current model is far from optimized, the predictions should be used cautiously.  We suggest that the model is best used as a "flagging device".  For example, if a reaction is to be carried out at 0.5M, one may place a threshold at 0.4M for the predicted values of reactants during solvent selection, with the recognition that a predicted 0.4M may be an actual 0.55M.  A similar threshold approach can be used for the product, where in this case the lowest solubility is desired.  A practical example of this is the &lt;a href="http://onschallenge.wikispaces.com/Optimal+solvent+selection+for+reactions"&gt;shortlisting of solvents candidates for the Ugi reaction&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another example of flagging involves identifying the outliers in the model.  These can be inspected for experimental errors and possibly remeasured.  Alternatively outliers may shed light on the limitations of the model.  For example we have found that the solubility of solutes with melting points near room temperature can be greatly underestimated by the current model.  This may be an opportunity to develop other models which incorporate melting point or enthalpy of fusion.[&lt;a href="http://www.highbeam.com/doc/1P3-1121826431.html"&gt;Rohani 2008&lt;/a&gt;]&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Although it is possible that better models and more data will improve the accuracy of the predictions, &lt;i&gt;this can be true only if the training set is accurate enough&lt;/i&gt;.  Based on conversations I've had with researchers who deal with solubility, reading modeling papers and our own experience with the ONS Challenge I am starting to suspect that much of the available data just isn't accurate enough for high precision modeling.  Models using data from the literature are especially vulnerable I think.  Take a look at this unsettling comparison between new measurements and literature values (not to mention the model) for common compounds.[&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/16584163"&gt;Loftsson 2006&lt;/a&gt;] Here is a subset:&lt;/div&gt;&lt;div&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TEyYf6w8QxI/AAAAAAAAA3Q/xX-pRkijVp8/s1600/loftsson.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 325px; height: 174px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TEyYf6w8QxI/AAAAAAAAA3Q/xX-pRkijVp8/s400/loftsson.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5497936919230432018" /&gt;&lt;/a&gt;I have also made the point in detail for the &lt;a href="http://usefulchem.blogspot.com/2010/01/dangerous-data-lessons-from-my-cheminfo.html"&gt;aqueous solubility of EGCG&lt;/a&gt;. Could this be the reason that so many different solubility models using different physical chemistry principles have evolved and continue to co-exist?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The situation reminds me a lot of the discussions taking place in the molecular docking community.[&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20345171"&gt;Bissantz 2010&lt;/a&gt;]  The differences in calculated binding energies are often small in comparison with the uncertainties involved.  But docking can still be used as one tool among others to find drug candidates by flagging a collection of compounds above a certain threshold binding energy.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1990253563769379118?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1990253563769379118/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1990253563769379118' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1990253563769379118'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1990253563769379118'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/07/general-transparent-solubility.html' title='General Transparent Solubility Prediction using Abraham Descriptors'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/TEybbH2L24I/AAAAAAAAA3Y/G3M2N4Ulu3I/s72-c/Predict1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3672845895498356097</id><published>2010-07-21T10:59:00.000-04:00</published><updated>2010-07-21T10:59:23.943-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reaction attempts'/><category scheme='http://www.blogger.com/atom/ns#' term='resveratrol'/><category scheme='http://www.blogger.com/atom/ns#' term='thesis'/><title type='text'>Resveratrol Thesis on Reaction Attempts</title><content type='html'>A few days ago &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; suggested to &lt;a href="http://www.facebook.com/group.php?gid=33870415433&amp;amp;v=wall#!/dustinsprouse"&gt;Dustin Sprouse&lt;/a&gt; that he submit his thesis to the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; database.  Like many undergraduates Dustin put in a lot of time and effort in doing experiments and writing up his results but didn't have quite enough time to obtain all that would have been required for a traditional publication. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A thesis is an unusual document within the context of scientific communication.  Unlike a peer reviewed paper, it may contain a large number of "failed experiments" and a substantial amount of speculation.  Although it is not quite as detailed as lab notebook, there is often plenty of raw data and details about how failed or ambiguous experiments proceeded.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In Dustin's case we felt that there was enough information provided to include his thesis in Reaction Attempts.  In addition, his thesis was accepted by &lt;a href="http://precedings.nature.com/documents/4685/version/1"&gt;Nature Precedings&lt;/a&gt;, thus providing a convenient means of citation.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The first component of the Reaction Attempts project is to quickly abstract the most basic information from synthetic organic chemistry reactions.  This includes the &lt;a href="http://www.chemspider.com/"&gt;ChemSpiderIDs&lt;/a&gt; and &lt;a href="http://en.wikipedia.org/wiki/Simplified_molecular_input_line_entry_specification"&gt;SMILES&lt;/a&gt; from the reactants and target products and brief notes about conditions and outcomes.  We are especially interested in failed or ambiguous experiments because these have almost no chance of being communicated and indexed in the traditional systems.  When attempting to carry out a reaction, it can be just as useful to know what doesn't work - and more specifically &lt;b&gt;how&lt;/b&gt; it doesn't work.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The second component of the project is dissemination.  Because the information is encoded semantically, it can be automatically converted to both &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;human and machine readable formats&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;One human interface consists of a &lt;a href="http://precedings.nature.com/documents/4416/version/1"&gt;PDF book&lt;/a&gt; (also as a &lt;a href="http://www.lulu.com/product/paperback/reaction-attempts/10792133"&gt;hard copy&lt;/a&gt;), with the option of selected reactions specified by listing CSIDs of reactants in the URL. For example Dustin's reactions can be presented selectively &lt;a href="http://tinyurl.com/37xl89z"&gt;here&lt;/a&gt;.  We also have a &lt;a href="http://usefulchem.blogspot.com/2010/06/reaction-attempts-explorer.html"&gt;Reaction Explorer&lt;/a&gt;, where reactants or products can be selected from a dropdown menu or via a substructure search.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/TEcIwLeGJ3I/AAAAAAAAA3I/gxwWXsBA080/s1600/DSp41.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 384px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/TEcIwLeGJ3I/AAAAAAAAA3I/gxwWXsBA080/s400/DSp41.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5496371494034745202" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We also provide live &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;XML feeds&lt;/a&gt; so that others can create applications easily from machine readable data.  For example one could create reaction chains automatically, which will occur whenever we enter reactions from multi-step syntheses like Dustin's - based on the synthesis of &lt;a href="http://en.wikipedia.org/wiki/Resveratrol"&gt;resveratrol&lt;/a&gt;.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I know that &lt;a href="http://wwmm.ch.cam.ac.uk/wikis/wwmm/index.php/Main_Page"&gt;Peter Murray-Rust&lt;/a&gt; has been very active in automatically abstracting information from chemistry theses.  It would be interesting to see how that approach would work for this thesis, especially with the failed experiments.  Reducing a page or two of text into only the most salient bits of information manually required a level of judgement that I imagine would be tricky to do automatically.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3672845895498356097?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3672845895498356097/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3672845895498356097' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3672845895498356097'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3672845895498356097'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/07/resveratrol-thesis-on-reaction-attempts.html' title='Resveratrol Thesis on Reaction Attempts'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_R3jz5f_NkRo/TEcIwLeGJ3I/AAAAAAAAA3I/gxwWXsBA080/s72-c/DSp41.png' height='72' width='72'/><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-6228776030111750341</id><published>2010-07-11T14:35:00.000-04:00</published><updated>2010-07-11T14:35:34.924-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='discovery'/><category scheme='http://www.blogger.com/atom/ns#' term='planets'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='priority'/><category scheme='http://www.blogger.com/atom/ns#' term='astronomy'/><title type='text'>Secrecy in Astronomy and the Open Science Ratchet</title><content type='html'>Probably because of the visibility of the &lt;a href="http://en.wikipedia.org/wiki/Galaxy_zoo"&gt;GalaxyZoo&lt;/a&gt; project, I think several of my colleagues and I have been under the impression that astronomy is a somewhat more open field than chemistry or molecular biology.  It was easy to rationalize such a position because patents are not an issue, as they clearly are in fields which rely more on invention than discovery.  However, after reading "&lt;a href="http://books.google.com/books?id=s4WCjkloJCAC"&gt;The Case for Pluto&lt;/a&gt;" by Alan Boyle, I am left with a much different impression.&lt;br /&gt;&lt;br /&gt;This book does an excellent job of covering the recent debate over Pluto's designation as a true planet.  A key trigger for this debate has been the discovery of dwarf planets with sizes very close to that of Pluto.  However, these discoveries did not occur without controversy.&lt;br /&gt;&lt;br /&gt;The story of the controversy regarding the discovery of Haumea is a particularly good example (starts on p. 108 of the book - a good summary also on &lt;a href="http://en.wikipedia.org/wiki/Controversy_over_the_discovery_of_Haumea"&gt;Wikipedia&lt;/a&gt;).  Starting in December 2004 Michael Brown at Caltech discovered a series of new dwarf planets.  Instead of immediately reporting his team's discoveries, he worked in secrecy until July 20, 2005 when he posted an online abstract indicating the discoveries would be announced at a conference that September. However, on July 27, 2005 a Spanish team led by José Luis Ortiz Moreno filed a claim with the &lt;a href="http://www.cfa.harvard.edu/iau/mpc.html"&gt;Minor Planet Center&lt;/a&gt; for priority in discovering one of these dwarf planets. This forced Brown's hand in disclosing his team's other discoveries within days - much sooner than he had anticipated.&lt;br /&gt;&lt;br /&gt;Apparently this stirred up a great controversy in the community and officially no name was associated with the discovery, although the Spanish team's telescope at Sierra Nevada Observatory was recognized as the location of the discovery.  However, Brown was allowed to select the name Haumea for the dwarf planet.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Even though the Minor Planet Center accepted Moreno's submission, most reports seem to side with Brown.  The main argument is no less than &lt;i&gt;academic fraud&lt;/i&gt; on Moreno's part because he accessed &lt;i&gt;public&lt;/i&gt; telescope logs and found some of Brown's data.  It was as simple as Googling the identifier that Brown inserted in his &lt;i&gt;public&lt;/i&gt; abstract.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If Moreno had hacked into a private computer from Brown's team I can understand fraud.  But is it fraud to access public databases?  We chemists do that all the time - reading abstracts from upcoming conferences to try to glean what our competitors are up to.  That hasn't stopped anyone from submitting a paper or patent.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Secrecy only works if everyone competing follows the same rules.  If there is a rule that planet discoveries must be made at conferences or by formal publication then this could not have happened.  Moreno's submission to the Minor Planet Center should have been rejected if such a rule existed.   If there is a rule that telescope logs should not be accessed then why make them public and indexed on Google?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Now there may exist field specific conventions.  I don't know what they are in the case of discoveries such as these but here is an interesting quote from &lt;a href="http://en.wikipedia.org/wiki/Michael_E._Brown"&gt;Michael Brown's Wikipedia page&lt;/a&gt;:&lt;/div&gt;&lt;blockquote&gt;When asked about this online activity, Ortiz responded with an email to Brown that suggested Brown was at fault for "hiding objects," and said that "the only reason why we are now exchanging e-mail is because you did not report your object."[3] Brown says that this statement by Ortiz contradicts the accepted scientific practice of analyzing one's research until one is satisfied that it is accurate, then submitting it to peer review prior to any public announcement. However, the MPC only needs precise enough orbit determination on the object in order to provide discovery credit, and Ortiz et al. not only provided the orbit, but "precovery" images of the body in 1957 plates.&lt;/blockquote&gt;It seems to me that there is a clash of what are the conventions in the field.  Certainly the Minor Planet Center did not recognize the convention of peer review before public disclosure.  They only required sufficient proof for the discovery.&lt;br /&gt;&lt;br /&gt;One way to look at this story is that Moreno acted more openly than Brown by disclosing information before peer review.  This action forced Brown to disclose scientific results much more quickly than he had anticipated.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;In a sense this is a type of Open Science Ratchet.  The actions of scientists that are most open set the pace for everyone else working on that particular project, regardless of their views on how secretive science should be.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Imagine how the scenario would have played out if one of the groups had used an &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook&lt;/a&gt;. On December 28, 2004 everyone with a stake in the search for planets would have had the opportunity to know that a very significant find had been made.  There were still details to work out - and the Brown group might not be the first to do all the calculations to completely characterize the discovery.  Certainly it would affect what other researchers did - even if they were completely opposed to the concept of Open Science.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Essentially secrecy in this context is an all-or-nothing gamble.  Everyone is free to not disclose their work until after peer reviewed publication.  In some cases the discoverer will get full credit for the discovery and the complete analysis.  But in other cases another group working in parallel will publish first and leave nothing to claim.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;As scientists become more open, it is likely that their ability to claim sole priority for all aspects of a discovery will be reduced.  However, they will retain priority for the observations and calculations that they made first.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The more open the science, the faster it happens.  And because of the Open Science Ratchet, a few Open Scientists scattered across various fields could have a larger hand than expected in speeding up science.&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-6228776030111750341?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/6228776030111750341/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=6228776030111750341' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6228776030111750341'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6228776030111750341'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/07/secrecy-in-astronomy-and-open-science.html' title='Secrecy in Astronomy and the Open Science Ratchet'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-5207584254876121748</id><published>2010-07-08T16:30:00.000-04:00</published><updated>2010-07-08T16:31:04.555-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='model'/><category scheme='http://www.blogger.com/atom/ns#' term='Ugi reaction'/><category scheme='http://www.blogger.com/atom/ns#' term='solubility'/><title type='text'>Methanol Solubility Prediction Model 4 for Ugi reactions in the literature</title><content type='html'>Since non-aqueous solubility measurements have not become part of the standard characterization of organic compounds, it is not surprising that all the &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;data&lt;/a&gt; we have for Ugi products originate from measurements that we made on our own compounds.&lt;br /&gt;&lt;br /&gt;Since methanol is our most common solvent, &lt;a href="http://en.wikipedia.org/wiki/Andrew_S.I.D._Lang"&gt;Andrew Lang&lt;/a&gt; has collected the measurements that we have with values from the literature for a range of compounds, including our Ugi products, to generate a &lt;a href="http://showme.physics.drexel.edu/onsc/models/SolubilityModel004.php"&gt;web service&lt;/a&gt; returning a predicted solubility based on a submitted SMILES string.  The model (&lt;a href="http://onschallenge.wikispaces.com/SolubilityModel004"&gt;Model 4&lt;/a&gt;) was derived from a &lt;a href="http://en.wikipedia.org/wiki/Random_forest"&gt;Random Forest&lt;/a&gt; algorithm, using molecular descriptors supplied by the &lt;a href="http://en.wikipedia.org/wiki/Chemistry_Development_Kit"&gt;CDK&lt;/a&gt; and &lt;a href="http://www.vcclab.org/"&gt;VCC&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;It would be nice to be able to test the model's ability to predict what will happen if a Ugi reaction is carried out in methanol.  Although the actual solubility of Ugi products in the literature is typically not reported, reading the experimental sections in papers can still provide some validation of the model in some cases.&lt;br /&gt;&lt;br /&gt;For example, consider the following Ugi products synthesized recently by Lezinska (&lt;a href="http://dx.doi.org/10.1016/j.tet.2010.06.062"&gt;Tetrahedron 2010&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TDYsUioXgWI/AAAAAAAAA3A/t8PH2jS76KU/s1600/ugiazide2.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 299px; height: 310px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TDYsUioXgWI/AAAAAAAAA3A/t8PH2jS76KU/s400/ugiazide2.png" alt="" id="BLOGGER_PHOTO_ID_5491625527030022498" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TDYsPte6oiI/AAAAAAAAA24/5fmWTo9qTx4/s1600/ugiazide.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 293px; height: 291px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TDYsPte6oiI/AAAAAAAAA24/5fmWTo9qTx4/s400/ugiazide.png" alt="" id="BLOGGER_PHOTO_ID_5491625444043825698" border="0" /&gt;&lt;/a&gt;Note that these images represent the azide group not following the octet rule.  It is necessary to represent the structure SMILES without charges because the CDK and VCC web services used by the model do not process charges correctly.  Stereochemistry also cannot be used and this can be removed from the SMILES simply by deleting slashes.  Thus for the two molecules above the SMILES to be submitted to the prediction web service are:&lt;br /&gt;&lt;br /&gt;O=C(NC1CCCCC1)C(Cc2ccc(C)cc2)N(c4ccccc4C(=O)c3ccccc3)C(=O)C(Cc5ccccc5)N=N#N&lt;br /&gt;AND&lt;br /&gt;O=C(NC1CCCCC1)C(C(=O)c2ccccc2)N(Cc3ccc(C)cc3)C(=O)C(C)CCN=N#N&lt;br /&gt;&lt;br /&gt;The predicted methanol solubilities are respectively &lt;a href="http://tinyurl.com/39kuo24"&gt;0.004 M&lt;/a&gt; and &lt;a href="http://tinyurl.com/3ypp77o"&gt;0.03 M&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Now if we look at the details in the experimental section, both of these Ugi products were synthesized in methanol at a limiting reactant concentration of about 0.1 M.  Even though this is much more dilute than the usual 0.5-2.0 M generally recommended for Ugi reactions (&lt;a href="http://www3.interscience.wiley.com/journal/73500473/abstract"&gt;Domling 2000&lt;/a&gt;), the products still precipitate and can be filtered off.  This is consistent with the predicted solubilities above and the model would have suggested ahead of time that methanol might be a good solvent for isolation of the products by precipitation.&lt;br /&gt;&lt;br /&gt;So far these are just anecdotal results but it does illustrate that solubility models can be evaluated without explicit determination of solubility in the literature.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-5207584254876121748?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/5207584254876121748/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=5207584254876121748' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5207584254876121748'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5207584254876121748'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/07/methanol-solubility-prediction-model-4.html' title='Methanol Solubility Prediction Model 4 for Ugi reactions in the literature'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/TDYsUioXgWI/AAAAAAAAA3A/t8PH2jS76KU/s72-c/ugiazide2.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-7166127689200592625</id><published>2010-06-25T19:26:00.000-04:00</published><updated>2010-06-25T19:26:50.666-04:00</updated><title type='text'>Reaction Attempts Explorer</title><content type='html'>Two months ago I &lt;a href="http://usefulchem.blogspot.com/2010/04/reaction-attempts-book-edition-1-and.html"&gt;reported&lt;/a&gt; on the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; project and the availability of the summary as a physical or electronic (PDF) book.  The basic idea behind the project is to collect organic chemistry reaction attempts reported in Open Notebooks.  This would include not only successful experiments but also those which could be categorized as failed, ambiguous, in progress, etc.&lt;br /&gt;&lt;br /&gt;The book was organized with reactants listed alphabetically.  In this way one could browse through summaries of the types of reactions being attempted by different researchers on a reactant of interest.  There might be information there (what to do or what to avoid) of some use for a planned reaction.  At the very least one could contact the researcher to initiate a discussion about work that had not yet been published in the traditional system.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.linkedin.com/in/andrewlang"&gt;Andrew Lang&lt;/a&gt; has just created a &lt;a href="http://showme.physics.drexel.edu/onsc/reactionattempts/"&gt;web-based tool&lt;/a&gt; to explore the Reaction Attempts database in much more sophisticated ways.&lt;br /&gt;&lt;br /&gt;Here are some scenarios of how one could use it.  On the left hand side of the page is a dropdown menu containing an alphabetically sorted list of all the reactants and products in the database.  Lets select furfurylamine.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUkUe8FFuI/AAAAAAAAA04/ieR2S5f0GYw/s1600/RAE1.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 115px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUkUe8FFuI/AAAAAAAAA04/ieR2S5f0GYw/s400/RAE1.PNG" alt="" id="BLOGGER_PHOTO_ID_5486831655341856482" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;This immediately informs us that there are 230 reactions involving furfurylamine and it lists the schemes for all these reactions upon scrolling down.  That's still a bit hard to process so a second dropdown menu appears populated with a list of other reactants or products involved with furfurylamine.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUlBdPUCxI/AAAAAAAAA1A/OJ2OrgoWhZk/s1600/RAE2.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 347px; height: 199px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUlBdPUCxI/AAAAAAAAA1A/OJ2OrgoWhZk/s400/RAE2.PNG" alt="" id="BLOGGER_PHOTO_ID_5486832427979770642" border="0" /&gt;&lt;/a&gt;We now select boc-glycine and that narrows our search to 145 reactions.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUlzOqNfaI/AAAAAAAAA1I/sGtfYJ6O3Qg/s1600/RAE3.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 347px; height: 199px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUlzOqNfaI/AAAAAAAAA1I/sGtfYJ6O3Qg/s400/RAE3.PNG" alt="" id="BLOGGER_PHOTO_ID_5486833283059514786" border="0" /&gt;&lt;/a&gt;Selecting benzaldehyde from the third dropdown menu narrows the search further to 61 reactions.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUmhmC11XI/AAAAAAAAA1Q/bW-g9KkoaKA/s1600/RAE4.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 343px; height: 228px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUmhmC11XI/AAAAAAAAA1Q/bW-g9KkoaKA/s400/RAE4.PNG" alt="" id="BLOGGER_PHOTO_ID_5486834079610819954" border="0" /&gt;&lt;/a&gt;The final dropdown menu contains a short list of only isocyanides and thus all represent attempted Ugi reactions.  Selecting t-butyl isocyanide gives us 56 reactions.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUnArzHzhI/AAAAAAAAA1Y/_b1Qv0YC2kE/s1600/RAE5.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 348px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUnArzHzhI/AAAAAAAAA1Y/_b1Qv0YC2kE/s400/RAE5.PNG" alt="" id="BLOGGER_PHOTO_ID_5486834613731446290" border="0" /&gt;&lt;/a&gt;That means that these same 4 components were reacted together 56 times.  Looking at the various reaction summaries will show that some of these are duplicates for reproducibility and others vary concentration and solvent and the effect on yield is included.  This particular reaction was in fact the subject of a &lt;a href="http://www.jove.com/index/details.stp?ID=942"&gt;paper&lt;/a&gt; on the optimization of a Ugi reaction using an automated liquid handler.&lt;br /&gt;&lt;br /&gt;Now here is where the design of the Explorer comes in handy.  We might want to ask if the reaction proceeds as well with the other isocyanides.  All we have to do is switch the final dropdown menu to ask what happens when we go from t-butyl to n-butyl isonitrile.  There is a single attempt of this reaction and it is "failed" in the sense that no precipitate was obtained from the reaction mixture.  This doesn't mean that the reaction didn't take place - it might be that the Ugi product was too soluble.  We can quickly inspect that the concentration and solvent are in line with conditions that allowed precipitation of the t-butyl derivative.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUn4fPNHfI/AAAAAAAAA1g/ooS7DwuzUfM/s1600/RAE6.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 328px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUn4fPNHfI/AAAAAAAAA1g/ooS7DwuzUfM/s400/RAE6.PNG" alt="" id="BLOGGER_PHOTO_ID_5486835572432248306" border="0" /&gt;&lt;/a&gt;OK lets see what happens with n-pentyl isocyanide.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUq5KObepI/AAAAAAAAA1o/5ymn5qbdVd4/s1600/RAE7.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 328px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUq5KObepI/AAAAAAAAA1o/5ymn5qbdVd4/s400/RAE7.PNG" alt="" id="BLOGGER_PHOTO_ID_5486838882506603154" border="0" /&gt;&lt;/a&gt;It looks like it behaves just like n-butyl isocyanide: another single non-precipitation event. What about benzyl isocyanide?&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUroeNsk6I/AAAAAAAAA1w/X-2u8OfspkE/s1600/RAE8.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 356px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUroeNsk6I/AAAAAAAAA1w/X-2u8OfspkE/s400/RAE8.PNG" alt="" id="BLOGGER_PHOTO_ID_5486839695326090146" border="0" /&gt;&lt;/a&gt;This time we do get the Ugi product from a single attempt. Note the lower yield compared to the t-butyl isocyanide under similar conditions.&lt;br /&gt;&lt;br /&gt;What about with cyclohexyl isocyanide?&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUu13CioxI/AAAAAAAAA14/EnwQaeEbwwY/s1600/RAE9.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 201px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCUu13CioxI/AAAAAAAAA14/EnwQaeEbwwY/s400/RAE9.PNG" alt="" id="BLOGGER_PHOTO_ID_5486843223863370514" border="0" /&gt;&lt;/a&gt;This time we hit an experiment in progress.  A precipitate was obtained but it was not characterized.  We can click on the link to the lab notebook page (&lt;a href="http://usefulchem.wikispaces.com/Exp232"&gt;EXP232&lt;/a&gt;) to learn more about how long it took for the precipitate to appear but there are not enough data to draw a definite conclusion about the successs of the reaction.  However, based on the results from the other precipitates in this series it is probably encouraging enough to repeat and characterize the product.&lt;br /&gt;&lt;br /&gt;There are other sources of information here.  Clicking on the image of the Ugi product takes us to its &lt;a href="http://www.chemspider.com/Chemical-Structure.24528191.html"&gt;ChemSpider entry&lt;/a&gt;.  In this case the only associated data relates to this reaction attempt.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUvf2FwRPI/AAAAAAAAA2A/LDo7BLUzvhg/s1600/RAE10.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 120px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUvf2FwRPI/AAAAAAAAA2A/LDo7BLUzvhg/s400/RAE10.PNG" alt="" id="BLOGGER_PHOTO_ID_5486843945162917106" border="0" /&gt;&lt;/a&gt;Lets look at another scenario: reactions involving aminoacetaldehyde dimethyl acetal.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUxtW6mc4I/AAAAAAAAA2I/KS97EGEOHZI/s1600/RAE11.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 161px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUxtW6mc4I/AAAAAAAAA2I/KS97EGEOHZI/s400/RAE11.PNG" alt="" id="BLOGGER_PHOTO_ID_5486846376336061314" border="0" /&gt;&lt;/a&gt;In this case we find the intersection of &lt;a href="http://www.chemspider.com/Chemical-Structure.24528191.html"&gt;two Open Notebooks&lt;/a&gt;.  The first reaction comes from &lt;a href="http://www.thesynapticleap.org/node/299"&gt;Michael Wolfle&lt;/a&gt; from the Todd group.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/TCUymwTdn_I/AAAAAAAAA2Q/Hy8V_vOO7_k/s1600/RAE12.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 321px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/TCUymwTdn_I/AAAAAAAAA2Q/Hy8V_vOO7_k/s400/RAE12.PNG" alt="" id="BLOGGER_PHOTO_ID_5486847362403770354" border="0" /&gt;&lt;/a&gt;The second comes from &lt;a href="http://usefulchem.wikispaces.com/Exp258"&gt;Khalid Mirza&lt;/a&gt; from the Bradley group.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUzE2H_DsI/AAAAAAAAA2Y/474SMsEY9ZE/s1600/RAE13.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 178px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUzE2H_DsI/AAAAAAAAA2Y/474SMsEY9ZE/s400/RAE13.PNG" alt="" id="BLOGGER_PHOTO_ID_5486847879362318018" border="0" /&gt;&lt;/a&gt;In order to learn more about the nature of the overlap we can use the substructure search capabilities of the &lt;a href="http://showme.physics.drexel.edu/onsc/reactionattempts/"&gt;Reaction Explorer&lt;/a&gt;.  Simply click on the image of the acetal and the &lt;a href="http://www.chemspider.com/Chemical-Structure.80987.html"&gt;ChemSpider entry&lt;/a&gt; pops up.  Now click on the copy button next to the SMILES for the compound.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUz7hwhicI/AAAAAAAAA2g/1P6LZx3gQlQ/s1600/RAE14.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 240px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TCUz7hwhicI/AAAAAAAAA2g/1P6LZx3gQlQ/s400/RAE14.PNG" alt="" id="BLOGGER_PHOTO_ID_5486848818788010434" border="0" /&gt;&lt;/a&gt;Paste the SMILES into the &lt;a href="http://www.daylight.com/dayhtml_tutorials/languages/smarts/"&gt;SMARTS&lt;/a&gt; box of the &lt;a href="http://showme.physics.drexel.edu/onsc/reactionattempts/"&gt;Reaction Explorer&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCU0W0FXHKI/AAAAAAAAA2o/Jb6xwiDLXRI/s1600/RAE15.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 133px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/TCU0W0FXHKI/AAAAAAAAA2o/Jb6xwiDLXRI/s400/RAE15.PNG" alt="" id="BLOGGER_PHOTO_ID_5486849287563713698" border="0" /&gt;&lt;/a&gt;We get 13 reaction attempts for this query - the two we found earlier and the rest corresponding to attempts by Michael Wolfle to synthesize praziquanamine. &lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCU0yTYYNeI/AAAAAAAAA2w/aFwHG3VM2bg/s1600/RAE16.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 352px; height: 400px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/TCU0yTYYNeI/AAAAAAAAA2w/aFwHG3VM2bg/s400/RAE16.PNG" alt="" id="BLOGGER_PHOTO_ID_5486849759821444578" border="0" /&gt;&lt;/a&gt;We learn that one connection between these two notebooks involves different attempts at &lt;a href="http://usefulchem.blogspot.com/2010/06/use-of-ons-to-protect-open-research.html"&gt;synthesizing praziquantel&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Hopefully this demonstrates the value of abstracting organic chemistry reaction attempts from Open Notebooks into a machine readable format.  Contributions to the database require only the ChemSpider IDs of the reactants and product and a link to the relevant lab notebook page.  Reaction schemes are automatically generated by the system.  More on the Reaction Attempts project &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-7166127689200592625?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/7166127689200592625/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=7166127689200592625' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7166127689200592625'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7166127689200592625'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/06/reaction-attempts-explorer.html' title='Reaction Attempts Explorer'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/TCUkUe8FFuI/AAAAAAAAA04/ieR2S5f0GYw/s72-c/RAE1.PNG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1203383614268658321</id><published>2010-06-07T14:35:00.000-04:00</published><updated>2010-06-07T14:35:31.669-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='telephone gambit'/><category scheme='http://www.blogger.com/atom/ns#' term='NSF'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='IGERT'/><title type='text'>IGERT NSF panel on Digital Science</title><content type='html'>On May 24, 2010 I was part of a panel in Washington for the &lt;a href="http://guest.cvent.com/EVENTS/Info/Agenda.aspx?i=d79c40c9-0396-4dbe-8198-3019ce31016b"&gt;NSF IGERT annual meeting&lt;/a&gt;.  As I &lt;a href="http://usefulchem.blogspot.com/2010/02/funding-agencies-and-open-science.html"&gt;mentioned previously&lt;/a&gt;, it is encouraging to find that funding agencies are paying more attention to the role of new forms of scholarship and dissemination of scientific information.&lt;br /&gt;&lt;br /&gt;My co-panelists included &lt;a href="http://scienceblogs.com/ethicsandscience/"&gt;Janet Stemwedel&lt;/a&gt;, who talked about the role of blogging in an academic career, &lt;a href="http://www.jove.com/index/Team.stp"&gt;Moshe Pritzker&lt;/a&gt;, who made a case for using video to communicate protocols in life sciences and &lt;a href="http://www.as.arizona.edu/people/faculty/impey.html"&gt;Chris Impey&lt;/a&gt;, who demonstrated applications of clickers and Second Life in the classroom. &lt;br /&gt;&lt;br /&gt;We only had 10 minutes each to speak so the presentations were basically highlights of what is possible.  Still, it was enough to stimulate a vigorous discussion with the audience.  There was a bit of controversy about the examples I used to demonstrate the limitations of peer review in chemistry.  People can misinterpret what we are trying to do with ONS - it certainly doesn't include bringing down the peer review system (not that we could anyway).  But we have to face the situation that peer review does not validate all the data and statements in a paper.  It operates at a much higher level of abstraction.  Providing transparency to the raw data should work in a synergistic way with the existing system. &lt;br /&gt;&lt;br /&gt;My favorite part of the conference was easily &lt;a href="http://www.sethshulman.com/"&gt;Seth Shulman'&lt;/a&gt;s talk on the "&lt;a href="http://www.sethshulman.com/seth_shulman_books/the_telephone_gambit.php"&gt;Telephone Gambit&lt;/a&gt;".  Ever since reading his book, I have been using the story of how carefully reading Bell's lab notebook has forced us to revise the generally accepted notion of how the telephone was invented.  Seth's presentation was truly captivating because he explained not only what was done but also what motives were at work to deceive and obfuscate. This cautionary tale is still very much relevant to science and invention today - and highlights how transparency can mitigate against this type of outcome.&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_4319174"&gt;&lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/igert-open-notebook-science-4319174" title="IGERT Open Notebook Science"&gt;IGERT Open Notebook Science&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse4319174" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyigert10-100526114337-phpapp02&amp;amp;stripped_title=igert-open-notebook-science-4319174"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse4319174" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleyigert10-100526114337-phpapp02&amp;amp;stripped_title=igert-open-notebook-science-4319174" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding: 5px 0pt 12px;"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1203383614268658321?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1203383614268658321/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1203383614268658321' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1203383614268658321'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1203383614268658321'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/06/igert-nsf-panel-on-digital-science.html' title='IGERT NSF panel on Digital Science'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-6774953636093931247</id><published>2010-06-01T11:39:00.001-04:00</published><updated>2010-06-01T11:40:58.521-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='IP'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='patent'/><category scheme='http://www.blogger.com/atom/ns#' term='praziquantel'/><category scheme='http://www.blogger.com/atom/ns#' term='intellectual property'/><category scheme='http://www.blogger.com/atom/ns#' term='open science'/><title type='text'>Use of ONS to protect Open Research: the case of the Ugi approach to Praziquantel</title><content type='html'>As we were collecting reactions from &lt;a href="http://usefulchem.blogspot.com/2010/05/synaptic-leap-experiments-on-reaction.html"&gt;The Synaptic Leap for the Reaction Attempts&lt;/a&gt; project, &lt;a href="http://www.linkedin.com/in/andrewlang"&gt;Andrew Lang&lt;/a&gt; noticed that there might be a quick synthetic route to &lt;a href="http://en.wikipedia.org/wiki/Praziquantel"&gt;praziquantel&lt;/a&gt; via a &lt;a href="http://en.wikipedia.org/wiki/Ugi_reaction"&gt;Ugi reaction&lt;/a&gt;.  I researched it further and found a paper (&lt;a href="http://www.heterocycles.jp/library/abstract.php?article=7248"&gt;Kim et al 1998&lt;/a&gt;) where &lt;a href="http://www.chemspider.com/Chemical-Structure.9906289.html"&gt;Ugi product 1&lt;/a&gt; was indeed converted to racemic praziquantel via the Pictet-Spegler cyclization.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/TAUh86qs5TI/AAAAAAAAA0w/AGtxEwRn3jo/s1600/praziugi.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 142px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/TAUh86qs5TI/AAAAAAAAA0w/AGtxEwRn3jo/s400/praziugi.PNG" alt="" id="BLOGGER_PHOTO_ID_5477821852190303538" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Using Beilstein Crossfire the only synthesis of 1 I found involves a multi-step amidation strategy.  But this compound should be accessible in one step from commercially available starting materials via a Ugi reaction (shown above).  Since all the starting materials are liquids we have some flexibility with solvent choice.  Khalid first tried it in methanol &lt;a href="http://usefulchem.wikispaces.com/Exp258"&gt;EXP258&lt;/a&gt; a few weeks ago but did not get a precipitate.  He was going to monitor it by NMR next to see if the problem was high solubility of the Ugi product or with the reaction itself.&lt;br /&gt;&lt;br /&gt;It was therefore with great interest that I read &lt;a href="http://www.thesynapticleap.org/node/317"&gt;Mat Todd's report&lt;/a&gt; this morning on The Synaptic Leap that a &lt;a href="http://v3.espacenet.com/publicationDetails/biblio?CC=WO&amp;amp;NR=2009115333A1&amp;amp;KC=A1&amp;amp;FT=D&amp;amp;date=20090924&amp;amp;DB=EPODOC&amp;amp;locale=en_ep"&gt;German patent&lt;/a&gt; had been issued on this Ugi strategy to praziquantel.  (TSL didn't provide a means of leaving a comment so I edited the page - which made me the author of that post but actually Mat wrote it)&lt;br /&gt;&lt;br /&gt;I have often mentioned during my talks that Open Notebook Science could be used not only in a defensive manner to claim academic priority - but also as an offensive tactic to block patent applications.  A company attempting to prevent the commercial exploitation of rival inventions has a few options.  Where applicable, it can buy up an existing patent pool with the intention of sitting on it.  For new inventions, it can do research and try to file patents before their competitors.  But this is a costly process and it may make more sense to simply publish the inventions to create disclosed prior art, thereby blocking patent applications of their competitors.&lt;br /&gt;&lt;br /&gt;But - as I and many others have &lt;a href="http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html"&gt;discussed&lt;/a&gt; - the current publication system is not optimally suited for the purpose of simply disclosing and communicating science.  Not only is it generally slow but the traditional article format requires a narrative of some sort - rarely can single experiments be published.  This means that much (if not most) of research done by an individual or group will never be disclosed.&lt;br /&gt;&lt;br /&gt;For these reasons I think that keeping an easily discoverable &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook&lt;/a&gt; for projects designed to block patent submission by competitors makes a lot of sense - both economically and from a workflow perspective.  Since researchers already have to keep a lab notebook, making it public doesn't impose the added time that writing an article or patent will require.&lt;br /&gt;&lt;br /&gt;In this specific example of praziquantel we were too late.  But if we had recorded this experiment a few years ago it might have worked to block &lt;a href="http://v3.espacenet.com/publicationDetails/biblio?CC=WO&amp;amp;NR=2009115333A1&amp;amp;KC=A1&amp;amp;FT=D&amp;amp;date=20090924&amp;amp;DB=EPODOC&amp;amp;locale=en_ep"&gt;Domling's patent&lt;/a&gt;.  Now, it isn't clear to me that &lt;a href="http://usefulchem.wikispaces.com/Exp258"&gt;EXP258&lt;/a&gt; would have been enough to do that.  The strategy to make praziquantel via a Ugi reaction was clearly stated but the experiment was not conclusive.  However, since Domling reported that methanol worked I am sure that we would have had the "reduced to practice" evidence in the notebook shortly.&lt;br /&gt;&lt;br /&gt;Above I used a company as an example of a party motivated to disclose inventions to protect their interests.  In our case it would not be a company but rather the entire Open Science community.  It is in our best interest to keep our scientific territory as unencumbered by patents as possible.  Keeping Open Notebooks might be one of the simplest means of ensuring that.&lt;br /&gt;&lt;br /&gt;Consider a humanitarian organization that might want to manufacture praziquantel.  I haven't researched it but presumably the Domling patent was filed in a number of countries beside Germany.  In order to consider using the Ugi strategy, the organization would now have to deal with the patent holder.  This might be the factor that makes this route untenable.  Patents have proven to be problematic for humanitarian aid - even in the simple case of &lt;a href="http://news.bbc.co.uk/2/hi/europe/8610427.stm"&gt;providing food&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;But all is not lost.  In addition to offering a simple 2-step synthesis of praziqantel, the Ugi route offers an easy way to make large libraries of analogs.  Optimally we would like to work with someone who has experience with docking praziquantel.  It might be interesting to screen not only the praziquantel analogs but also the uncyclized Ugi products themselves.  When we did this for malarial enoyl reductase inhibitors (&lt;a href="http://usefulchem.wikispaces.com/D-EXP005"&gt;D-EXP005&lt;/a&gt;) we found that we did not need to cyclize to obtain compounds predicted to bind.  This ultimately led to &lt;a href="http://usefulchem.blogspot.com/2008/01/we-have-anti-malarial-activity.html"&gt;active compounds&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-6774953636093931247?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/6774953636093931247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=6774953636093931247' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6774953636093931247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6774953636093931247'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/06/use-of-ons-to-protect-open-research.html' title='Use of ONS to protect Open Research: the case of the Ugi approach to Praziquantel'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_R3jz5f_NkRo/TAUh86qs5TI/AAAAAAAAA0w/AGtxEwRn3jo/s72-c/praziugi.PNG' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-5935495743220103617</id><published>2010-05-16T20:26:00.000-04:00</published><updated>2010-05-16T20:26:52.747-04:00</updated><title type='text'>OpenSciNY Open Notebook Science Talk</title><content type='html'>On May 14, 2010 I presented on Open Notebook Science at the &lt;a href="http://opensciny.com/"&gt;OpenSciNY&lt;/a&gt; conference at the New York University Bobst Library.  I introduced the topic by telling a few &lt;a href="http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html"&gt;stories&lt;/a&gt; about how new forms of communication are affecting how we think about concepts like "scientific precedent", "peer review", "scientific publishing" and "scientific scholarship".  At the end I spoke about archiving Open Notebook Science projects and showed the physical copies of both the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; and &lt;a href="http://onsbooks.wikispaces.com/ONS+Solubility+Challenge"&gt;ONS Solubility Challenge&lt;/a&gt; books.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://library.nyu.edu/mathematics/"&gt;Margaret Smith&lt;/a&gt; did a wonderful job of organizing the conference with a very interesting line-up of speakers: &lt;a href="http://opensciny.com/speakers/"&gt;Heather Joseph, Antony Williams, Elizabeth Brown and    David Hogg&lt;/a&gt;. We formed break-out sessions at the end to discuss with the attendees concepts around Open Science.  I was part of the session on Promoting Open Science.&lt;br /&gt;&lt;br /&gt;The tone at this and other similar conferences I have attended recently is probably best described as cautiously optimistic and focused on what is possible.  The Open Science movement - at least as far as it is reflected by the people I know - does not seem to be driven by zealots or idealists trying to get everyone to drink the cool-aid.  It is just a bunch of people who see opportunities to do things in better ways as new tools become available - and they can't find a credible reason not to do them.&lt;br /&gt;&lt;br /&gt;Check &lt;a href="http://friendfeed.com/opensciny"&gt;here on FriendFeed&lt;/a&gt; for updates about links to recordings, slides, etc.&lt;br /&gt;&lt;br /&gt;My presentation below:&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_4096459"&gt;&lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/opensciny-open-notebook-science" title="OpenSciNY Open Notebook Science"&gt;OpenSciNY Open Notebook Science&lt;/a&gt;&lt;/strong&gt;&lt;object id="__sse4096459" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=opensciny10-100514075135-phpapp02&amp;amp;stripped_title=opensciny-open-notebook-science"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse4096459" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=opensciny10-100514075135-phpapp02&amp;amp;stripped_title=opensciny-open-notebook-science" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding: 5px 0pt 12px;"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="flashvars" value="id=17876&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" flashvars="id=17876&amp;amp;type=3" width="480" height="400"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-5935495743220103617?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/5935495743220103617/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=5935495743220103617' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5935495743220103617'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5935495743220103617'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/05/opensciny-open-notebook-science-talk.html' title='OpenSciNY Open Notebook Science Talk'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3900179893684179889</id><published>2010-05-07T17:17:00.000-04:00</published><updated>2010-05-07T17:17:34.360-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='thescientist'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='ELN'/><category scheme='http://www.blogger.com/atom/ns#' term='lab notebook'/><category scheme='http://www.blogger.com/atom/ns#' term='Wikispaces'/><title type='text'>The Scientist Article on Electronic Lab Notebooks</title><content type='html'>&lt;a href="http://www.amberldance.com/Amber%20Dance--science%20writing/Amber%20Dance--Science%20Writer.html"&gt;Amber Dance&lt;/a&gt; has written an article in The Scientist (2010-05-01) &lt;a href="http://www.the-scientist.com/article/display/57363/"&gt;Digital Upgrade: How to choose your lab’s next electronic lab notebook&lt;/a&gt;. This is basically a quick overview of different Electronic Lab Notebooks (ELNs) that should be helpful for people researching what is currently available in that space.&lt;br /&gt;&lt;br /&gt;There was some coverage of &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; and &lt;a href="http://www.linkedin.com/in/stevekoch"&gt;Steve Koch&lt;/a&gt; and I were quoted.  Ironically my contribution appeared in the "Cons" section :)&lt;br /&gt;&lt;blockquote&gt;&lt;p&gt;&lt;strong&gt;Pros&lt;/strong&gt;  &lt;/p&gt;&lt;ul&gt;&lt;li&gt;The format is unconstrained—you can set up any categories, and as  many users and pages, as you want—and fast to set up.&lt;/li&gt;&lt;li&gt;Open notebooking attracts collaborators. Koch counts three  collaborations that wouldn’t have happened if he weren’t on OpenWetWare.  And his students build professional networks well before they author a  paper.&lt;/li&gt;&lt;/ul&gt;  &lt;p&gt;&lt;strong&gt;Cons&lt;/strong&gt;  &lt;/p&gt;&lt;ul&gt;&lt;li&gt;Wikis were not designed with scientific data in mind. For example,  it’s hard to make a table, Koch says.&lt;/li&gt;&lt;li&gt;Open notebook science “does limit where you can send your work,”  says Jean-Claude Bradley, a chemist at Drexel University in  Philadelphia, who also uses an open wiki notebook. His lab sticks to  journals that accept preprints.&lt;/li&gt;&lt;li&gt;Posting online voids international patent rights, although US  patents are still possible.&lt;/li&gt;&lt;/ul&gt;&lt;/blockquote&gt;In my opinion, one of the biggest "Pros" wasn't listed in that section: the free cost. (That was mentioned elsewhere though)  When you see the costs of some of these other commercial systems, that has to be a factor for many people trying to make a decision.&lt;br /&gt;&lt;br /&gt;If privacy is an issue wikis can certainly be made private, although I'm not sure if that is possible on OpenWetWare.  It can be done for $5/month on Wikispaces, the wiki we use for lab notebooks - although then it wouldn't be Open Notebook Science.&lt;br /&gt;&lt;br /&gt;Concerning Steve's Con of wikis being difficult to use to store data, that is true.  However combining the use of a wiki with Google Spreadsheets has completely resolved that issue for us.  With our ability to automatically &lt;a href="http://usefulchem.blogspot.com/2010/02/ons-solubility-book-edition-3-with.html"&gt;export an archive&lt;/a&gt; of the notebook (as HTML) and spreadsheets (as XLS) into an integrated archive, the two platforms operate essentially as if they were a single system.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3900179893684179889?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3900179893684179889/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3900179893684179889' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3900179893684179889'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3900179893684179889'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/05/scientist-article-on-electronic-lab.html' title='The Scientist Article on Electronic Lab Notebooks'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-3629059505233603392</id><published>2010-05-04T15:23:00.001-04:00</published><updated>2010-05-04T15:25:23.533-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='second life'/><category scheme='http://www.blogger.com/atom/ns#' term='ACS'/><title type='text'>Visualizing Chemistry in Second Life ACS Talk Recording</title><content type='html'>The American Chemical Society has processed the recording of the talk that Andrew Lang and I gave at the Spring 2010 ACS meeting in San Francisco on March 23, 2010:&lt;br /&gt;&lt;a href="http://www.softconference.com/acschem/player.asp?PVQ=GHIL&amp;amp;fVQ=FEDKEH&amp;amp;hVQ="&gt;Visualizing Chemistry in Second Life&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-3629059505233603392?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/3629059505233603392/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=3629059505233603392' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3629059505233603392'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/3629059505233603392'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/05/visualizing-chemistry-in-second-life.html' title='Visualizing Chemistry in Second Life ACS Talk Recording'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-4889757841621910554</id><published>2010-05-03T11:59:00.000-04:00</published><updated>2010-05-03T11:59:50.980-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='chemspider'/><category scheme='http://www.blogger.com/atom/ns#' term='syntheticpages'/><category scheme='http://www.blogger.com/atom/ns#' term='reactionattempts'/><category scheme='http://www.blogger.com/atom/ns#' term='cssp'/><category scheme='http://www.blogger.com/atom/ns#' term='Ugi reaction'/><title type='text'>ChemSpider SyntheticPages</title><content type='html'>I recently &lt;a href="http://usefulchem.blogspot.com/2010/04/reaction-attempts-book-edition-1-and.html"&gt;mentioned&lt;/a&gt; the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts&lt;/a&gt; project, which aims to collect organic chemistry experiments - especially those that are "failed", in progress or somehow incomplete.&lt;br /&gt;&lt;br /&gt;For reactions where the desired product has been obtained and fully characterized, &lt;a href="http://cssp.chemspider.com/"&gt;ChemSpider SyntheticPages&lt;/a&gt; also offers a very convenient publication vehicle.  As I &lt;a href="http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html"&gt;mentioned previously&lt;/a&gt; there is a need for enabling the publication of single experiments, especially when these are unlikely to become part of a traditional article.&lt;br /&gt;&lt;br /&gt;We are in the process of submitting suitable reactions from the &lt;a href="http://usefulchem.wikispaces.com/All+Reactions"&gt;UsefulChem project&lt;/a&gt; to CS|SP.  This will require some re-formatting of procedures and characterization data as they currently appear in the lab notebook.&lt;br /&gt;&lt;br /&gt;Here is an example of &lt;a href="http://usefulchem.wikispaces.com/exp176"&gt;one of our Ugi reactions&lt;/a&gt;: &lt;a href="http://cssp.chemspider.com/Article.aspx?id=406&amp;amp;hw=0"&gt;SyntheticPage 406 (UCEXP176C)&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/S97uHLQAFqI/AAAAAAAAA0g/7Pl2acT_iZ4/s1600/cssp1.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 291px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/S97uHLQAFqI/AAAAAAAAA0g/7Pl2acT_iZ4/s400/cssp1.PNG" alt="" id="BLOGGER_PHOTO_ID_5467068804720760482" border="0" /&gt;&lt;/a&gt; &lt;br /&gt;A nice feature of these pages is the automatic rendering of 2D structures upon hovering on top of chemical names.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/S97uM-aS7yI/AAAAAAAAA0o/300sM7NOIzE/s1600/cssp2.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 222px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/S97uM-aS7yI/AAAAAAAAA0o/300sM7NOIzE/s400/cssp2.PNG" alt="" id="BLOGGER_PHOTO_ID_5467068904353492770" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Here are a &lt;a href="http://cssp.chemspider.com/About.aspx"&gt;few more reasons&lt;/a&gt; to use ChemSpider SyntheticPages:&lt;br /&gt;&lt;blockquote&gt;    * ChemSpider SyntheticPages takes you directly to a procedure. When you get a hit - you get a procedure.&lt;br /&gt;  * ChemSpider SyntheticPages provides information that may not generally be found elsewhere, such as frequently encountered problems, trouble-shooting tips, the number of times the reaction has been carried out, scale-variation etc.&lt;br /&gt;  * ChemSpider SyntheticPages is the only interactive chemistry database. Information is constantly updated and validated by comments from the user community (Peer Review in the Public Domain™).&lt;br /&gt;  * ChemSpider SyntheticPages can provide you with the most up-to-date method, we aim for 95% of submissions to be processed within 48 hours of submission.&lt;br /&gt;  * ChemSpider SyntheticPages is free of charge.&lt;/blockquote&gt;&lt;span style="font-style: italic;"&gt;[Disclaimer: I am a member of the &lt;a href="http://cssp.chemspider.com/Editors.aspx"&gt;editorial group&lt;/a&gt; at CS|SP]&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-4889757841621910554?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/4889757841621910554/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=4889757841621910554' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4889757841621910554'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/4889757841621910554'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/05/chemspider-syntheticpages.html' title='ChemSpider SyntheticPages'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_R3jz5f_NkRo/S97uHLQAFqI/AAAAAAAAA0g/7Pl2acT_iZ4/s72-c/cssp1.PNG' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1658902223914724146</id><published>2010-05-02T20:37:00.000-04:00</published><updated>2010-05-02T20:37:49.317-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reactionattempts'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='praziquantel'/><category scheme='http://www.blogger.com/atom/ns#' term='synapticleap'/><title type='text'>The Synaptic Leap Experiments on Reaction Attempts</title><content type='html'>Andrew Lang and I &lt;a href="http://usefulchem.blogspot.com/2010/04/reaction-attempts-book-edition-1-and.html"&gt;recently reported&lt;/a&gt; on the first edition of the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts book and database&lt;/a&gt;.  Part of the motivation for this was to structure the experiments from the &lt;a href="http://usefulchem.wikispaces.com/"&gt;UsefulChem project&lt;/a&gt; in both a machine readable format and one that could be browsed as a physical copy.  However, we also had in mind the easy integration of other open experiments, especially those labeled as "failed", since these are unlikely to be found by searching conventional reaction archives.&lt;br /&gt;&lt;br /&gt;As a demonstration, we have added a series of experiments from &lt;a href="http://www.thesynapticleap.org/"&gt;The Synaptic Leap&lt;/a&gt;, which &lt;a href="http://friendfeed.com/michaelwolfle"&gt;Michael Wolfle&lt;/a&gt; (working as a post-doc with &lt;a href="http://friendfeed.com/mattodd"&gt;Mat Todd&lt;/a&gt;) has posted.  All of these reactions involve intermediates in the synthesis of &lt;a href="http://en.wikipedia.org/wiki/Praziquantel"&gt;praziquantel&lt;/a&gt;, which is a major focus of the Todd group.  One group of these reactions involved the attempted synthesis of &lt;a href="http://friendfeed.com/enantiopure-pzq/326b04ec/resolution-of-praziquanamine"&gt;praziquanamine&lt;/a&gt; via a &lt;a href="http://en.wikipedia.org/wiki/Pictet%E2%80%93Spengler_reaction"&gt;Pictet-Spengler cyclization&lt;/a&gt;.  Most of these are &lt;a href="http://www.thesynapticleap.org/node/306"&gt;failed attempts&lt;/a&gt; and &lt;a href="http://www.thesynapticleap.org/node/300"&gt;one successful one&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Adding these experiments to Reaction Attempts was very simple - since the minimum information required is the ChemSpiderIDs (CSIDs) of all the reactants and the product, which a hyperlink to more details.  We also added a few more details provided by Michael - such as the solvent, reaction conditions and outcome.&lt;br /&gt;&lt;br /&gt;Andy has provided a simple mechanism to pull up all Reaction Attempts for a given reactant with the following url structure:&lt;br /&gt;&lt;blockquote&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/databook/ucdatabook.php?reactants=9099925"&gt;http://showme.physics.drexel.edu/onsc/databook/ucdatabook.php?reactants=9099925&lt;/a&gt;&lt;/blockquote&gt;The number at the end is the CSID for the reactant.  Multiple reactants can be pulled from the database by adding more CSIDs separated by commas.&lt;br /&gt;&lt;br /&gt;Successful runs in Reaction Attempts are identified with a green check mark:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/S93s2Rqgh5I/AAAAAAAAA0Y/FghUItiLFhQ/s1600/slattempts1.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 208px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/S93s2Rqgh5I/AAAAAAAAA0Y/FghUItiLFhQ/s400/slattempts1.PNG" alt="" id="BLOGGER_PHOTO_ID_5466785939896436626" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Again the main idea here is not to exhaustively abstract all pertinent information for an experiment.  Rather it is to connect up researchers who are working on similar reactions.  Since it requires so little effort to come up with the minimum required information we are hoping to get contributions from other sources.&lt;br /&gt;&lt;br /&gt;We will focus next on coming up with more sophisticated ways to retrieve information - such as substructure searching or by reaction type, solvent, etc.  We will also periodically publish hard copies of future &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts editions&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1658902223914724146?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1658902223914724146/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1658902223914724146' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1658902223914724146'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1658902223914724146'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/05/synaptic-leap-experiments-on-reaction.html' title='The Synaptic Leap Experiments on Reaction Attempts'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_R3jz5f_NkRo/S93s2Rqgh5I/AAAAAAAAA0Y/FghUItiLFhQ/s72-c/slattempts1.PNG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-6453300441448317095</id><published>2010-04-30T12:32:00.000-04:00</published><updated>2010-04-30T12:32:10.934-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='integration'/><category scheme='http://www.blogger.com/atom/ns#' term='GoogleSpreadsheet'/><category scheme='http://www.blogger.com/atom/ns#' term='web service'/><category scheme='http://www.blogger.com/atom/ns#' term='NMR'/><title type='text'>NMR integration web service expanded</title><content type='html'>The &lt;a href="http://onschallenge.wikispaces.com/"&gt;ONS Challenge&lt;/a&gt; has extensively used a web service created by Andrew Lang to &lt;a href="http://usefulchem.blogspot.com/2009/05/streamlining-automated-solubility.html"&gt;automatically calculate solubility from NMR spectra&lt;/a&gt;.  One of the constraints of the service was that the &lt;a href="http://en.wikipedia.org/wiki/JCAMP"&gt;JCAMP-DX&lt;/a&gt; file had to be deposited in a special folder on a server at Drexel.&lt;br /&gt;&lt;br /&gt;Andy has now modified the script so that the JCAMP-DX file can be located anywhere on the internet.  I have prepared a modified &lt;a href="https://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdFU1QjY2S2lEeXV2S1h0eDlJdnRKZEE&amp;amp;hl=en"&gt;Google Spreadsheet&lt;/a&gt; to serve as a template for SAMS calculations (&lt;a href="http://usefulchem.blogspot.com/2009/03/semi-automated-measurement-of.html"&gt;Semi-Automated Measurement of Solubility&lt;/a&gt;).  Simply enter the url to the JCAMP-DX file in the appropriate column and fill in the ppm ranges and corresponding hydrogen numbers for the solvent and solute, and molecular weight and density data. (The predicted density of solids can be found on &lt;a href="http://chemspider.com/"&gt;Chemspider&lt;/a&gt;).  The concentration of the solute will then be automatically calculated based on an &lt;a href="http://usefulchem.blogspot.com/2009/03/semi-automated-measurement-of.html"&gt;assumption of volume additivity&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The web service (&lt;a href="http://usefulchem.blogspot.com/2009/05/baseline-correction-for-automated.html"&gt;which handles baseline correction&lt;/a&gt;) could be used for any other purpose involving the integration of spectra.  Just make a copy of the &lt;a href="https://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdFU1QjY2S2lEeXV2S1h0eDlJdnRKZEE&amp;amp;hl=en"&gt;Google Spreadsheet&lt;/a&gt; and modify.&lt;br /&gt;&lt;br /&gt;Note that the JCAMP-DX files must be in XY format. If your instrument saves spectra in a compressed format they must be converted to XY.  The desktop version of Robert Lancashire's &lt;a href="http://sourceforge.net/projects/jspecview/"&gt;JSpecView&lt;/a&gt; can be used to carry out the conversion.&lt;br /&gt;&lt;br /&gt;This template spreadsheet also features a service in a cell to display the NMR spectrum by simply clicking on the link inside the cell.  This is very handy because it obviates the need to create an HTML file which must normally accompany the JCAMP-DX file for viewing.  Being able to quickly view a spectrum from a particular row within the Google Spreadsheet makes tracking data provenance very intuitive and errors easy to spot.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/S9sEWoRlKXI/AAAAAAAAA0Q/RbjgtQAxiZU/s1600/sams1.JPG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 364px; height: 213px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/S9sEWoRlKXI/AAAAAAAAA0Q/RbjgtQAxiZU/s400/sams1.JPG" alt="" id="BLOGGER_PHOTO_ID_5465967359559018866" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-6453300441448317095?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/6453300441448317095/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=6453300441448317095' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6453300441448317095'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/6453300441448317095'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/nmr-integration-web-service-expanded.html' title='NMR integration web service expanded'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_R3jz5f_NkRo/S9sEWoRlKXI/AAAAAAAAA0Q/RbjgtQAxiZU/s72-c/sams1.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-523153569515697161</id><published>2010-04-28T16:47:00.000-04:00</published><updated>2010-04-28T16:47:56.873-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reaction attempts'/><category scheme='http://www.blogger.com/atom/ns#' term='lulu'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='Ugi reaction'/><category scheme='http://www.blogger.com/atom/ns#' term='usefulchem'/><category scheme='http://www.blogger.com/atom/ns#' term='book'/><title type='text'>Reaction Attempts Book Edition 1 and UsefulChem Archive</title><content type='html'>I am pleased to report that Andrew Lang and I have published the first edition of the &lt;a href="http://onsbooks.wikispaces.com/Reaction+Attempts"&gt;Reaction Attempts book&lt;/a&gt;.  It currently contains most of the Ugi reactions from the UsefulChem project and is associated with an April 27, 2010 snapshot archive of the entire UsefulChem project, including NMR spectra, spreadsheets, images and the entire lab notebook from Wikispaces.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/S9ib-bhquzI/AAAAAAAAA0I/i176-NQyG_A/s1600/reactionattemptsbook1.JPG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 303px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/S9ib-bhquzI/AAAAAAAAA0I/i176-NQyG_A/s400/reactionattemptsbook1.JPG" alt="" id="BLOGGER_PHOTO_ID_5465289644657261362" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;At 582 pages the printing cost from LuLu amounts to $26.28.  Not meant to replace electronic searches, it should prove to be a handy reference book for the lab to quickly browse through what was attempted for a given reactant, what the outcome was and the researcher involved.&lt;br /&gt;&lt;br /&gt;We are hoping to include reaction attempts from other groups in future editions.  More details can be found in the preface, reproduced below:&lt;br /&gt;&lt;blockquote&gt;&lt;h2&gt;Reaction Attempts First Edition&lt;/h2&gt;&lt;h2&gt;&lt;span style="font-weight: normal;"&gt;Data Source: the UsefulChem project&lt;/span&gt;&lt;/h2&gt;&lt;h3&gt;Introduction&lt;/h3&gt;Open  Notebook Science (ONS) refers to the practice of making the full  contents of a laboratory notebook and all associated raw data files  available in near real time.[1]  This represents an opportunity for  everyone to benefit from work in progress in an open research group.   However, in order to make use of the information, it must be easily  discoverable.  A simple strategy to increase discoverability is  redundancy over multiple communication platforms.&lt;br /&gt;&lt;br /&gt;In another  project - the Open Notebook Science Solubility Challenge[2] - we  published non-aqueous solubility data in the form of physical and  downloadable (PDF) books.[3] Although it is possible to search the  solubility database using web query interfaces, exploration of a Google  Spreadsheet, an XML feed, etc.[4], having a physical copy in the  laboratory has proved to be very convenient in several instances.  A  similar format for reactions will also be useful.&lt;br /&gt;&lt;h3&gt;The UsefulChem  Project&lt;/h3&gt;UsefulChem started in 2005 as an organic chemistry Open  Notebook Science project with a main goal of discovering new  anti-malarial agents that can be prepared by simple and cheap  syntheses.[5]  Most of the reactions on UsefuChem are Ugi reactions,  which involve the mixing of an amine, aldehyde, carboxylic acid and  isonitrile in a solvent at room temperature generally for a few hours to  days.[6]  The multicomponent design of the Ugi reaction and the simple  reaction conditions make it ideal for exploring large virtual libraries  and selecting compounds of interest to make.[7]&lt;br /&gt;&lt;br /&gt;Isolation of the  Ugi products can be immensely simpler, cheaper and readily scalable if  they precipitate in pure form from the reaction mixture.  To this end,  much of the research in the UsefulChem project focuses on reaction  conditions that lead to this outcome.[8]  This is in fact the origin of  the ONS Solubility Challenge discussed above.[9]&lt;br /&gt;&lt;br /&gt;&lt;h3&gt;The Reaction  Attempts Database&lt;/h3&gt;In order to look for patterns in the reaction  conditions which led to Ugi product precipitation, the CombiUgiResults  Google Spreadsheet was set up.[10]  Reactions indexed there can be  sorted by precipitation outcome, solvent, reactant, concentration, etc.  and links to the laboratory notebook pages can be followed for full  details.  However, this sheet is designed specifically for Ugi reactions  and contains columns specifically for the aldehyde, amine, carboxylic  acid and isonitrile.&lt;br /&gt;&lt;br /&gt;In order to enable the tracking of other  types of reactions, the information in the CombiUgiResults sheet was  reformatted into two other sheets: ReactionAttempts[11] (containing  reagents and reactants) and RXIDsReactionAttempts[12] (containing  reaction conditions and results, such as solvent, concentration of  limiting reactant, appearance of a precipitate, yield, etc.).  The two  sheets are connected via the use of a common ReactionID. This format  permits the representation of any type of reaction, with an unlimited  number of reactants and products.[13]&lt;br /&gt;&lt;br /&gt;By definition, any Open  Notebook Science project in a work in progress.  The listing of a  reaction in this database only means that the researcher attempted or is  in the process of attempting it.  Whatever the situation, a link to the  laboratory notebook page is provided, where the most recent information  is available.  The philosophy used here is that partial information is  always better than no information at all.  Thus a researcher  investigating the prior use a particular reactant in a Ugi reaction  might find the report that a precipitate was obtained in methanol  helpful for designing their own reactions, even if the characterization  of the precipitate is still pending.  At the very least, knowing that a  certain researcher has at least attempted a similar reaction is enough  information for initiating a discussion, which may lead to valuable  insights.&lt;br /&gt;&lt;h3&gt;Reaction Attempts on Chemspider&lt;/h3&gt;Although SMILES[14]  are provided in the spreadsheets, the primary key to identify compounds  is the ChemSpider ID (CSID)[15]. This allows us to render molecule  images in the book automatically.  In the case of the ONS Solubility  Challenge book[3], use of the CSID enables a convenient way to calculate  various descriptors for displaying values in the book.&lt;br /&gt;&lt;br /&gt;In  addition, the compounds in the Reaction Attempts database are indexed on  ChemSpider as two Data Sources: ReactantsAttemptedReactions and  ProductsAttemptedReactions[13].  In this way a substructure search for  either reactants or products will identify indexed molecules.  Clicking  on the Syntheses tab in the ChemSpider record for a selected molecule  will then reveal a list of hyperlinks to the relevant laboratory  notebook pages.&lt;br /&gt;&lt;h3&gt;Organization of the Book&lt;/h3&gt;In keeping with  the layout of the ONS Solubility Challenge Book, the reactants are  listed in alphabetical order.  Each entry displays the list of reactions  where the reactant was used.  This includes a scheme with all reactants  and product as well as key metadata: the researcher, reaction type,  solvent, limiting reactant concentration, observation of a precipitate,  comments and a reference (links to the laboratory notebook page).&lt;br /&gt;&lt;br /&gt;In  this edition, only Ugi reactions are included.  The reaction schemes  are laid out in the following order: carboxylic acid, amine, aldehyde  and isonitrile.  This should allow for easy comparison between schemes  within a given record.  Reactions where the Ugi product was isolated and  characterized are marked with a green check and the percent yield is  noted.  Since the Ugi products do not have simple common names, they are  not included as separate entries.  However, all reactions where the  synthesis of a specific Ugi product was attempted can be found by  looking up the entries for any of the four reactants.&lt;br /&gt;&lt;br /&gt;Although  this compilation is not exhaustive, it does cover the vast majority of  reactions in the UsefulChem project to date.  Future editions will  include other reactions from UsefulChem and other sources.&lt;br /&gt;&lt;h3&gt;Archive&lt;/h3&gt;This edition is linked to the UsefulChem data archive (ZIP)[16],  (DVD)[17] and interactive hosted archive format[18], ReactionAttempts  (XLS)[19] and RXIDsReactionAttempts(XLS)[20] taken on 2010-04-27.&lt;h3&gt;References&lt;/h3&gt;1.  Open Notebook Science Wikipedia Entry  &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;http://en.wikipedia.org/wiki/Open_Notebook_Science&lt;/a&gt;&lt;br /&gt;2. Open Notebook  Science Solubility Challenge Wiki &lt;a href="http://onschallenge.wikispaces.com/"&gt;http://onschallenge.wikispaces.com&lt;/a&gt;&lt;br /&gt;3.  Bradley, J.-C. First Edition of ONS Solubility Challenge Book  UsefulChem Blog (2009)&lt;br /&gt;&lt;a href="http://usefulchem.blogspot.com/2009/12/first-edition-of-ons-solubility.html"&gt;http://usefulchem.blogspot.com/2009/12/first-edition-of-ons-solubility.html&lt;/a&gt;&lt;br /&gt;4.  Open Notebook Science Solubility Challenge List of Experiments page  &lt;a href="http://onschallenge.wikispaces.com/list+of+experiments"&gt;http://onschallenge.wikispaces.com/list+of+experiments&lt;/a&gt;&lt;br /&gt;5. UsefulChem  Wiki &lt;a href="http://usefulchem.wikispaces.com/"&gt;http://usefulchem.wikispaces.com&lt;/a&gt;&lt;br /&gt;6. Ugi Reaction Wikipedia Entry  &lt;a href="http://en.wikipedia.org/wiki/Ugi_reaction"&gt;http://en.wikipedia.org/wiki/Ugi_reaction&lt;/a&gt;&lt;br /&gt;7. Dömling, A., &amp;amp; Ugi,  I. (2000). Multicomponent Reactions with Isocyanides. Angewandte Chemie  International English Edition, 39(18), 3168-3210. &lt;a href="http://www3.interscience.wiley.com/journal/73500473/abstract"&gt;http://www3.interscience.wiley.com/journal/73500473/abstract&lt;/a&gt;.&lt;br /&gt;8.  UsefulChem List of Experiments  &lt;a href="http://usefulchem.wikispaces.com/All+Reactions"&gt;http://usefulchem.wikispaces.com/All+Reactions&lt;/a&gt;&lt;br /&gt;9. Bradley, J.-C. Open  Notebook Science Challenge UsefulChem Blog (2008)&lt;br /&gt;&lt;a href="http://usefulchem.blogspot.com/2008/09/open-notebook-science-challenge.html"&gt;http://usefulchem.blogspot.com/2008/09/open-notebook-science-challenge.html&lt;/a&gt;&lt;br /&gt;10.  CombiUgiResults Google Spreadsheet &lt;a href="http://spreadsheets.google.com/ccc?key=plwwufp30hfpUERhse9y5Kw"&gt; http://spreadsheets.google.com/ccc?key=plwwufp30hfpUERhse9y5Kw&lt;/a&gt;&lt;br /&gt;11.  ReactionAttempts Google Spreadsheet&lt;br /&gt;&lt;a href="http://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdG9NejNLcDNUMkVBVURGM01TR0NxdXc"&gt;http://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdG9NejNLcDNUMkVBVURGM01TR0NxdXc&lt;/a&gt;&lt;br /&gt;12.  RXIDsReactionAttempts Google Spreadsheet&lt;br /&gt;&lt;a href="http://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdGVENVFMWjdzaGd2REJTTnA4RG5vblE"&gt;http://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdGVENVFMWjdzaGd2REJTTnA4RG5vblE&lt;/a&gt;&lt;br /&gt;13.  Bradley, J.-C. Reaction Attempts on ChemSpider UsefulChem Blog (2010)&lt;br /&gt;&lt;a href="http://usefulchem.blogspot.com/2010/03/reaction-attempts-on-chemspider.html"&gt;http://usefulchem.blogspot.com/2010/03/reaction-attempts-on-chemspider.html&lt;/a&gt;&lt;br /&gt;14.  SMILES Wikipedia Entry  &lt;a href="http://en.wikipedia.org/wiki/Simplified_molecular_input_line_entry_specification"&gt;http://en.wikipedia.org/wiki/Simplified_molecular_input_line_entry_specification&lt;/a&gt;&lt;br /&gt;15.  ChemSpider Web Site &lt;a href="http://www.chemspider.com/"&gt;http://www.chemspider.com/&lt;/a&gt;&lt;br /&gt;16. UC archive Drexel  server (ZIP)  &lt;a href="http://showme.physics.drexel.edu/usefulchem/archives/usefulchem2010-04-27.zip"&gt;http://showme.physics.drexel.edu/usefulchem/archives/usefulchem2010-04-27.zip&lt;/a&gt;&lt;br /&gt;17.  UC archive on lulu.com (DVD) &lt;a href="http://www.lulu.com/product/dvd/usefulchem-archive/10791847"&gt; http://www.lulu.com/product/dvd/usefulchem-archive/10791847&lt;/a&gt;&lt;br /&gt;18. UC interactive hosted format  &lt;a href="http://showme.physics.drexel.edu/usefulchem/archives/usefulchem2010-04-27/All%20Reactions.html"&gt;http://showme.physics.drexel.edu/usefulchem/archives/usefulchem2010-04-27/All%20Reactions.html&lt;/a&gt;&lt;br /&gt;19.  Bradley, J.-C.; Lang, A.. Reaction Attempts Reactants and Products.  UsefulChem. 2010-04-27. &lt;div&gt;URL:&lt;a href="http://spreadsheets.google.com/pub?key=toMz3Kp3T2EAUDF3MSGCquw&amp;amp;output=xls"&gt;http://spreadsheets.google.com/pub?key=toMz3Kp3T2EAUDF3MSGCquw&amp;amp;output=xls&lt;/a&gt;.  Accessed: 2010-04-27. &lt;/div&gt;&lt;div&gt;(Archived by WebCite® at  &lt;a href="http://www.webcitation.org/5pIsFEbT9"&gt;http://www.webcitation.org/5pIsFEbT9&lt;/a&gt;)&lt;br /&gt;20. Bradley, J.-C.; Lang, A..  Reaction Attempts RXIDs. UsefulChem. 2010-04-27. &lt;/div&gt;&lt;div&gt;URL:&lt;a href="http://spreadsheets.google.com/pub?key=teD5QLZ7shgvDBSNp8DnonQ&amp;amp;output=xls"&gt;http://spreadsheets.google.com/pub?key=teD5QLZ7shgvDBSNp8DnonQ&amp;amp;output=xls&lt;/a&gt;.  Accessed: 2010-04-27. &lt;/div&gt;&lt;div&gt;(Archived by WebCite® at  &lt;a href="http://www.webcitation.org/5pIs2eh62"&gt;http://www.webcitation.org/5pIs2eh62&lt;/a&gt;)&lt;/div&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-523153569515697161?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/523153569515697161/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=523153569515697161' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/523153569515697161'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/523153569515697161'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/reaction-attempts-book-edition-1-and.html' title='Reaction Attempts Book Edition 1 and UsefulChem Archive'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/S9ib-bhquzI/AAAAAAAAA0I/i176-NQyG_A/s72-c/reactionattemptsbook1.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8072857411061458852</id><published>2010-04-20T19:44:00.000-04:00</published><updated>2010-04-20T19:44:47.551-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='nature precedings'/><category scheme='http://www.blogger.com/atom/ns#' term='lulu'/><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='usefulchem'/><title type='text'>ONS Books Wiki</title><content type='html'>I recently &lt;a href="http://usefulchem.blogspot.com/2010/03/nature-precedings-as-archiving-tool-for.html"&gt;reported on our use of Nature Precedings&lt;/a&gt; to archive different editions of the ONS Solubility Challenge book.  One of the advantages is that Precedings automatically alerts visitors if more recent editions exist. &lt;br /&gt;&lt;br /&gt;However, today I learned that there is a glitch to this system: it is not possible to link individual versions on Precedings to a corresponding book edition on LuLu.  That means that if you find yourself on the Nature Precedings entry and want to order the book from LuLu it isn't obvious at all how to do so.&lt;br /&gt;&lt;br /&gt;To resolve this issue once and for all I just created a wiki page (&lt;a href="http://onsbooks.wikispaces.com/"&gt;ONSbooks.wikispaces.com&lt;/a&gt;) to track every edition of the book.  This is actually better because I can also provide links to all the available data archives and blog posts corresponding to each edition.&lt;br /&gt;&lt;br /&gt;This is also the page where we will keep track of every edition of other Open Notebook Science books.  The next one to be published shortly is for the &lt;a href="http://usefulchem.wikispaces.com/"&gt;UsefulChem&lt;/a&gt; project.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/S847oBB1unI/AAAAAAAAA0A/iyLc-VMw4MM/s1600/onsbooks.JPG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 200px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/S847oBB1unI/AAAAAAAAA0A/iyLc-VMw4MM/s400/onsbooks.JPG" alt="" id="BLOGGER_PHOTO_ID_5462368956703947378" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8072857411061458852?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8072857411061458852/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8072857411061458852' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8072857411061458852'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8072857411061458852'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/ons-books-wiki.html' title='ONS Books Wiki'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/S847oBB1unI/AAAAAAAAA0A/iyLc-VMw4MM/s72-c/onsbooks.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-5434468992441974845</id><published>2010-04-08T14:15:00.000-04:00</published><updated>2010-04-08T14:16:05.010-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='wald'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='science careers'/><title type='text'>Scientists Embrace Openness Article in Science Careers</title><content type='html'>&lt;a href="http://www.chelseawald.com/"&gt;Chelsea Wald&lt;/a&gt; just published an article in Science Careers: &lt;a href="http://sciencecareers.sciencemag.org/career_magazine/previous_issues/articles/2010_04_09/caredit.a1000036"&gt;Scientists Embrace Openness (April 9, 2010)&lt;/a&gt;.   She interviewed several people in the Open Science movement including &lt;a href="http://bobcat.genomecenter.ucdavis.edu/mediawiki/index.php/Main_Page"&gt;Jonathan Eisen&lt;/a&gt;, &lt;a href="http://www.linkedin.com/in/stevekoch"&gt;Steve Koch&lt;/a&gt;, &lt;a href="http://openwetware.org/wiki/User:Anthony_Salvagno"&gt;Anthony Salvagno&lt;/a&gt;, &lt;a href="http://two.ucdavis.edu/%7Ecboettig/index.html"&gt;Carl Boettiger&lt;/a&gt; and myself.&lt;br /&gt;&lt;br /&gt;The article covers Open Notebook Science, Open Data and associated themes.  I think it presents a view of the most commonly discussed advantages and disadvantages very well.&lt;br /&gt;&lt;br /&gt;One section was particularly relevant to an issue I recently &lt;a href="http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html"&gt;posted about&lt;/a&gt; - (and discussed on &lt;a href="http://ff.im/ixztk"&gt;FriendFeed&lt;/a&gt;):&lt;br /&gt;&lt;blockquote&gt;Open Notebook Science advocates claim that being open may &lt;em&gt;protect&lt;/em&gt;  a scientist's ideas rather than exposing them to theft. Newton's  decision to conceal his findings within an anagram made it harder for  him to prove priority over rival Gottfried Leibniz. Open Notebook  scientists say all they need to do is point to their open notebooks to  show that they had an idea or found a result first. &lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-5434468992441974845?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/5434468992441974845/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=5434468992441974845' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5434468992441974845'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/5434468992441974845'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/scientists-embrace-openness-article-in.html' title='Scientists Embrace Openness Article in Science Careers'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-7971542005611526058</id><published>2010-04-06T18:18:00.000-04:00</published><updated>2010-04-06T18:18:49.070-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='zazzle'/><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='t-shirt'/><title type='text'>ONS t-shirts from Zazzle</title><content type='html'>Inspired by &lt;a href="http://ff.im/fFcQR"&gt;Graham Steel&lt;/a&gt;, I just received my t-shirt with an Open Notebook Science Logo and a picture of our crystal on the cover of our&lt;a href="http://www.lulu.com/product/paperback/open-notebook-science-challenge-solubilities-of-organic-compounds-in-organic-solvents/6124961"&gt; ONS Solubility Challenge book&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/S7uuuRe-eyI/AAAAAAAAAzw/lk8vF_dMe3c/s1600/zazzleonspic1.JPG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 325px; height: 400px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/S7uuuRe-eyI/AAAAAAAAAzw/lk8vF_dMe3c/s400/zazzleonspic1.JPG" alt="" id="BLOGGER_PHOTO_ID_5457147483480029986" border="0" /&gt;&lt;/a&gt;I was going to set up an ONS store but Zazzle does not permit zero  royalties (don't see the logic there).  But making up t-shirts on &lt;a href="http://www.zazzle.com/"&gt;Zazzle&lt;/a&gt; is super simple - just grab a  logo of your choice from the &lt;a href="http://onsclaims.wikispaces.com/"&gt;ONSclaims  wiki&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Any other pic is your choice - this is the crystal from &lt;a href="http://usefulchem.wikispaces.com/Exp150"&gt;UCEXP150C&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_R3jz5f_NkRo/S7uxv4sXBXI/AAAAAAAAAz4/XRkLperFkHE/s1600/xtal150C.JPG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 272px; height: 239px;" src="http://4.bp.blogspot.com/_R3jz5f_NkRo/S7uxv4sXBXI/AAAAAAAAAz4/XRkLperFkHE/s400/xtal150C.JPG" alt="" id="BLOGGER_PHOTO_ID_5457150809719899506" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;You can also order all kinds of other personalized things, including coffee cups.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-7971542005611526058?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/7971542005611526058/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=7971542005611526058' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7971542005611526058'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/7971542005611526058'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/ons-t-shirts-from-zazzle.html' title='ONS t-shirts from Zazzle'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_R3jz5f_NkRo/S7uuuRe-eyI/AAAAAAAAAzw/lk8vF_dMe3c/s72-c/zazzleonspic1.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1038235817542340176</id><published>2010-04-02T17:41:00.001-04:00</published><updated>2010-04-02T17:42:35.281-04:00</updated><title type='text'>Bipolar Electrodeposition of CdS: Scientific Results in Limbo?</title><content type='html'>There has been a lot of discussion about the fear of getting "scooped" as a reason to be weary of using new scientific publication vehicles.&lt;br /&gt;&lt;br /&gt;These conversations can be somewhat frustrating since people don't necessarily use the same definition of that term.  Even dictionaries don't use the same language.  For example, &lt;a href="http://dictionary.reference.com/browse/scooping"&gt;Dictionary.com&lt;/a&gt; has:&lt;br /&gt;&lt;blockquote&gt;to get the better of (other publications, newscasters, etc.) by obtaining and publishing or broadcasting a news item, report, or story first: &lt;span style="font-style: italic;"&gt;They scooped all the other dailies with the story of the election fraud.&lt;/span&gt; &lt;/blockquote&gt;&lt;a href="http://en.wiktionary.org/wiki/scoop"&gt;Wiktionary&lt;/a&gt; has:&lt;br /&gt;&lt;blockquote&gt;To learn something, especially something worthy of a news article, before (someone else). &lt;span style="font-style: italic;"&gt;The paper across town scooped them on the City Hall scandal.&lt;/span&gt; &lt;/blockquote&gt;Depending on the definition used, one could argue that in the story I'm going to tell I got scooped or I did the scooping.  Some people use the term to imply that a malicious act has taken place.  The classic scenario is that one would blog about their research and a nefarious individual would appropriate their results and submit as their own for publication in a peer reviewed journal.&lt;br /&gt;&lt;br /&gt;That isn't scooping - it is fraud - and I want to be clear that this is not what I am suggesting happened here.&lt;br /&gt;&lt;br /&gt;Two months ago I was asked to review an article for the ACS journal Langmuir.  &lt;a href="http://docs.google.com/View?id=ddzgbfwh_202ddsgxddp"&gt;Before 2005&lt;/a&gt; one of my main research areas was bipolar electrodeposition and so I still get asked periodically to review papers in that field.&lt;br /&gt;&lt;br /&gt;Not only was this paper in that area but it reported on exactly the same experimental design we had previously reported: the bipolar electrodeposition of cadmium sulfide.  The solvent, reagents and substrate were different but it was the same material made by the same process.&lt;br /&gt;&lt;br /&gt;Although 2 of our papers were listed in the references the key report was not.  I noted this in my review but I was surprised to find that the paper was published without that correction.  I contacted the editor of Langmuir to find out what happened.  I thought perhaps the authors disagreed with some technical issue in our report.&lt;br /&gt;&lt;br /&gt;But what actually happened is that the authors requested that the reference be included in the supplementary data section instead of the regular reference section of the article because it was not a peer-reviewed article.  The editor thought that was a reasonable request and complied.&lt;br /&gt;&lt;br /&gt;I was quite surprised by this because Langmuir - or ACS journals in general - do not have a &lt;a href="http://pubs.acs.org/paragonplus/submission/langd5/langd5_authguide.pdf"&gt;formal policy on requiring references to be peer-reviewed&lt;/a&gt;.  In fact, a quick &lt;a href="http://pubs.acs.org/action/doSearch?action=search&amp;amp;searchText=%22unpublished+results%22&amp;amp;qsSearchArea=searchText&amp;amp;type=within&amp;amp;publication=40026039"&gt;search for "unpublished results" on Langmuir&lt;/a&gt; reveals many articles which use that as an acceptable reference.&lt;br /&gt;&lt;br /&gt;I could understand not wanting to cite a blog post with unsubstantiated claims but the &lt;a href="http://dspace.library.drexel.edu/retrieve/2023/article.pdf"&gt;document in question&lt;/a&gt; is very thorough - it includes a systematic review of prior art, detailed experimental description and characterization data.&lt;br /&gt;&lt;br /&gt;This is actually an example of a "SMIRP Knowledge Product".  It is a publication device that I used to make public single experimental results from work that was recorded by my research group in the &lt;a href="http://linkinghub.elsevier.com/retrieve/pii/S1535553504000747"&gt;SMIRP&lt;/a&gt; Knowledge Management System we used at the time as a laboratory notebook.&lt;br /&gt;&lt;br /&gt;The system was built on interlinked modules designed to produce "Knowledge Products" based on a combination of manual and automated workflows.  For example, the module generating reviews of prior art was based on "Knowledge Filters" uncovering the novelty of the experiment in question by filtering precedents for relevant aspects.&lt;br /&gt;&lt;br /&gt;In the case of "&lt;a href="http://dspace.library.drexel.edu/retrieve/2023/article.pdf"&gt;Bipolar Electrodeposition of Cadmium Sulfide onto a Tip of a Carbon Nanotube&lt;/a&gt;", the relevant knowledge filters were "Bipolar Electrodeposition", "Electrodeposition onto Carbon Nanotubes" and "Electrodeposition Approaches to Synthesize Cadmium Sulfide".  Other modules generated the experimental description, results, discussion, conclusion and reference sections.  In this way, not only could the experiment be fully documented but its context within the field could be extremely well defined in a systematic way.&lt;br /&gt;&lt;br /&gt;With a workflow to create these knowledge products we still needed a way to communicate them.  Back in 2003 options were far more limited than they are now.  But luckily (or so we thought) at this time Elsevier was running the &lt;a href="http://chem-bla-ics.blogspot.com/2007/06/preprint-servers-cps-failed-how-will.html"&gt;Chemistry Preprint Server&lt;/a&gt;.  They offered a place to upload documents such as these and provided a way of citing them.  We used the recommended citation format aggressively, including peer-reviewed articles such as &lt;a href="http://www.springerlink.com/content/wx74r643t4362186/"&gt;this one from Springer&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;However, &lt;a href="http://preprint.chemweb.com/chemistry/0312001"&gt;attempting to access these documents today&lt;/a&gt; using the official links gives this as a result:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/S7ZFQIxuRpI/AAAAAAAAAzo/4QHXGfTHfmA/s1600/cpstimeout.JPG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 65px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/S7ZFQIxuRpI/AAAAAAAAAzo/4QHXGfTHfmA/s400/cpstimeout.JPG" alt="" id="BLOGGER_PHOTO_ID_5455624142142195346" border="0" /&gt;&lt;/a&gt;In what is probably one of the worst scientific publisher PR moves in recent memory, Elsevier broke all the hyperlinks they told their authors to use for citations.  If you do some research you will find that the documents are still available from &lt;a href="http://www.sciencedirect.com/preprintarchive"&gt;http://www.sciencedirect.com/preprintarchive&lt;/a&gt; but you have to register to even perform a search to find them!  This requirement removes them from indexing by Google.  Coupled with the broken citation links these documents are now very far removed from likely discovery.&lt;br /&gt;&lt;br /&gt;The story would have ended there were it not for redundancy.  I also uploaded copies to &lt;a href="http://dspace.library.drexel.edu/retrieve/2023/article.pdf"&gt;Drexel's institutional repository&lt;/a&gt; (DSpace), which are happily very well indexed by &lt;a href="http://www.google.com/search?q=%22bipolar+electrodeposition+of+cadmium+sulfide%22&amp;amp;hl=en&amp;amp;client=firefox-a&amp;amp;hs=EO8&amp;amp;rls=org.mozilla:en-US:official&amp;amp;filter=0"&gt;Google&lt;/a&gt; - and perhaps more critically - by &lt;a href="http://scholar.google.com/scholar?hl=en&amp;amp;q=%22bipolar+electrodeposition+of+cadmium+sulfide%22&amp;amp;btnG=Search&amp;amp;as_sdt=800000000000&amp;amp;as_ylo=&amp;amp;as_vis=1"&gt;Google Scholar&lt;/a&gt;.  I had not fully appreciated the value of institutional repositories until I noticed that they are treated by some important databases as collections of scholarly works.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;So what are the lessons for all the stakeholders?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;For those who have scientific results that can be published as articles and MUST be published in ACS journals - send your manuscripts in.  If you post them on your institutional repository first &lt;span style="font-weight: bold;"&gt;they may end up in limbo -they DO qualify as publications preventing you from submitting them as manuscripts to ACS journals - and they may NOT qualify as publications when you try to cite them in ACS journals&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;But what about scientific result that cannot be published as manuscripts.  The Knowledge Products are unlikely to be accepted by regular journals for several reasons.  First they communicate only a single experimental result.  Articles generally require narratives.  Second, if some of them do get published in traditional journals, there will be copyright conflicts.  The Knowledge Filters for the review of prior art will be identical for similar experiments.  For example the Knowledge Product for the "&lt;a href="http://dspace.library.drexel.edu/retrieve/2023/article.pdf"&gt;Bipolar Electrodeposition of CdS on one Tip of a Carbon Nanotube&lt;/a&gt;" Will have identical prior art to the "&lt;a href="http://idea.library.drexel.edu/bitstream/1860/257/1/article.pdf"&gt;Bipolar Electrodeposition of Cd on one Tip of a Carbon Nanotube&lt;/a&gt;" except for the section of the electrodeposition of Cd or CdS.  And no - I don't think it is a good use of my time to move words around for every document to get around copyright issues.&lt;br /&gt;&lt;br /&gt;Some of the Knowledge Products were incorporated into full articles when it made sense.  But many, including the one under discussion here was not.  So publishing this work as part of a full article was never even an option.  There are so many scientific results like this that fall into that kind of limbo.  Even today there are no really good publication vehicles for these types of results - besides institutional repositories.  &lt;a href="http://plosone.org/"&gt;PLoS ONE&lt;/a&gt; might come to mind as an option but I don't think it fits their mandate to publish single experiments like this.  And if they did it would be extremely expensive if they did not waive author fees every time.  &lt;a href="http://cssp.chemspider.com/"&gt;ChemSpider Synthetic Pages&lt;/a&gt; and similar initiatives might work for organic chemistry but this is materials science.&lt;br /&gt;&lt;br /&gt;Considering all of these difficulties over the years is really the main motivation behind our migration away from a login based system like SMIRP to our adoption of &lt;a href="http://en.wikipedia.org/wiki/Open_Notebook_Science"&gt;Open Notebook Science&lt;/a&gt; based on a wikis and blogs, which are very efficiently indexed in real time by Google and thus easily discoverable without additional formatting work.&lt;br /&gt;&lt;br /&gt;For publishers and authors, do you really think it is in your best interest to have a &lt;a href="http://pubs.acs.org/doi/abs/10.1021/la100292u"&gt;statement in the introduction&lt;/a&gt; about prior art say "&lt;span style="font-weight: bold;"&gt;To our knowledge, reports of bipolar electrodeposition of compounds have  not been previously published.&lt;/span&gt;" when a simple &lt;a href="http://www.google.com/search?q=bipolar+electrodeposition+of+cds&amp;amp;ie=utf-8&amp;amp;oe=utf-8&amp;amp;aq=t&amp;amp;rls=org.mozilla:en-US:official&amp;amp;client=firefox-a"&gt;Google search&lt;/a&gt; shows that is not the case for the compound you are electrodepositing?  I suppose the argument is that the term "published" is used with the technical interpretation of being "published under peer-review". It would have been better to at least make that explicit to avoid confusion.  But the bottom line is that someone wanting to perform bipolar electrodeposition of cadmium sulfide will quickly find both reports and will learn two ways of doing it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1038235817542340176?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1038235817542340176/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1038235817542340176' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1038235817542340176'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1038235817542340176'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/bipolar-electrodeposition-of-cds.html' title='Bipolar Electrodeposition of CdS: Scientific Results in Limbo?'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_R3jz5f_NkRo/S7ZFQIxuRpI/AAAAAAAAAzo/4QHXGfTHfmA/s72-c/cpstimeout.JPG' height='72' width='72'/><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8425911014809808926</id><published>2010-04-01T14:00:00.001-04:00</published><updated>2010-04-01T14:02:37.016-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='chemtiles'/><category scheme='http://www.blogger.com/atom/ns#' term='beer'/><title type='text'>Beer Chemistry Quiz on ChemTiles</title><content type='html'>During the &lt;a href="http://getcheminfo.wikispaces.com/"&gt;Chemical Information Retrieval course&lt;/a&gt; I taught in the Fall of 2009, Alex Bilinski did a project on the chemistry of beer.  He created a set of images with information that is either true or false in any context.&lt;br /&gt;&lt;br /&gt;I just added these images of beer chemistry to the &lt;a href="http://chemtiles.wikispaces.com/"&gt;ChemTiles game&lt;/a&gt;, which I am actively using in my current&lt;a href="http://chem241.wikispaces.com/"&gt; Organic Chemistry I (CHEM241) course&lt;/a&gt;.  Andrew Lang has &lt;a href="http://usefulchem.blogspot.com/2009/04/chemtiles-game.html"&gt;made it very easy&lt;/a&gt; to add content to the game by simply uploading to a &lt;a href="http://www.flickr.com/groups/orgogame/"&gt;Flickr group&lt;/a&gt;.  The category is determined by the Flickr tag.&lt;br /&gt;&lt;br /&gt;I won't be testing my organic students on beer but they might find it fun to play that category.  Alex wrote a fascinating report "&lt;a href="http://getcheminfo.wikispaces.com/abilinski"&gt;&lt;span style="font-weight: bold;"&gt;Beer Flavor Compounds and Detection Methods&lt;/span&gt;&lt;/a&gt;" that can be used as a study guide for the quiz.&lt;br /&gt;&lt;br /&gt;My students are currently competing on the topics of Lewis structures, hybridization, Newman projections and nomenclature.  They simply need to sign in by entering "contest1" for the group name.  The student in my class with the &lt;a href="http://lxsrv7.oru.edu/%7Ealang/chem/groupscores.php?group=contest1"&gt;highest score for the contest1 group&lt;/a&gt; at the end of class (10:50 AM) on April 9, 2010 will win an organic chemistry textbook.  As I have done in &lt;a href="http://drexel-coas-elearning.blogspot.com/2009/06/spring-2009-term-post-mortem-of-chem.html"&gt;previous classes&lt;/a&gt; I'll run a few of these contests over the term with increasing amounts of course content.&lt;br /&gt;&lt;br /&gt;It turns out that the ChemTiles game is very convenient to play on a &lt;a href="http://phones.verizonwireless.com/motorola/droid/#/home"&gt;Droid phone&lt;/a&gt; (and presumably on an iPhone although I have not checked that yet).  For many students this might be a preferred way to review material before tests when on the move.  I'll find out at the end of the term.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/S7TcmGBrUvI/AAAAAAAAAzY/0OKITFttQrc/s1600/chemtilesoptions.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 224px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/S7TcmGBrUvI/AAAAAAAAAzY/0OKITFttQrc/s400/chemtilesoptions.PNG" alt="" id="BLOGGER_PHOTO_ID_5455227595663299314" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/S7TcrxAbl-I/AAAAAAAAAzg/fX53uXHOxMA/s1600/beerex1.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 218px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/S7TcrxAbl-I/AAAAAAAAAzg/fX53uXHOxMA/s400/beerex1.PNG" alt="" id="BLOGGER_PHOTO_ID_5455227693100144610" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8425911014809808926?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8425911014809808926/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8425911014809808926' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8425911014809808926'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8425911014809808926'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/04/beer-chemistry-quiz-on-chemtiles.html' title='Beer Chemistry Quiz on ChemTiles'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_R3jz5f_NkRo/S7TcmGBrUvI/AAAAAAAAAzY/0OKITFttQrc/s72-c/chemtilesoptions.PNG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1912150961440188595</id><published>2010-03-28T17:07:00.001-04:00</published><updated>2010-03-29T11:53:41.445-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#acs'/><category scheme='http://www.blogger.com/atom/ns#' term='second life'/><category scheme='http://www.blogger.com/atom/ns#' term='visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='#acs_sf'/><category scheme='http://www.blogger.com/atom/ns#' term='chemistry'/><title type='text'>Back from ACS San Francisco Meeting 2010</title><content type='html'>Last week I co-hosted a symposium with &lt;a href="http://www.redbrick.dcu.ie/%7Enoel/"&gt;Noel O'Boyle&lt;/a&gt; and &lt;a href="http://www.linkedin.com/in/andrewlang"&gt;Andrew Lang&lt;/a&gt; on "&lt;a href="http://abstracts.acs.org/chem/239nm/program/divisionindex.php?act=presentations&amp;amp;val=Visual+Analysis+of+Chemical+Data&amp;amp;ses=Visual+Analysis+of+Chemical+Data&amp;amp;prog=90"&gt;Visual Analysis of Chemical Data&lt;/a&gt;" at the American Chemical Society meeting in San Francisco (March 22-23, 2010).  The ACS recorded almost every talk in our symposium and I'll provide the link here when available (they tell me mid April).&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.linkedin.com/in/ldorland"&gt;Liz Dorland&lt;/a&gt; kicked off the symposium with a great keynote presentation covering effective visualization in a number of fields and the special challenges faced in chemistry.  There were several talks about QSAR and I particularly enjoyed &lt;a href="http://uk.linkedin.com/in/edmundchampness"&gt;Edmund Chapness&lt;/a&gt; who incorporated the visualization of confidence in predictions with an intuitive colored molecule map.  &lt;a href="http://hutchison.chem.pitt.edu/Welcome.html"&gt;Geoff Hutchison&lt;/a&gt; gave an informative overview of &lt;a href="http://avogadro.openmolecules.net/wiki/Main_Page"&gt;Avogadro&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Perhaps the biggest revelation was the "&lt;a href="http://tripod.nih.gov"&gt;iTunes for Cheminformatics&lt;/a&gt;" project by NIH researcher &lt;a href="http://www.ncgc.nih.gov/about/ajadhav.html"&gt;Ajit Jadhav&lt;/a&gt; (leading the team which includes &lt;a href="http://www.rguha.net/"&gt;Rajarshi Guha&lt;/a&gt;).  The alpha version will be available for testing on April 5, 2010 and many of us are eagerly anticipating being able to give it a spin.  From what I understand the system will automatically be able to identify scaffolds (fragments) in a collection of molecules and make it easy to search for and filter assay results.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://pubs.acs.org/cen/staff/biocd.html"&gt;Carmen Drahl&lt;/a&gt; covered in minute by minute detail announcements about new drug candidates on &lt;a href="http://search.twitter.com/search?q=%23acsmedi"&gt;Twitter&lt;/a&gt;.  Following the FriendFeed feed for the conference flagged a very interesting post about a &lt;a href="http://www.eurekalert.org/pub_releases/2010-03/acs-fm030810.php"&gt;Cold Fusion Symposium&lt;/a&gt; that was being held.  In spite of the notorious lack of wireless availability at ACS conferences, attendees seem to be making due with accessing their social networks via their cell phone devices.&lt;br /&gt;&lt;br /&gt;Andrew Lang and I spoke about Visualizing Chemistry in Second Life - our slides are here - hopefully I'll be able to post the recordings as well soon.&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_3572261"&gt;&lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/visualizing-chemistry-in-second-life-acs-sp2010" title="Visualizing Chemistry in Second Life ACS SP2010"&gt;Visualizing Chemistry in Second Life ACS SP2010&lt;/a&gt;&lt;/strong&gt;&lt;object width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleylangacs2010-100327124200-phpapp02&amp;amp;stripped_title=visualizing-chemistry-in-second-life-acs-sp2010"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bradleylangacs2010-100327124200-phpapp02&amp;amp;stripped_title=visualizing-chemistry-in-second-life-acs-sp2010" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding: 5px 0pt 12px;"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-1912150961440188595?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/1912150961440188595/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=1912150961440188595' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1912150961440188595'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/1912150961440188595'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/03/back-from-acs-san-francisco-meeting.html' title='Back from ACS San Francisco Meeting 2010'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-8724960493746138637</id><published>2010-03-27T10:34:00.000-04:00</published><updated>2010-03-27T10:35:32.317-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='teaching'/><category scheme='http://www.blogger.com/atom/ns#' term='open notebook science'/><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><category scheme='http://www.blogger.com/atom/ns#' term='screencasting'/><category scheme='http://www.blogger.com/atom/ns#' term='open data'/><category scheme='http://www.blogger.com/atom/ns#' term='wikis'/><category scheme='http://www.blogger.com/atom/ns#' term='open science'/><category scheme='http://www.blogger.com/atom/ns#' term='usefulchem'/><title type='text'>Education 2.0: Leveraging Collaborative Tools for Teaching</title><content type='html'>On March 25, 2010 I presented at the &lt;a href="http://www.drexel.edu/irt/news/events/eLearningConf/"&gt;Drexel E-Learning 2.0 Conference&lt;/a&gt; on "&lt;span style="font-weight: bold;"&gt;Education 2.0: Leveraging Collaborative Tools for Teaching&lt;/span&gt;".  It was an opportunity to update my slides with what &lt;a href="http://drexel-coas-elearning.blogspot.com/2010/01/post-mortem-of-cheminfo-retrieval-class.html"&gt;I did and learned from the Chemical Information Retrieval course&lt;/a&gt; I taught over the Fall 2009 term.&lt;br /&gt;&lt;br /&gt;I described using a wiki to organize course content and to allow students to contribute useful resources.  Their assignments were also designed to be useful to other students in the class as well as to the general library and chemistry community.&lt;br /&gt;&lt;br /&gt;I covered using wikis and other collaborative tools to mentor students doing laboratory research with Open Notebook Science.  At the end I provided a quick overview of using games and Second Life for educational purposes.&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_3551004"&gt;&lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/jcbradley/education-20-leveraging-collaborative-tools-for-teaching" title="Education 2.0: Leveraging Collaborative Tools for Teaching"&gt;Education 2.0: Leveraging Collaborative Tools for Teaching&lt;/a&gt;&lt;/strong&gt;&lt;object width="425" height="355"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=edu20drexel10-100325101655-phpapp01&amp;amp;stripped_title=education-20-leveraging-collaborative-tools-for-teaching"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=edu20drexel10-100325101655-phpapp01&amp;amp;stripped_title=education-20-leveraging-collaborative-tools-for-teaching" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="padding: 5px 0pt 12px;"&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/jcbradley"&gt;Jean-Claude Bradley&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;object width="480" height="400"&gt;&lt;param name="movie" value="http://www.scivee.tv/flash/embedCast.swf"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="flashvars" value="id=16903&amp;amp;type=3"&gt;&lt;embed src="http://www.scivee.tv/flash/embedCast.swf" allowfullscreen="true" flashvars="id=16903&amp;amp;type=3" width="480" height="400"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/14568785-8724960493746138637?l=usefulchem.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://usefulchem.blogspot.com/feeds/8724960493746138637/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=14568785&amp;postID=8724960493746138637' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8724960493746138637'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/14568785/posts/default/8724960493746138637'/><link rel='alternate' type='text/html' href='http://usefulchem.blogspot.com/2010/03/education-20-leveraging-collaborative.html' title='Education 2.0: Leveraging Collaborative Tools for Teaching'/><author><name>Jean-Claude Bradley</name><uri>http://www.blogger.com/profile/09157209601339686455</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='31' height='32' src='http://photos1.blogger.com/img/267/3447/320/headshot.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-14568785.post-1056954732406011068</id><published>2010-03-20T12:10:00.002-04:00</published><updated>2010-03-20T12:14:27.065-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reactionattempts'/><category scheme='http://www.blogger.com/atom/ns#' term='onschallenge'/><category scheme='http://www.blogger.com/atom/ns#' term='XML'/><category scheme='http://www.blogger.com/atom/ns#' term='Ugi reaction'/><category scheme='http://www.blogger.com/atom/ns#' term='usefulchem'/><title type='text'>Reaction Attempts on ChemSpider</title><content type='html'>Just as we have done with the &lt;a href="http://onschallenge.wikispaces.com/list+of+experiments"&gt;Open Notebook Science Solubility Challenge&lt;/a&gt;, we are adding more structure to the &lt;a href="http://usefulchem.wikispaces.com/"&gt;UsefulChem&lt;/a&gt; project.&lt;br /&gt;&lt;br /&gt;This is a little bit more difficult because the &lt;a href="http://usefulchem.wikispaces.com/All+Reactions"&gt;UC notebook&lt;/a&gt; represents mainly chemical reactions, while the &lt;a href="http://onschallenge.wikispaces.com/list+of+experiments"&gt;ONSC data&lt;/a&gt; are simply solubility measurements.  Since most of the UC reactions are Ugi reactions, we have been keeping summary data in the &lt;a href="http://spreadsheets.google.com/ccc?key=plwwufp30hfpUERhse9y5Kw"&gt;CombiUgi&lt;/a&gt; Google Spreadsheet, which is completely specialized for this reaction and variations in our reaction conditions.  This lets us search or sort by reactant, concentration, solvent, etc.  However, we cannot do substructure searching directly using the CombiUgi sheet and we cannot add other types of reactions.&lt;br /&gt;&lt;br /&gt;In order to enable substructure searching and add other reactions, Antony Williams has created 2 new data sources in &lt;a href="http://www.chemspider.com/"&gt;ChemSpider&lt;/a&gt;: Attempted Reactions - Reactants and Attempted Reactions - Products.  The data represented in the CombiUgi sheet has been restructured into 2 new Google Spreadsheets: &lt;a href="https://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdGVENVFMWjdzaGd2REJTTnA4RG5vblE&amp;amp;hl=en"&gt;RXIDs Reaction Attempts&lt;/a&gt; and &lt;a href="https://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdG9NejNLcDNUMkVBVURGM01TR0NxdXc&amp;amp;hl=en"&gt;Reaction Attempts&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Both of these sheets use a common Reaction ID to tie together an unlimited number of reactants and products (Reaction Attempts) and other pertinent reaction conditions (RXIDs Reaction Attempts), such as the concentration of the limiting reagent, the solvent, yield, notes, etc.&lt;br /&gt;&lt;br /&gt;Currently only the data in the Reaction Attempts sheet has been imported into ChemSpider.  But this alone gives us new functionality: we can perform substructure searches for either reactants or products.&lt;br /&gt;&lt;br /&gt;For example lets say we want to search for all reaction attempts using aromatic carboxylic acids.  First we simply do a substructure search on ChemSpider drawing benzoic acid and selecting Attempted Reactions - Reactants as the Data Source.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_R3jz5f_NkRo/S6TklyMtpUI/AAAAAAAAAzA/Rb7DVrz5Eqk/s1600-h/aromaticacidsearch.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 345px;" src="http://3.bp.blogspot.com/_R3jz5f_NkRo/S6TklyMtpUI/AAAAAAAAAzA/Rb7DVrz5Eqk/s400/aromaticacidsearch.PNG" alt="" id="BLOGGER_PHOTO_ID_5450732786806531394" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;This pulls up 8 compounds that were used as a reactant at least once.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_R3jz5f_NkRo/S6TlBDOG_wI/AAAAAAAAAzI/htjCWz_ACpE/s1600-h/aromaticacidresult.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 336px;" src="http://1.bp.blogspot.com/_R3jz5f_NkRo/S6TlBDOG_wI/AAAAAAAAAzI/htjCWz_ACpE/s400/aromaticacidresult.PNG" alt="" id="BLOGGER_PHOTO_ID_5450733255232257794" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Clicking on one of these hits brings us to the ChemSpider entry. Selecting the Syntheses tab in the Data Sources shows links to the lab notebook pages where this compound was used.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_R3jz5f_NkRo/S6Tm_DvbLZI/AAAAAAAAAzQ/I9f03ACJvoY/s1600-h/4-hydroxydata.PNG"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 308px;" src="http://2.bp.blogspot.com/_R3jz5f_NkRo/S6Tm_DvbLZI/AAAAAAAAAzQ/I9f03ACJvoY/s400/4-hydroxydata.PNG" alt="" id="BLOGGER_PHOTO_ID_5450735420035509650" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The system is configured to accept reactions with fully characterized products to reactions where products were not isolated or even reactions in progress.  I'm not using the term "failed reaction" because the term has no meaning without the context of the objective of the reaction.  In our Ugi reactions we are typically looking for the product to precipitate out.  By our criteria, reactions where no precipitate was observed after a few days would be classified as "failed".  However it may well be that product was formed but did not precipitate.  Even when product is obtained, some might consider 30% isolated yields to be failures, while others would not. Context is everything in qualifying success.&lt;br /&gt;&lt;br /&gt;But even with a clear definition of success, many reactions are simply neither successful or failures.  Reactions in progress fall into that category. The student may have even completed the reaction but not yet analyzed the results.  But that doesn't matter so much if the raw monitoring data has been provided.&lt;br /&gt;&lt;br /&gt;The general structure of this database means that we can add not only our reactions but those of anybody.  Even in cases where someone does not have an Open Notebook, just providing a link to contact information of the researcher could be very useful to start a conversation.  In that case the system would function more as a social networking platform - connecting researchers who work on similar molecules.&lt;br /&gt;&lt;br /&gt;I don't think people are willing to do extensive write-ups for what they consider to be "failed experiments".  However, if all that is requested is the list of reactants and target products that may not be such a burden if it potentially means connecting up with another researcher who can help or even start a new collaboration.&lt;br /&gt;&lt;br /&gt;Currently ChemSpider does not take into account the information in the RXIDs Reaction Attempts sheet but we hope to be able to make use of that at some point.  That would let us do more sophisticated searches like - search for any reaction attempt where an aromatic carboxylic acid was reacted with an aliphatic amine in methanol.&lt;br /&gt;&lt;br /&gt;Andrew Lang has also provided the information of the 2 spreadsheets as XML:&lt;br /&gt;&lt;a href="http://showme.physics.drexel.edu/onsc/Services/OData.svc/Reactions/" target="_blank"&gt;http://showme.physics.drexel.&lt;wbr&gt;edu/onsc/Services
