Macs in Chemistry

Insanely Great Science

iOS12 adoption

 

The adoption of iOS is being monitored by mixpanel and it currently stands at around 20%. Rather slower uptake than iOS11 but it looks like there is a clean transition from iOS11 to iOS12.

ios12adoption

The comparison with Android OS adoption is interesting.

androidOSadoption


Comments

Reenabling Extensions after Safari 12 update

 

The latest update to Safari (Version 12) brings a range of features intended to improve online security and privacy. Unfortunately one of the consequences is that only Safari Extensions available from the App Store are enabled and you will get a message that Safari no longer supports unsafe extensions and you are directed to the App Store.

extension

Whilst I'm sure that extensions from major developers will migrate to the App Store I suspect that those Extensions provided by scientists may well not make the transition. This is a shame because some are very useful. However you can build the extension yourself to get around the problem.

This tutorial shows how to extract the code from an existing extension and then build it using Extension Builder.



Comments

Installing Cheminformtics packages on a Mac

 

A while back I wrote a very popular page describing how to install a wide variety of chemiformatics packages on a Mac, since there have been some changes with Homebrew which have meant that a few of the scientific applications are no longer available so I've decided to rewrite the page on installing the missing packages using Anaconda.

I've also included a list of quick demos so you can everything is working as expected.

Full details are here

Packages include:

  • OpenBabel
  • RDKit
  • brew install cdk
  • chemspot
  • indigo
  • inchi
  • opsin
  • osra
  • pymol
  • oddt

In addition to gfortran and a selection of developers tools.

Comments

iChemLabs and SciFinder-n

 

Just got this update from iChemLabs the developers of ChemDoodle.

"iChemLabs customized one of the leading chemistry sketchers and graphics drawing tools for the new SciFindern interface,” said Kevin Theisen, President, iChemLabs, LLC. “Our collaboration with CAS has been very successful in helping researchers develop and visualize better chemical structures more rapidly within SciFindern, and we look forward to continuing to provide SciFindern users with this best-in-class experience.”


Comments

Open Source Cheminformatics Tookits

 

When I wrote the article entitled A few thoughts on scientific software one of the responses I got was that people did not know about the existence of open-source chemistry toolkits so I thought I'd publish a page that hopefully prevent stop people reinventing the wheel. Here are four open-source toolkits that I'm aware of and if I've missed any, my apologies and send me details. Listing of Open-source cheminformatics toolkits


Comments

MayaChemtools

 

MayaChemTools now includes a collection of python scripts for PyMol

The command line Python scripts based on PyMOL provide functionality for the following tasks:

Aligning macromolecules Splitting macromolecules into chains and ligands Listing information about macromolecules Calculation of physicochemical properties Comparison of marcromolecules based on RMSD Conversion between different ligand file formats Visualizing X-ray electron density and cryo-EM density Visualizing macromolecules in terms of chains, ligands, and ligand binding pockets

MayaChemTools is a growing collection of Perl and Python scripts, modules, and classes to support a variety of day-to-day computational discovery needs.


Comments

Optibrium and Intellegens Collaborate

Optibrium and Intellegens Collaborate to Apply Novel Deep Learning Methods to Drug Discovery

Partnership combines Intellegens’ proprietary AI technology with Optibrium’s expertise in predictive modelling and compound design. Optibrium provides elegant software solutions for small molecule design, optimisation and data analysis. By leveraging Intellegens’ AlchemiteTM technology, the partnership will create a “next generation” predictive modelling platform that is capable of delivering more accurate predictions and enabling better decision-making when it comes to the optimisation of compounds.

Read more.


Comments

MGMS Young Modellers’ Forum 2018

 

Molecular Graphics and Modelling Society Young Modellers’ Forum 2018.

To encourage young molecular modellers at the beginning of their careers, the MGMS invites PhD students who wish to present their work on any aspect of computational chemistry, cheminformatics, or computational biology at the 2018 Young Modellers’ Forum. Other members of the modelling community are are strongly encouraged to attend this event as it is your opportunity to see these talented young modellers and to assist us in the evaluation of the prizes. There is also the chance to discuss the talks afterwards in the pub

Abstract submission 5th October 2018

Date: Friday, 30th November, 2018 Venue: Room QA063, Queen Ann Court, The Old Naval College, Greenwich Location: Details of how to get to the campus can be found at http://www2.gre.ac.uk/about/travel/greenwich.


Comments

RSC Chemical Information and Computer Applications Group

 

The website for RSC Chemical Information and Computer Applications Group (CICAG) has undergone an update http://www.rsccicag.org now includes more information on forthcoming events and awards, together with the latest CICAG newsletter. Please feel free to share.

The Chemical Information and Computer Applications Group (CICAG) is one of the RSC’s many member-led Interest Groups, which exist to benefit RSC members and the wider chemical science community, and to meet the requirements of the RSC’s strategy and charter.

CICAG works to support users of chemical information, data and computer applications and advance excellence in the chemical sciences. Inform RSC members and others of the latest developments in these rapidly evolving areas and promote the wider recognition of excellence in chemical information and computer applications at this level

aifirst_announcement-


Comments

Virtual Chemical Libraries

 

A very interesting paper on Virtual Chemical Libraries by W. Patrick Walters DOI describing how it is now possible to generate virtual libraries of molecules of billions of compounds. These vast virtual libraries result in a number of practical challenges in particular their use in virtual screening.

If we consider a virtual screen with a false positive rate of 1% (an optimistic estimate for even the best virtual screening methods), a virtual screen on a library of 1 million molecules would yield 10,000 false positive hits. (A “false positive” is an inactive molecule which is predicted to be active).

Another consideration with very large virtual libraries is the time and CPU resource required for processing, whilst substructure and 2D similarity searches are very fast and can make use of hashed fingerprints. 3D or docking searches are orders of magnitude slower and require either storage of multiple conformations of the ligand or conformation generation on the fly. Realistically these require access to large compute clusters, cloud based resources are now relatively accessible but require significant expertise to access efficiently and securely.

Even the fastest docking programs require 2 seconds per molecule to dock an ensemble of conformations into a protein binding site. At this rate, approximately 15,327 CPU days would be required to dock 680 million molecules.

With this in mind it perhaps appropriate to flag that D3R Grand Challenge 4 has just opened, Full details are published on the Drug Design Data Resource site.


Comments

Papers Update

 

The latest update to Papers 3 (version 3.4.16)

  • Adds support for Microsoft Word version 16
  • Adds Citations support for Scrivener
  • Resolves issues with PDF annotations

There are more reference management apps here.


Comments

Chembience

 

Chembience is a Docker based platform intended for the fast development of chemoinformatics-centric web applications and micro-services based on RDkit. It supports a clean separation of your scientific web service implementation work from any infrastructure related configuration requirements.

chembience

At its current development stage, Chembience supports three base types of application (App) containers: (1) a Django/Django REST framework-based App container which is specifically suited for the development of web-based Python applications, (2) a Python shell-based App container which allows for the execution of script-based python applications, and (3), a Jupyter-based App container which let you run Jupyter notebooks (currently only a Python kernel is supported).


Comments

Augmented Reality in Chemistry

 

The use of augmented and virtual reality in chemistry is slowly starting to gain traction. The initial use of virtual reality in drug discovery is well documented but usually confined to highly specialised hardware which has limited it's exposure to a wider audience. However as described by Jonas Boström at the recent Chemistry on Mobile Devices Meeting Virtual reality smartphone apps making chemistry look and feel cool. This project aims to enhance the learning experience for school chemistry lessons by providing virtual reality viewing of molecules using inexpensive Google Cardboard viewers available online.

Virtual reality smartphone apps are making chemistry look and feel cool. This project aims to enhance the learning experience for school chemistry lessons by providing virtual reality viewing of molecules using inexpensive Google Cardboard viewers.

EduChemVR have a number of apps for download to allow users to interact with macromolecules or learn stereochemistry.

The power of the latest generation of smart phones has enabled scientists to also explore augmented reality. Augmented reality is now being used in a number of situations. To enhance publications as demonstrated by Alistair Crow, if you want to know how to do this instructions are available here. Many people have probably used the superb ChemTube3D website created by Nick Greeves at the University of Liverpool which is an invaluable education resource, this is also accessible via a Smartphone app.

ChemTube3D contains interactive 3D animations and structures, with supporting information for some of the most important topics covered during an undergraduate chemistry degree

More recently some of the pages have been enhanced to provide access to virtual reality models, if you would like to develop similar pages there is an AppleScript droplet to batch convert Jmol files into files suitable for AR.

More recently Mark Costner has released MoleculAR: an augmented reality (AR) app to view molecules in 3D.

The images of molecules for use with the MoleculAR augmented reality app are available on GitHub and there is a more detailed explanation here.


Comments

Mixfile format

 

An interesting post on Mixtures & cheminformatics on designing a new file format to handle mixtures of chemicals, in particular things like "LDA within a solvent mixture of THF and hexanes, in a ratio of 1 to 7".

LDAsoln

The format hasn’t been locked down yet, but it is very simple: it’s JSON-based, in order to make it easy to read & write with any software platform, and have high human readability. It’s hierarchical, making it possible to describe mixtures-of-mixtures, which happens frequently. Each component is expected to provide a structure and quantity whenever these are known, with name being also highly encouraged. Other information like canonical identifiers, database links, cross references, etc., can easily be encapsulated – the Mixfile is intended to be an inclusive container of information – but they do not necessarily impart much-if-any special meaning to the software that interprets them.

More info can be found on the GitHub page https://github.com/cdd/mixtures.


Comments

D3R Grand Challenge 4

 

I've written a couple of tutorials on docking here and here that have been popular pages.

dockedligand.png

The tools used for docking are being regularly updated and so the D3R Grand Challenge 4, a new blinded prediction challenge for protein-ligand poses and affinities is an invaluable data point for comparison of the current state of play.

The Grand Challenge 4 (GC4) will open on September 4, with the following submission deadlines:

  • Stage 1a, cross-docking challenge: October 4
  • Stage 1b, self-docking challenge: October 19
  • Stage 2. affinity ranking and free energies: December 4

Challenge components will include:

  • Affinity ranking of ~450 Cathepsin S inhibitors from the same large dataset drawn from in GC3
  • Affinity ranking of ~150 beta secretase 1 (BACE) inhibitors
  • Pose prediction of 20 BACE inhibitors
  • Free energy prediction challenges suitable for alchemical free energy methods, for both Cathepsin and BACE

Full details will be published on the Drug Design Data Resource site.

Comments

OMEGA v3.0.1 released

 

OpenEye have just announced the release of OMEGA v3.0.1 This upgrade fixes several bugs and adds a number of internal improvements.

Major bug fixes

  • A bug that caused memory leaks in OMEGA classic, dense, pose, and rocs modes, has been fixed. Previously, a substantial memory leak was experienced when running OMEGA on a large database.
  • OMEGA macrocycle no longer uses excessive memory for molecules with terminal heavy atoms.

OMEGA performs rapid conformational expansion of drug-like molecules, yielding a throughput of tens of thousands of compounds per day per processor. OMEGA is very effective at reproducing bioactive conformations, and provides an optimal balance between speed and performance when used on large compound databases.


Comments

Deep Replay

 

This looks rather neat, Deep Replay

Deep Replay is a package designed to allow you to replay in a visual fashion the training process of a Deep Learning model in Keras.

part1

To install Deep Replay just type:

pip install deepreplay

Comments

Chemfp

 

Just got this message which I thought readers might be interested in

chemfp 1.5 is now available from http://dalkescientific.com/releases/chemfp-1.5.tar.gz and from PyPI (the Python package index) through "pip install chemfp".

The software is available in source code form under the MIT license. For more information see the home page at http://chemfp.com/ or the documentation page at https://chemfp.readthedocs.io/en/chemfp-1.5/ .

Chemfp is a set of command-line tools and a Python library for working with cheminformatics fingerprints. It can use OEChem/OEGraphSim, RDKit, or Open Babel to create fingerprints in the FPS format, and it implements a high-speed Tanimoto search.

As far as I can tell, chemfp 1.5 is the fastest free/open source fingerprint search system for the CPU. (Some proprietary/commercial toolkits are faster, including the commercial version of chemfp, and GPU-based search is usually faster than the CPU.)

The main changes for this release are:

  • 10% faster performance for k-nearest search
  • fixed a bug in symmetric k-nearest neighbor when multiple fingerprints have no bits set
  • improved the use of chemfp as a baseline benchmark for similarity search tools

Similarity search performance benchmark

Concerning the last point, I have assembled a data set which can be used to benchmark similarity search performance for several different search types, fingerprint types, and scoring functions. This includes pre-computed fingerprints and expected search results, as well as timing numbers for several different versions of chemfp.

My hope is that it evolves into a standard benchmark that help evaluate search tools - bearing in mind that performance is only one of many factors that go into selecting a tool.

The benchmark files are at https://bitbucket.org/dalke/chemfp_benchmark . Those files which fall under copyright are distributed under the MIT license.

Many thinks to ChEMBL, OpenEye, PubChem, Open Babel, RDKit, and Daniel Lemire for providing the data and resources for putting this benchmark together.

Best regards,

Andrew dalke@dalkescientific.com


Comments

Aug 15 1998 Apple launches the iMac

 

On August 15, 1998, Apple launched the first iMac into the world, the multi-colored gumdrop-shaped iMac proved to be the perfect launchpad for a revitalised Apple.

iMac

The first iMac had fairly modest specs, a 233 - 700MHz PowerPC 750 G3 processor, 128GB of storage, a 15-inch CRT, a CD-ROM drive, and an ATI graphics card. Since then Apple has regularly upgraded the iMac

The latest Pro version boasts up to 18-core 2.3 GHz Intel Xeon W processors (Turbo Boost up to 4.3GHz), 32GB of 2666MHz DDR4 memory (four SO-DIMM slots, user configurable to 128GB), up to 4TB SSD storage 27-inch (diagonal) Retina 5K display and Radeon Pro Vega graphics 64 card with 16GB of high bandwidth memory, and of course it is available in space grey.

iMacPro

Happy 20th birthday.


Comments

REALizer KNIME workflow from BioSolveIT

 

BioSolveIT have added to their collection of KNIME workflows.

The "REALizer" helps you to post-process the results from searches in the REAL Space, leading you to those compounds of biggest interest.


Comments

Fortran on a Mac update

 

As I've noted on several occasions I'm not a big Fortran user but looking at the website stats the Fortran on a Mac page is now the third most regularly read page on the site and page views seem to be increasing.

I was recently sent a new link and I have added it to the Fortran on a Mac page.

Sourcery Institute a variety of resources for Fortran programmers, Sourcery institute tap for Homebrew formulae not in homebrew/homebrew-core, a Coarray Fortran Jupyter notebook kernel, forks of flang and gcc and OpenCoarrays a transport layer for coarray Fortran compilers.

Comments

An Applescript droplet to generate Augmented Reality files from JMol

 

Augmented reality is finding new applications in science, in particular the ability to enhance publications or lecture notes, and viewers can set up a free account with Augment to provide easy access.

I was asked recently if it might be possible to generate an AppleScript droplet that you could simply drop a chemical structure file onto to generate the desired files needed for the Augment, and this is an ideal use case for a droplet.

This script uses Jmol to generate the Wavefront .obj and .mtl files which can be used

You read more about the script and download it here.

Nick Greeves has tweeted an example of its use here and a demo page here.

Comments

Updated INSENSITIVE

 

Insensitive (Incredible Nuclear Spin EvolutioN Simulation Tool Intended for Visual Education) is an application to simulate the NMR experiment based on the quantum mechanical density matrix formalism.

It is available for Mac OS X 10.6 and above and iOS 5.1.1 and above. Full details can be found in Concepts In Magnetic Resonance, 2011, 38A (2), 17-24 DOI.

The NMR experiment is usually described by a choice of three models that operate on different levels of abstraction: the vector model, the product operator formalism and the density matrix approach. The transition between these models poses a didactic challenge for teacher and student alike. A new computer program is presented, which simulates a spin system on the textbook level and compares the three approaches, with the possibility to manipulate the system at every step. It closes a gap between NMR education and professional simulation tools. Some algorithms are explained, which are used in the simulation to extract information from the density matrix.

1


Comments

ACS awards for Computers in Chemistry

 

Nominations are now open for the Computers in Chemistry division of the ACS awards.

More details here http://www.acscomp.org/awards.


Comments

New ChEMBL interface

 

Just having a look at the new ChEMBL interface, quite like the easy way to embed records into web pages

<object data="https://www.ebi.ac.uk/chembl/beta/embed/#mini_report_card/Compound/CHEMBL1471" width="100%" height="300"></object>

and it is displayed as shown below.

Will doing some more investigations later this week.

Comments

Intelligently Automating Machine Learning, Artificial Intelligence, and Data Science

 

A timely tutorial and example workflow.

we have put together a more comprehensive workflow, serving as a blueprint for anyone to build her or his own version of a Guided Analytics application to combine just the right amount of automation and interaction for a specific set of problems.

Full details here


Comments

LabMathX

 

LabMathX is a MacOSX program for scientific analysis, calculations and Visualisation that includes support for older hardware.

LMXHarmonic

  • LabMathX is Scriptable with AppleScript. Check the Dictionary with Apple's Script Editor.
  • LabMathX Supports Services and Can Be Accessed From the Services Menu in Other Services-Aware Applications.
  • LabMathX and Its Plug-ins Are Written in Objective C Under Cocoa.
Comments

Apps at discount prices

 

A summer promotion is offering 12 applications at discount prices, pick and choose the ones you want.

Here is the list of participating apps:

iOS apps:

  • Mindnode by IdeasOnCanvas GmbH (AUT) → now 10,99€/$9.99 (30% OFF)
  • Notebooks by Alfons Schmid (AUT) → now 4,49€/$3.99 (40% OFF)
  • Inko by Creaceed SPRL (BEL) → now 14,99€/$13.99 (30% OFF)
  • Prizmo Go by Creaceed SPRL (BEL) → now 3,49€/$2.99 (40% OFF)
  • Grafio by Ten Touch Ltd. (BGR) → now 8,99€/$7.99 (20% OFF)
  • PocketCAS by Daniel Alm (DEU) → now 4,99€/$3.99 (50% OFF)
  • Money by Jumsoft (LTU) → now 1,09€/$0.99 (65% OFF on Standard IAP)

Mac apps:

  • Mindnode by IdeasOnCanvas GmbH (AUT) → now 29,99€/$26.99 (30% OFF)
  • Notebooks by Alfons Schmid (AUT) → now 9,99€/$8.99 (50% OFF)
  • Prizmo by Creaceed SPRL (BEL) → now 38,99€/$32.99 (30% OFF)
  • Remote Buddy by IOSPIRIT GmbH (DEU) → now 19,99€/$17.99 (20% OFF)
  • PocketCAS by Daniel Alm (DEU) → now 9,99€/$8.99 (50% OFF)
  • Findings by Findings Software SAS (FRA) → now 32,99€/$29.99 (40% OFF)
  • PDF Watermarker by seense (FRA) → now 8,99€/$7.99 (60% OFF)
  • Money by Jumsoft (LTU) → now 16,99€/$14.99 (40% OFF on Standard IAP)
  • Studies by The Mental Faculty B.V. (NLD) → now 21,99€/$19.99 (30% OFF)
  • Workspaces by Apptorium (POL) → now 6,99€/$5.99 (35% OFF)
  • FiveNotes by Apptorium (POL) → now 3,49€/$2.99 (40% OFF)

Comments

RSC CICAG webite

 

The Chemical Information and Computer Applications Group (CICAG) is one of the RSC’s many member-led Interest Group. The new website is now live, http://www.rsccicag.org

Why not have a browse around and let us know what else you would like to see included.

RSC_LOGO_CI+CAG_A4_PRINT


Comments

Wolfram|Alpha

 

Wolfram|Alpha has been updated

Across thousands of domains--with more continually added--Wolfram|Alpha uses its vast collection of algorithms and data to compute answers and generate reports for you. The Wolfram|Alpha App plugs directly into the Wolfram|Alpha supercomputing cloud, computing answers to your questions quickly, efficiently, and without draining your battery.

There are more iPhone/iPad science apps on the Mobile Science Website.


Comments

A few thoughts on scientific software

 

Whilst this website is aimed at providing a resource for Mac using chemists regular readers will know that much of the content is platform agnostic and includes much code/software that will be of interest to all scientists.

software

I recently got a rather sad email

It seems that Third Street Software quietly disappeared, breaking the syncing for Sente (reference management).

I've also heard about a couple of other smaller software developers who are finding life very tough and it started me thinking about the status of scientific software, after exchanging emails with a number of people in the industry (many thanks for their input) I thought I'd collect a few thoughts on my blog.

You can read it here https://www.macinchem.org/reviews/scientificsoftware/software.php.

Comments

BBEdit updated

 

BBEdit 12.1.5 contains fixes for reported issues. This update does not contain any new features.

The full release notes are available here https://www.barebones.com/support/bbedit/notes-12.1.5.html.


Comments

KNIME update

 

What’s New in KNIME Analytics Platform 3.6.

  • KNIME Deep Learning
  • Constant Value Column Filter
  • Numeric Outliers
  • Column Expressions
  • Scorer (JavaScript)
  • Git Nodes
  • Call Workflow (Table Based)
  • KNIME Server Connection
  • Text Processing
  • Usability Improvements
  • Connect/Unconnect nodes using keyboard shortcuts
  • Zooming
  • Replacing and connecting nodes with node drop
  • Node repository search
  • Usability improvements in the KNIME Explorer
  • Copy from/Paste to JavaScript Table view/editor
  • Miscellaneous
  • Performance: Column Store (Preview)
  • Making views beautiful: CSS changes
  • KNIME Big Data Extensions
  • Create Local Big Data Environment
  • KNIME H2O Sparkling Water Integration
  • Support for Apache Spark v2.3
  • Big Data File Handling Nodes (Parquet/ORC)
  • Spark PCA
  • Spark Pivot
  • Frequent Item Sets and Association Rules
  • Previews
  • Create Spark Context via Livy
  • Database Integration
  • Apache Kafka Integration
  • KNIME Server

  • Management (Client Preferences)

  • Job View (Preview)
  • Distributed Executors (Preview)
  • General release notes

  • JSON Path library update

  • Java Snippet Bundle Imports

I suspect it will be the KNIME Deep learning that will catch the eye, the ability to set up deep learning models using drag and drop. Use regular Tensorflow models within KNIME Analytics Platform and seamlessly convert from Keras to Tensorflow for efficient network execution

deeplearning

The new Create Local Big Data Environment node creates a fully functional local big data environment including Apache Spark, Apache Hive and HDFS. It allows you to try out the nodes of the KNIME Big Data Extensions without a Hadoop cluster.


Comments

Resuts from Avogadro Survey

 

The results of the Avogadro 2018 Community Survey are now in.

Avogadro is an advanced 3D molecule editor and visualizer designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science, and related areas. It offers flexible high quality rendering and a powerful plugin architecture.

The results are well worth browsing though but here are a few things I've picked out

  • The most common way people hear about Avogadro by word of mouth.
  • Most people install downloaded binaries
  • Many users can code, mainly Python
  • Most tasks performed centre around initial molecule building and editing

avogadro

You can download from sourceforge here https://sourceforge.net/projects/avogadro/files/latest/download


Comments

The use of augmented reality in chemistry

 

A couple more examples of the use of augmented reality to display chemistry

This also looks interesting.
Touching proteins with virtual bare hands

….A more accessible and intuitive visualization of the three-dimensional configuration of the atomic geometry in the models can be achieved through the implementation of immersive virtual reality (VR). While bespoke commercial VR suites are available, in this work, we present a freely available software pipeline for visualising protein structures through VR. New consumer hardware, such as the HTC Vive and the Oculus Rift utilized in this study, are available at reasonable prices….

https://doi.org/10.1007/s10822-018-0123-0


Comments

ChEMBL 24 predictive models

 

Recently ChEMBL was updated to version 24 the update contains:

  • 2,275,906 compound records
  • 1,828,820 compounds (of which 1,820,035 have mol files)
  • 15,207,914 activities
  • 1,060,283 assays
  • 12,091 targets
  • 69,861 documents

In addition today they released the predictive models built on the updated database, they can be downloaded from the ChEMBL ftp server ftp://ftp.ebi.ac.uk/pub/databases/chembl/target_predictions

There are 1569 models.


Comments

Tips & Tricks for Using KNIME

 

The Knime blog has a post containing lots of user submitted tips and tricks

Ever sat next to a friend or colleague at the computer and were awed when you suddenly realised the way they do certain tasks is much better? We recently asked KNIME users to share their tips and tricks on using KNIME. In this series of posts we’ll be showing you how the experts use KNIME in the hopes that by sharing ideas you’ll discover some handy techniques.


Comments

AI in Chemistry meeting report

 

RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry Friday, 15th June 2018 - Royal Society of Chemistry at Burlington House, London, UK
Post-event Report on Speaker Presentations, written by Bursary Awardees

http://www.maggichurchouseevents.co.uk/bmcs/Downloads/Archive/AI%20-%20post-event%20report.pdf

Comments

Added MestraNova to Mobile apps

 

I've just added MestReNova to the mobile science site.

MestRe Nova is an iPad app for viewing/manipulation NMR spectra

There are an increasing number of spectroscopy apps available.


Comments

1 million page views

 

I was looking at the website stats and I just noticed that last month the site passed the 1 million page views since it was changed to the current format. I'm delighted (and slightly surprised) that the site has proved to be so popular.

The top 5 most popular pages are:

Comments

EzMol

 

EzMol - An easy to use simple molecular graphics program

EzMol aims to fill a quite different role to that delivered by superb programs such as PyMol and Chimera. EzMol is designed at the occasional user and provides a step-by-step wizard to rapidly generate an image for inspection and publication. For example, residue selection, colouring and labelling using a paint-box approach so no typing of commands

You can read more here DOI.


Comments

Mnova 12.0.3 (minor release)

 

Update

IUPAC Name

  • Able to name molecules with atoms in non-standard valence
  • Implement skeletal replacement (“a”) nomenclature for heteropolycyclic ring systems
  • Naming of branched ring assemblies
  • Correct names of several suffix groups
  • Able to name ring assemblies of 3-6 identical cyclic systems

MS

  • Precursors m/z values displayed in the MSn extracted spectra title

Full details here http://resources.mestrelab.com/whats-new-mnova-12-0-3/

There is a review of Mnova here


Comments

Updated Conda

 

I've been checking a few things since I updated. One thing that was immediately apparent was the similarity maps in RDKit are much nicer! As you can see from the output of the HERG prediction.

hergactiverdkit

Feel like I got something for free.


Comments

Add mathematical equations to your document in Pages, Numbers, and Keynote

 

I previously mentioned that there is LaTeX and MathML support in Pages and iBooks Author. This has now been extended to Numbers and keynote.

Add mathematical equations to your document in Pages, Numbers, and Keynote https://support.apple.com/en-us/HT207569.

You can include mathematical expressions and equations in your Pages, Numbers, or Keynote document when you use LaTeX commands or MathML elements.

ipad-pro-ios11-pages4-1-equation


Comments

Electronic Lab Notebooks

 

This looks useful a comparison of electronic lab notebooks

The Electronic Lab Notebook Matrix has been created to aid HMS researchers in the process of identifying a usable Electronic Lab Notebook solutions to meet their specific research needs. Through this resource, researchers can compare and contrast the numerous solutions available today, and also explore individual options in-depth.

Comments

Updating conda

 

I've been putting off doing any updates until I finished a substantial piece of work, but now I have time so wish me luck.

conda update -n root conda

conda update --all

Comments

Accessing a Jupyter Notebook HERG model from Vortex

 

A recent paper "The Catch-22 of Predicting hERG Blockade Using Publicly Accessible Bioactivity Data" DOI described a classification model for HERG activity. I was delighted to see that all the datasets used in the study, including the training and external datasets, and the models generated using these datasets were provided as individual data files (CSV) and Python Jupyter notebooks, respectively, on GitHub https://github.com/AGPreissner/Publications).

The models were downloaded and the Random Forest Jupyter Notebooks (using RDKit) modified to save the generated model using pickle to store the predictive model, and then another Jupyter notebook was created to access the model without the need to rebuild the model each time. This notebook was exported as a python script to allow command line access, and Vortex scripts created that allow the user to run the model within Vortex and import the results and view the most significant features.

All models and scripts are available for download.

Full details are here…

hergactiveVortex


Comments

Schrödinger Software Release 2018-2

 

Schrödinger have announced a major update their software suite.

Full details are here https://www.schrodinger.com/newfeatures


Comments

BiosolveIT update SeeSAR and more.

 

BioSolveIT have announced significant changes and improvements in SeeSAR resulting in another major release to version 8. The biggest change is that they now provide full protein visualization support. While the focus of the tool is for the most part still on the defined binding site, you can now...: see the whole protein in all its glory! As always, a major update means that HYDE scores must be re-calculated to stay in line with the changes made in the underlying structures. We certainly believe that these enhancements are well worth it:

  • improved alignment
  • full protein support in the seqence view
  • search&find specific amino acids, waters or other protein components
  • all protein visualization controls bundled
  • enhanced pharmacophore handling
  • fragment growing for covalent binders

For details see: https://www.biosolveit.de/SeeSAR/changes.html

They also have two new tools:

REALSpaceNavigator is the world's largest, ultra-fast searchable chemical space developed in collaboration with Enamine Ltd. It comprises roughly 3.8 billion compounds today, which will be delivered on demand in less than 4 weeks with an exceptional success rate of 80% and above.

PepSee is a software tool for interactive, visual compound prioritization as well as the design of next-generation peptide therapeutics. Peptide design ideally supports a multi-parameter optimization to maximize the likelihood of success. PepSee visualizes the relevant parameters at hand, side by side with the sequence data. Color-coded display stimulates SAR exploration. The main features of PepSee comprise:

  • comfortable sequence & data import (from Excel, FASTA, PLN, Text, even PDF)
  • automated as well as manual sequence alignment
  • various data coloring and plotting options
  • organizing and annotating your compounds
  • interactive design of novel peptides
Comments

Data Creator Updated

 

One of the things that I’m occasionally asked for is a test data set that can be used to evaluate an application. Whilst I keep a couple of data sets that I can use perhaps DataCreator will provide a more comprehensive solution. Data Creator is an application that has been designed to fill this important niche, Data Creator can be used to build very large data sets using field types defined by the user and then filled with random realistic content.

Data Creator can create sample tables (rows and columns) as you like and fill them with pseudo-random proper content (rows of content) with a single click. You can select which kind of fields (columns) you like (name of animals, colors, fruits, english surname, german names and so on with over 50 different kind of data) and have all the contents filled for how many rows you like in a click. It can export to Comma separated value, Tab separated values, html tables, even web pages ready to click or in any custom format you like.

The latest update brings a couple of bug fixes and

  • New type 'Decimal Number in Range' to many requested format such as currency (example: $ 1.99)
  • Improved error detection of data formatting
  • Optimized for macOS 10.12 Sierra

There is a review of DataCreator here.


Comments

A Review of MNova NMR

 

MNova NMR is Mestrelab Research’s NMR analysis program that can be used to quickly view, process and analyse both 1D and 2D spectra, as well as to easily produce publication quality assignments and images. The software can be downloaded from Mestrelab’s website (45-day free trial licences are available).

You can read the review here

Picture1

Comments

Mobile Science Apps

 

I just checked the most upvoted apps on the Mobile Science site

https://www.macinchem.org/mobilescience/upvoted/.

ChemDoodle still tops the list but Medicinal Chemistry Toolkit and Elemental are picking up votes as is WolframAlpha. The newly updated Findings lab notebook also remains popular.

The virtual reality macromolecule viewer Learning MacroMols VR is also popular.


Comments

A quick look at CypReact

 

Sometimes you just want to know which enzymes are likely to be involved in the metabolism of a molecule, CypReact DOI takes a structure (SMILES or sdf input) and predicts if the molecule will react with any one of the nine of the most important human cytochrome P450 (CYP450) enzymes [CYP1A2, CYP2A6, CYP2B6, CYP2C8, CYP2C9, CYP2C19, CYP2D6, CYP2E1, or CYP3A4]. Read more here..

emendmetab


Comments

How Do You Build and Validate 1500 Models and What Can You Learn from Them?

 

Greg Landrum's ICCS 2018 presentation on slideshare


Comments

Scaling Python with Dask webinar

 

This looks to be an interesting webinar on Dask

https://know.anaconda.com/Scaling-Python-Dask-Webinar.html Wednesday, May 30th at 2:00PM CDT.

Dask is a flexible parallel computing library for analytic computing.

Dask is composed of two components:

  • Dynamic task scheduling optimized for computation. This is similar to Airflow, Luigi, Celery, or Make, but optimized for interactive computational workloads.
  • “Big Data” collections like parallel arrays, dataframes, and lists that extend common interfaces like NumPy, Pandas, or Python iterators to larger-than-memory or distributed environments. These parallel collections run on top of the dynamic task schedulers.

Comments

Unix commands for helping deal with very large files

 

I'm regularly handling very large files containing millions for chemical structures and whilst BBEdit is my usual tool for editing text files in practice it becomes rather cumbersome for really large files (> 2 GB). In these cases I've compiled a useful list of UNIX commands that make life easier.

The page is part of the Hints and Tutorials section and can be viewed here.

Whilst I use them when dealing with large chemical structure files they are equally useful when dealing with any large text or data files.

Updated

A suggestion from a reader. Sometimes rather than one large file download sites provide the data as a large number of individual files. We can keep track of the number of files using this simple command.

MacPro:~ Chris$ ls | wc -1
177248

If anyone has any additional suggestions please feel free to submit them.




Comments

Implementing AB-MPS scoring

 

Whilst the rule of 5 (Ro5) has provided a useful way to describe small molecule drug space it is also clear that there are a significant number of molecular classes that exist beyond the rule of 5 boundaries (bRo5). In a review of the AbbVie compound collection DOI they were able to identify key findings that might explain the success (or failure) of bRo5 projects. From an analysis of a variety of calculated physicochemical properties they proposed a simple multiparametric scoring function (AB-MPS) was devised that correlated preclinical PK results with cLogD, number of rotatable bonds, and number of aromatic rings.

AB-MPS = Abs(cLogD-3) + NAR + NRB

Now implemented as a Vortex script.


Comments

Chemical Information and Computer Applications Group (CICAG) website

 

The new RSC CICAG website is now live http://www.rsccicag.org why not have a look and provide suggestions and feedback.

RSC_LOGO_CI+CAG_A4_PRINT

The Chemical Information and Computer Applications Group (CICAG) is one of the RSC’s many member-led Interest Groups, which exist to benefit RSC members and the wider chemical science community.

Also provides links to the social media feeds (Twitter, LinkedIn etc.)



Comments

Intel® Distribution for Python

 

Anyone fancy taking this for a test drive and providing some information on performance?

Get real performance results and download the free Intel Distribution for Python that includes everything you need for blazing-fast computing, analytics, machine learning, and more. Use Intel Python with existing code, and you’re all set for a significant performance boost.

The core computing packages, Numpy, SciPy, and scikit-learn, are accelerated under the hood with powerful, multithreaded native performance libraries such as Intel® Math Kernel Library, Intel® Data Analytics Acceleration Library, and others, to deliver native code-like performance results to Python. We leverage Intel® hardware capabilities using multiple cores and the latest Intel® Advanced Vector Extensions (Intel® AVX) instructions, including Intel® AVX-512. The Intel Python team reimplemented select algorithms to dramatically improve their performance. Examples include NumPy FFT and random number generation, SciPy FFT, and more.

Available for Windows, Linux and macOS.

Minimum System Requirements

  • Processors: Intel Atom® processor or Intel® Core™ i3 processor
  • Disk space: 1 GB
  • Operating systems: Windows* 7 or later, macOS, and Linux
  • Python* versions: 2.7.X, 3.5.X, 3.6
  • Included development tools: Conda, conda-env, Jupyter Notebook (IPython)

Comments

Diversity Genie

 

Diversity Genie is a desktop software tool which allows to analyze and manipulate chemical data. Its capabilities include:

  • mapping molecules and their properties with sammon embedding.

  • filtering and converting sets of molecules in SDF, SMILES, and InChI formats.

  • plotting histograms, scatter plots, and ROC curves.

  • Computing well-known molecular properties and merging CSV files.

  • Creating machine learning models using powerful gradient boosting methods.

Diversity Genie 3 is completely free to use by academia and for personal non-commercial use. You can download Mac OSX, Windows and Linux builds at

http://www.diversitygenie.com/index.html

screenshot1


Comments

CCP4 release 7.0 update 056 now available

 

Collaborative Computational Project No. 4 (CCP4) exists to produce and support a world-leading, integrated suite of programs that allows researchers to determine macromolecular structures by X-ray crystallography, and other biophysical techniques.

Details of the latest update are here https://twitter.com/ccp4_mx/status/991256632729403392


Comments

Google Sumer of code, Open Chemistry Projects

 

The details of some of the projects taking part in the Google Summer of Code are now online here https://summerofcode.withgoogle.com/organizations/6513013473935360/ under the Open Chemistry header.

Really interesting work includes 3-D coordinate generation, standardising fingerprint APIs, a framework for molecular validation, and standardization and molecular dynamics in Avogadro.

Good luck to all that are taking part!!


Comments

deMon2k code version 5 released

 

deMon (density of Montréal) is a software package for density functional theory (DFT) calculations. It uses the linear combination of Gaussian-type orbital (LCGTO) approach for the self-consistent solution of the Kohn-Sham (KS) DFT equations. The calculation of the four-center electron repulsion integrals is avoided by introducing an auxiliary function basis for the variational fitting of the Coulomb potential.

The user guide provides installation instructions and requires a Fortran compiler, BASH and MPI.


Comments

ChemDoodle 9.0 released

 

I just saw that ChemDoodle 9.0 has been released and I plan to have a detailed look later this month.

ChemDoodle 9 is a major revision of every aspect of the software. We spent over 2 years overhauling and improving the cheminformatics engine, interface, drawing controls, image and chemical file types, graphics, and operating system compatibility. In addition to the new features, the entire codebase has been refactored for the current best standards to take advantage of the latest performance, memory and security features of the operating system.

What is new in ChemDoodle 9

  • A new user manual discusses all the new features in detail over several pages, too many to list here. (click to load manual, section 1.2)
  • Drawing and Graphics – Tons of new systems for making your graphics quicker. Auto-placement of attributes (charges/radicals/stereocenters/etc.). An improved text tool that can create both atom text and formatted captions. Draw chiral carbon nanotubes in addition to zigzag and armchair. New dynamic brackets and structure highlights. Better drawing tools for advanced figures.
  • Chemistry – State-of-the-art implementation of the most recent CIP rules. A clearer and more powerful warning system. Advanced implicit hydrogen handling including the analysis of advanced aromatic resonance systems. Full support for the latest elements as defined by IUPAC and much more!
  • Interface – A brand new customizable cursor system, improved IUPAC name-to-structure interface. Improved color palettes, now with Rasmol, CPK and Custom color sets. HTTPS support for PubChem is now implemented for access in MolGrabber. Improved color choosers including alpha support and high resolution improvements across the entire application.
  • Chemical Files – The Nature style sheet has been added. SMILES interpretation has seen significant work, with a focus on very advanced cheminformatics techniques. Added support for the RCSB MacroMolecular Transmission Format (MMTF). More support for ChemDraw, MDL CT, MRV and ISIS/Sketch files.
  • Images – TIFF images can now be exported with custom DPI settings. GIF image output can now have semi-transparent pixels merged with white. Added viewBox attribute for SVG. When saving files, you can now use alternate extensions and other image file chooser improvements. Control which image file types are shown in the save image choosers.
  • Vector Art – New glassware graphics have been added as well as dozens of new BioArt.
  • Customizability – The keyboard and tools shortcuts are now fully customizable by the user. The user settings folder location can now be controlled. * Custom attribute names and values are now persisted through restarts. Windows – Full support for high-DPI screens, without the manual scaling required in the past. The OLE plugin has been rebuilt for the most current compliance with Windows libraries.
  • macOS – Improved and full Retina support. Native file choosers.

Comments

Jupyter and Fortran

 

Well after my last post about Swift and Jupyter a reader sent me link to the use of both Julia and Fortran programming languages in a Jupyter Notebook.

fortranJupyter

More information in this lecture Project Jupyter: Architecture and Evolution of an Open Platform for Modern Data Science by Fernando Perez.

Project Jupyter, evolved from the IPython environment, provides a platform for interactive computing that is widely used today in research, education, journalism and industry. The core premise of the Jupyter architecture is to provide tools for human-in-the-loop interactive computing. It provides protocols, file formats, libraries and user-facing tools optimized for the task of humans interactively exploring problems with the aid of a computer, combining natural and programming languages in a common computational narrative.


Comments

Swift 4.1 in a Jupyter Notebook

 

I'm a great fan of Jupyter Notebooks but I only ever use python.

The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text

A recent post by Ray Yamamoto Hilton caught my eye who recently put together a little experiment to demonstrate using Swift 4.1 from within Jupyter Notebooks.

You can download a demo notebook here.

swiftjupyter


Comments

Amber 18 and AmberTools 18released

 

Amber is a suite of biomolecular simulation programs. It began in the late 1970's, and is maintained by an active develpment community

Amber 18 ajor new features include:

  • Free energy calculations on GPUs
  • GPU support for 12-6-4 ion potentials
  • Domain decomposition for CPU-parallelism
  • Nudged elastic band calculations for pmemd (CPU and partial GPU implementation)
  • Constant redox potential calculations, to supplement constant pH simulations
  • Support and significant performance improvements for the latest Maxwell, Pascal and Volta GPUs from NVIDIA.
  • New pmemd.gem code for advanced force fields, including AMOEB

AmberTools 18 new features include

  • CUDA-enabled pbsa solver; extensions for membrane modeling with PB *lambda-dynamics method for constant pH simulations *packmol_memgen tool for building lipids and bilayers *New ("middle") integration algorithms in sander *Build tools based on CMake *Continued updates and extensions to cpptraj: *ability to obtain energies from snapshots of PME simulations *Pairlist and other speedups *improved scripting abilities

Instructions for installing Amber under Mac OSX are here http://ambermd.org/Installation.php

You will need to install gfortran, whilst you can download the binary it might be worth considering using Homebrew as described here


Comments

NWChem updated

 

Just catching up.

NWChem 6.8 is now available on Github https://github.com/nwchemgit/nwchem.

NWChem provides many methods for computing the properties of molecular and periodic systems using standard quantum mechanical descriptions of the electronic wavefunction or density. Its classical molecular dynamics capabilities provide for the simulation of macromolecules and solutions, including the computation of free energies using a variety of force fields. These approaches may be combined to perform mixed quantum-mechanics and molecular-mechanics simulations.

Instructions for compiling NWChem on various platforms including Mac OSX https://github.com/nwchemgit/nwchem/wiki/Compiling-NWChem.


Comments

STK: A Python Toolkit for Supramolecular Assembly

 

I bookmarked this paper a while back but have only just had time to read it through, STK: A Python Toolkit for Supramolecular Assembly. STK is a tool for the automated assembly, molecular optimization and property calculation of supramolecular materials. It has a simple Python API and integration with third party computational codes.

The source code of the program can be found at https://github.com/lukasturcani/stk and the detailed documentation is here.

Additional linking functional groups can be defined as SMARTS and STK can be extended by adding additional optimisation force-fields.

molecular_cage


Comments

Top 12 unix commands for data scientists.

 

A really useful post on KDnuggets.

With the beautiful intuitive interface it is sometimes easy to forget that Mac OS X has unix underpinnings and that the Terminal gives access to whole set of invaluable tools.

This post is a short overview of a dozen Unix-like operating system command line tools which can be useful for data science tasks. The list does not include any general file management commands (pwd, ls, mkdir, rm, ...) or remote session management tools (rsh, ssh, ...), but is instead made up of utilities which would be useful from a data science perspective, generally those related to varying degrees of data inspection and processing. They are all included within a typical Unix-like operating system as well.

If you regularly have to deal with very large data files some of these commands will be invaluable, for example:

head outputs the first n lines of a file (10, by default) to standard output. The number of lines displayed can be set with the -n option.

head -n 5 my file.txt

Read more here.


Comments

Review of MOE 2018.01

 

The 2018.01 release of Chemical Computing Group's Molecular Operating Environment (MOE) software includes a number of new features, enhancements and changes. I written a review that highlights a number of the features.

2Doverlay

Read more here….


Comments

Roundtrip editing with ChemDraw 17.1

 

Whenever there is an update to ChemDraw I always hold my breath to see if round-trip editing (i.e. the ability to copy and paste from a chemical drawing package into Word for example and then be able to copy and paste the structure back from Word into the chemical drawing application) has been broken.

Fortunately this blog post provides an invaluable update to the current situation.

Comments

RDKit code changes

 

I just saw this on the RDKit email circulation list and since I know a number of readers use RDKit I thought I'd mention it.

When we do the beta for the 2018.03.1 release we're going to switch the C++ backend to use modern C++ (=C++11). For people who can't switch to use that code, we will continue to provide bug fixes for the 2017.09 release for at least another 6 months.

This should only affect people who need to build the RDKit C++ code themselves. If you use a binary version of the RDKit like the ones available inside of Anaconda Python or KNIME, this change should have no impact upon you.

It looks like we're almost there. Hopefully we will be able to do a beta of the 2018.03 release by the end of the week.


Comments

Updated Literature search script

 

I've updated the Vortex script to run text based queries of PubMed.

If you regularly use the E-utilities API you might want to read this.

After May 1, 2018, NCBI will limit your access to the E-utilities unless you have one of these keys. Obtaining an API key is quick, and simple, and will allow you to access NCBI data faster. If you don’t have an API key, E-utilities will still work, but you may be limited to fewer requests than allowed with an API key.

After May 1, 2018, any computer (IP address) that submits more than 3 E-utility requests per second will receive an error message. This limit applies to any combination of requests to EInfo, ESearch, ESummary, EFetch, ELink, EPost, ESpell, and EGquery.

If you write software of scripts that access the E-utilities API then the users will need to get their own api key. Calls will have this format

https://www.ncbi.nlm.nih.gov/entrez/eutils/einfo.fcgi?db=pubmed&api_key=ABCD123

I've updated this script to reflect this change, and I've highlighted where you need to add your api key in the script. I've also tried to ensure that any query string should be encoded to make it URL safe and I've extended the search range up to 2018.

AIsearchresults


Comments

iRASPA: GPU-accelerated visualization software for materials scientists

 

A recent publication DOI describes a new application for materials science.

A new macOS software package, iRASPA, for visualisation and editing of materials is presented. iRASPA is a document-based app that manages multiple documents with each document containing a unique set of data that is stored in a file located either in the application sandbox or in iCloud drive. The latter allows collaboration on a shared document (on High Sierra). A document contains a gallery of projects that show off the main features, a CloudKit-based access to the CoRE MOF database (approximately 8000 structures), and local projects of the user. Each project contains a scene of one or more structures that can initially be read from CIF, PDB or XYZ-files, or made from scratch. Main features of iRASPA are: structure creation and editing, pictures and movies, ambient occlusion and high-dynamic range rendering, collage of structures, (transparent) adsorption surfaces, cell replicas and supercells, symmetry operations like space group and primitive cell detection, screening of structures using user-defined predicates, and GPU-computation of helium void fraction and surface areas in a matter of seconds. Leveraging the latest graphics technologies like Metal, iRASPA can render hundreds of thousands of atoms (including ambient occlusion) with stunning performance.

AppSnapshot

iRASPA is available from Mac app store.


Comments

SeeSAR updated

 

A new version of SeeSAR is available (7.3), this update includes.

  • Easy mode switching: from the molecules table to the editor or the inspirator and back in just one click...
  • Automated workflows: in the settings you can now decide about which calculations should happen automatically
  • Menus re-organized: buttons are grouped for better overview and almost all table entries obtained a convenient context menu, simply right-click to give it a try
  • Excel export: this is one of the rather hidden Easter Eggs. Besides SDF you may save tables now as XLSX (including the 2D depiction)
  • Saved settings: user settings (the layout, background color, etc.) are now saved separately from project settings (filters and visualization features)

Full release notes are available.


Comments

RDkit in Samson

 

I've posted about Samson a couple of times and it just keeps getting better and better.

SAMSON is a novel software platform for computational nanoscience. Rapidly build models of nanotubes, proteins, and complex nanosystems. Run interactive simulations to simulate chemical reactions, bend graphene sheets, (un)fold proteins. SAMSON's generic architecture makes it suitable for material science, life science, physics, electronics, chemistry, and even education. SAMSON is developed by the NANO-D group at INRIA, and means "Software for Adaptive Modeling and Simulation Of Nanosystems.

A recent blog post highlights the use of RDKit in Samson.

In this post I will present you the RDKit-SMILES Manager module that I integrated in the SAMSON platform. As some of you know, RDKit is an open source toolkit for cheminformatics which is widely used in the bioinformatics research. One of its features is the conversion of molecules from their SMILES code to a 2D and 3D structures. Thanks to the new SAMSON Element, it is now possible to use these features in the SAMSON platform. SMILES code files (.smi) or text files (.txt) containing several SMILES codes can be read using the import button.

The new module allows you to import a file containing SMILES strings, generate 2D depictions, and by right-clicking on these images, you can open, generate the 3D structure in SAMSON or save the image as png or svg.

GenAll

It is also possible to run substructure searching using SMARTS.


Comments

Rodeo: A Python IDE for Data Scientists

 

Just added Rodeo a python IDE built for analysing data to the page of data analysis tools.

rodeo-overview-shot


Comments

Introducing IBM Watson Services for Core ML

 

This should be an interesting development for those developing scientific apps for iOS, the ability to access IBM Watson capabilities.

With Watson Services for Core ML, it’s easy to build apps that access powerful Watson capabilities right from iPhone and iPad, so you can provide dynamic, intelligent insights that improve over time. And with the IBM Cloud Developer Console for Apple, you can quickly tap into Watson Services for Core ML and other services on IBM Cloud

To get you started there is a project on GitHub https://github.com/watson-developer-cloud/visual-recognition-coreml.

Classify images with Watson Visual Recognition and Core ML. The images are classified offline using a deep neural network that is trained by Visual Recognition.

There is a database of Mobile apps for science.


Comments

Chemistry WebVR:- This is so cool

 

Jonas Bostrom who spoke at the Chemistry on Mobile Devices Meeting just sent me a link to EduChem VR - WebVR highlighting the use of virtual reality in chemistry.

"Chemistry WebVR" is web-based platform to learn about organic chemistry. You can experience important concepts like stereochemistry, molecular geometries, atom orbitals or reactions mechanisms in a virtual reality. It is userfriendly and works direct in your smartphone browser. The target is University courses and advanced high-school levels.

There is a demo of a SN2 reaction here and if you explore you will see a link to sign up as a beta tester.


Comments

mmpdb: An Open Source Matched Molecular Pair Platform for Large Multi-Property Datasets

 

An interesting paper on chemrxiv DOI

Matched Molecular Pair Analysis (MMPA) enables the automated and systematic compilation of medicinal chemistry rules from compound/property datasets. Here we present mmpdb, an open source Matched Molecular Pair (MMP) platform to create, compile, store, retrieve, and use MMP rules. mmpdb is suitable for the large datasets typically found in pharmaceutical and agrochemical companies and provides new algorithms for fragment canonicalization and stereochemistry handling. The platform is written in Python and based on the RDKit toolkit. It is freely available from https://github.com/rdkit/mmpdb


Comments

NMR solvent peaks

 

I just noticed this mentioned on Twitter and so I've added it to the Mobile Science site.

NMR Solvent peaks is a conveniently-searchable version of the ungainly table of NMR data most organic chemists keep a copy of nearby. Instead of searching through the table for a peak near your unidentified peak, just enter your solvent and the peak's multiplicity and location and you'll have a short list of candidate impurities

There is also a web-based version and a twitter feed for submitting bugs and finding out about updates.

There are a number of other NMR apps available


Comments

WWDC 2018

 

The Apple Worldwide Developers Conference takes place in San Jose, CA, June 4–8. The opportunity to buy tickets to WWDC18 is offered by random selection. Registration is open until Thursday, March 22, 2018 at 10:00 a.m. PDT

To register, you must be a member of the Apple Developer Program or Apple Developer Enterprise Program as of March 13, 2018 at 10:00 a.m. PDT, and agree to the WWDC18 Registration and Attendance Policy. Your membership must be current, valid, and in good standing from this date until the end of WWDC18.


Comments

Flagging Potential Kinase Inhibitors

 

Most of kinase inhibitors bind in the region of the ATP binding site using the hydrogen bonding interactions of the hinge region shown in the schematic below. We can use the knowledge of these hinge binding motifs to flag potential kinase inhibitors.

schematicatpbinding

Read more ….


Comments

BBEdit 12.1.2 Released

 

BBEdit 12.1.2 is a minor update to my favourite text editor.

From the release notes.

There's a new item in the Application preferences, as part of the software update settings: "Early Access". You can use this to turn on (or off) notification of pre-release maintenance updates for the version of BBEdit that you're using. (Note that even if you turn on Early Access, you will not receive notice of pre-release versions of feature updates or major upgrades.)

A new setting in the "Editing" preferences allows you to control whether tick marks appear in the scroll bar for Live Search matches. Turning this off can be useful if you're working in very large files and have so many results that the application stalls while trying to update the marks.

There are also a number of bug fixes including.

Fixed bug in which the Markdown tokenizer was confused by empty URL references (e.g. ) in such a way that editing in certain subsequent parts of the file would cause syntax coloring to get out of whack. This change also fixes a bug in the Markdown syntax coloring in which links with an empty description or URL were not properly recognized and colored.

BBEdit 12.1.2 requires Mac OS X 10.11.6 or later, and is compatible with macOS 10.13 "High Sierra"

I use BBEdit extensively for Markdown editing but there are a number of alternatives.


Comments

Top 20 programming languages

 

Red Monk have published their Programming Language Rankings. The data source used for these queries is the GitHub Archive.

  1. JavaScript
  2. Java
  3. Python
  4. PHP
  5. C#
  6. C++
  7. CSS
  8. Ruby
  9. C
  10. Swift
  11. Objective-C
  12. Shell
  13. R
  14. TypeScript
  15. Scala
  16. Go
  17. PowerShell
  18. Perl
  19. Haskell
  20. Lua

Swift (+1): Finally, the apprentice is now the master. Technically, this isn’t entirely accurate, as Swift merely tied the language it effectively replaced – Objective C – rather than passing it. Still, it’s difficult to view this run as anything but a changing of the guard. Apple’s support for Objective C and the consequent opportunities it created via the iOS platform have kept the language in a high profile role almost as long as we’ve been doing these rankings. Even as Swift grew at an incredible rate, Objective C’s history kept it out in front of its replacement. Eventually, however, the trajectories had to intersect, and this quarter’s run is the first occasion in which this has happened. In a world in which it’s incredibly difficult to break into the Top 25 of language rankings, let alone the Top 10, Swift managed the chore in less than four years. It remains a growth phenomenon, even if its ability to penetrate the server side has not met expectations.


Comments

Three-Dimensional Printing of Ellipsoidal Structures Using Mercury

 

A recent paper on ChemRxiv

A description of how to use the Mercury software from the CCDC to print 3-dimensional crystal structures that depict the anisotropic displacement parameters, matching the commonly used ellipsoidal depiction used in scientific papers. Details on how to convert a cif file into a 3D printing data file is included in the main paper, and details on the preparation of that data file for printing on a number of different 3D printers is included in the ESI.

DOI

There is more on 3D printing here .




Comments

Vortex update

Dotmatics have announced the impending release of the latest update to Vortex

The focus appears to be on the enhancement of the Vortex bioinformatics tools reviewed previously.






Comments

Script Debugger 7 released

 

A new version of Script Debugger has been released.

Script Debugger is an integrated development environment focused entirely on AppleScript. This focus allows it to deliver a suite of tools that make AppleScript development amazingly productive. You can use it to write and edit code, analyze target applications, debug scripts, and more.

SDFeatureSteppingII


Comments

Second Major DeepChem Release

 

A major update the DeepChem has been announced.

This major version release finishes consolidating the DeepChem codebase around our TensorGraph API for constructing complex models in DeepChem. We've made a variety of improvements to TensorGraph's saving/loading features and added a number of new tutorials improving our documentation of TensorGraph. We've also removed a number of older deprecated submodules and models in favor of the new, standardized TensorGraph implementations.

In addition, we've implemented a number of new deep models and algorithms, including DRAGONNs, Molecular Autoencoders, MIX+GANs, continuous space A3C, MCTS for RL, Mol2Vec and more. We've also continued improving our core graph convolutional implementations.

Also remember the RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry Meeting registration is now open.


Comments

SAMSON 0.7.0 is available

 

SAMSON has been updated with a number of cool features, I particularly like the embedded Jupyter console.

SAMSON is a platform for computational nanoscience.

Python scripting is now available! Most of the SAMSON API is exposed in Python, and a Jupyter console embedded in SAMSON allows you to create models and run simulations, generate movies, perform analysis and reporting, etc., directly from scripts.

Python

What’s more, Python makes it even easier to integrate and pipeline SAMSON and SAMSON Elements with well-known packages from diverse fields, e.g. TensorFlow, PyRosetta, RDKit, ASE, etc., to name a few.

Comments

Data Aanlysis tools

 

I've just added the simple lightweight CSV editor Table Tool to the Data Analysis tools page.

The Data Analysis tools page contains a listing of over 100 applications, tools and libraries that can be used for data analysis under Mac OSX.


Comments

OMEGA v3.0.0 released

 

Conformational analysis is a critical component of molecular modelling and I've always viewed OMEGA from OpenEye as the standard to which all other software packages should be compared.

OMEGA's knowledge-based approach produces high-quality conformers, superior to those of many other methods. It has also been found to be the fastest of commercially available conformer generators. Benchmarking Conformer Ensemble Generators, Friedrich, N.-O. de Bruyn Kops, C. Fachsenberg, F. Sommer, K., Rarey, M. Kirchmair, J. J. Chem. Inf. Model. 2017, 57, 2719-2728. DOI.

OMEGA’s capability has been expanded for molecules containing large rings by adding a method specifically tuned to sample macrocyclic conformational space. The approach is based on a rewritten version of the original OMEGA distance geometry algorithm.

OMEGA-release-image-1

In this update support for macOS El Capitan (10.11), macOS Sierra (10.12), and macOS High Sierra (10.13) has been added.


Comments

Microsoft Quantum Development Kit Samples and Libraries under MacOSX

 

Well this is well out of my comfort zone but I thought I'd mention it.

Welcome to the Microsoft Quantum Development Kit! This repository contains the libraries and samples provided with the Quantum Development Kit https://github.com/microsoft/quantum.

The Microsoft Quantum Development Kit has been tested under MacOSX, Ubuntu Linux, but may work on other distributions. The Python interoperability feature has been developed for the Anaconda distribution of Python 3.6. Please see the README file provided with the Python sample for more details

Thank you for your interest in Microsoft Quantum Development Kit preview. The development kit contains the tools you'll need to build your own quantum computing programs and experiments.

So off you go…..


Comments

Google Summer of Code:- Open Chemistry

 

There are a number of interesting projects being undertaken in this years Google Summer of Code.

If you know of any students that might be interested then perhaps point them to the Open Chemistry Project.

The Open Chemistry project is a collection of open source, cross platform libraries and applications for the exploration, analysis and generation of chemical data. The organization is an umbrella of leading projects developed by long-time collaborators and innovators in open chemistry such as the Avogadro, Open Babel, and cclib projects. These three alone have been downloaded over 700,000 times and cited in over 2,000 academic papers. Our goal is to improve the state of the art, and facilitate the open exchange of chemical data and ideas while utilizing the best technologies from quantum chemistry codes, molecular dynamics, informatics, analytics, and visualization.

There is a list of the GSoC Ideas 2018 here but of course students can add their own.


Comments

MOE update 2018.01 released

 

The latest update to Chemical Computing Group's Molecular Operating Environment (MOE) software includes a variety of new features, enhancements

Windows XP (finally!) and macOS 10.6 have been removed from the list of officially supported platforms. Supported Windows platforms are Vista/7/8/10, and the minimum supported macOS is 10.7 (Lion).

Amber14:EHT Forcefield. The Amber14 parameter set is now supported in MOE. The new parameters consist of improvements to nucleic acids; otherwise, protein and small molecule parameters (and charges) are unchanged. The forcefield can be selected in the MOE | Footer.

TCR-MHC Protein Complex Database. A new MOE Project database containing T-Cell Receptor (TCR) – Major Histocompatibility Complex (MHC) x-ray structures has been added to MOE. The database can be accessed with MOE | Protein | Search | TCR-MHC | TCR-MHC which will launch the MOE Project Search panel.

Several applications have been parallelized to run in the moe -mpu environment:

  • Descriptor calculations with the SVL function QuaSAR_DescriptorMDB.
  • Energy minimization in the Database Viewer DBV | Compute | Molecule | Energy Minimize.
  • Conformational search using MDB input files in MOE | Compute | Conformations | Search.
  • Rotamer library generation with DBV | Compute | Build Rotamer Library.
  • Project database creation with the SVL run file dbupdate.svl and the scripts $MOE/bin/projupdate and $MOE/bin/projupdate.bat.

I plan to review the latest version of MOE in the near future.


Comments

CDD Vault is Now an ELN

 

CDD Vault ELN is an extension to CDD Vault for archiving and selectively sharing experimental text, data. CDD Vault ELN helps you capture and collaborate around unstructured information (conversations, notes, documents, images, files) and structured data (experimental results, plots, SAR).

You can easily capture and link to a variety of objects in CDD Vault ELN including:

  • Images
  • File attachments
  • Links to CDD Vault & other resources
  • Tables
  • Structures

Comments

Awesome Python Chemistry

 

A curated list of awesome Python frameworks, libraries, software and resources related to Chemistry.

https://github.com/lmmentel/awesome-python-chemistry

A blog post giving more details http://lukaszmentel.com/blog/awesome-python-chemistry/index.html.


Comments

Vida updated

 

VIDA v4.4.0 has been released. This upgrade adds several new features and fixes many previous issues.

  • A new ribbon style that produces ribbons with a smoother appearance has been introduced into VIDA.

VIDA4.4.0-release-image-600px

  • Improvements to the Builder/Sketcher, including:
  • closing the Sketcher window prompts for Save, Save as New, Discard, or Cancel
  • closing the Builder closes the Sketcher window
  • an additional “Save As New” option in the toolbar and Builder context menu
  • hitting Return now finishes adding typed-in molecules from the Sketcher
  • Significant improvements to the Extension Manager. In addition, extensions can be centrally deactivated.

VIDA is built on top of the OpenEye Toolkits v2017.Oct libraries to ensure that it and ancillary programs take full advantage of the state-of-the-art improvements in all underlying programming libraries. Support for macOS El Capitan (10.11), macOS Sierra (10.12), and macOS High Sierra (10.13) has been added.


Comments

KNIME tutorial

 

Don't forget to sign up for your chance to hear a webinar by Greg Landrum, Knime's VP for Life Sciences, this Wednesday, He will be talking about processing malaria HTS results using Knime and will give a tutorial on workflows developed for ligand-based virtual screening, based on results of a phenotypic HTS against malaria.

Wed, Feb 21, 2018 3:00 PM - 4:00 PM GMT

Register Here.


Comments

The Royal Society of Chemistry Chemical Information and Computer Applications Group (CICAG) Winter Newsletter is now available Online

 

The Winter 2017-18 edition of the CICAG Newsletter has been published and can be downloaded from the Newsletters webpage.

Features in this edition which may be of interest include: * Details of CICAG's upcoming Artificial Intelligence in Chemistry meeting * 30th Anniversary celebration of the Catalyst Science Discovery Centre and a look at the scientific history and achievements of the area * Tony Kent Strix Award and Annual Lecture 2017 and eLucidate from UKeiG * Other CICAG planned and proposed meetings along with other upcoming conferences and events * Meeting reports * Book reviews * News from Infochem and CAS * A review of the latest chemical information news and developments

PhD Student and Post-Doc Conference Bursaries

Did you know that most CICAG sponsored meetings have a number of bursaries available for PhD and post-doctoral students? Normally up to a value of £250, these awards help to cover registration and travel costs. Preference will be given to members of the RSC (and meeting co-sponsors if applicable), especially those who are selected to give posters.

RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry Friday, 15th June 2018 Royal Society of Chemistry at Burlington House, London, UK. Twitter hashtag - #RSC_AIChem


Comments

Google summer of code chemistry ideas

 

The Open Chemistry project have collected together project ideas for GSoC 2018. The projects cover a wide range of projects in chemistry

The full listing is available here and includes projects that make use of a number of open source toolkits such as Open Babel, RdKit and cclib.


Comments

Molecular Materials Informatics Apps

 

Molecular Materials Informatics, Inc have been busy recently with updates to many of their applications

The following mobile apps have all been updated

PolyPharma Poly-pharmacology of molecular structures: use structure activity relationships to view predicted activities against biological targets, physical properties, and off-targets to avoid. Calculations are done using Bayesian models and other kinds of calculations that are performed on the device.

Green Lab Notebook allows recording of multistep chemical reactions, using molecular structure, name and stoichiometry as the primary components. When quantities are provided, interconversions are calculated automatically, and green chemistry metrics are shown.

SAR Table app is designed for creating tables containing a series of related structures, their activity/property data, and associated text. Structures are represented by scaffolds and substituents, which are combined together to automatically generate a construct molecule. The table editor has many convenience features and data checking cues to make the data entry process as efficient as possible.

MolPrime is a chemical structure drawing tool based on the unique sketcher from the Mobile Molecular DataSheet (MMDS).

Approved Drugs app contains over a thousand chemical structures and names of small molecule drugs approved by the US Food & Drug Administration (FDA). Structures and names can be browsed in a list, searched by name, filtered by structural features, and ranked by similarity to a user-drawn structure. The detail view allows viewing of a 3D conformation as well as tautomers. Structures can be exported in a variety of ways, e.g. email, twitter, clipboard.

Green Solvents reference card for chemical solvents, with data regarding their "greenness": safety, health and environmental effects.

For the desktop the OS X Molecular DataSheet (XMDS) is an interactive cheminformatics tool for viewing and editing molecular structures, chemical reactions and data. It is designed to be instantly intuitive to anyone who has used a Mac, a spreadsheet and any chemical structure sketcher.

xmds


Comments

BBEdit 12 is now 64bit

 

To call BBEdit a text editor is a great injustice, it is the Swiss army knife of text editors and I use it constantly.

The latest update has a major change, BBEdit is now 64-bit this comes with several advantages as the release notes describe

BBEdit is now built as a 64-bit application. This works around various reported bugs in the OS and has other beneficial side effects: the application starts more quickly on a "cold" launch; 64-bit color pickers and contextual-menu plug-ins are now available; and our customers are even more handsome and athletic than before.

Beginning with this version, you can open documents that are much larger than was previously possible. In the Before Time, documents whose in-memory size (about twice the on-disk size) exceeded roughly 1.5GB would fail to open and report an out-of-memory error, as would documents whose internal structure required generation of large quantities of syntax coloring and/or code folding information (such as complicated XML documents). Beginning with this version, you can perform many large-scale operations on very large files without running out of memory or needing to clear Undo state. Support for the Touch Bar has been added to various windows (applicable only to computers that have a Touch Bar, of course):

There are many more updates and fixes described in detail in the release notes.

BBEdit 12 requires macOS 10.11.6 ("El Capitan") or later, and is compatible with macOS 10.13 "High Sierra".

If you are using macOS 10.13 "High Sierra", please make sure that you have updated to the latest available OS version (10.13.3 or later).

Comments

MayaChem Tools

 

MayaChemTools is a fabulous collection of Perl and Python scripts, modules, and classes to support a variety of day-to-day computational discovery needs.

The core set of command line Perl scripts available in the current release of MayaChemTools has no external dependencies and provide functionality for the following tasks:

  • Manipulation and analysis of data in SD, CSV/TSV, sequence/alignments, and PDB files
  • Listing information about data in SD, CSV/TSV, Sequence/Alignments, PDB, and fingerprints files
  • Calculation of a key set of physicochemical properties, such as molecular weight, hydrogen bond donors and acceptors, logP, and topological polar surface area
  • Generation of 2D fingerprints corresponding to atom neighborhoods, atom types, E-state indices, extended connectivity, MACCS keys, path lengths, topological atom pairs, topological atom triplets, topological atom torsions, topological pharmacophore atom pairs, and topological pharmacophore atom triplets
  • Generation of 2D fingerprints with atom types corresponding to atomic invariants, DREIDING, E-state, functional class, MMFF94, SLogP, SYBYL, TPSA and UFF
  • Similarity searching and calculation of similarity matrices using available 2D fingerprints
  • Listing properties of elements in the periodic table, amino acids, and nucleic acids
  • Exporting data from relational database tables into text files

The command line Python scripts based on RDKit provide functionality for the following tasks:

  • Calculation of molecular descriptors
  • Comparison 3D molecules based on RMSD and shape
  • Conversion between different molecular file formats
  • Enumeration of compound libraries and stereoisomers
  • Filtering molecules using SMARTS, PAINS, and names of functional groups
  • Generation of graph and atomic molecular frameworks
  • Generation of images for molecules
  • Performing structure minimization and conformation generation based on distance geometry and forcefields
  • Picking and clustering molecules based on 2D fingerprints and various clustering methodologies
  • Removal of duplicate molecules

These invaluable scripts can be used in other applications, I've written a Vortex Script that uses them.


Comments

Artificial Intelligence in Chemistry

 

I mentioned the first announcement of a meeting to be held next year.

RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry Friday, 15th June 2018 Royal Society of Chemistry at Burlington House, London, UK.
Twitter hashtag - #RSC_AIChem

AI-web-image-1

A number of the speakers have now been confirmed.

Confirmed Speakers

Keynote: What I learned about machine learning - revisited Bob Sheridan, Merck

Presentation title to be confirmed Nadine Schneider, Novartis

Scaling de novo design, from single target to disease portfolio Wilhem van Hoorn, Exscientia

Presentation title to be confirmed Marwin Segler, Benevolent AI

Molecular de novo design through deep learning Ola Engkvist, AstraZeneca

I also notice that there are a number of EPSRC funding opportunities

Artificial Intelligence - UKRI CDTs EPSRC is expected to support 10-20 doctoral training positions.

The call is now open for around 15 Centres for Doctoral Training (CDTs) focused on areas relevant to Artificial Intelligence (AI) across UKRI's remit. This call opens against the background of Professor Dame Wendy Hall and Jérôme Pesenti's review, Growing the artificial intelligence industry in the UK, and the Government's Industrial Strategy White Paper, Building a Britain fit for the Future. This investment in AI skills will be kick-started by support for over 100 studentships that will be funded during 2018/19 via the Research Councils current mechanisms and schemes.

Universities are invited to apply against two priority areas:

Enabling Intelligence, a priority area within Engineering and Physical Sciences Research Council's (EPSRC) main CDT call
Applications and Implications of Artificial Intelligence (AIAI), a new priority area relevant to all Research Councils.

More info..



Comments

Screenlamp:- A toolkit for ligand-based virtual screening

 

A recent publication "Enabling the hypothesis-driven prioritization of ligand candidates in big databases: Screenlamp and its application to GPCR inhibitor discovery for invasive species control" {DOI](http://dx.doi.org/10.1007/s10822-018-0100-7) describes a very interesting software tool for virtual screening.

While the advantage of screening vast databases of molecules to cover greater molecular diversity is often mentioned, in reality, only a few studies have been published demonstrating inhibitor discovery by screening more than a million compounds for features that mimic a known three-dimensional (3D) ligand. Two factors contribute: the general difficulty of discovering potent inhibitors, and the lack of free, user-friendly software to incorporate project-specific knowledge and user hypotheses into 3D ligand-based screening. The Screenlamp modular toolkit presented here was developed with these needs in mind.

The Screenlamp homepage gives more details and installation instructions. Screenlamp is written in Python (3.6) and can be downloaded from GitHub https://github.com/psa-lab/screenlamp.

Certain submodules within screenlamp require external software to sample low-energy conformations of molecules and to generate pair-wise overlays. The tools that are currently being used in the pre-built, automated screening pipeline are OpenEye OMEGA and OpenEye ROCS to accomplish those tasks. However, screenlamp does not strictly require OMEGA and ROCS, and you are free to use any open source alternative that provided that the output files are compatible with screenlamp tools, which uses the MOL2 file format.

Screenlamp is research software and has been made available to other researchers under a permissive Apache v2 open source license.


Comments

Wolfram|Alpha Updated

 

Wolfram|Alpha has been updated.

Wolfram|Alpha. Building on 25 years of development led by Stephen Wolfram, Wolfram|Alpha has rapidly become the world's definitive source for instant expert knowledge and computation.

There are more apps on the MobileScience website.


Comments

Spark V10.5 released

 

Cresset have just announced the latest release of Spark a scaffold hopping and bioisostere replacement tool.

Figure7_The-Spark-GUI_new-600x380

Highlights

  • New wizards to support ligand growing and linking, macrocyclization and water replacement experiments
  • Enhanced Spark database update functionality
  • New pharmacophore constraints
  • Enhancements in search algorithm and advanced options.

Comments

Findings2 released

 

A new version of the very popular electronic notebook Findings has been released. You can try it out for free with no time limit. It allows the creation of up to 20 entries. Purchase Findings Pro to allow the creation of unlimited entries.

screenshot-large-v2

Remember there is a mobile version of Findings for you iPhone or iPad.


Comments

SeeSAR version 7.2 released

 

SeeSAR has been updated.

Get fresh inspiration from this huge update of SeeSAR! We realized, on the one hand, that the functionality of the editor was growing and growing, making it more and more complicated to use. On the other hand, access to the full functionality of ReCore demands a different kind of user interface. So we "took the bull by the horns" and, akin to the editor, created the new Inspirator which you can use to do:

  • Core replacement This feature is the same but with a much improved UI. You are able to directly select and visualize the bonds that will be clipped to carve out a core fragment for replacement. The clipped bonds now remain in place (even while you define sphere constraints) up until you define a new query. Also the display of results is much enhanced, as you can see the new core fragments highlighted in 2D as well as in 3D. For reference, your query molecule stays visible as well.
  • Fragment linking and merging You may of course launch the Inspirator with more than just one molecule. In this case, you can define bonds to clip on different molecules, thereby requesting linker fragments that will connect the remaining pieces. Note that it is not mandatory to clip a terminal part of each molecule to create the query, you may replace a core part in one and connect it to another fragment at the same time.
  • Fragment growing This was possibly the most frequently requested functionality in ReCore: Cut just one bond and grow onto this bond using a fragment library of typical side chains. In this way, you can, for example, reach out to nearby subpockets. The new growing algorithm can very quickly scan through a (for now) ready-made library of typical fragments. You may of course define sphere constraints at the same time in order to target particular locations in the bi

You can download SeeSAR here and use it for free for 7 days.


Comments

Xplor-NIH for molecular structure determination from NMR

 

A discussion on the new developments of Xplor-NIH DOI. Xplor-NIH is a popular software package for biomolecular structure determination from nuclear magnetic resonance (NMR) and other data sources.

Most of Xplor-NIH's code is now being developed directly in the Python language, and thus is directly accessible for modification by the end-user without recompilation, while code paths which require high performance, such as those executed at every timestep of molecular dynamics, are coded in C++. The Python interface to Xplor-NIH provides an extensible toolbox for developing further functionality. Precompiled packages for most popular Unix and Unix-like operating systems (such as Linux and Mac OS X), as well as documentation and support are available directly from http://nmr.cit.nih.gov/xplor-nih/.


Comments

MedChemStructures Genius

 

The idea behind MedChem Structures Genius is that the chemical structure can be used as a visual and semantical mark to gain information on drug molecules (mode of action, side effects, bioavailability,…). This app, aimed at both students and professionnals, allows learning to recognize chemical drug structures and link them to their INN and their pharmacological class. The quiz allows self evaluation. Only small molecules and peptides and biochemical molecules are listed (no biologics, vaccines, …). The drug classification has been adapted from the ATC WHO classification.

392x696bb

There are many more science apps on the Mobile Science site.




Comments

GROMACS updated

 

The official release of GROMACS 2018 is now available.

GROMACS is one of the major software packages for the simulation of biological macromolecules.

Highlights from this update include:-

  • PME long-ranged interactions can now run on a single GPU, which means many fewer CPU cores are needed for good performance.
  • Optimized SIMD support for recent CPU architectures: AMD Zen, Intel Skylake-X and Skylake Xeon-SP.

  • The AWH (Accelerated Weight Histogram) method is now supported, which is an adaptive biasing method used for overcoming free energy barriers and calculating free energies (see http://dx.doi.org/10.1063/1.4890371).

  • A new dual-list dynamic-pruning algorithm for the short-ranged interactions, that uses an inner and outer list to permit a longer-lived outer list, while doing less work overall and making runs less sensitive to the choice of the “nslist” parameter.
  • A physical validation suite is added, which runs a series of short simulations, to verify the expected statistical properties, e.g. of energy distributions between the simulations, as a sensitive test that the code correctly samples the expected ensemble.
  • Conserved quantities are computed and reported for more integration schemes - now including all Berendsen and Parrinello-Rahman schemes.

Comments

Fortran on a Mac

 

I was sent a few updates over the Christmas break and so I've updated the Fortran on a Mac page.


Comments

SeeSAR for Parallelized Fragment Growing & Pocket Exploration

 

I see that SeeSAR now supports a parallelized 'real' fragment growing.

SeeSAR is a software tool for interactive, visual compound prioritisation as well as compound evolution. Structure-based design work ideally supports a multi-parameter optimization to maximise the likelihood of success, rather than affinity alone. Having the relevant parameters at hand in combination with real-time visual computer assistance in 3D is one of the strengths of SeeSAR. Stimulating exploration with SeeSAR, we have embarked on pursuing a new cheminformatics compute paradigm of "Propose & Validate".

d0029fff-17c1-46ea-8d52-6f925b77101d-medium

You can download SeeSAR here and use it for free for 7 days.

Comments

Behind the Scenes in Real-Life Software Design By Stephen_Wolfram · 48 videos

 

I just stumbled across a fascinating series of lectures. These are recordings of the live discussions behind the ongoing software development led by Stephen Wolfram.

Of particular interest might be the discussion on incorporating chemistry into the Wolfram language.

https://www.twitch.tv/videos/181269427?collection=F82InZg17BQFzw.


Comments

UCSF ChimeraX

 

A recent publication DOI describes an update to the popular molecule viewer UCSF Chimera

UCSF ChimeraX is next-generation software for the visualization and analysis of molecular structures, density maps, 3D microscopy, and associated data. It addresses challenges in the size, scope, and disparate types of data attendant with cutting-edge experimental methods, while providing advanced options for high-quality rendering (interactive ambient occlusion, reliable molecular surface calculations, etc.) and professional approaches to software design and distribution.

The application can be downloaded here http://www.rbvi.ucsf.edu/chimerax/download.html

It is important to note that ChimeraX is not backward compatible with Chimera and does not read Chimera session files. It has been tested on MacOS X 10.12. The ChimeraX user interface is implemented in Qt, offering a native-like look and feel on each platform. ChimeraX is largely implemented using Python, an interpreted programming language. To manipulate these very large datasets interactively, ChimeraX uses memory-efficient data structures combined with high-performance algorithms implemented in C++. MacroMolecular Crystallographic Interchange Format (mmCIF) is the preferred format for atomic data in ChimeraX, mmCIF replaces the aged and more limited PDB format and offers a number of advantages.

sym


Comments

Python support in Excel

 

The most popular suggestion on the "How can we improve Excel for Windows" forum is Python as an Excel scripting language with over 4500 votes and it has elicited a comment from the MSFT excel team.

Thanks for the continued passion around this topic. We’d like to gather more information to help us better understand the needs around Excel and Python integration.

Followed by a survey.

Of course one would hope that they also add it to the Mac version of Excel.

Comments

Suggestions for a Laser Pointer

 

I give a course that consists of a full day of lectures, in the past I've had to use a selection of laser pointers/batteries because they don't last.

So I'm looking for a laser pointer that will last for several hours, and be bright enough to show up on the large flat screens used in many lecture theatres these days.

Any suggestions welcome.


Comments

Predicting the Conformational Energy of Small Molecules

 

An interesting publication in JCIM, Atom Types Independent Molecular Mechanics Method for Predicting the Conformational Energy of Small Molecules, DOI.

We report herein our effort to incorporate lone pairs into our model to extend its applicability domain to any saturated small molecules. The developed model H-TEQ 2 has been validated on a wide variety of molecules from polyaromatic molecules to carbohydrates and molecules with high heteroatoms/carbon ratios.


Comments

Deep Learning Cheat Sheet (using Python Libraries)

 

Just came across this really invaluable resource.

  • Deep Learning Cheat Sheet (using Python Libraries)
  • PySpark Cheat Sheet: Spark in Python
  • Data Science in Python: Pandas Cheat Sheet
  • Cheat Sheet: Python Basics For Data Science
  • A Cheat Sheet on Probability
  • Cheat Sheet: Data Visualization with R
  • New Machine Learning Cheat Sheet by Emily Barry
  • Matplotlib Cheat Sheet
  • One-page R: a survival guide to data science with R
  • Cheat Sheet: Data Visualization in Python
  • Stata Cheat Sheet
  • Common Probability Distributions: The Data Scientist’s Crib Sheet
  • Data Science Cheat Sheet
  • 24 Data Science, R, Python, Excel, and Machine Learning Cheat Sheets
  • 14 Great Machine Learning, Data Science, R , DataViz Cheat Sheets



Comments

YANK

 

YANK is a GPU-accelerated Python framework for exploring algorithms for alchemical free energy calculations.

Features

  • Modular Python framework to facilitate development and testing of new algorithms
  • GPU-accelerated via the OpenMM toolkit
  • Alchemical free energy calculations in both explicit and implicit solvent
  • Hamiltonian exchange among alchemical intermediates with Gibbs sampling framework
  • General Markov chain Monte Carlo framework for exploring enhanced sampling methods
  • Built-in equilibration detection and convergence diagnostics
  • Support for AMBER prmtop/inpcrd files
  • Support for absolute binding free energy calculations
  • Support for transfer free energies (such as hydration or partition free energies)

Install using conda

$ conda config --add channels omnia --add channels conda-forge
$ conda install yank

conda will install dependencies from binary packages automatically, including difficult-to-install packages such as OpenMM, numpy, and scipy. YANK runs on Python 3.5, and Python 3.6


Comments

Mac in Chemistry Annual website review

 

At the end of each year I have a look at the website analytics to see which items were the most popular.

Over the year there were 60,000 unique visitors with 25% visiting the site on multiple occasions. The US provided 30% of the visitors and the UK 10% with Germany, Canada and Japan around 5%. As might be expected 60% of the visitors were using a Mac, but 25% of the visitors were Windows users and 10% iOS. Looking at the last month's Mac visitors, 53% were using Mac OS X 10.13, 25% 10.12 and 12% 10.11.

Safari and Chrome (each 41%) were the most used web browsers with the once dominant Internet Explorer down at 2%.

The most viewed blog pages in 2017 were

The most popular web pages were (other than the main page)

The continued popularity of the Fortran on a Mac web page is interesting, I'm not a big Fortran user but if anyone knows of items that could be added to the page I'd be delighted to hear about them. I've done a couple of updates to the Cheminformatics on a Mac page and I think I'll need to add a section on Bioconda in the future.

Interestingly the Scientific Applications under High Sierra page was of only transient popularity. It seem this update to Mac OSX was relatively benign with very few issues.

2017 also saw the 2000th download of iBabel, iBabel is a GUI (graphical user interface) for the open source cheminformatics toolkit OpenBabel. It also provides an interface to a variety of tools built using OpenBabel and a molecule viewer. I'm planning to do an update to iBabel to take advantage of some of the updates to OpenBabel but if you have any suggestions I'd happy to see if I can include them.

3dviewers

2017 also saw the migration of the website from http to https, a change that went pretty seamlessly with only a couple of minor glitches.

The Twitter feed is increasing in popularity with 390 followers. The most popular tweets were

Creating a Bioconda recipe
RSC meeting on AI in Chemistry

The RSS feed still has around 100 followers

Comments