Coder Social home page Coder Social logo

ju-leon / bachelor-thesis Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 1.0 143.68 MB

Analysing the influence of the spacial structure of catalyst molecules on their activation barrier using neural networks.

License: GNU General Public License v3.0

Jupyter Notebook 99.85% Python 0.15% Shell 0.01%
machine-learning chemical-engineering material-science keras catalyst feature-engineering

bachelor-thesis's Introduction

Bachelor Thesis - March 2021 - Karlsruhe Institute of Technology

Rotationally invariant neural networks for homogenous catalysis

Leon Jungemeyer

While there are many standart chemical feature extractor, none of the state-of-the-art feature extractors today allow for a conversion back from feature space to chemical space. This makes interpretations of predictions impossible. This thesis aims at extracting features from catalyst molecules that allow for a conversion back from feature space to chemical space, allowing for an in depth analysis on what parts of the chemical structure is responsible for the prediction.

title-vid

Abstract

Metal Catalysts are crucial for a variety of applications, from water splitting to CO2 reduction. Catalysts derived from Vaska’s complex can be constructed by combining different ligands around an iridium atom. Depending on the structure of the resulting complex, the activation barrier will vary.

Due to the large space of possible ligand combinations and the complexity of quantum chemistry methods, experimental or virtual exploration of activation energies is impractical. Machine learning can bridge the gap and allow for fast prediction of a catalysts activation barrier. The methods proposed here are capable of performing high accuracy predictions of the activation barrier based on the 3D shape of the molecule.

Two different ways of encoding a catalysts into a format understandable to a neural network are explored. The first is using elliptic fourier descriptors to generate fully rotationally invariant features. The second is using a combination of spherical harmonics and radial basis functions to generate a representation of the molecule. Both feature generators proposed allow for a semi bi-directional mapping between feature space and chemical space.

A neural network is then trained to predict the activation barrier from these features. The best neural network achieves an accuracy of 0.53kcal/mol and r^2 = 0.96 on a train fraction of 80%.

In a last step, the neural network is used to analyze which parts of the catalyst are important to the prediction of the activation barrier. With this information an intuition on how the chemical structure has to be changed in order to lower the activation barrier can be learned.

Overview

The first step for any machine learning task is generating features. The features generated here are using a combination of spherical harmonis and radial basis functions to create a basis that can describe the density space surrounding a central atom.

The features are genrated for each species in the dataset. Here the features for phosphorus are visulaized:

Visulaization of generted feautres

The features are then used in a neural network. The network predicts the activation barrier from the generated features. The accuracies reached are the highes accuracies on this dataset to date.

mae-compare

In a last step, the netowrk is analysed. This allows to learn about the influence of certain parts of the molecule on the prediction. In the following graph, the influence of nitrogen on the prediction is visualised. The negative density indicated removing nitrogen in this are will decrease the prediction of the model. In many cases, this intuition correlates with the ground truth.

elem2-N

Documentation

The full Bachelor's thesis can be found under releases.

bachelor-thesis's People

Contributors

gabegomes avatar ju-leon avatar pascalfriederich avatar

Watchers

 avatar  avatar

Forkers

xilingyi

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.