Sunday, September 30, 2018

Google vs. Google

Has this ever happened to you? (It has to me.) I search Google® Scholar for, let’s say, hydrodesulfurization. Then I decide to narrow the search to hydrodesulfurization AND catalysis.

On my screen are TWO Google® search boxes. Which one do I choose?

If I am not careful, I choose the box nearer the top of the screen. I am surprised to find loads of stuff that I do not want. Only then do I realize that I have been diverted from Google® Scholar to Google® regular.

TIP: What happens in Google® Scholar does not stay in Google® Scholar … unless you consciously choose the Google® Scholar search box.

Jean Steinhardt served as Librarian, Aramco Services, Engineering Division, for 13 years. He now heads Jean Steinhardt Consulting LLC, producing the same high quality research that he performed for Aramco.

Follow Jean’s blog at: http://desulf.blogspot.com/  for continuing tips on effective online research
Email Jean at research@jeansteinhardtconsulting.com  with questions on research, training, or anything else
Visit Jean’s Web site at http://www.jeansteinhardtconsulting.com/  to see examples of the services we can provide


Sunday, September 23, 2018

Introducing! Operando computational modeling in heterogeneous catalysis

I found the following article as the result of a Google Scholar email alert. Besides being a good review article, it has the additional benefit of being absolutely free. The link to the source appears at the end of the post.

TIP: Study the article’s introduction. It is better than the abstract at explaining the point of the review. Keep it in mind when you write your next article.

///////
Towards operando computational modeling in heterogeneous catalysis

Lukáš Grajciar ORCID (a), Christopher J. Heard ORCID (a), Anton A. Bondarenko (b), Mikhail V. Polynski ORCID (b), Jittima Meeprasert ORCID (c), Evgeny A. Pidko ORCID *(b)(c) and  Petr Nachtigall ORCID *(a)
a Department of Physical and Macromolecular Chemistry, Faculty of Science, Charles University in Prague, 128 43 Prague 2, Czech Republic. E-mail: lukas.grajciar@natur.cuni.cz ; petr.nachtigall@natur.cuni.cz ; heardc@natur.cuni.cz
b TheoMAT group, ITMO University, Lomonosova 9, St. Petersburg, 191002, Russia
c Inorganic Systems Engineering group, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, The Netherlands. E-mail: e.a.pidko@tudelft.nl
First published on 11th September 2018
DOI: 10.1039/C8CS00398J (Review Article) Chem. Soc. Rev., 2018, Advance Article

Abstract
An increased synergy between experimental and theoretical investigations in heterogeneous catalysis has become apparent during the last decade. Experimental work has extended from ultra-high vacuum and low temperature towards operando conditions. These developments have motivated the computational community to move from standard descriptive computational models, based on inspection of the potential energy surface at 0 K and low reactant concentrations (0 K/UHV model), to more realistic conditions. The transition from 0 K/UHV to operando models has been backed by significant developments in computer hardware and software over the past few decades. New methodological developments, designed to overcome part of the gap between 0 K/UHV and operando conditions, include (i) global optimization techniques, (ii) ab initio constrained thermodynamics, (iii) biased molecular dynamics, (iv) microkinetic models of reaction networks and (v) machine learning approaches. The importance of the transition is highlighted by discussing how the molecular level picture of catalytic sites and the associated reaction mechanisms changes when the chemical environment, pressure and temperature effects are correctly accounted for in molecular simulations. It is the purpose of this review to discuss each method on an equal footing, and to draw connections between methods, particularly where they may be applied in combination.

Lukáš Grajciar
Lukáš Grajciar received his MSc and PhD degrees in chemistry from the Charles University in Prague in 2009 and 2013, respectively, developing and applying dispersion-corrected DFT methods for adsorption in zeolites and metal–organic frameworks. At his postdoctoral position at Jena University in Germany, he became involved in development of high-performance algorithms for ab initio treatment of large molecules and periodic system within the TURBOMOLE program, including implementation of a new tool for global structure optimization of clusters in confinement. Currently, he is a researcher at the Charles University in Prague, investigating reactivity of zeolites using biased ab initio molecular dynamics.

1. Introduction
Most of the chemicals produced nowadays are obtained using processes based on catalysis. The on-going search for optimal process conditions and the most suitable catalyst is driven by various concerns, including (i) environmental impact, (ii) resource utilization, (iii) safety and (iv) overall process economy. While this has traditionally been the domain of experimental investigations, the input from computational investigations has been steadily increasing over the last 40 years. An increased synergy between theory and experiment has become apparent during the last decade, in particular, in the field of heterogeneous catalysis.
By definition a heterogeneous catalyst shifts the reference reaction onto a different free energy surface where the energy of critical transition states with respect to relevant intermediates becomes lower. Mechanisms of chemical reactions were traditionally explored within the concept of the potential energy surface (PES), considering simplified models of a catalytic system working under idealized conditions of, basically, infinite dilution. Such a heterogeneous catalysis model represents ultra-high vacuum conditions, for which calculations provide information at 0 K; we will refer to this model as the 0 K/UHV model. Strictly speaking, such a description corresponds to rather unrealistic reaction conditions and its validity decreases with increasing temperature and pressure. A great number of mechanisms have been proposed based on calculations with such a simplistic model and results were often at least in qualitative agreement with available experimental data. Computational results obtained with 0 K/UHV model correspond reasonably well with experimental data obtained for well-defined surfaces under UHV conditions. However, the overlap of such calculated data and catalytic experiments carried out under realistic conditions is rather small, and a good agreement between 0 K/UHV theory and catalytic experiments was often just fortuitous.

The success of the simple PES concept applied within the 0 K/UHV approximation can be expected only when the following assumptions hold: (i) the structure of the active site under realistic conditions is known (or correctly guessed), (ii) both the structure of the active site and the reaction mechanism do not depend on the surface coverage of individual reaction intermediates, (iii) the reaction mechanism found under nearly UHV conditions is not different from that at the realistic composition of the surrounding gas or liquid phase and (iv) temperature effects, including the transition from PES to free energy surface (FES), can be safely neglected. Unfortunately, all such assumptions are rarely satisfied at once. If the temperature is relatively low it follows that reactants, products and/or reaction intermediates are adsorbed on the surface; and in contrast, one can expect that the reaction proceeds on a clean catalyst surface only at elevated temperature.

A deeper atomistic insight into the reaction mechanisms, the catalyst structure/activity relationship and catalyst stability/transformation during the reaction greatly increases our chances to find the optimal catalyst for a particular process. The most detailed experimental evidence about the catalyst at the molecular level can be obtained by a combination of characterization techniques under UHV conditions. More and more information becomes available from experimental investigations gathered under the conditions of a model catalytic reaction – in situ conditions – and also under conditions where the applied catalytic process takes place – operando conditions. For details of experimental in situ and operando conditions see, e.g., ref. 1–3. A great development of in situ and in particular operando experimental techniques for studying catalytic reactions in the last 20 years has brought an increasing amount of information about the state of the catalysts under realistic conditions.4,5

Among the most important findings emerging from such studies is the evidence of the dynamic nature of the catalyst surface, whose structure constantly changes under the catalytic reaction conditions. For example, in oxidation catalysis by supported metal nanoparticles, in situ and operando techniques revealed the formation of ultra-thin oxide layers covering the metal nanoparticles in an oxidizing atmosphere, which provide the active sites for the target catalytic reactions. Obviously, such an active site model could not be proposed based on the UHV surface science experiments or computations carried out in the 0 K/UHV regime. A problem of how the structure of the catalyst depends on the realistic chemical environment and temperature that are relevant for a particular process is thus the key for a proper understanding of catalysis at the molecular level and for a design of improved catalysts.6–8

Similar to the shift of experimental investigations in catalysis from UHV to operando conditions, theoretical investigations in the field of catalysis are moving more and more from 0 K/UHV models to computational operando investigations. In analogy with the experimental operando conditions, a computational operando model is defined by the following conditions: the structure of the active catalyst surface and the reaction coordinates must reflect realistic conditions during the reaction and a complex reaction network must be established (see Fig. 1 and corresponding text for more details). However, a transition from the 0 K/UHV to operando model dramatically influences the complexity of the problem and increases computational demands. A number of methods have been developed in the past few decades that ease the 0 K/UHV ? operando transition and it is the goal of this review to discuss the current state of the computational investigations of catalysis, with the goal to enable the long-sought after paradigm of catalysis by design.

Fig. 1  Schematic of the various computational methods applied to heterogeneous catalysis, which lie between an idealised UHV model and a realistic, operando model. The traffic light key depicts the quality of each method with respect to catalyst model complexity (Cat), reaction coordinate accuracy (RCN) and reaction network complexity (RxN). Adapted with permission from Piccini et al., Journal of Physical Chemistry C, 2015, 119, 6128–6137, Copyright 2015, American Chemical Society, Vilhelmsen et al., Journal of Chemical Physics, 2014, 141, 044711, Copyright 2014, American Institute of Physics, Chen et al., Journal of Catalysis, 2018, 358, 179–186, Copyright 2018, Elsevier, Pavan et al., Journal of Chemical Physics, 2015, 143, 184304, Copyright 2015, American Institute of Physics, Heard et al., ACS Catalysis, 2016, 6, 3277–3286, Copyright 2016, American Chemical Society. 

A huge gap between the 0 K/UHV models on one side and operando models on the other side cannot be overcome by a single computational method that would explicitly account for the whole complexity of the underlying phenomena. A multiscale modeling approach can be followed to construct a composite methodology that includes all the crucial physical phenomena. In our opinion, the following five methods appear to be the most important for bridging this gap: (i) global optimization techniques, (ii) ab initio constrained thermodynamics, (iii) biased MD simulations, (iv) microkinetic models of reaction networks. The fifth class of methods is a conceptually different approach that does not necessarily imply the explicit account of the complex physics of a catalyst system and yet holds great promise as a tool to enable catalysis by design. This class is the broad family of machine learning methods. The latest development of each of these five techniques is addressed individually in the following five sections of this review.

A transition from the 0 K/UHV to operando model is schematically depicted in Fig. 1. The 0 K/UHV model corresponds to the situation at the lower left corner, corresponding to vanishing partial pressures of reaction components (expressed in terms of chemical potentials) and low temperature. The operando model corresponds to the upper right corner. Going from bottom to top of the figure the reaction environment (in terms of chemical potentials and temperature) becomes more realistic. Any model improvement results in the increased complexity of the problem (from left to right), mostly in the number of configurations that are considered. Basics of the 0 K/UHV model include the following approximations: (i) idealized catalyst surface (denoted as Cat in Fig. 1), (ii) idealized reaction coordinates with minimum number of reactants on the PES at 0 K (reaction coordinate environment – RCE) and (iii) elementary reaction steps are considered (reaction network – RxN). All these approximations must be lifted to move forward to an operando model.

Methods presented in Fig. 1 from left to right start with Hessian-based thermal corrections, followed by a global optimization approach, ab initio constrained thermodynamics and biased MD; microkinetic modeling and machine learning techniques are taken off this order since they can be used at any level of the 0 K/UHV ? operando transition. The order presented in Fig. 1 is motivated by the fact that if all extensions are applied for a particular system, they would be applied in the order presented in the figure, with the exception of Hessian-based thermal corrections. Hessian-based thermal corrections allow a proper transition from potential- to free-energy surfaces while the complexity of the system remains unchanged; they can be used either to improve the 0 K/UHV model or in combination with global optimization or ab initio constrained dynamics (improving the quality of partition functions). It is important to note that it is common to apply just one or two extensions (or even three in some cases) and by no means does it have to be the first methods from left to right. For example, it is rather common to combine Hessian-based thermal corrections directly with microkinetics. It depends on the particular problem under investigation as to which of the extensions is crucial. Global optimization techniques mostly help in finding relevant configurations when these are difficult or impossible to obtain from relevant experimental data. Ab initio thermodynamics is critically important for the investigation of catalyst surfaces that are changed in the reaction environment. Biased molecular dynamics (MD) techniques become essential for the localization of transition states in complex environments when these are strongly affected by the surrounding molecules. Microkinetic modeling of the reaction network is essential for situations in which a large number of reaction intermediates exist. Last but not least, machine learning techniques are emerging as a useful tool in rationalization of the system descriptors and finding important correlations in large data sets.

Each of the methods presented in Fig. 1 is designed to overcome part of the gap between 0 K/UHV and operando conditions. Each method is discussed in the following sections and each of the methods has been reviewed separately in recent years in a comprehensive way. It is the purpose of this review to discuss them on an equal footing with respect to the gap between 0 K/UHV and operando. It should be stressed that the simultaneous application of all these extensions is computationally prohibitive in a general sense. But it should be noted that it is often not necessary to apply all these model extensions for a particular catalytic system; instead it is important to identify which of the extensions is critical for the problem investigated.

Free full text source: https://pubs.rsc.org/en/content/articlehtml/2018/cs/c8cs00398j
///////

Monday, September 10, 2018

Relationships + Synergy … Aramco + KAUST

Thanks to a Google® Scholar email alert (key word: Aramco), I found the following Open Access article.
The first thing that jumped out at me was the subject of the article … bioinformatics.
Authors of the article are affiliated with KAUST, King Abduliziz University, and Saudi Aramco.

KAUST and King Abduliziz University I understand. The two universities are engaged in a wide range of academic pursuits.  But why Saudi Aramco?

I don’t know, but for what it’s worth, I can speculate.

Saudi Aramco and the Saudi universities, including KAUST and King Abduliziz University, have deep and synergistic relationships with each other.
Aramco employs massive computing resources to characterize its reservoirs in order to enhance its ability to extract oil from those resources.
The big data acquired as a result of these characterizations must be analyzed. This analysis requires DeepLearning in order to make the data useful.
DeepLearning algorithms created in the bioinformatics field can be adapted for use in the reservoir characterization effort.

Accordingly, it makes sense for Aramco to make available to bioinformatics researchers their vast computing resources. The researchers will benefit from the use of Aramco’s resources, and Aramco will benefit from the results of their research.

So what do you think … does the Aramco/KAUST relationship + synergy theory seem plausible?

TIP #1: Relationship + synergy applies to everything, like ExxonMobil, Chevron, Shell, BP, and the various universities they are associated with.

TIP #2: The logic for the relationship may not be linear. It may be as indirect as the relationship described above.

Here are some details from the article.  The link at the end of this description will take you to the free full text of the article.

///////
Bioinformatics, 2018, 0–0 doi: 10.1093/bioinformatics/xxxxx Advance Access Publication Date: 01 September 2018 Original Paper Sequence analysis
DeepGSR: An optimized deep-learning structure for the recognition of genomic signals and regions
Manal Kalkatawi 1, 2
Arturo Magana-Mora 1, 3
Boris Jankovic 1
Vladimir B. Bajic 1
1 King Abdullah University of Science and Technology, Computational Bioscience Research Center, Thuwal 23955-6900, Saudi Arabia.
2 King Abdulaziz University, Faculty of Computing and Information Technology, Jeddah 21589, Saudi Arabia.
3 Saudi Aramco, EXPEC-ARC, Drilling Technology Team, Dhahran, 31311, Saudi Arabia.
Abstract
Motivation: Recognition of different genomic signals and regions (GSRs) in DNA is crucial for understanding genome organization, gene regulation, and gene function, which in turn generate better genome and gene annotations. Although many methods have been developed to recognize GSRs, their pure computational identification remains challenging. Moreover, various GSRs usually require a specialized set of features for developing robust recognition models. Recently, deep-learning (DL) methods have been shown to generate more accurate prediction models than ‘shallow’ methods without the need to develop specialized features for the problems in question. Here, we explore the potential use of DL for the recognition of GSRs. Results: We developed DeepGSR, an optimized DL architecture for the prediction of different types of GSRs. The performance of the DeepGSR structure is evaluated on the recognition of polyadenylation signals (PAS) and translation initiation sites (TIS) of different organisms: human, mouse, bovine, and fruit fly. The results show that DeepGSR outperformed the state-of-the-art methods, reducing the classification error rate of the PAS and TIS prediction in the human genome by up to 29% and 86%, respectively. Moreover, the cross-organisms and genome-wide analyses we performed, confirmed the robustness of DeepGSR and provided new insights into the conservation of examined GSRs across species. Availability and implementation: DeepGSR is implemented in Python using Keras API; it is available as open-source software and can be obtained at https://doi.org/10.5281/zenodo.1117159G.
Contact: vladimir.bajic@kaust.edu.sa  or manal.kalkatawi@kaust.edu.sa  
Supplementary information: Supplementary data are available at Bioinformatics online.
1 Introduction © The Author(s) 2018. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License

source: https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/bty752/5089227
Free Full text: https://watermark.silverchair.com/bty752.pdf?token=AQECAHi208BE49Ooan9kkhW_Ercy7Dm3ZL_9Cf3qfKAc485ysgAAAlcwggJTBgkqhkiG9w0BBwagggJEMIICQAIBADCCAjkGCSqGSIb3DQEHATAeBglghkgBZQMEAS4wEQQMzsxvWuEXhCUDWHXKAgEQgIICCkJuVRgDn_yeWX5NyUSMOHBHpFSgahMs2YN8jxYEd87_rSm5_UlMoMFxCjPMXynMYpSs4P10Ef4-gOmXldMvy0Dgm9bigMVMGHpgwFqar436r8RoDxZpov5fHYpNeNUcyFSQIlOlW-z4DMFhfXZWYSkdnulBRAIK1-U7vaNhsuuJB4takgTZvPvTrj1PIODBRZHeljlvQcVJ4lyD2bfiXLDv681QwxbJ029_xXDjQTwYaOoHO_TNFBdDx5z8-ZV_0EGemIaImcZ3JhbdDWoD7146HGZVOkLqjML_q7zy4i7E32Gh-0GsbdOO9yfWktXb9SVqydwWt4CIQis0YUPvcSPER50DYWgTw2h2ysutmkFL_guDPL_VQ-Ud7kJXXxcWBDIijUaDTLO_TD6wVXbgTXdIRKRmebEFLmZkF5BUqXkw00Xy-qKmUNPCsmEIsn6RBfRN6Or5L4-jHAi0saVoHICJYcL3b0Jo5NvOWzH9nXCTPB9QFKx4QxcSdn2mUQT0CudzznCy4kGS0K1y8YCmPZm1enbgybdP3jDZev6v3w8hv8htsupOACYwqHb8arhqGKxrT3VzkRnO_ng_R-bxPRXLUGYAUUpt8kSyZF2Fuz7BpiGySwmHWaLl9ni4R3jEopJRLq6lqUZZhLkKvdTS5sSRHFNKdbvXmrFzY6Y_T2tq1yMSKv7xWuiCNg
///////