Search CORE

1,721,151 research outputs found

Crystal structure of Bacillus subtilis S-adenosylmethionine : tRNA ribosyltransferase-isomerase

Author: Grimm Clemens
Ficner Ralf
Klebe Gerhard
Sgraja Tanja
Reuter Klaus
Haebel Peter
Publication venue
Publication date: 01/01/2006
Field of study

The enzyme S-adenosylmethionine:tRNA ribosyltransferase-isomerase (QueA) is involved in the biosynthesis of the hypermodified tRNA nucleoside queuosine. It is unprecedented in nature as it uses the cofactor S-adenosylmethionine as the donor of a ribosyl group. We have determined the crystal structure of Bacilhis subtilis QueA at a resolution of 2.9 angstrom. The structure reveals two domains representing a 6-stranded beta-barrel and an alpha beta alpha-sandwich, respectively. All amino acid residues invariant in the QueA enzymes of known sequence cluster at the interface of the two domains indicating the localization of the substrate binding region and active center. Comparison of the B. subtilis QueA structure with the structure of QueA from Thermotoga maritima suggests a high domain flexibility of this enzyme. (c) 2006 Elsevier Inc. All rights reserved

GRO.publications

GRO.publications (Univ. Göttingen)

Development of a database and knowledge-based predictive methods for investigating water molecules in protein structures and their role in protein-ligand binding

Author: Günther Judith (128660856)
Publication venue
Publication date: 10/08/2011
Field of study

Die vorliegende Arbeit befasst sich mit dem Aufbau der ersten Datenbank zur Charakterisierung von Wassermolekülen in Proteinstrukturen. Diese wurde als Modul der Rezeptor-Ligand-Datenbank Relibase+ konzipiert und erfasst alle Röntgenstrukturen der Proteindatenbank PDB. Diese Datenbasis wurde anschließend genutzt, um wissensbasierte Methoden zu entwickeln und zu validieren, welche die Vorhersage von Wasserbindestellen erlauben. Von besonderem Interesse sind dabei Wassermoleküle, die im Bindungsepitop zwischen Rezeptor und Ligand vergraben sind, da diese das vielen rationalen Wirkstoffdesignmethoden zugrundeliegende, einfache Schlüssel-Schloss-Prinzip durchkreuzen. Kapitel 2 beleuchtet in einem Literaturüberblick den heutigen Kenntnisstand zu Wassermolekülen in Proteinstrukturen vor dem Hintergrund der zugrundeliegenden experimentellen Methoden. Kapitel 3 beschreibt die Konzeption der Wasserdatenbank und Anwendungsbeispiele zu den darin implementierten Werkzeugen. Die Methoden zur vergleichenden Analyse von Solvatationsstrukturen erlauben es, sowohl die strukturelle Ähnlichkeit von Liganden wie auch die sequentielle Verwandtschaft von Proteinen als Referenz für eine räumliche Strukturüberlagerung heranzuziehen. Obwohl wiederkehrende Solvatationsmuster (Konservierung von Wassermolekülen) vor allem durch die von der Proteinoberfläche exponierten physikochemischen Eigenschaften bestimmt sind, sollte auch der Einfluß des Liganden nicht vernachlässigt werden. Darüber hinaus können auch relativ schwache Wechselwirkungen wie CH-Wasserstoffbrücken eine Rolle spielen. Des weiteren implementiert die Wasserdatenbank einen Algorithmus zur Erkennung fehlzugewiesener Wassermoleküle in Röntgenstrukturen, welcher literaturbekannte Methoden verbessert (siehe Kapitel 7). Durch eine kombinierte Beurteilung der Koordinationsgeometrie, des kristallographischen Temperaturfaktors sowie der aus den Bindungslängen zu umgebenden Atomen errechneten elektrostatischen Valenz kann abgeschätzt werden, ob sich hinter einem als Wassermolekül deklarierten Teilchen möglicherweise eher ein Natrium (oder ein Magnesium-) Ion verbirgt. Kapitel 4 untersucht Solvatationsstrukturen mit Mitteln der Statistik und weist auf, welchen Randbedingungen erfolgsversprechende prädiktive Ansätze im Drug Design genügen müssen. Da vergrabene Wasserpositionen bei der Ligandenbindung nur selten gegenüber der ligand-freien Struktur verschoben und sehr viel häufiger konserviert werden, markieren die Wasserbindestellen in der ligand-freien Struktur mögliche Wasserpositionen in einem Komplex. Der Konservierungsgrad beim Vergleich zweier sequenzidentischer Bindetaschen, die unterschiedliche Liganden binden, hängt jedoch signifikant von der Ähnlichkeit der beiden Ligandenstrukturen ab. Kapitel 5 beschreibt die Vorhersage konservierter Wasserpositionen mit Hilfe eines GA/knn-Algorithmus in unterschiedlichen Szenarien. Die entwickelten Deskriptoren erlauben die Diskriminierung von solvatisierten und nicht solvatisierten Positionen auf der Proteinoberfläche mit einer Vorhersagegenauigkeit von 82%, was alle literaturbekannten Methoden übertrifft. Eine deutliche Steigerung gegenüber bisherigen Verfahren konnte auch für die Klassifikation von konservierten und nicht konservierten Wasserpositionen beim Vergleich mehrerer sequenzidentischer Proteinstrukturen erreicht werden (Vorhersagegenauigkeit 78%). Hierzu war es essentiell, zum Aufbau der Wissensbasis möglichst viele Strukturvergleiche heranzuziehen und den nachweisbaren, individuellen Einfluss der kristallographischen Autoren zu berücksichtigen. Bei der Diskriminierung von konservierten und nicht konservierten Wasserpositionen in Ligandenbindetaschen wurde eine Vorhersagegenauigkeit von 73% erreicht, wobei vor allem der Einfluss des einzelnen gebundenen Liganden als limitierender Faktor erscheint. Der Ansatz, eine Auswahl von Wassermolekülen bei einem Docking-Experiment fest vorzuplazieren, kann (abhängig von den Eigenschaften der einzelnen Bindetasche) daher ausgesprochen ungeeignet sein. Kapitel 6 befasst sich daher mit der Weiterentwicklung des in FlexX implementierten Particle Concepts. Dieser Ansatz erlaubt eine flexible Plazierung von Wassermolekülen abhängig vom jeweiligen Liganden. Durch die Implementierung einer um Wassermoleküle erweiterten Version der Scoringfunktion DrugScore konnte die energetische Reihung der generierten Lösungsvorschläge deutlich verbessert werden. Die neue Scoring-Funktion ist nicht nur der ursprünglich in dem Particle Concept implementierten, empirischen Funktion von Boehm überlegen, sondern zeigt auch gegenüber der ursprünglichen DrugScore-Version eine um 15% verbesserte Erkennung nativähnlicher Bindungsmoden (RMSD<= 1.0A) auf dem ersten Rang.This study is dedicated to the development of the first database for characterizing water molecules in protein structures. It was designed as a module for the receptor-ligand database Relibase+ and comprises all the X-ray structures of the protein databank PDB. The data collection has subsequently been utilized for the development and validation of knowledge-based methods for the prediction of water sites. Of particular interest are water molecules buried in the protein-ligand interface, since these water molecules defy the simple lock and key principle that most rational drug design methods rely on. Chapter 2 reviews the knowledge on water molecules in protein structures gathered to date against the background of the underlying experimental methods. Chapter 3 describes the conception of the water database as well as some of the application examples for the tools implemented. The tools developed for comparative analysis of solvation patterns allow different references to be used. Both the structural similarity of ligands as well as the sequential relationships of proteins can serve as the reference for superimposition of the respective structures in three-dimensional space. Although recurrent solvation patterns (conservation of water molecules) are predominantly determined by physicochemical properties exposed on the protein surface, the influence of the ligand should not be underestimated. Moreover, apart from classical hydrogen bonds, weaker interactions such as CH-hydrogen bonds can play a relevant role. The water database also includes a tool for the detection of crystallographically misassigned water molecules, which enhances known methods (see chapter 7). The method estimates as to whether or not some particles assigned as water molecules might instead represent a sodium (or magnesium) ion. The algorithm combines a set of descriptors, which include the coordination geometry of the particle, its B-factor and its electrostatic valence, which is derived from the contact lengths to the atoms in the local neighbourhood. In Chapter 4, hydration structures in proteins are examined by means of statistical methods. This analysis reveals important conditions that predictive methods in rational drug design have to meet in order to appear promising. Water molecules buried in the protein-ligand interface are mostly conserved with respect to the ligand-free structure, while a significant shift of their positions upon ligand binding is only rarely observed. Thus, water sites in a ligand-free structure provide an indication as to where feasible water sites in a protein-ligand-complex can be found. However, the degree of conservation amongst water sites from two sequence-identical protein pockets depends significantly on the structural similarity of the two bound ligands. Chapter 5 deals with the prediction of conserved water molecules in different scenarios by means of a GA/knn algorithm. Using the descriptors developed with the water database, a prediction accuracy of 82% was achieved for the discrimination of crystallographically determined water sites from non-solvated positions on a protein surface. In this application scenario, the approach outperformed all previously reported methods. A similar improvement when compared with existing methods was also achieved for the classification of conserved versus non-conserved water molecules in a comparison of different structures of the same protein (prediction accuracy 78%). To this end, firstly, it was necessary to consider as many structural comparisons as possible for a reference protein when compiling the knowledge base and, secondly, to account for the proven bias introduced by the individual crystallographers who author the structures, respectively. For the discrimination of water molecules conserved upon ligand binding versus others that are non-conserved, a prediction accuracy of 73% was obtained. In this scenario, it is primarily the influence of the individual bound ligand that limits the performance of the algorithm. Depending on the respective protein binding pocket, the approach of pre-placing selected water molecules in fixed positions in the setup of, e.g., a virtual screening, can thus be inappropriate. Therefore, chapter 6 focuses on a methodical enhancement of the Particle Concept implemented in FlexX. This approach allows for flexible placement of water molecules during the build-up of the individual ligand in the binding pocket. Implementation of an enhanced version of the scoring function DrugScore, which takes water molecules into account, improved the energy ranking of the generated solutions significantly. The new scoring scheme not only outperforms the originally implemented empirical function by Boehm, it also yields a by 15% improved recognition of near native binding modes on top rank (RMSD<= 1.0A) when compared with the standard DrugScore version

open_UMR (Universität Marburg)

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)

Erweiterung der Toolbox für die Computergestützte Analyse in der rationalen Wirkstoffentwicklung: Verwendung von Biomolekularer Solvatisierung zur Vorhersage thermodynamischer, kinetischer und struktureller Eigenschaften von Protein-Liganden-Komplexen

Author: Hüfner Tobias
Publication venue
Publication date: 25/05/2020
Field of study

Most biomolecular interactions occur in aqueous environment. Therefore, one must consider the interactions between proteins and water molecules when developing a drug molecule against a target protein. The study of these interactions is challenging using experimental techniques alone, therefore computer simulations are commonly used to study the molecular details of protein-water or ligand-water interactions. In the first study presented in this doctoral dissertation (Chapter 2), the development, parameterization and testing of an approach is presented that can be used to calculate the solvation contribution in protein-ligand binding thermodynamics. The approach uses an extensive amount of molecular dynamics trajectories in conjunction with GIST calculations in order to obtain models that can predict relative protein-ligand solvation thermodynamics. In order to validate the approach, the model system thrombin is investigated using a set of 53 ligands with experimentally characterized protein-ligand structures and ITC profiles. We found that the binding thermodynamics of 186 congeneric pairs of ligands can be accurately described using our solvation-based models. The relative free energy of binding for these 186 pairs can be calculated from the desolvation free energy of the ligand molecules alone. Furthermore, complete thermodynamic profiles for protein-ligand binding reactions (i.e. free energy, enthalpy and entropy of binding) are accurately predicted by incorporating GIST solvent data from the unbound ligand as well as the protein-ligand complex. In Chapter 3, the aforementioned approach is applied to develop a strategy that enables to equip drug molecules with a desired set of solvation thermodynamics properties. For this purpose, the thrombin ligands (same ligand series as in previous Chapter 2) and the corresponding GIST integrals are decomposed into smaller building block molecules. In the next step, the solvation thermodynamics for the building blocks in the ligand molecule as well as the solvation thermodynamics for the isolated building block in aqueous solution are calculated. We found greatly varying solvation thermodynamics for the different building blocks, demonstrating their potential to design ligands with a wide range of solvation characteristics. Also, we found that the building block decomposition of ligand molecules and the corresponding GIST integrals can be readily used to understand remote solvent structuring effects. These effects occur in the unbound ligand molecule and describe the enhanced solvent structuring on a building block in the ligand molecule due to the presence of another building block at a distal site of the ligand. Furthermore, we demonstrated that the fluorination of building blocks leads to an increased unfavorable desolvation free energy and thus disfavors binding for the presented dataset. The research presented in Chapter 2 and Chapter 3 was accomplished with the computer program Gips that was developed as part of this doctoral dissertation. In the following Chapter 4, the mechanism and time scale of desolvation is being analyzed for the protein-ligand dissociation reaction of trypsin and thrombin in complex with benzamidine and N amidinopiperidine. The analysis is carried out using umbrella sampling free energy calculations and LoCorA calculations. The LoCorA approach is a method for the analysis of residence times of water molecules on the surface of amino acids. It was found that water molecules reside approximately 1.3 ns in the binding pocket of thrombin, whereas in trypsin they are residing one order of magnitude shorter (0.3 ns). This difference is explained with special solvent channels that connect the interior of the binding pocket to bulk solvent environment. The solvent channels are present in thrombin but not in trypsin. Furthermore, the selectivity profiles of benzamidine and N amidinopiperidine are related to a solvent-mediated free energy barrier that is present in thrombin but not trypsin. Also due to the presence of the solvent channels, the water molecules show similar residence time for both complexes in the case of thrombin but differing residence times in the case of the two trypsin complexes. The LoCorA approach is implemented in the computer program LoCorA (same name as the approach itself), which was developed as part of this doctoral dissertation. In the course of this doctoral dissertation, further computational studies were carried out in combination with experimental ones. These can be found in chapter 5 of this dissertation. Each of these studies is preceded by a separate abstract and a statement concerning the author contribution.Die meisten biomolekularen Interaktionen finden im wässrigen Medium statt. Daher ist es wichtig die Interaktionen zwischen Proteinen und Wassermolekülen in der Wirkstoff-Forschung zu berücksichtigen. Die Untersuchung dieser Interaktionen mittels experimenteller Methoden ist anspruchsvoll, daher werden häufig Computer-Simulationen verwendet um die molekularen Details von Protein-Wasser oder Ligand-Wasser-Interaktionen zu studieren. Im zweiten Kapitel der vorliegenden Doktorarbeit wird die Entwicklung, Parametrisierung und Erprobung eines Ansatzes vorgestellt, der zur Berechnung der Solvatations-Beiträge in Protein-Ligand Bindungsreaktionen verwendet werden kann. Der Ansatz verwendet eine umfassende Menge an Trajektorien aus Moleküldynamik-Simulationen in Kombination mit GIST Berechnungen um Modelle zu erhalten, mit welchen die relativen Beiträge zur Protein-Ligand Solvatations-Thermodynamik vorhergesagt werden können. Um den Ansatz zu validieren wurde das Model System Thrombin mit einem Satz von 53 Liganden mit bekannter Kristallstruktur und ITC Profilen untersucht. Dabei wurde herausgefunden, dass die Bindungs-Thermodynamik von insgesamt 186 Paaren von Liganden genau vorhergesagt werden kann. Die relative Freie Energie der Bindung für diese 186 Paare kann dabei schon alleinig aus der Desolvatation des freien Liganden ermittelt werden. Im Weiteren werden vollständige thermodynamische Profile für Protein-Ligand Bindungsreaktionen korrekt vorhergesagt. Im dritten Kapitel wird der zuvor vorgestellte Ansatz verwendet um eine Strategie zu entwickeln die es ermöglicht Wirkstoffe mit gewünschter Solvatations-Thermodynamik auszustatten. Für diesen Zweck werden die Thrombin-Liganden (gleiche Liganden Serie wie im vorrangegangenen Kapitel 2) in kleinere molekulare Bausteine zerlegt. Im nächsten Schritt wird die Solvatations-Thermodynamik eines jeden Bausteins im Liganden ebenso wie für den isolierten Baustein in wässriger Lösung berechnet. Dabei wurden sehr diverse Eigenschaften für die verschieden Bausteine gefunden, was deren Potential zum Entwurf von Liganden mit einer großen Bandbreite von Solvatations-Charakteristika ermöglicht. Ebenso wurden Fernstrukturierungseffekte von Wassermolekülen entdeckt. Diese Effekte konnten nur durch die Zerlegung der Liganden und der korrespondierenden GIST-Integrale in einzelne Bausteine ermöglicht werden. Die Fernstrukturierungseffekte treten im ungebundenen Liganden auf und beschreiben die verstärkte Strukturierung von Solvens-molekülen auf einer Baueinheit bedingt durch das Vorhandensein einer anderen Baueinheit auf einer entfernten Seite des Liganden. Im Weiteren wurde gezeigt, dass die Fluorierung von Baueinheiten zu erhöhten unvorteilhaften Desolvatationseigenschaften führt. Die Fluorierung führt daher zu einer reduzierten Bindungsaffinität. Die Forschungsarbeiten aus Kapitel 2 und 3 wurden mit Hilfe des Computerprograms Gips durchgeführt, welches im Zuge dieser Doktorarbeit entwickelt wurde. In Kapitel 4 wird der Mechanismus und die Zeitskala der Desolvatation für eine Protein-Ligand Dissoziationsreaktion für die von Trypsin und Thrombin im Komplex mit Benzamidin und N amidinopiperidin untersucht. Die Untersuchung wird durchgeführt mittels „Umbrella Sampling“ und LoCorA Rechnungen. LoCorA ist eine Methode zur Analyse von Besetzungszeiten von Wassermolekülen auf der Oberfläche von Aminosäuren. Damit wurde herausgefunden, dass Wassermoleküle ungefähr 1.3 ns in der apo Bindetasche von Thrombin verweilen, wohingegen sie in der apo Bindetasche von Trypsin um eine Größenordnung kürzer verweilen (0.3 ns). Dieser Unterschied wird mit Solvens-Kanälen im Falle von Thrombin, und mit einem Solvens-Reservoir im Falle von Trypsin erklärt. Die Solvens-Kanäle bedingen, dass Wassermoleküle die gleichen Besetzungszeiten für beide Komplexe zeigen im Falle von Thrombin. Durch das Fehlen dieser Kanäle in Trypsin gibt es hier jedoch unterschiedliche Besetzungszeiten für die beiden Komplexe. Der LoCorA Ansatz ist implementiert in das Computerprogram LoCorA (gleicher Name wie der Ansatz selbst), welches im Zuge dieser Doktorarbeit entwickelt wurde. Weitere Studien die im Zuge dieser Doktorarbeit durchgeführt und mit experimentellen Untersuchungen kombiniert wurden, sind in Kapitel 5 dieser Dissertation zu finden. Zu jeder dieser Studien ist eine separate Zusammenfassung und Erläuterung bezüglich der Eigenanteile vorangestellt zu finden

open_UMR (Universität Marburg)

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)

Konzepte zur Modulierung von Protein-Protein Interaktionen: Datenanalyse, Strukturuntersuchungen und Strategien zum Finden niedermolekularer Modulatoren

Author: Block Peter
Publication venue
Publication date: 10/08/2011
Field of study

(1) Analyzing protein-protein interactions at the atomic level is critical for our understanding of the principles governing the interactions involved in protein-protein recognition. For this purpose descriptors explaining the nature of different protein-protein complexes are desirable. In this work, we introduce Epic Protein Interface Classification (EPIC) as a framework handling the preparation, processing, and analysis of protein-protein complexes for classification with machine learning algorithms. We applied four different machine learning algorithms: Support Vector Machines (SVM), C4.5 Decision Trees, K Nearest Neighbors (KNN), and Naïve Bayes (NB) algorithm in combination with three feature selection methods, Filter (Relief F), Wrapper, and Genetic Algorithms (GA) to extract discriminating features from the protein-protein complexes. To compare protein-protein complexes to each other, we represented the physicochemical characteristics of their interfaces in four different ways, using two different atomic contact vectors (ACVs), DrugScore pair potential vectors (DPV) and SFCscore descriptor vectors (SDV). We classified two different datasets: (A) 172 protein-protein complexes comprising 96 monomers, forming contacts enforced by the crystallographic packing environment (crystal contacts), and 76 biologically functional homodimer complexes; (B) 345 protein-protein complexes containing 147 permanent complexes and 198 transient complexes. We were able to classify up to 94.8% of the packing enforced/functional and up to 93.6% of the permanent/transient complexes correctly. Furthermore, we were able to extract relevant features from the different protein-protein complexes and introduce an approach for scoring the importance of the extracted features. (2) Since protein-protein interactions play pivotal role in the communication on the molecular level in virtually every biological system and process, the search and design for modulators of such interactions is of utmost interest. In recent years many inhibitors for specific protein-protein interactions have been developed, however, in only a few cases, small and druglike molecules are able to interfere the complex formation of proteins. On the other hand, there a several small molecules known to modulate protein-protein interactions by means of stabilizing an already assembled complex. To achieve this goal, a ligand is binding to a pocket, which is located rim-exposed at the interface of the interacting proteins, e.g. as the phytotoxin Fusicoccin, which stabilizes the interaction of plant H+-ATPase and 14-3-3 protein by nearly a factor of 100. To suggest alternative leads, we performed a virtual screening campaign to discover new molecules putatively stabilizing this complex. Furthermore, we screen a dataset of 198 transient recognition protein-protein complexes for cavities, which are located rim-exposed at their interfaces. We provide evidence for high similarity between such rim-exposed cavities and usual ligand accommodating active sites of enzymes. This analysis suggests that rim-exposed cavities at protein-protein interfaces are druggable targets. Therefore, the principle of stabilizing protein-protein interactions seems to be a promising alternative to the approach of the competitive inhibition of such interactions by small molecules. (3) AffinDB is a database of affinity data for structurally resolved protein-ligand complexes from the PDB. It is freely accessible at http://www.agklebe.de/affinity. Affinity data are collected from the scientific literature, both from primary sources describing the original experimental work of affinity determination and from secondary references which report affinity values determined by others. AffinDB currently contains over 730 affinity entries covering more than 450 different protein-ligand complexes. Besides the affinity value, PDB summary information and additional data are provided, including the experimental conditions of the affinity measurement (if available in the corresponding reference); 2D drawing, SMILES code, and molecular weight of the ligand; links to other databases, and bibliographic information. AffinDB can be queried by PDB code or by any combination of affinity range, temperature and pH-value of the measurement, ligand molecular weight, and publication data (author, journal, year). Search results can be saved as tabular reports in text files. The database is supposed to be a valuable resource for researchers interested in biomolecular recognition and the development of tools for correlating structural data with affinities, as needed, for example, in structure-based drug design.Diese Arbeit beschreibt eine Klassifizierung von Protein-Protein Komplexen mit Algorithmen aus dem Bereich des Maschinellen Lernens (ML). Es wird die Vorhersagequalität von vier verschiedenen ML Algorithmen verglichen: Support Vector Maschinen, C4.5 Entscheidungsbäume, K-Nächste-Nachbarn und Naïve Bayes. In Kombination mit so genannten Feature-Selektionsverfahren konnte ein Datensatz von 345 Protein-Protein Komplexen (147 permanente und 198 transiente Komplexe) in einer „Leave-One-Out Cross-Validierung“ zu 93,6% richtig vorhergesagt werden. Des Weiteren wurde eine Klassifizierung von Protein-Protein Kristallkontakten gegenüber funktionellen Protein-Protein Komplexen durchgeführt. Ein Datensatz von 172 Protein-Protein Komplexen (76 funktionelle Komplexe gegenüber 96 Kristallkontakten) konnte zu 94,8% richtig klassifiziert werden. Mit Hilfe der Auswertung und Optimierung anhand Genetischer Algorithmen ließ sich ein Verfahren entwickeln, das es ermöglicht, eine quantitative Aussage über die Relevanz einzelner Deskriptoren zu treffen. Dazu werden alle so genannten Individuen des Genetischen Algorithmus evaluiert und die relative Häufigkeit der einzelnen Deskriptoren ins Verhältnis zur der Vorhersagerate gesetzt. Durch diese Analyse konnte gezeigt werden, dass beispielsweise das Verhältnis von hydrophober zu hydrophiler Oberfläche zwischen den Protein-Protein Komplexen eine für die Diskriminierung entscheidende Rolle spielt. Der zweite Teil dieser Arbeit konzentriert sich auf die Suche und das Design von Stabilisatoren für Protein-Protein Interaktionen. Obwohl ein Großteil der heute eingesetzten Arzneistoffe allosterisch fungierende Effektoren bzw. Agonisten oder Antagonisten unterschiedlichster Rezeptoren sind, ist die funktionelle Regulierung biologischer Systeme prinzipiell auch durch die Modulierung von Protein-Protein Interaktionen möglich. Das ehrgeizige Ziel kleine, arzneistoffähnliche Moleküle zu entwerfen, die in der Lage sind Protein-Protein Interaktionen zu inhibieren, führte allerdings bis heute nur in Einzelfällen zu Erfolg. Eine Modulierung von Protein-Protein Interaktionen muss allerdings nicht zwangsläufig durch eine kompetitive Inhibierung erfolgen. Mittels einer gezielten Stabilisierung, bei der kleine Moleküle im Randbereich der Proteinkontaktfläche eines Protein-Protein Komplexes binden, kann man ebenfalls die gewünschte Modulierung erzielen. Dieses Phänomen wird eindrucksvoll durch die Bindung von Fusicoccin, das die Interaktion zwischen einer pflanzlichen H+-ATPase und einem 14-3-3 Protein um nahezu den Faktor 100 verstärkt, beschrieben. Auf der Suche nach niedermolekularen Verbindungen, die ebenso wie Fusicoccin den Protein-Protein Komplex stabilisieren, wurden Datenbanken mit ca. 2 Millionen käuflich erwerbbaren Molekülen durchmustert, deren Anzahl ich durch verschiedene Filterschritte reduzieren ließ. Mit verschiedenen Dockingprogrammen wurden über 160.000 Kandidatenmoleküle in die Bindetasche des H+-ATPase/14-3-3-Komplexes eingepasst. Die zahlreichen generierten Dockingposen wurden anhand geeigneter Pharmakophorfilter selektiert. Dazu ließen sich Methoden entwickeln, die effizient mit großen Datenmengen umgehen können. Diejenigen Moleküle mit pharmakophorerfüllenden Eigenschaften und Geometrien wurden anschließend mit verschiedenen Bewertungsfunktionen evaluiert. Für einen intuitiven Einblick in die Beiträge einzelner Atome des Liganden zu dessen Gesamtbewertung, wurde eine etablierte Bewertungsfunktion in ihrer Funktionalität erweitert. Des Weiteren konnte an einem Datensatz mit 198 Protein-Protein Komplexen gezeigt, dass nahezu alle der untersuchten Komplexe taschenförmige Vertiefungen im Randbereich ihrer Kontaktfläche aufweisen. Eine nähere Betrachtung der Taschen zeigt, dass einige eine ähnliche Gestalt zu Bindetaschen in globulären Proteinen aufweisen, die bekanntermaßen kleine Moleküle binden. Diese Erkenntnis lässt vermuten, dass auch Bindetaschen im Randbereich von Protein-Protein Komplexen ein vielversprechendes Target für die Bindung kleiner Moleküle darstellen. Eine Modulierung von Protein-Protein Interaktionen im Sinne einer Stabilisierung durch niedermolekulare Verbindungen erscheint somit als interessante Alternative zur Inhibierung solcher Interaktionen. Ein weiterer Teil dieser Arbeit entstand im Verlaufe eines Projektes zur Entwicklung einer verbesserten Bewertungsfunktion in silico generierter Dockingposen, wie sie bei der Auswahl möglicher Liganden zum Binden in die Fusicoccin Bindetasche des H+-ATPase/14-3-3-Komplexes eingesetzt wurden. Für die Entwicklung empirischer Bewertungsfunktionen, bei denen vorhergesagte gegenüber gemessenen Affinitäten regressionsbasiert korreliert werden, sind große und diverse Datensätze von Protein-Ligand Kristallkomplexen und deren ermittelter Affinität essentiell. In diesem Zusammenhang konnte die webbasierte Datenbank AffinDB entwickelt werden. AffinDB ist im Internet unter http://www.agklebe.de/affinity frei verfügbar

open_UMR (Universität Marburg)

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)

Author Instructions

Author: Instructions Author
Publication venue
Publication date: 04/11/2013
Field of study

Crossref

Cartographic Perspectives (E-Journal - North American Cartographic Information Society, NACIS)

Going Beyond Counting First Authors in Author Co-citation Analysis

Author: Zhao Dangzhi
Publication venue
Publication date: 01/01/2005
Field of study

The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

E-LIS

Systematische Korrelation der Eigenschaft von Struktur, Thermodynamik und der Solvathülle von Hydrophoben Substituenten in Hydrophoben Taschen unter Verwendung von Thermolysin als Fallbeispiel

Author: Krimmer Stefan Günter
Publication venue
Publication date: 10/11/2017
Field of study

Water molecules participate besides protein and ligand as an additional binding partner in every in vivo protein–ligand binding process. The displacement of water molecules from apolar surfaces of solutes is considered the driving force of the hydrophobic effect. It is generally assumed that the mobility of the water molecules increases through the displacement, and, as a consequence, entropy increases. This explanation, which is based on experiments with simple model systems, is, however, insufficient to describe the hydrophobic effect as part of the highly complex protein–ligand complex formation process. For instance, the displacement of water molecules from apolar surfaces that already exhibit an increased mobility before their displacement can result in an enthalpic advantage. Furthermore, it has to be considered that by the formation of the protein–ligand complex a new solvent-exposed surface is created, around which water molecules have to rearrange. The present thesis focuses on the impact of the latter effect on the thermodynamic and kinetic binding properties of a given ligand. A congeneric ligand series comprised of nine ligands binding to the model protein thermolysin (TLN) was analyzed to determine the impact of the rearrangement of water molecules around the surface of a newly formed protein–ligand complex on the thermodynamic binding properties of a ligand. The protein–ligand complexes were characterized structurally by X-ray crystallography and thermodynamically by isothermal titration calorimetry (ITC). The only structural difference between the ligands was their strictly apolar P2’ substituent, which changed in size from a methyl to a phenylethyl group. The P2’ group interacts with the flat, apolar, and well-solvated S2’ pocket of TLN. Depending on the bound ligand, the solvent-exposed surface of the protein–ligand complex changes. The ITC measurements revealed strong thermodynamic differences between the different ligands. The structural analysis showed ligand-coating water networks pronounced to varying degrees. A pronounced water network clearly correlated with a favorable enthalpic and less favorable entropic term, and overall resulted in an affinity gain. Based on these results, new P2’ substituents were rationally designed with the aim to achieve stronger stabilization of the adjacent water networks and thereby further increase ligand affinity. First, the quality of the putative water networks was validated using molecular dynamics (MD) simulations. Subsequently, the proposed ligands were synthesized, crystallized in complex with TLN, and analyzed thermodynamically. Additionally, a kinetic characterization using surface plasmon resonance (SPR) was performed. The crystallographically determined water networks adjacent to the P2’ substituents were in line with their predictions conducted by MD simulations. The ligands showed increasingly pronounced water networks as well as a slight enthalpy-drive affinity increase compared to the ligands from the initial study. The ligand with the highest affinity showed an almost perfect water network as well as a significantly reduced dissociation constant. To analyze the influence of the ligand-coating water networks on the kinetic binding properties of a ligand, seventeen congeneric TLN ligands exhibiting different P2’ groups were kinetically (by SPR) and crystallographically characterized. The different degree of the water network stabilization showed only a minor influence on the binding kinetic properties. By contrast, the strength of the interaction between the ligand and Asn112 proved crucial for the magnitude of the dissociation rate constant. A strong interaction resulted in a considerably prolonged residence time of the ligand by hindering TLN to undergo a conformational transition that is necessary for ligand release. In the last study, the reason for the exceptionally high affinity gain for addressing the deep, apolar S1’ pocket of TLN with apolar ligand portions was investigated. Therefore, a congeneric TLN ligand series substituted with differently large apolar P1’ substituents (ranging from a single hydrogen atom to an iso-butyl group) was analyzed. The exchange of the hydrogen atom at the P1’ position with a single methyl group already results in a 100-fold affinity increase of the ligand. To elucidate the molecular mechanism behind this considerable affinity gain, the solvation state of the S1’ pocket was carefully analyzed. The results strongly indicate that the S1’ pocket is completely free of the presence of any water molecules. Thus, the huge affinity gain was attributed to the absence of an energetically costly desolvation step. The data presented in this thesis show that to describe the thermodynamic signature of the hydrophobic effect it is necessary to explicitly consider the change of the thermodynamic properties of every involved water molecule. Solely considering the buried apolar surface area and assigning an entropic term to it is not sufficient. The increasing stabilization of the water network adjacent to the protein-bound ligand represents a promising approach — quite independent of specific properties of the target protein — to optimize the thermodynamic profile of a given ligand. This approach also allows fine-tuning of the kinetic binding parameters.Wassermoleküle nehmen neben Protein und Ligand als zusätzlicher Bindungspartner an jedem in vivo Protein–Ligand Bindungsprozess teil. Die Verdrängung von Wassermolekülen von apolaren Oberflächen gelöster Moleküle in die umgebende flüssige Phase wird als treibende Kraft des hydrophoben Effekts angesehen. Die gängige Annahme ist, dass die Beweglichkeit der Wassermoleküle durch ihre Verdrängung erhöht wird, was einen Anstieg der Entropie zur Folge hat. Diese Erklärung, die auf Experimenten mit einfachen Modellsystemen beruht, reicht jedoch nicht aus, um den hydrophoben Effekt als Teil eines hochgradig komplexen Protein–Ligand Komplexbildungsprozesses zu beschreiben. So kann zum Beispiel die Verdrängung von Wassermolekülen von apolaren Oberflächen, die bereits vor der Verdrängung eine erhöhte Beweglichkeit aufweisen, zu einem enthalpischen Vorteil führen. Desweiteren muss berücksichtigt werden, dass durch die Bildung des Protein–Ligand Komplexes eine neue, zur Wasserphase hin exponierte Oberfläche ausgebildet wird, um die sich Wassermoleküle neu anordnen müssen. Diese Arbeit behandelt den Einfluss dieses Effektes auf die thermodynamische und kinetische Bindungseigenschaft eines Liganden. Um den Einfluss der Neuanordnung von Wassermolekülen um die Oberfläche eines neu gebildeten Protein–Ligand Komplexes auf das thermodynamische Bindungsprofil eines Liganden zu ermitteln, wurde eine homologe Serie aus neun Liganden, die an das Modellprotein Thermolysin (TLN) binden, analysiert. Die Protein–Ligand Komplexe wurden strukturell mittels Röntgenkristallographie und thermodynamisch mittels isothermaler Titrationskalorimetrie (ITC) charakterisiert. Der einzige strukturelle Unterschied zwischen den Liganden stellte deren strikt apolarer P2‘ Substituent dar, dessen Größe von einer Methylgruppe bis zu einer Phenylethylgruppe variiert wurde. Die P2‘ Gruppen adressieren die flache, apolare und gut solvatisierte S2‘ Tasche von TLN. Je nach gebundenem Ligand ändert sich die Form der wasserexponierten Oberfläche des Protein–Ligand Komplexes. Die ITC Messungen ergaben starke thermodynamische Unterschiede zwischen den einzelnen Liganden. Die strukturelle Analyse zeigte eine unterschiedlich starke Ausprägung der die gebundenen Liganden überziehenden Wassernetzwerke. Ein ausgeprägtes Wassernetzwerk korrelierte deutlich mit einem günstigen enthalpischen und einem ungünstigen entropischen Beitrag, und insgesamt mit einem Affinitätsanstieg. Auf diesen Ergebnissen aufbauend wurden zusätzliche P2‘ Substituenten rational entworfen mit dem Ziel eine stärkere Stabilisierung der angrenzenden Wassernetzwerke zu erreichen und dadurch die Ligandaffinität weiter zu erhöhen. Zunächst wurde die Qualität der putativen Wassernetzwerke mittels molekulardynamischer (MD) Simulationen validiert. Anschließend wurden die vorgeschlagenen Liganden synthetisiert, im Komplex mit TLN kristallisiert und thermodynamisch analysiert. Eine kinetische Charakterisierung mittels Oberflächenplasmonresonanzspektroskopie (SPR) wurde ebenfalls durchgeführt. Die an die P2‘ Substituenten angrenzenden, kristallographisch bestimmten Wassernetzwerke entsprachen den Vorhersagen der MD Simulationen. Die Liganden zeigten ausgeprägtere Wassernetzwerke sowie eine leichte, enthalpiegetriebene Affinitätserhöhung im Vergleich zu den Liganden der ersten Studie. Der Ligand mit der höchsten Affinität zeigte ein nahezu perfektes Wassernetzwerk sowie eine signifikant reduzierte Dissoziationskonstante. Um den Einfluss der ligandüberziehenden Wassernetzwerke auf die kinetischen Bindungseigenschaften eines Liganden zu untersuchen, wurden 17 homologe TLN Liganden mit unterschiedlichen P2‘ Gruppen kinetisch mittels SPR und kristallographisch charakterisiert. Die Wassernetzwerkstabilisierung zeigte nur einen geringen Einfluss auf die Bindungskinetik. Im Gegensatz dazu erwies sich die Stärke der Interaktion zwischen Ligand und Asn112 als bestimmend für die Größe der Dissoziationskonstante. Eine starke Wechselwirkung führte zu einer deutlichen Verlängerung der Aufenthaltszeit des Liganden durch die Erschwerung einer konformativen Änderung von TLN, die jedoch notwendig für die Freisetzung des Liganden ist. In der letzte Studie wurde die Ursache für den außergewöhnlich großen Affinitätsanstieg bei einer Adressierung der tiefen, apolaren S1‘ Tasche von TLN mit apolaren Ligandgruppen untersucht. Dafür wurde eine homologe TLN Ligandserie mit unterschiedlich großen, apolaren P1‘ Gruppen (von einem Wasserstoffatom bis zu einem Isobutylrest) analysiert. Bereits der Austausch des Wasserstoffatoms an der P1‘ Position durch eine Methylgruppe führte zu einem 100-fachen Affinitätsanstieg des Liganden. Um den molekularen Mechanismus hinter dieser Affinitätssteigerung aufzuklären, wurde die S1‘ Tasche genau auf ihren Solvatationszustand hin untersucht. Die Ergebnisse sprechen deutlich dafür, dass die S1‘ Tasche komplett frei von Wassermolekülen ist. Die Affinitätssteigerung wurde deshalb auf das Fehlen eines energetisch kostspieligen Desolvatationsschrittes zurückgeführt. Die im Rahmen dieser Arbeit diskutierten Daten legen dar, dass für die thermodynamische Beschreibung des hydrophoben Effekts die Änderung des thermodynamischen Zustandes jedes einzelnen involvierten Wassermoleküls explizit berücksichtigt werden muss. Die alleinige Berücksichtigung der vergrabenen Oberfläche und die Zuweisung eines entropischen Beitrags ist nicht ausreichend. Die Erhöhung der Stabilisierung der Wasserstruktur, die angrenzend an einen proteingebundenen Liganden ausgebildet wird, stellt ein generelles und planbares — und vom Zielprotein relativ unabhängiges — Konzept dar, um das thermodynamische Profil eines gegebenen Liganden zu optimieren. Zu gewissem Maße kann dadurch auch Einfluss auf die Bindungskinetik genommen werden

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)

Synthese von D Phe D DiPhe Pro basierenden Liganden und deren biophysikalischen Charakterisierung zur Selektivitätsstudie von Thrombin und Trypsin sowie ein synthetischer Beitrag zur Darstellung von Inhibitoren der Aldose Reduktase und Carboanhydrase II

Author: Ngo Trong Khang
Publication venue
Publication date: 01/01/2020
Field of study

Im ersten Teil der vorliegenden Arbeit (Kapitel 1-7) wird ein Syntheseprojekt einer kongenerischen Serie von D-Phe/D-DiPhe-Pro-basierenden Inhibitoren für Thrombin und Trypsin sowie deren biophysikalischen Charakterisierung vorgestellt. Beide Proteine sind strukturell hochgradig verwandte Serinproteasen, die Unterschiede in ihrer Substraterkennung aufweisen. Thrombin ist Arg-spezifisch, während Trypsin Peptidketten nach Lys und Arg spaltet. Sie erkennen die basischen Substratkopfgruppen über Asp189, das sich am Boden der S1-Tasche befindet. Die synthetisierten Inhibitoren werden sowohl kristallographisch für beide Serinproteasen analysiert als auch thermodynamisch über isothermale Titrationskalorimetrie (ITC) untersucht, um ihre Bindungseigenschaften in der S1-Tasche zu beschreiben und zu vergleichen. Ein enzymkinetischer Fluoreszenzassay unterstützt zudem die biophysikalische Charakterisierung dieser Verbindungen gegenüber beiden Proteinen. Insbesondere steht dabei die Untersuchung von Protonierungseffekten in der S1-Tasche im Vordergrund, um die Selektivitätseigenschaften von Thrombin und Trypsin genauer aufzuklären. Obwohl beide Proteasen eine sehr ähnlich aufgebaute S1-Tasche haben, deuten die Ergebnisse auf einen Unterschied in den elektrostatischen Eigenschaften von Asp189 hin, welcher mit der Substratselektivität korrelieren könnte. Es ist überraschend, dass ein Ligand aus dieser Serie im protonierten Zustand mit seiner P1-Kopfgruppe an Trypsin bindet, während dieser im Komplex mit Thrombin unprotoniert bleibt. Für diesen Liganden wird ein leichter Unterschied bezüglich seiner Solvatationsmerkmale beobachtet. Unsere Daten zeigen, dass diese Unterschiede durch die spezifische Natriumbindestelle verursacht werden können, die nur in Thrombin vorhanden ist, aber in Trypsin fehlt. Während die Unterschiede außerdem durch das geladene Glu192 am Rand der S1-Tasche in Thrombin ausgeprägt werden können, werden sie durch das ungeladene Gln192 in Trypsin nicht beeinflusst. Dabei spielt die Berücksichtigung der verschiedenen pKa-Werte der beteiligten funktionellen Gruppen ebenfalls eine wesentliche Rolle. Weiterhin werden in diesem Teil verschiedene Fragmente sowie D-Phe-Pro-Derivate im Komplex mit beiden Proteinen vorgestellt, deren Bindungseigenschaften in der S1-Tasche charakterisiert und verglichen werden. Im zweiten Teil dieser Arbeit (Kapitel 8-12) wird die Synthese von Aldose-Reduktase-Inhibitoren mit einem 2-Arylcarbamoylphenoxyessigsäure-Gerüst durch die Einführung von verschiedenen Benzylgruppen mit m-ständigen elektronenziehenden Gruppen (hier: Sulfoxid und Boronsäure) vorgestellt. Sie dienen zur Untersuchung der transienten Spezifitätstasche in der Aldose-Reduktase. Elektronenarme aromatische Systeme können eine Öffnung der Spezifitätstasche auslösen und zu bevorzugten Wechselwirkungen mit Trp111 führen. Um den Öffnungsmechanismus dieser transienten Bindetasche genauer zu verstehen, sollten diese synthetisierten Verbindungen auch in verschiedenen Mutationsstudien eingesetzt werden. Unsere ersten kristallographischen Ergebnisse sowie ITC-Experimente zeigen bisher keine eindeutige Interpretation dieses Mechanismus. Unterschiedliche Desolvatationseigenschaften der Inhibitoren können einen Einfluss auf den Mechanismus haben. Der dritte Teil dieser Arbeit (Kapitel 13-17) präsentiert eine weitere synthetische Untersuchung einer Serie von p-alkylierten Benzensulfonamiden und deren Etherderivaten als humane Carboanhydrase II (hCAII)-Inhibitoren, die schrittweise in ihrer Alkylkettenlänge variiert wurden. Während die Sulfonamidgruppe anionisch an Zn2+ bindet, adressiert die aliphatische Alkylkette die hydrophobe Tasche im aktiven Zentrum. Die Bindungsprozesse dieser Verbindungen an diesem Protein wurden kristallographisch, kinetisch und thermodynamisch über ITC analysiert. Es konnten Konformationsänderungen festgestellt werden, deren Effekte die Bedeutung der Formkomplementarität für die Struktur-Bindungsbeziehung aufdecken. Zusätzlich wird in diesem Teil die Synthese von perfluorierten Derivaten vorgestellt, die ein anderes Bindungsmuster in hCAII aufweisen. Sowohl eine kinetische als auch thermodynamische Untersuchung wurde ebenfalls analog zur Studie der nicht-fluorierten Serie durchgeführt. Die Acidität der Sulfonamidgruppe ist durch die stark elektronenziehenden Effekte der Fluorsubstituenten beeinflusst, die vermutlich zum unterschiedlichen Bindungsverhalten des aromatischen Systems im aktiven Zentrum des Proteins führen.The first part of this thesis (chapters 1-7) presents a synthetic and biophysical study of a congeneric series of D-Phe/D-DiPhe-Pro based inhibitors for thrombin and trypsin. Both proteins are structurally highly related serine proteases which show differences in their preferred substrate recognition. Thrombin is Arg-specific, whereas trypsin cleaves peptide chains after Lys and Arg. They recognize the basic substrate head group via Asp189, located at the bottom of the S1 pocket. The synthesized inhibitors are crystallographically analyzed for both serine proteases and thermodynamically investigated by isothermal titration calorimetry (ITC) to describe and compare their binding properties in the S1 pocket. An enzyme-kinetic fluorescence assay also supports the biophysical characterization of these compounds toward both proteins. In order to elucidate selectivity characteristics of thrombin and trypsin more precisely, the protonation effects were investigated in the S1 pocket. Although both proteases have a highly similar S1 pocket, our results suggest a difference in the electrostatic properties of Asp189 which may correlate with their selectivity in substrate recognition. It is surprising that one ligand of the series binds to trypsin in protonated state at its P1 head group while it remains unprotonated in case of thrombin. For this ligand, a slight difference in the residual solvation structures is observed. Our data show that these differences may be caused by the specific sodium binding site which is only present in thrombin, but absent in trypsin. This may be further pronounced by the charged Glu192 at the rim of the S1 pocket in thrombin and is not affected by the uncharged Gln192 in trypsin. Here, the consideration of different pKa values of the involved functional groups also play an essential role. Furthermore, various fragments as well as D-Phe-Pro derivatives are presented in both proteins whose binding properties in the S1 pocket are characterized and compared. The second part of this thesis (chapters 8-12) is a synthetic study of aldose reductase inhibitors with a (2-arylcarbamoyl-phenoxy)-acetic acid scaffold by varying benzyl moieties with different m-substituted withdrawing groups (here: sulfoxide and boronic acid). They are prepared for the investigation of the transient specificity pocket in aldose reductase. Electron-deficient aromatic systems can trigger an opening of the specificity pocket thus lead to favored interactions with Trp111. In order to understand the opening mechanism of this transient binding pocket, these synthesized compounds should be also used in mutagenesis studies. Our first crystallographic results and ITC experiments did not reveal any clear interpretation regarding this mechanism so far. Different desolvation properties of the inhibitors may also have an impact on this mechanism. The third part of this thesis (chapters 13-17) is another synthetic study of a series of p-alkylated benzenesulfonamides and their ether derivatives as human carbonic anhydrase II (hCAII) inhibitors which are varied in their alkyl chain length. While the sulfonamide group binds anionically to Zn2+, the aliphatic alkyl chain addresses the hydrophobic pocket in the active site. The binding events of these compounds in hCAII were investigated crystallographically, kinetically and thermodynamically by ITC. The results demonstrate the conformational changes and reveal effects of the importance of shape complementarity for the structure-binding relationship. Additionally in this part, the synthesis of perfluorinated derivatives are presented which exhibit a different binding pattern in hCAII. A kinetic and thermodynamic study was also performed analogously according to the investigation of the non-fluorinated series. The acidity of the sulfonamide group is influenced by the strongly electron-withdrawing effects of the fluorine substituents which lead to putative different binding characteristics of the aromatic system in the active site of this protein

HZB Repository

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)

Lead structure search using a crystallographic fragment screening to inhibit the class II chaperone IpgC of Shigella flexneri

Author: Gárdonyi Marina
Publication venue
Publication date: 05/12/2023
Field of study

Die Shigellenruhr ist eine entzündliche Erkrankung des Dickdarms, welche durch die Bakteriengattung Shigella ausgelöst wird. Sie tritt vorwiegend in Entwicklungsländern auf und fordert jährlich eine hohe Zahl an Todesopfern. Zur Pathogenitätsentwicklung müssen die Bakterien in die Epithelzellen des Dickdarms eindringen. Dazu verwenden sie ein Typ-III-Sekretionssystem. IpgC – ein Klasse-II-Chaperon – tritt mit einer Vielzahl an relevanten Proteinen in Wechselwirkung und ist somit für den Befall der Epithelzellen essentiell. Im Zentrum dieser Arbeit stand daher die Suche nach einer Leitstruktur zur Inhibierung des für die Ausbildung der Shigella-Pathogenität notwendigen Chaperons IpgC. Das Protokoll zur heterologen Produktion und Reinigung von IpgC wurde aus vorherigen Arbeiten übernommen und geringfügig verändert. Der erste Schritt bestand darin, eine Kristallisationsbedingung für IpgC zu etablieren. Dazu kam zunächst eine am C-Terminus um vier Aminosäuren verkürzte Proteinvariante zum Einsatz. Mit dieser Variante konnte eine Kristallisationsbedingung gefunden werden, welche jedoch Kristalle entstehen ließ, die höchstwahrscheinlich aufgrund ihres sehr hohen Lösungsmittelgehalts ein zu geringes Streuvermögen besaßen. Neben dem Versuch, eine besser kristallisierbare Proteinvariante unter Anwendung des Prinzips der Oberflächenentropiereduktion zu ermitteln, wurde eine weitere, am N-Terminus um zusätzliche neun Aminosäuren verkürzte Variante für Kristallisationsexperimente verwendet. Mit dieser Proteinvariante konnten schließlich Kristalle mit hohem Streuvermögen gezüchtet werden. Trotz der Tatsache, dass diese Kristalle überwiegend verzwillingt waren, gelang es, die Struktur von IpgC zu bestimmen. Mit einer Auflösung von 1.58 Å wurde die erste hochauflösende Kristallstruktur für IpgC erhalten. Im Vergleich zu den bis dato in der PDB deponierten IpgC-Strukturen konnte neben einer deutlich besseren Auflösung eine bislang noch nicht beobachtete Dimer-Anordnung des Proteins erhalten werden. Dies könnte in Zukunft noch eine wichtige Rolle spielen, denn auch in Lösung liegt IpgC als Dimer vor. Mit der gefundenen Kristallisationsbedingung war es möglich, ein kristallographisches Fragment-Screening unter Verwendung einer hauseigenen Fragment-Bibliothek (96 strukturell verschiedene Fragmente) durchzuführen. Es konnten zehn Fragment-Hits identifiziert werden. Diese hatten an insgesamt fünf unterschiedlichen Stellen gebunden. Die meisten Fragmente hatten in einer Tasche im Interface des Dimers (Interface-Tasche 1) gebunden. Im Anschluss an das kristallographische Fragment-Screening wurden die 96 Fragmente erneut für die Durchführung eines Thermal-Shift-Assays verwendet. Dieses Assay hatte zum Ziel, die im kristallographischen Fragment-Screening gefundenen Hits zu bestätigen. Es konnten jedoch keine Fragmente gefunden werden, die den Schmelzpunkt des Proteins erhöhten, was auf einen stabilisierenden Effekt hingedeutet hätte. Stattdessen wurden 26 Fragmente identifiziert, die den Schmelzpunkt herabsetzten. Darunter befanden sich einige Fragmente, die beim kristallographischen Fragment-Screening als Hit identifiziert worden waren und im Interface des Dimers gebunden hatten. Dies könnte darauf hindeuten, dass einige der im Interface gebundenen Fragmente das Dimer durch das Aufbrechen von H-Brücken destabilisieren. Die im Interface vorgefundenen Fragmente wurden als guter Startpunkt zur Entwicklung von Wirkstoffen angesehen. Mittels in silico-Docking und unter Verwendung des Programms SeeSAR sollten potentielle Follow-up-Verbindungen ermittelt werden, die auf den Fragment-Hits aufbauen, jedoch eine verbesserte Affinität besitzen. Durch die Verwendung ausschließlich kommerziell erhältlicher Verbindungen konnte eine aufwändige Synthese der Verbindungen vermieden werden. Aufgrund vielversprechender Docking-Resultate und aufbauend auf drei Fragment-Hits wurden 17 Follow-up-Verbindungen ausgewählt. Anhand von Soaking-Experimenten und anschließender Kristallstrukturbestimmung sollten aus diesen 17 Follow-up-Verbindungen die Substanzen ermittelt werden, die tatsächlich mit IpgC einen Komplex eingehen. Zwei Verbindungen konnten eindeutig als Hit identifiziert werden. Eine weitere Verbindung stellt einen zusätzlichen potentiellen Hit dar. Zwei dieser Verbindungen (U08 und U11) hatten – wie vorhergesagt – in Interface-Tasche 1 gebunden. Eine dieser beiden Verbindungen (U11) bestätigte den bei J20 beobachteten Bindungsmodus und füllte den hydrophoben Bereich der Bindetasche im Vergleich zu J20 deutlich besser aus. Die dritte Verbindung (U09) nahm wider Erwarten eine vollkommen andere Bindestelle ein. Genau diese Bindestelle ist jedoch von großer Bedeutung. Es handelt sich um die bislang einzige bekannte Bindestelle eines Interaktionspartners, nämlich um eine Bindestelle von IpaB. Mit den Follow-up-Verbindungen U09 und U11 konnten zwei für zukünftige Experimente bedeutende Liganden ermittelt werden. Es wurden ebenfalls Experimente mit einigen Interaktionspartnern von IpgC durchgeführt. Spa13 – Bestandteil des Typ-III-Sekretionssystems – sollte in dieser Arbeit kloniert, rekombinant produziert und gereinigt werden, um letztlich sowohl die dreidimensionale Struktur von Spa13 als auch die Struktur des IpgC-Spa13-Komplexes röntgenkristallographisch bestimmen zu können. Es konnte ein Protokoll zur heterologen Produktion und Reinigung von Spa13 etabliert werden. Neben der Produktion als GST-Fusionsprotein wurde Spa13 als MBP-Fusionsprotein produziert. Mithilfe des GST-Tags wurde nur unlösliches Protein erhalten. Mithilfe des MBP-Tags konnte Spa13 jedoch in löslicher Form produziert und gereinigt werden. Allerdings stellte sich heraus, dass Spa13 in hochaggregierter Form vorlag, wodurch eine vollständige Darstellung erschwert wurde. Aus diesem Grund konnte Spa13 letztendlich nicht vollständig rein und monodispers erhalten werden. Auch der Versuch, Spa13 zusammen mit seinem Interaktionspartner IpgC zu produzieren, führte nicht zum Erfolg. Um dennoch Informationen über die Interaktion zwischen IpgC und Spa13 zu erhalten, wurde ein Peptid-Microarray durchgeführt. Dabei wurden Sequenzabschnitte von Spa13 auf einer Membran fixiert. Der Nachweis der Interaktion dieser Peptide mit IpgC sollte mithilfe geeigneter Antikörper erfolgen. Auf diese Weise sollte das Bindungsepitop von Spa13 ermittelt werden. Die Ergebnisse waren jedoch nicht aufschlussreich. Nur eine der drei Positivkontrollen war tatsächlich erfolgreich. Für zwei der Spa13-Peptide konnte ein sehr schwaches Signal erhalten werden. Diese Peptid-Sequenzen konnten in ersten Versuchen mittels MST jedoch nicht als IpgC-Bindungsepitop bestätigt werden. Dies lag z.T. an der eingeschränkten Löslichkeit der Spa13-Peptide. Die Bindungsepitope von MxiE und IpaC konnten bereits in vorherigen Arbeiten ermittelt werden. Darauf aufbauend wurde in dieser Arbeit versucht, die Struktur von IpgC mit den Bindungsepitopen von MxiE und IpaC zu ermitteln. Dazu wurden Kokristallisationsexperimente durchgeführt. Im Falle der Kokristallisation von IpgC mit dem MxiE-Peptid stellte sich jedoch nach Erhalt ausreichend streuender Kristalle mehrfach heraus, dass das Peptid nicht gebunden hatte. Die Kokristallisation von IpgC mit dem IpaC-Peptid gelang mithilfe der für IpaB bekannten Kokristallisationsbedingung. Hier konnte zwar ein Datensatz bis zu einer Auflösung von 2.62 Å gesammelt werden, mit diesem gelang jedoch aus bislang ungeklärten Gründen die Strukturbestimmung nicht.Shigellosis is an inflammatory disease of the colon caused by a bacterial infection with Shigella. To develop pathogenicity, the bacteria have to penetrate the epithelial cells of the colon. IpgC – a class II chaperone – is essential for the active infection of endothelial cells due to its multiple interactions with invasion proteins. The focus of this thesis was to find a lead structure for the inhibition of IpgC function. The first step was to find a crystallization condition for IpgC. Initially a protein variant shortened by four amino acids at the C-terminus was used. With this variant a crystallization condition could be found which however resulted in crystals that had a too low scattering power, most likely due to high solvent content. Another variant, which was additionally shortened by nine amino acids at the N terminus, was used for the crystallization experiments. With this protein variant crystals with high scattering power could finally be obtained. With a resolution of 1.58 Å a first high-resolution crystal structure of IpgC was obtained. Compared to already deposited IpgC-structures, a previously unobserved dimer arrangement of the protein was obtained. This could be of importance in the future, because IpgC also exists in solution as a dimer. With the same crystallization condition, it was possible to perform a crystallographic fragment screening using an in-house fragment library comprising 96 structurally diverse fragments. Ten fragment hits were identified which had bound to the protein at a total of five different sites. Most of the fragments had bound in a pocket in the interface of the dimer. The 96 fragments were again used to perform a thermal shift assay. No fragments were found that increased the melting point of the protein that would have indicated a stabilizing effect. Instead, 26 fragments were identified that lowered the melting point of the protein. These included some of the crystallographic fragment hits which had bound to the dimer interface. This was explained by the fact that some of these fragments destabilize the dimer by breaking hydrogen bonds between chain A and chain B of the protein. The fragments found in the dimer interface were considered as a good starting point for further drug development. In silico docking was used in order to identify potential follow-up compounds. By using only commercially available compounds for docking, synthesis of the compounds could initially be avoided. Based on good docking results and using three different fragment hits as a starting point, 17 follow-up compounds were selected. On the basis of soaking experiments and subsequent crystal structure determination, the follow-up compounds that actually form a complex with IpgC were to be determined. Two compounds could be clearly identified as hits. Another compound represents a further potential hit. Two of these compounds (U08 and U11) had bound in interface pocket 1, as expected. One of these two follow-up compounds (U11) confirmed the binding mode observed for fragment J20 and filled the hydrophobic region of the binding pocket much better compared to J20. Contrary to expectations based on docking, the third compound (U09) occupied a completely different binding site. It is the only known binding site of an interaction partner so far, namely of IpaB. Spa13 – part of the type III secretion system – was to be cloned, recombinantly produced and purified in order to determine the three-dimensional structure of the IpgC-Spa13 complex by X ray crystallography. A protocol for heterologous production and purification of Spa13 was established. Using an MBP tag, Spa13 could be produced and partially purified in soluble form. However, Spa13 was found to be present in a highly aggregated form that made purification difficult. The coproduction of Spa13 together with its interaction partner IpgC was not successful either. To still obtain information about the interaction of IpgC with Spa13, a peptide microarray was performed. This was to determine the binding epitope of Spa13. However, the results were not conclusive. Only one of three positive controls was positive. For two of the Spa13 peptides a very weak signal could be obtained. However, these peptide sequences could not be confirmed as binding epitopes in initial experiments by MST. Co-crystallization experiments were performed to determine the crystal structure of IpgC together with the binding epitopes of MxiE and IpaC. Concerning the co-crystallization of IpgC with the binding epitope of MxiE crystals with a good scattering power could be obtained several times, but in each case the respective structure revealed that the peptide had not bound. Using the published condition for co-crystallization of IpgC with the IpaB binding epitope, the co-crystallization of IpgC with the binding epitope of IpaC was successful. Here, however, the collected data set to 2.62 Å resolution could not be resolved for unexplained reasons

open_UMR (Universität Marburg)

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)

Entwicklung und Verbesserung von Werkzeugen und Algorithmen für das Problem der Atomtyp-Zuweisung sowie für die Bewertung von Protein-Ligand-Komplex-Geometrien

Author: Neudert Gerd (1019441682)
Publication venue
Publication date: 19/03/2012
Field of study

In context of the present work, a scoring function for protein-ligand complexes has been developed, not aimed at affinity prediction, but rather a good recognition rate of near native geometries. The developed program DSX makes use of the same formalism as the knowledge-based scoring function DrugScore, hence using the knowledge from crystallographic databases and atom-type specific distance-dependent distribution functions. It is based on newly defined atom-types. Additionally, the program is augmented by two novel potentials which evaluate the torsion angles and (de-)solvation effects. Validation of DSX is based on a literature-known, comprehensive data-set that allows for comparison with other popular scoring functions. DSX is intended for the recognition of near-native binding modes. In this important task, DSX outperforms the competitors, but is also among the best scoring functions regarding the ranking of different compounds. Another essential step in the development of DSX was the automatical assignment of the new atom types. A powerful programming framework was implemented to fulfill this task. Validation was done on a literature-known data-set and showed superior efficiency and quality compared to similar programs where this data was available. The front-end fconv was developed to share this functionality with the scientific community. Multiple features useful in computational drug-design workflows are also included and fconv was made freely available as Open Source Project. Based on the developed potentials for DSX, a number of further applications was created and impemented: The program HotspotsX calculates favorable interaction fields in protein binding pockets that can be used as a starting point for pharmacophoric models and that indicate possible directions for the optimization of lead structures. The program DSFP calculates scores based on fingerprints for given binding geometries. These fingerprints are compared with reference fingerprints that are derived from DSX interactions in known crystal structures of the particular target. Finally, the program DSX_wat was developed to predict stable water networks within a binding pocket. DSX interaction fields are used to calculate the putative water positions.Im Rahmen der vorliegenden Arbeit wurde eine neue Bewertungsfunktion für Protein-Ligand-Komplexe entwickelt, die nicht auf eine Affinitätsvorhersage abzielt, sondern auf die Erkennung nah-nativer Geometrien, welche dann rechenintensiveren Methoden zur Energiebestimmung zugeführt werden können. Das entwickelte Programm DSX beruht auf dem Formalismus der wissensbasierten Funktion DrugScore und nutzt somit das vorhandene Wissen aus kristallografischen Datenbanken, insbesondere die daraus abgeleiteten, Atomtyp-spezifischen Distanzverteilungsfunktionen. Es basiert jedoch auf neu definierten Atomtypen und wurde um zwei neue Arten von Potentialen zur Bewertung von Torsionswinkeln und (De-)Solvatationseffekten erweitert. Die Validierung von DSX wurde mit einem umfangreichen, Literatur-bekannten Datensatz durchgeführt, der den Vergleich mit einer Vielzahl anderer Bewertungsfunktionen erlaubt. DSX nimmt hierbei die Spitzenposition bei der angestrebten Erkennung nah-nativer Bindungsmodi ein und gehört auch beim Ranking unterschiedlicher Verbindungen zu den besten Bewertungsfunktionen. Neben der Definition neuer Atomtypen war ein entscheidender Schritt zur Entwicklung von DSX deren automatische Zuweisung. Zu diesem Zweck wurde ein leistungsfähiges Programmier-Framework entwickelt und anhand eines Literatur-bekannten Datensatzes validiert. Effizienz und Qualität der Atomtyp-Zuweisung übertreffen die Programme, für die Vergleichsdaten zur Verfügung stehen. Um Wissenschaftler ohne Programmierkenntnisse von der Atomtyp-Zuweisung profitieren zu lassen, wurde das Frontend fconv entwickelt. Es wurde um eine Vielzahl im computergestützten Wirkstoffdesign häufig benötigter Funktionen erweitert und der wissenschaftlichen Gemeinschaft als Open Source Projekt zur Verfügung gestellt. Basierend auf den für DSX entwickelten Potentialen wurde eine Reihe weiterer Anwendungen entwickelt und implementiert: Das Programm HotspotsX ermöglicht die Berechnung günstiger Interaktionsfelder in Protein-Bindetaschen, welche als Startpunkt für Pharmakophormodelle sowie als Ideengeber für die Leitstrukturoptimierung genutzt werden können. Das Programm DSFP ist eine Bewertungsfunktion, die keinen einfachen Gesamtscore für einen gegebenen Komplex berechnet, sondern bindetaschenabhängige Fingerprints erstellt. Diese können dann anhand der aus bekannten Kristallstrukturen abgeleiteten Referenzfingerprints bewertet werden. Zur Vorhersage stabiler Wassernetzwerke in Bindetaschen wurde das Programm DSX_wat entwickelt. Hierfür wurde ein heuristischer Ansatz gewählt, um aus DSX-Interaktionsfeldern exklusive Sets möglicher Wasserpositionen zu bestimmen

data_UMR - Forschungsdatenrepositorium (Philipps-Universität Marburg)