ar

Coordination structure and inter­molecular inter­actions in copper(II) acetate com­plexes with 1,10-phenanthroline and 2,2'-bi­py­ri­dine

The crystal structures of two coordination com­pounds, (acetato-κO)(2,2'-bi­py­ri­dine-κ2N,N')(1,10-phenanthroline-κ2N,N')copper(II) acetate hexa­hydrate, [Cu(C2H3O2)(C10H8N2)(C12H8N2)](C2H3O2)·6H2O or [Cu(bipy)(phen)Ac]Ac·6H2O, and (acetato-κO)bis­(2,2'-bi­py­ri­dine-κ2N,N')copper(II) acetate–acetic acid–water (1/1/3), [Cu(C2H3O2)(C10H8N2)2](C2H3O2)·C2H4O2·3H2O or [Cu(bipy)2Ac]Ac·HAc·3H2O, are reported and com­pared with the previously published structure of [Cu(phen)2Ac]Ac·7H2O (phen is 1,10-phenanthroline, bipy for 2,2'-bi­py­ri­dine, ac is acetate and Hac is acetic acid). The geometry around the metal centre is penta­coordinated, but highly distorted in all three cases. The coordination number and the geometric distortion are both discussed in detail, and all com­plexes belong to the space group Poverline{1}. The analysis of the geometric parameters and the Hirshfeld surface properties dnorm and curvedness provide information about the metal–ligand inter­actions in these com­plexes and allow com­parison with similar systems.




ar

Multivalent hy­dro­gen-bonded architectures directed by self-com­plementarity between [Cu(2,2'-bi­imid­az­ole)] and malonate building blocks

The synthesis and structural characterization of four novel supra­molecular hy­dro­gen-bonded arrangements based on self-assembly from mol­ecular `[Cu(2,2'-bi­imid­az­ole)]' modules and malonate anions are pre­sent­ed, namely, tetra­kis­(2,2'-bi­imid­az­ole)di-μ-chlorido-dimal­on­atotricopper(II) penta­hydrate, [Cu3(C3H2O4)2Cl2(C6H6N4)4]·5H2O or [Cu(H2biim)2(μ-Cl)Cu0.5(mal)]2·5H2O, aqua­(2,2'-bi­imid­az­ole)­mal­on­atocopper(II) dihydrate, [Cu(C3H2O4)(C6H6N4)(H2O)]·2H2O or [Cu(H2biim)(mal)(H2O)]·2H2O, bis­[aqua­bis­(2,2'-bi­imid­az­ole)­cop­per(II)] di­mal­on­atodi­perchloratocopper(II) 2.2-hydrate, [Cu(C6H6N4)2(H2O)]2[Cu(C3H2O4)(ClO4)2]·2.2H2O or [Cu(H2biim)2(H2O)]2[Cu(mal)2(ClO4)2]·2.2H2O, and bis­(2,2'-bi­imid­az­ole)­copper(II) bis­[bis­(2,2'-bi­imid­az­ole)(2-carb­oxy­acetato)mal­on­atocopper(II)] tridecahydrate, [Cu(C6H6N4)2][Cu(C3H2O4)(C3H3O4)(C6H6N4)2]·13H2O or [Cu(H2biim)2][Cu(H2biim)2(Hmal)(mal)]2·13H2O. These as­sem­blies are characterized by self-com­plementary donor–acceptor mol­ecular inter­actions, demonstrating a recurrent and distinctive pattern of hy­dro­gen-bonding preferences among the carboxyl­ate, carb­oxy­lic acid and N—H groups of the coordinated 2,2'-bi­imid­az­ole and malonate ligands. Additionally, co­or­din­ation of the carboxyl­ate group with the metallic centre helps sustain re­mark­able supra­molecular assemblies, such as layers, helices, double helix columns or 3D channeled architectures, including mixed-metal com­plexes, into a single structure.




ar

Revisiting a natural wine salt: calcium (2R,3R)-tar­trate tetra­hydrate

The crystal structure of the salt calcium (2R,3R)-tar­trate tetra­hydrate {sys­tem­atic name: poly[[di­aqua­[μ4-(2R,3R)-2,3-di­hydroxy­butane­dioato]calcium(II)] di­hydrate]}, {[Ca(C4H8O8)(H2O)2]·2H2O}n, is reported. The absolute configuration of the crystal was established unambiguously using anomalous dispersion effects in the diffraction patterns. High-quality data also allowed the location and free refinement of all the H atoms, and therefore to a careful analysis of the hy­dro­gen-bond inter­actions.




ar

Mol­ecular and crystal structures of six poly(arylsulfin­yl)- and poly(aryl­sulfan­yl)fer­ro­cenes

Starting from (p-tolyl­sulfin­yl)fer­ro­cene (1), a mixture of the complete series [CpFe{C5H5–n(SOTol-p)n}] (n = 2–4) (2–4) in all regioisomers was obtained. After chromatographic separation, crystals of 1,2-bis­[(4-methyl­benzene)­sulfin­yl]fer­ro­cene, 2a, and 1,3-bis­[(4-methyl­benzene)­sulfin­yl]fer­ro­cene, 2b, both [Fe(C5H5)(C19H17O2S2)], as well as of 1,2,3-tris­[(4-methyl­benzene)­sulfin­yl]fer­ro­cene, [Fe(C5H5)(C26H23O3S3)], 3a, and 1,2,3,4-tetra­kis­[(4-methyl­benzene)­sul­fin­yl]fer­ro­cene ethyl acetate 0.75-solvate, [Fe(C5H5)(C33H29O4S4)]·0.75C4H8O2, 4, could be isolated. Their mol­ecular and crystal structures are compared with each other and also with the so far un­reported structures of related 1,2-bis­(phenyl­sulfan­yl)fer­ro­cene, [Fe(C5H5)(C17H13S2)], 5, and 1,2,3,4-tetra­kis­(phenyl­sulfan­yl)fer­ro­cene, [Fe(C5H5)(C29H21S4)], 6. In all the sulfinyl structures, the O atoms of the S=O groups are in equatorial positions, except for that in tetrasubstituted 4. All the arene rings of these com­pounds (except for one ring in 4) are in axial positions directed away from the Fe atom, mostly in a near perpendicular orientation with respect to the plane of the cyclo­penta­di­en­yl ring. The main inter­molecular inter­actions in the crystals are C—H⋯H—C, C—H⋯π and C—H⋯O, while C—H⋯S inter­actions are much less important, except for tetra­sul­fan­yl com­pound 6. π–π inter­actions (intra­molecular) are only important in com­pound 3a. Hirshfeld analysis shows that dispersion terms are dominant for the inter­action energies of all six com­pounds. In general, the calculated total inter­action energies increase with increasing number of substituents and are higher for the sulfinyl than for the sul­fan­yl groups.




ar

Crystal clear: the impact of crystal structure in the development of high-performance organic semiconductors

 




ar

The High-Pressure Freezing Laboratory for Macromolecular Crystallography (HPMX), an ancillary tool for the macromolecular crystallography beamlines at the ESRF

This article describes the High-Pressure Freezing Laboratory for Macromolecular Crystallography (HPMX) at the ESRF, and highlights new and complementary research opportunities that can be explored using this facility. The laboratory is dedicated to investigating interactions between macromolecules and gases in crystallo, and finds applications in many fields of research, including fundamental biology, biochemistry, and environmental and medical science. At present, the HPMX laboratory offers the use of different high-pressure cells adapted for helium, argon, krypton, xenon, nitrogen, oxygen, carbon dioxide and methane. Important scientific applications of high pressure to macromolecules at the HPMX include noble-gas derivatization of crystals to detect and map the internal architecture of proteins (pockets, tunnels and channels) that allows the storage and diffusion of ligands or substrates/products, the investigation of the catalytic mechanisms of gas-employing enzymes (using oxygen, carbon dioxide or methane as substrates) to possibly decipher intermediates, and studies of the conformational fluctuations or structure modifications that are necessary for proteins to function. Additionally, cryo-cooling protein crystals under high pressure (helium or argon at 2000 bar) enables the addition of cryo-protectant to be avoided and noble gases can be employed to produce derivatives for structure resolution. The high-pressure systems are designed to process crystals along a well defined pathway in the phase diagram (pressure–temperature) of the gas to cryo-cool the samples according to the three-step `soak-and-freeze method'. Firstly, crystals are soaked in a pressurized pure gas atmosphere (at 294 K) to introduce the gas and facilitate its inter­actions within the macromolecules. Samples are then flash-cooled (at 100 K) while still under pressure to cryo-trap macromolecule–gas complexation states or pressure-induced protein modifications. Finally, the samples are recovered after depressurization at cryo-temperatures. The final section of this publication presents a selection of different typical high-pressure experiments carried out at the HPMX, showing that this technique has already answered a wide range of scientific questions. It is shown that the use of different gases and pressure conditions can be used to probe various effects, such as mapping the functional internal architectures of enzymes (tunnels in the haloalkane dehalogenase DhaA) and allosteric sites on membrane-protein surfaces, the interaction of non-inert gases with proteins (oxygen in the hydrogenase ReMBH) and pressure-induced structural changes of proteins (tetramer dissociation in urate oxidase). The technique is versatile and the provision of pressure cells and their application at the HPMX is gradually being extended to address new scientific questions.




ar

A web-based dashboard for RELION metadata visualization

Cryo-electron microscopy (cryo-EM) has witnessed radical progress in the past decade, driven by developments in hardware and software. While current software packages include processing pipelines that simplify the image-processing workflow, they do not prioritize the in-depth analysis of crucial metadata, limiting troubleshooting for challenging data sets. The widely used RELION software package lacks a graphical native representation of the underlying metadata. Here, two web-based tools are introduced: relion_live.py, which offers real-time feedback on data collection, aiding swift decision-making during data acquisition, and relion_analyse.py, a graphical interface to represent RELION projects by plotting essential metadata including interactive data filtration and analysis. A useful script for estimating ice thickness and data quality during movie pre-processing is also presented. These tools empower researchers to analyse data efficiently and allow informed decisions during data collection and processing.




ar

From femtoseconds to minutes: time-resolved macromolecular crystallography at XFELs and synchrotrons

Over the last decade, the development of time-resolved serial crystallography (TR-SX) at X-ray free-electron lasers (XFELs) and synchrotrons has allowed researchers to study phenomena occurring in proteins on the femtosecond-to-minute timescale, taking advantage of many technical and methodological breakthroughs. Protein crystals of various sizes are presented to the X-ray beam in either a static or a moving medium. Photoactive proteins were naturally the initial systems to be studied in TR-SX experiments using pump–probe schemes, where the pump is a pulse of visible light. Other reaction initiations through small-molecule diffusion are gaining momentum. Here, selected examples of XFEL and synchrotron time-resolved crystallography studies will be used to highlight the specificities of the various instruments and methods with respect to time resolution, and are compared with cryo-trapping studies.




ar

Fragment-based screening targeting an open form of the SARS-CoV-2 main protease binding pocket

To identify starting points for therapeutics targeting SARS-CoV-2, the Paul Scherrer Institute and Idorsia decided to collaboratively perform an X-ray crystallographic fragment screen against its main protease. Fragment-based screening was carried out using crystals with a pronounced open conformation of the substrate-binding pocket. Of 631 soaked fragments, a total of 29 hits bound either in the active site (24 hits), a remote binding pocket (three hits) or at crystal-packing interfaces (two hits). Notably, two fragments with a pose that was sterically incompatible with a more occluded crystal form were identified. Two isatin-based electrophilic fragments bound covalently to the catalytic cysteine residue. The structures also revealed a surprisingly strong influence of the crystal form on the binding pose of three published fragments used as positive controls, with implications for fragment screening by crystallography.




ar

The crystal structure of mycothiol disulfide reductase (Mtr) provides mechanistic insight into the specific low-molecular-weight thiol reductase activity of Actinobacteria

Low-molecular-weight (LMW) thiols are involved in many processes in all organisms, playing a protective role against reactive species, heavy metals, toxins and antibiotics. Actinobacteria, such as Mycobacterium tuberculosis, use the LMW thiol mycothiol (MSH) to buffer the intracellular redox environment. The NADPH-dependent FAD-containing oxidoreductase mycothiol disulfide reductase (Mtr) is known to reduce oxidized mycothiol disulfide (MSSM) to MSH, which is crucial to maintain the cellular redox balance. In this work, the first crystal structures of Mtr are presented, expanding the structural knowledge and understanding of LMW thiol reductases. The structural analyses and docking calculations provide insight into the nature of Mtrs, with regard to the binding and reduction of the MSSM substrate, in the context of related oxidoreductases. The putative binding site for MSSM suggests a similar binding to that described for the homologous glutathione reductase and its respective substrate glutathione disulfide, but with distinct structural differences shaped to fit the bulkier MSSM substrate, assigning Mtrs as uniquely functioning reductases. As MSH has been acknowledged as an attractive antitubercular target, the structural findings presented in this work may contribute towards future antituberculosis drug development.




ar

Characterization of novel mevalonate kinases from the tardigrade Ramazzottius varieornatus and the psychrophilic archaeon Methanococcoides burtonii

Mevalonate kinase is central to the isoprenoid biosynthesis pathway. Here, high-resolution X-ray crystal structures of two mevalonate kinases are presented: a eukaryotic protein from Ramazzottius varieornatus and an archaeal protein from Methanococcoides burtonii. Both enzymes possess the highly conserved motifs of the GHMP enzyme superfamily, with notable differences between the two enzymes in the N-terminal part of the structures. Biochemical characterization of the two enzymes revealed major differences in their sensitivity to geranyl pyrophosphate and farnesyl pyrophosphate, and in their thermal stabilities. This work adds to the understanding of the structural basis of enzyme inhibition and thermostability in mevalonate kinases.




ar

Structural determination and modeling of ciliary microtubules

The axoneme, a microtubule-based array at the center of every cilium, has been the subject of structural investigations for decades, but only recent advances in cryo-EM and cryo-ET have allowed a molecular-level interpretation of the entire complex to be achieved. The unique properties of the nine doublet microtubules and central pair of singlet microtubules that form the axoneme, including the highly decorated tubulin lattice and the docking of massive axonemal complexes, provide opportunities and challenges for sample preparation, 3D reconstruction and atomic modeling. Here, the approaches used for cryo-EM and cryo-ET of axonemes are reviewed, while highlighting the unique opportunities provided by the latest generation of AI-guided tools that are transforming structural biology.




ar

Mononuclear binding and catalytic activity of europium(III) and gadolinium(III) at the active site of the model metalloenzyme phosphotriesterase

Lanthanide ions have ideal chemical properties for catalysis, such as hard Lewis acidity, fast ligand-exchange kinetics, high coordination-number preferences and low geometric requirements for coordination. As a result, many small-molecule lanthanide catalysts have been described in the literature. Yet, despite the ability of enzymes to catalyse highly stereoselective reactions under gentle conditions, very few lanthanoenzymes have been investigated. In this work, the mononuclear binding of europium(III) and gadolinium(III) to the active site of a mutant of the model enzyme phosphotriesterase are described using X-ray crystallography at 1.78 and 1.61 Å resolution, respectively. It is also shown that despite coordinating a single non-natural metal cation, the PTE-R18 mutant is still able to maintain esterase activity.




ar

Scaling and merging macromolecular diffuse scattering with mdx2

Diffuse scattering is a promising method to gain additional insight into protein dynamics from macromolecular crystallography experiments. Bragg intensities yield the average electron density, while the diffuse scattering can be processed to obtain a three-dimensional reciprocal-space map that is further analyzed to determine correlated motion. To make diffuse scattering techniques more accessible, software for data processing called mdx2 has been created that is both convenient to use and simple to extend and modify. mdx2 is written in Python, and it interfaces with DIALS to implement self-contained data-reduction workflows. Data are stored in NeXus format for software interchange and convenient visualization. mdx2 can be run on the command line or imported as a package, for instance to encapsulate a complete workflow in a Jupyter notebook for reproducible computing and education. Here, mdx2 version 1.0 is described, a new release incorporating state-of-the-art techniques for data reduction. The implementation of a complete multi-crystal scaling and merging workflow is described, and the methods are tested using a high-redundancy data set from cubic insulin. It is shown that redundancy can be leveraged during scaling to correct systematic errors and obtain accurate and reproducible measurements of weak diffuse signals.




ar

Identifying and avoiding radiation damage in macromolecular crystallography

Radiation damage remains one of the major impediments to accurate structure solution in macromolecular crystallography. The artefacts of radiation damage can manifest as structural changes that result in incorrect biological interpretations being drawn from a model, they can reduce the resolution to which data can be collected and they can even prevent structure solution entirely. In this article, we discuss how to identify and mitigate against the effects of radiation damage at each stage in the macromolecular crystal structure-solution pipeline.




ar

A small step towards an important goal: fragment screen of the c-di-AMP-synthesizing enzyme CdaA

CdaA is the most widespread diadenylate cyclase in many bacterial species, including several multidrug-resistant human pathogens. The enzymatic product of CdaA, cyclic di-AMP, is a secondary messenger that is essential for the viability of many bacteria. Its absence in humans makes CdaA a very promising and attractive target for the development of new antibiotics. Here, the structural results are presented of a crystallographic fragment screen against CdaA from Listeria monocytogenes, a saprophytic Gram-positive bacterium and an opportunistic food-borne pathogen that can cause listeriosis in humans and animals. Two of the eight fragment molecules reported here were localized in the highly conserved ATP-binding site. These fragments could serve as potential starting points for the development of antibiotics against several CdaA-dependent bacterial species.




ar

Pillar data-acquisition strategies for cryo-electron tomography of beam-sensitive biological samples

For cryo-electron tomography (cryo-ET) of beam-sensitive biological specimens, a planar sample geometry is typically used. As the sample is tilted, the effective thickness of the sample along the direction of the electron beam increases and the signal-to-noise ratio concomitantly decreases, limiting the transfer of information at high tilt angles. In addition, the tilt range where data can be collected is limited by a combination of various sample-environment constraints, including the limited space in the objective lens pole piece and the possible use of fixed conductive braids to cool the specimen. Consequently, most tilt series are limited to a maximum of ±70°, leading to the presence of a missing wedge in Fourier space. The acquisition of cryo-ET data without a missing wedge, for example using a cylindrical sample geometry, is hence attractive for volumetric analysis of low-symmetry structures such as organelles or vesicles, lysis events, pore formation or filaments for which the missing information cannot be compensated by averaging techniques. Irrespective of the geometry, electron-beam damage to the specimen is an issue and the first images acquired will transfer more high-resolution information than those acquired last. There is also an inherent trade-off between higher sampling in Fourier space and avoiding beam damage to the sample. Finally, the necessity of using a sufficient electron fluence to align the tilt images means that this fluence needs to be fractionated across a small number of images; therefore, the order of data acquisition is also a factor to consider. Here, an n-helix tilt scheme is described and simulated which uses overlapping and interleaved tilt series to maximize the use of a pillar geometry, allowing the entire pillar volume to be reconstructed as a single unit. Three related tilt schemes are also evaluated that extend the continuous and classic dose-symmetric tilt schemes for cryo-ET to pillar samples to enable the collection of isotropic information across all spatial frequencies. A fourfold dose-symmetric scheme is proposed which provides a practical compromise between uniform information transfer and complexity of data acquisition.




ar

Introduction of the Capsules environment to support further growth of the SBGrid structural biology software collection

The expansive scientific software ecosystem, characterized by millions of titles across various platforms and formats, poses significant challenges in maintaining reproducibility and provenance in scientific research. The diversity of independently developed applications, evolving versions and heterogeneous components highlights the need for rigorous methodologies to navigate these complexities. In response to these challenges, the SBGrid team builds, installs and configures over 530 specialized software applications for use in the on-premises and cloud-based computing environments of SBGrid Consortium members. To address the intricacies of supporting this diverse application collection, the team has developed the Capsule Software Execution Environment, generally referred to as Capsules. Capsules rely on a collection of programmatically generated bash scripts that work together to isolate the runtime environment of one application from all other applications, thereby providing a transparent cross-platform solution without requiring specialized tools or elevated account privileges for researchers. Capsules facilitate modular, secure software distribution while maintaining a centralized, conflict-free environment. The SBGrid platform, which combines Capsules with the SBGrid collection of structural biology applications, aligns with FAIR goals by enhancing the findability, accessibility, interoperability and reusability of scientific software, ensuring seamless functionality across diverse computing environments. Its adaptability enables application beyond structural biology into other scientific fields.




ar

Deep-learning map segmentation for protein X-ray crystallographic structure determination

When solving a structure of a protein from single-wavelength anomalous diffraction X-ray data, the initial phases obtained by phasing from an anomalously scattering substructure usually need to be improved by an iterated electron-density modification. In this manuscript, the use of convolutional neural networks (CNNs) for segmentation of the initial experimental phasing electron-density maps is proposed. The results reported demonstrate that a CNN with U-net architecture, trained on several thousands of electron-density maps generated mainly using X-ray data from the Protein Data Bank in a supervised learning, can improve current density-modification methods.




ar

Managing macromolecular crystallographic data with a laboratory information management system

Protein crystallography is an established method to study the atomic structures of macromolecules and their complexes. A prerequisite for successful structure determination is diffraction-quality crystals, which may require extensive optimization of both the protein and the conditions, and hence projects can stretch over an extended period, with multiple users being involved. The workflow from crystallization and crystal treatment to deposition and publication is well defined, and therefore an electronic laboratory information management system (LIMS) is well suited to management of the data. Completion of the project requires key information on all the steps being available and this information should also be made available according to the FAIR principles. As crystallized samples are typically shipped between facilities, a key feature to be captured in the LIMS is the exchange of metadata between the crystallization facility of the home laboratory and, for example, synchrotron facilities. On completion, structures are deposited in the Protein Data Bank (PDB) and the LIMS can include the PDB code in its database, completing the chain of custody from crystallization to structure deposition and publication. A LIMS designed for macromolecular crystallography, IceBear, is available as a standalone installation and as a hosted service, and the implementation of key features for the capture of metadata in IceBear is discussed as an example.




ar

Cryo2RT: a high-throughput method for room-temperature macromolecular crystallography from cryo-cooled crystals

Advances in structural biology have relied heavily on synchrotron cryo-crystallography and cryogenic electron microscopy to elucidate biological processes and for drug discovery. However, disparities between cryogenic and room-temperature (RT) crystal structures pose challenges. Here, Cryo2RT, a high-throughput RT data-collection method from cryo-cooled crystals that leverages the cryo-crystallography workflow, is introduced. Tested on endothiapepsin crystals with four soaked fragments, thaumatin and SARS-CoV-2 3CLpro, Cryo2RT reveals unique ligand-binding poses, offers a comparable throughput to cryo-crystallography and eases the exploration of structural dynamics at various temperatures.




ar

Structural analysis of a ligand-triggered intermolecular disulfide switch in a major latex protein from opium poppy

Several proteins from plant pathogenesis-related family 10 (PR10) are highly abundant in the latex of opium poppy and have recently been shown to play diverse and important roles in the biosynthesis of benzylisoquinoline alkaloids (BIAs). The recent determination of the first crystal structures of PR10-10 showed how large conformational changes in a surface loop and adjacent β-strand are coupled to the binding of BIA compounds to the central hydrophobic binding pocket. A more detailed analysis of these conformational changes is now reported to further clarify how ligand binding is coupled to the formation and cleavage of an intermolecular disulfide bond that is only sterically allowed when the BIA binding pocket is empty. To decouple ligand binding from disulfide-bond formation, each of the two highly conserved cysteine residues (Cys59 and Cys155) in PR10-10 was replaced with serine using site-directed mutagenesis. Crystal structures of the Cys59Ser mutant were determined in the presence of papaverine and in the absence of exogenous BIA compounds. A crystal structure of the Cys155Ser mutant was also determined in the absence of exogenous BIA compounds. All three of these crystal structures reveal conformations similar to that of wild-type PR10-10 with bound BIA compounds. In the absence of exogenous BIA compounds, the Cys59Ser and Cys155Ser mutants appear to bind an unidentified ligand or mixture of ligands that was presumably introduced during expression of the proteins in Escherichia coli. The analysis of conformational changes triggered by the binding of BIA compounds suggests a molecular mechanism coupling ligand binding to the disruption of an intermolecular disulfide bond. This mechanism may be involved in the regulation of biosynthetic reactions in plants and possibly other organisms.




ar

Comparison of two crystal polymorphs of NowGFP reveals a new conformational state trapped by crystal packing

Crystal polymorphism serves as a strategy to study the conformational flexibility of proteins. However, the relationship between protein crystal packing and protein conformation often remains elusive. In this study, two distinct crystal forms of a green fluorescent protein variant, NowGFP, are compared: a previously identified monoclinic form (space group C2) and a newly discovered ortho­rhombic form (space group P212121). Comparative analysis reveals that both crystal forms exhibit nearly identical linear assemblies of NowGFP molecules interconnected through similar crystal contacts. However, a notable difference lies in the stacking of these assemblies: parallel in the monoclinic form and perpendicular in the orthorhombic form. This distinct mode of stacking leads to different crystal contacts and induces structural alteration in one of the two molecules within the asymmetric unit of the orthorhombic crystal form. This new conformational state captured by orthorhombic crystal packing exhibits two unique features: a conformational shift of the β-barrel scaffold and a restriction of pH-dependent shifts of the key residue Lys61, which is crucial for the pH-dependent spectral shift of this protein. These findings demonstrate a clear connection between crystal packing and alternative conformational states of proteins, providing insights into how structural variations influence the function of fluorescent proteins.




ar

Robust and automatic beamstop shadow outlier rejection: combining crystallographic statistics with modern clustering under a semi-supervised learning strategy

During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.




ar

Utilizing anomalous signals for element identification in macromolecular crystallography

AlphaFold2 has revolutionized structural biology by offering unparalleled accuracy in predicting protein structures. Traditional methods for determining protein structures, such as X-ray crystallography and cryo-electron microscopy, are often time-consuming and resource-intensive. AlphaFold2 provides models that are valuable for molecular replacement, aiding in model building and docking into electron density or potential maps. However, despite its capabilities, models from AlphaFold2 do not consistently match the accuracy of experimentally determined structures, need to be validated experimentally and currently miss some crucial information, such as post-translational modifications, ligands and bound ions. In this paper, the advantages are explored of collecting X-ray anomalous data to identify chemical elements, such as metal ions, which are key to understanding certain structures and functions of proteins. This is achieved through methods such as calculating anomalous difference Fourier maps or refining the imaginary component of the anomalous scattering factor f''. Anomalous data can serve as a valuable complement to the information provided by AlphaFold2 models and this is particularly significant in elucidating the roles of metal ions.




ar

Structural studies of β-glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharolyticus

β-Glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharo­lyticus (Bgl1) has been denoted as having an attractive catalytic profile for various industrial applications. Bgl1 catalyses the final step of in the decomposition of cellulose, an unbranched glucose polymer that has attracted the attention of researchers in recent years as it is the most abundant renewable source of reduced carbon in the biosphere. With the aim of enhancing the thermostability of Bgl1 for a broad spectrum of biotechnological processes, it has been subjected to structural studies. Crystal structures of Bgl1 and its complex with glucose were determined at 1.47 and 1.95 Å resolution, respectively. Bgl1 is a member of glycosyl hydrolase family 1 (GH1 superfamily, EC 3.2.1.21) and the results showed that the 3D structure of Bgl1 follows the overall architecture of the GH1 family, with a classical (β/α)8 TIM-barrel fold. Comparisons of Bgl1 with sequence or structural homologues of β-glucosidase reveal quite similar structures but also unique structural features in Bgl1 with plausible functional roles.




ar

CHiMP: deep-learning tools trained on protein crystallization micrographs to enable automation of experiments

A group of three deep-learning tools, referred to collectively as CHiMP (Crystal Hits in My Plate), were created for analysis of micrographs of protein crystallization experiments at the Diamond Light Source (DLS) synchrotron, UK. The first tool, a classification network, assigns images into categories relating to experimental outcomes. The other two tools are networks that perform both object detection and instance segmentation, resulting in masks of individual crystals in the first case and masks of crystallization droplets in addition to crystals in the second case, allowing the positions and sizes of these entities to be recorded. The creation of these tools used transfer learning, where weights from a pre-trained deep-learning network were used as a starting point and repurposed by further training on a relatively small set of data. Two of the tools are now integrated at the VMXi macromolecular crystallography beamline at DLS, where they have the potential to absolve the need for any user input, both for monitoring crystallization experiments and for triggering in situ data collections. The third is being integrated into the XChem fragment-based drug-discovery screening platform, also at DLS, to allow the automatic targeting of acoustic compound dispensing into crystallization droplets.




ar

The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era

The availability of highly accurate protein structure predictions from AlphaFold2 (AF2) and similar tools has hugely expanded the applicability of molecular replacement (MR) for crystal structure solution. Many structures can be solved routinely using raw models, structures processed to remove unreliable parts or models split into distinct structural units. There is therefore an open question around how many and which cases still require experimental phasing methods such as single-wavelength anomalous diffraction (SAD). Here, this question is addressed using a large set of PDB depositions that were solved by SAD. A large majority (87%) could be solved using unedited or minimally edited AF2 predictions. A further 18 (4%) yield straightforwardly to MR after splitting of the AF2 prediction using Slice'N'Dice, although different splitting methods succeeded on slightly different sets of cases. It is also found that further unique targets can be solved by alternative modelling approaches such as ESMFold (four cases), alternative MR approaches such as ARCIMBOLDO and AMPLE (two cases each), and multimeric model building with AlphaFold-Multimer or UniFold (three cases). Ultimately, only 12 cases, or 3% of the SAD-phased set, did not yield to any form of MR tested here, offering valuable hints as to the number and the characteristics of cases where experimental phasing remains essential for macromolecular structure solution.




ar

Structure determination using high-order spatial correlations in single-particle X-ray scattering

Single-particle imaging using X-ray free-electron lasers (XFELs) is a promising technique for observing nanoscale biological samples under near-physiological conditions. However, as the sample's orientation in each diffraction pattern is unknown, advanced algorithms are required to reconstruct the 3D diffraction intensity volume and subsequently the sample's density model. While most approaches perform 3D reconstruction via determining the orientation of each diffraction pattern, a correlation-based approach utilizes the averaged spatial correlations of diffraction intensities over all patterns, making it well suited for processing experimental data with a poor signal-to-noise ratio of individual patterns. Here, a method is proposed to determine the 3D structure of a sample by analyzing the double, triple and quadruple spatial correlations in diffraction patterns. This ab initio method can reconstruct the basic shape of an irregular unsymmetric 3D sample without requiring any prior knowledge of the sample. The impact of background and noise on correlations is investigated and corrected to ensure the success of reconstruction under simulated experimental conditions. Additionally, the feasibility of using the correlation-based approach to process incomplete partial diffraction patterns is demonstrated. The proposed method is a variable addition to existing algorithms for 3D reconstruction and will further promote the development and adoption of XFEL single-particle imaging techniques.




ar

Orientational ordering and assembly of silica–nickel Janus particles in a magnetic field

The orientation ordering and assembly behavior of silica–nickel Janus particles in a static external magnetic field were probed by ultra small-angle X-ray scattering (USAXS). Even in a weak applied field, the net magnetic moments of the individual particles aligned in the direction of the field, as indicated by the anisotropy in the recorded USAXS patterns. X-ray photon correlation spectroscopy (XPCS) measurements on these suspensions revealed that the corresponding particle dynamics are primarily Brownian diffusion [Zinn, Sharpnack & Narayanan (2023). Soft Matter, 19, 2311–2318]. At higher fields, the magnetic forces led to chain-like configurations of particles, as indicated by an additional feature in the USAXS pattern. A theoretical framework is provided for the quantitative interpretation of the observed anisotropic scattering diagrams and the corresponding degree of orientation. No anisotropy was detected when the magnetic field was applied along the beam direction, which is also replicated by the model. The method presented here could be useful for the interpretation of oriented scattering patterns from a wide variety of particulate systems. The combination of USAXS and XPCS is a powerful approach for investigating asymmetric colloidal particles in external fields.




ar

Dynamic X-ray speckle-tracking imaging with high-accuracy phase retrieval based on deep learning

Speckle-tracking X-ray imaging is an attractive candidate for dynamic X-ray imaging owing to its flexible setup and simultaneous yields of phase, transmission and scattering images. However, traditional speckle-tracking imaging methods suffer from phase distortion at locations with abrupt changes in density, which is always the case for real samples, limiting the applications of the speckle-tracking X-ray imaging method. In this paper, we report a deep-learning based method which can achieve dynamic X-ray speckle-tracking imaging with high-accuracy phase retrieval. The calibration results of a phantom show that the profile of the retrieved phase is highly consistent with the theoretical one. Experiments of polyurethane foaming demonstrated that the proposed method revealed the evolution of the complicated microstructure of the bubbles accurately. The proposed method is a promising solution for dynamic X-ray imaging with high-accuracy phase retrieval, and has extensive applications in metrology and quantitative analysis of dynamics in material science, physics, chemistry and biomedicine.




ar

Refining short-range order parameters from the three-dimensional diffuse scattering in single-crystal electron diffraction data

Our study compares short-range order parameters refined from the diffuse scattering in single-crystal X-ray and single-crystal electron diffraction data. Nb0.84CoSb was chosen as a reference material. The correlations between neighbouring vacancies and the displacements of Sb and Co atoms were refined from the diffuse scattering using a Monte Carlo refinement in DISCUS. The difference between the Sb and Co displacements refined from the diffuse scattering and the Sb and Co displacements refined from the Bragg reflections in single-crystal X-ray diffraction data is 0.012 (7) Å for the refinement on diffuse scattering in single-crystal X-ray diffraction data and 0.03 (2) Å for the refinement on the diffuse scattering in single-crystal electron diffraction data. As electron diffraction requires much smaller crystals than X-ray diffraction, this opens up the possibility of refining short-range order parameters in many technologically relevant materials for which no crystals large enough for single-crystal X-ray diffraction are available.




ar

Solving protein structures by combining structure prediction, molecular replacement and direct-methods-aided model completion

Highly accurate protein structure prediction can generate accurate models of protein and protein–protein complexes in X-ray crystallography. However, the question of how to make more effective use of predicted models for completing structure analysis, and which strategies should be employed for the more challenging cases such as multi-helical structures, multimeric structures and extremely large structures, both in the model preparation and in the completion steps, remains open for discussion. In this paper, a new strategy is proposed based on the framework of direct methods and dual-space iteration, which can greatly simplify the pre-processing steps of predicted models both in normal and in challenging cases. Following this strategy, full-length models or the conservative structural domains could be used directly as the starting model, and the phase error and the model bias between the starting model and the real structure would be modified in the direct-methods-based dual-space iteration. Many challenging cases (from CASP14) have been tested for the general applicability of this constructive strategy, and almost complete models have been generated with reasonable statistics. The hybrid strategy therefore provides a meaningful scheme for X-ray structure determination using a predicted model as the starting point.




ar

The prediction of single-molecule magnet properties via deep learning

This paper uses deep learning to present a proof-of-concept for data-driven chemistry in single-molecule magnets (SMMs). Previous discussions within SMM research have proposed links between molecular structures (crystal structures) and single-molecule magnetic properties; however, these have only interpreted the results. Therefore, this study introduces a data-driven approach to predict the properties of SMM structures using deep learning. The deep-learning model learns the structural features of the SMM molecules by extracting the single-molecule magnetic properties from the 3D coordinates presented in this paper. The model accurately determined whether a molecule was a single-molecule magnet, with an accuracy rate of approximately 70% in predicting the SMM properties. The deep-learning model found SMMs from 20 000 metal complexes extracted from the Cambridge Structural Database. Using deep-learning models for predicting SMM properties and guiding the design of novel molecules is promising.




ar

Community recommendations on cryoEM data archiving and validation

In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for the deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 47 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discussed, and the resulting consensus recommendations. Some challenges for future methods-development efforts in this area are also highlighted, as is the implementation to date of some of the recommendations.




ar

Cocrystals of a coumarin derivative: an efficient approach towards anti-leishmanial cocrystals against MIL-resistant Leishmania tropica

Leishmaniasis is a neglected parasitic tropical disease with numerous clinical manifestations. One of the causative agents of cutaneous leishmaniasis (CL) is Leishmania tropica (L. tropica) known for causing ulcerative lesions on the skin. The adverse effects of the recommended available drugs, such as amphotericin B and pentavalent antimonial, and the emergence of drug resistance in parasites, mean the search for new safe and effective anti-leishmanial agents is crucial. Miltefosine (MIL) was the first recommended oral medication, but its use is now limited because of the rapid emergence of resistance. Pharmaceutical cocrystallization is an effective method to improve the physicochemical and biological properties of active pharmaceutical ingredients (APIs). Herein, we describe the cocrystallization of coumarin-3-carb­oxy­lic acid (CU, 1a; 2-oxobenzo­pyrane-3-carb­oxy­lic acid, C10H6O4) with five coformers [2-amino-3-bromo­pyridine (1b), 2-amino-5-(tri­fluoro­methyl)-pyridine (1c), 2-amino-6-methyl­pyridine (1d), p-amino­benzoic acid (1e) and amitrole (1f)] in a 1:1 stoichiometric ratio via the neat grinding method. The cocrystals 2–6 obtained were characterized via single-crystal X-ray diffraction, powder X-ray diffraction, differential scanning calorimetry and thermogravimetric analysis, as well as Fourier transform infrared spectroscopy. Non-covalent interactions, such as van der Waals, hydrogen bonding, C—H⋯π and π⋯π interactions contribute significantly towards the packing of a crystal structure and alter the physicochemical and biological activity of CU. In this research, newly synthesized cocrystals were evaluated for their anti-leishmanial activity against the MIL-resistant L. tropica and cytotoxicity against the 3T3 (normal fibroblast) cell line. Among the non-cytotoxic cocrystals synthesized (2–6), CU:1b (2, IC50 = 61.83 ± 0.59 µM), CU:1c (3, 125.7 ± 1.15 µM) and CU:1d (4, 48.71 ± 0.75 µM) appeared to be potent anti-leishmanial agents and showed several-fold more anti-leishmanial potential than the tested standard drug (MIL, IC50 = 169.55 ± 0.078 µM). The results indicate that cocrystals 2–4 are promising anti-leishmanial agents which require further exploration.




ar

Dynamical refinement with multipolar electron scattering factors

Dynamical refinement is a well established method for refining crystal structures against 3D electron diffraction (ED) data and its benefits have been discussed in the literature [Palatinus, Petříček & Corrêa, (2015). Acta Cryst. A71, 235–244; Palatinus, Corrêa et al. (2015). Acta Cryst. B71, 740–751]. However, until now, dynamical refinements have only been conducted using the independent atom model (IAM). Recent research has shown that a more accurate description can be achieved by applying the transferable aspherical atom model (TAAM), but this has been limited only to kinematical refinements [Gruza et al. (2020). Acta Cryst. A76, 92–109; Jha et al. (2021). J. Appl. Cryst. 54, 1234–1243]. In this study, we combine dynamical refinement with TAAM for the crystal structure of 1-methyl­uracil, using data from precession ED. Our results show that this approach improves the residual Fourier electrostatic potential and refinement figures of merit. Furthermore, it leads to systematic changes in the atomic displacement parameters of all atoms and the positions of hydrogen atoms. We found that the refinement results are sensitive to the parameters used in the TAAM modelling process. Though our results show that TAAM offers superior performance compared with IAM in all cases, they also show that TAAM parameters obtained by periodic DFT calculations on the refined structure are superior to the TAAM parameters from the UBDB/MATTS database. It appears that multipolar parameters transferred from the database may not be sufficiently accurate to provide a satisfactory description of all details of the electrostatic potential probed by the 3D ED experiment.




ar

The ABC toxin complex from Yersinia entomophaga can package three different cytotoxic components expressed from distinct genetic loci in an unfolded state: the structures of both shell and cargo

Bacterial ABC toxin complexes (Tcs) comprise three core proteins: TcA, TcB and TcC. The TcA protein forms a pentameric assembly that attaches to the surface of target cells and penetrates the cell membrane. The TcB and TcC proteins assemble as a heterodimeric TcB–TcC subcomplex that makes a hollow shell. This TcB–TcC subcomplex self-cleaves and encapsulates within the shell a cytotoxic `cargo' encoded by the C-terminal region of the TcC protein. Here, we describe the structure of a previously uncharacterized TcC protein from Yersinia entomophaga, encoded by a gene at a distant genomic location from the genes encoding the rest of the toxin complex, in complex with the TcB protein. When encapsulated within the TcB–TcC shell, the C-terminal toxin adopts an unfolded and disordered state, with limited areas of local order stabilized by the chaperone-like inner surface of the shell. We also determined the structure of the toxin cargo alone and show that when not encapsulated within the shell, it adopts an ADP-ribosyltransferase fold most similar to the catalytic domain of the SpvB toxin from Salmonella typhimurium. Our structural analysis points to a likely mechanism whereby the toxin acts directly on actin, modifying it in a way that prevents normal polymerization.




ar

Crystal structure of human peptidylarginine deiminase type VI (PAD6) provides insights into its inactivity

Human peptidylarginine deiminase isoform VI (PAD6), which is predominantly limited to cytoplasmic lattices in the mammalian oocytes in ovarian tissue, is essential for female fertility. It belongs to the peptidylarginine deiminase (PAD) enzyme family that catalyzes the conversion of arginine residues to citrulline in proteins. In contrast to other members of the family, recombinant PAD6 was previously found to be catalytically inactive. We sought to provide structural insight into the human homologue to shed light on this observation. We report here the first crystal structure of PAD6, determined at 1.7 Å resolution. PAD6 follows the same domain organization as other structurally known PAD isoenzymes. Further structural analysis and size-exclusion chromatography show that PAD6 behaves as a homodimer similar to PAD4. Differential scanning fluorimetry suggests that PAD6 does not coordinate Ca2+ which agrees with acidic residues found to coordinate Ca2+ in other PAD homologs not being conserved in PAD6. The crystal structure of PAD6 shows similarities with the inactive state of apo PAD2, in which the active site conformation is unsuitable for catalytic citrullination. The putative active site of PAD6 adopts a non-productive conformation that would not allow protein–substrate binding due to steric hindrance with rigid secondary structure elements. This observation is further supported by the lack of activity on the histone H3 and cytokeratin 5 substrates. These findings suggest a different mechanism for enzymatic activation compared with other PADs; alternatively, PAD6 may exert a non-enzymatic function in the cytoplasmic lattice of oocytes and early embryos.




ar

RCSB Protein Data Bank: supporting research and education worldwide through explorations of experimentally determined and computationally predicted atomic level 3D biostructures

The Protein Data Bank (PDB) was established as the first open-access digital data resource in biology and medicine in 1971 with seven X-ray crystal structures of proteins. Today, the PDB houses >210 000 experimentally determined, atomic level, 3D structures of proteins and nucleic acids as well as their complexes with one another and small molecules (e.g. approved drugs, enzyme cofactors). These data provide insights into fundamental biology, biomedicine, bioenergy and biotechnology. They proved particularly important for understanding the SARS-CoV-2 global pandemic. The US-funded Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) and other members of the Worldwide Protein Data Bank (wwPDB) partnership jointly manage the PDB archive and support >60 000 `data depositors' (structural biologists) around the world. wwPDB ensures the quality and integrity of the data in the ever-expanding PDB archive and supports global open access without limitations on data usage. The RCSB PDB research-focused web portal at https://www.rcsb.org/ (RCSB.org) supports millions of users worldwide, representing a broad range of expertise and interests. In addition to retrieving 3D structure data, PDB `data consumers' access comparative data and external annotations, such as information about disease-causing point mutations and genetic variations. RCSB.org also provides access to >1 000 000 computed structure models (CSMs) generated using artificial intelligence/machine-learning methods. To avoid doubt, the provenance and reliability of experimentally determined PDB structures and CSMs are identified. Related training materials are available to support users in their RCSB.org explorations.




ar

Linking solid-state phenomena via energy differences in `archetype crystal structures'

Categorization underlies understanding. Conceptualizing solid-state structures of organic molecules with `archetype crystal structures' bridges established categories of disorder, polymorphism and solid solutions and is herein extended to special position and high-Z' structures. The concept was developed in the context of disorder modelling [Dittrich, B. (2021). IUCrJ, 8, 305–318] and relies on adding quantum chemical energy differences between disorder components to other criteria as an explanation as to why disorder – and disappearing disorder – occurs in an average structure. Part of the concept is that disorder, as probed by diffraction, affects entire molecules, rather than just the parts of a molecule with differing conformations, and the finding that an R·T energy difference between disorder archetypes is usually not exceeded. An illustrative example combining disorder and special positions is the crystal structure of oestradiol hemihydrate analysed here, where its space-group/subgroup relationship is required to explain its disorder of hydrogen-bonded hydrogen atoms. In addition, we show how high-Z' structures can also be analysed energetically and understood via archetypes: high-Z' structures occur when an energy gain from combining different rather than overall alike conformations in a crystal significantly exceeds R·T, and this finding is discussed in the context of earlier explanations in the literature. Twinning is not related to archetype structures since it involves macroscopic domains of the same crystal structure. Archetype crystal structures are distinguished from crystal structure prediction trial structures in that an experimental reference structure is required for them. Categorization into archetype structures also has practical relevance, leading to a new practice of disorder modelling in experimental least-squares refinement alluded to in the above-mentioned publication.




ar

Structural insights into the molecular mechanism of phytoplasma immunodominant membrane protein

Immunodominant membrane protein (IMP) is a prevalent membrane protein in phytoplasma and has been confirmed to be an F-actin-binding protein. However, the intricate molecular mechanisms that govern the function of IMP require further elucidation. In this study, the X-ray crystallographic structure of IMP was determined and insights into its interaction with plant actin are provided. A comparative analysis with other proteins demonstrates that IMP shares structural homology with talin rod domain-containing protein 1 (TLNRD1), which also functions as an F-actin-binding protein. Subsequent molecular-docking studies of IMP and F-actin reveal that they possess complementary surfaces, suggesting a stable interaction. The low potential energy and high confidence score of the IMP–F-actin binding model indicate stable binding. Additionally, by employing immunoprecipitation and mass spectrometry, it was discovered that IMP serves as an interaction partner for the phytoplasmal effector causing phyllody 1 (PHYL1). It was then shown that both IMP and PHYL1 are highly expressed in the S2 stage of peanut witches' broom phytoplasma-infected Catharanthus roseus. The association between IMP and PHYL1 is substantiated through in vivo immunoprecipitation, an in vitro cross-linking assay and molecular-docking analysis. Collectively, these findings expand the current understanding of IMP interactions and enhance the comprehension of the interaction of IMP with plant F-actin. They also unveil a novel interaction pathway that may influence phytoplasma pathogenicity and host plant responses related to PHYL1. This discovery could pave the way for the development of new strategies to overcome phytoplasma-related plant diseases.




ar

Toward a quantitative description of solvation structure: a framework for differential solution scattering measurements

Appreciating that the role of the solute–solvent and other outer-sphere interactions is essential for understanding chemistry and chemical dynamics in solution, experimental approaches are needed to address the structural consequences of these interactions, complementing condensed-matter simulations and coarse-grained theories. High-energy X-ray scattering (HEXS) combined with pair distribution function analysis presents the opportunity to probe these structures directly and to develop quantitative, atomistic models of molecular systems in situ in the solution phase. However, at concentrations relevant to solution-phase chemistry, the total scattering signal is dominated by the bulk solvent, prompting researchers to adopt a differential approach to eliminate this unwanted background. Though similar approaches are well established in quantitative structural studies of macromolecules in solution by small- and wide-angle X-ray scattering (SAXS/WAXS), analogous studies in the HEXS regime—where sub-ångström spatial resolution is achieved—remain underdeveloped, in part due to the lack of a rigorous theoretical description of the experiment. To address this, herein we develop a framework for differential solution scattering experiments conducted at high energies, which includes concepts of the solvent-excluded volume introduced to describe SAXS/WAXS data, as well as concepts from the time-resolved X-ray scattering community. Our theory is supported by numerical simulations and experiment and paves the way for establishing quantitative methods to determine the atomic structures of small molecules in solution with resolution approaching that of crystallography.




ar

A step towards 6D WAXD tensor tomography

X-ray scattering/diffraction tensor tomography techniques are promising methods to acquire the 3D texture information of heterogeneous biological tissues at micrometre resolution. However, the methods suffer from a long overall acquisition time due to multi-dimensional scanning across real and reciprocal space. Here, a new approach is introduced to obtain 3D reciprocal information of each illuminated scanning volume using mathematic modeling, which is equivalent to a physical scanning procedure for collecting the full reciprocal information required for voxel reconstruction. The virtual reciprocal scanning scheme was validated by a simulated 6D wide-angle X-ray diffraction tomography experiment. The theoretical validation of the method represents an important technological advancement for 6D diffraction tensor tomography and a crucial step towards pervasive applications in the characterization of heterogeneous materials.




ar

The evolution of raw data archiving and the growth of its importance in crystallography

The hardware for data archiving has expanded capacities for digital storage enormously in the past decade or more. The IUCr evaluated the costs and benefits of this within an official working group which advised that raw data archiving would allow ground truth reproducibility in published studies. Consultations of the IUCr's Commissions ensued via a newly constituted standing advisory committee, the Committee on Data. At all stages, the IUCr financed workshops to facilitate community discussions and possible methods of raw data archiving implementation. The recent launch of the IUCrData journal's Raw Data Letters is a milestone in the implementation of raw data archiving beyond the currently published studies: it includes diffraction patterns that have not been fully interpreted, if at all. The IUCr 75th Congress in Melbourne included a workshop on raw data reuse, discussing the successes and ongoing challenges of raw data reuse. This article charts the efforts of the IUCr to facilitate discussions and plans relating to raw data archiving and reuse within the various communities of crystallography, diffraction and scattering.




ar

Structural insight into piezo-solvatochromism of Reichardt's dye

To date, accurate modelling of the solvation process is challenging, often over-simplifying the solvent–solute interactions. The interplay between the molecular arrangement associated with the solvation process and crystal nucleation has been investigated by analysis of the piezo-solvatochromic behaviour of Reichardt's dye, ET(1), in methanol, ethanol and acetone under high pressure. High-pressure single-crystal X-ray diffraction and UV–Vis spectroscopy reveal the impact of solute–solvent interactions on the optical properties of ET(1). The study underscores the intricate relationship between solvent properties, molecular conformation and crystal packing. The connection between liquid and solid phases emphasizes the capabilities of high-pressure methods for expanding the field of crystal engineering. The high-pressure environment allowed the determination of the crystal structures reported here that are built from organic molecules fourfold solvated with ethanol or methanol: ET(1)·4CH3OH and ET(1)·4C2H5OH·H2O. The observed piezo-solvatochromic effects highlight the potential of ET(1) in nonlinear optoelectronics and expand the application of solvatochromic chemical indicators to pressure sensors.




ar

From X-ray crystallographic structure to intrinsic thermodynamics of protein–ligand binding using carbonic anhydrase isozymes as a model system

Carbonic anhydrase (CA) was among the first proteins whose X-ray crystal structure was solved to atomic resolution. CA proteins have essentially the same fold and similar active centers that differ in only several amino acids. Primary sulfonamides are well defined, strong and specific binders of CA. However, minor variations in chemical structure can significantly alter their binding properties. Over 1000 sulfonamides have been designed, synthesized and evaluated to understand the correlations between the structure and thermodynamics of their binding to the human CA isozyme family. Compound binding was determined by several binding assays: fluorescence-based thermal shift assay, stopped-flow enzyme activity inhibition assay, isothermal titration calorimetry and competition assay for enzyme expressed on cancer cell surfaces. All assays have advantages and limitations but are necessary for deeper characterization of these protein–ligand interactions. Here, the concept and importance of intrinsic binding thermodynamics is emphasized and the role of structure–thermodynamics correlations for the novel inhibitors of CA IX is discussed – an isozyme that is overexpressed in solid hypoxic tumors, and thus these inhibitors may serve as anticancer drugs. The abundant structural and thermodynamic data are assembled into the Protein–Ligand Binding Database to understand general protein–ligand recognition principles that could be used in drug discovery.




ar

A predicted model-aided reconstruction algorithm for X-ray free-electron laser single-particle imaging

Ultra-intense, ultra-fast X-ray free-electron lasers (XFELs) enable the imaging of single protein molecules under ambient temperature and pressure. A crucial aspect of structure reconstruction involves determining the relative orientations of each diffraction pattern and recovering the missing phase information. In this paper, we introduce a predicted model-aided algorithm for orientation determination and phase retrieval, which has been tested on various simulated datasets and has shown significant improvements in the success rate, accuracy and efficiency of XFEL data reconstruction.




ar

A modified phase-retrieval algorithm to facilitate automatic de novo macromolecular structure determination in single-wavelength anomalous diffraction

The success of experimental phasing in macromolecular crystallography relies primarily on the accurate locations of heavy atoms bound to the target crystal. To improve the process of substructure determination, a modified phase-retrieval algorithm built on the framework of the relaxed alternating averaged reflection (RAAR) algorithm has been developed. Importantly, the proposed algorithm features a combination of the π-half phase perturbation for weak reflections and enforces the direct-method-based tangent formula for strong reflections in reciprocal space. The proposed algorithm is extensively demonstrated on a total of 100 single-wavelength anomalous diffraction (SAD) experimental datasets, comprising both protein and nucleic acid structures of different qualities. Compared with the standard RAAR algorithm, the modified phase-retrieval algorithm exhibits significantly improved effectiveness and accuracy in SAD substructure determination, highlighting the importance of additional constraints for algorithmic performance. Furthermore, the proposed algorithm can be performed without human intervention under most conditions owing to the self-adaptive property of the input parameters, thus making it convenient to be integrated into the structural determination pipeline. In conjunction with the IPCAS software suite, we demonstrated experimentally that automatic de novo structure determination is possible on the basis of our proposed algorithm.




ar

Benchmarking predictive methods for small-angle X-ray scattering from atomic coordinates of proteins using maximum likelihood consensus data

Stimulated by informal conversations at the XVII International Small Angle Scattering (SAS) conference (Traverse City, 2017), an international team of experts undertook a round-robin exercise to produce a large dataset from proteins under standard solution conditions. These data were used to generate consensus SAS profiles for xylose isomerase, urate oxidase, xylanase, lysozyme and ribonuclease A. Here, we apply a new protocol using maximum likelihood with a larger number of the contributed datasets to generate improved consensus profiles. We investigate the fits of these profiles to predicted profiles from atomic coordinates that incorporate different models to account for the contribution to the scattering of water molecules of hydration surrounding proteins in solution. Programs using an implicit, shell-type hydration layer generally optimize fits to experimental data with the aid of two parameters that adjust the volume of the bulk solvent excluded by the protein and the contrast of the hydration layer. For these models, we found the error-weighted residual differences between the model and the experiment generally reflected the subsidiary maxima and minima in the consensus profiles that are determined by the size of the protein plus the hydration layer. By comparison, all-atom solute and solvent molecular dynamics (MD) simulations are without the benefit of adjustable parameters and, nonetheless, they yielded at least equally good fits with residual differences that are less reflective of the structure in the consensus profile. Further, where MD simulations accounted for the precise solvent composition of the experiment, specifically the inclusion of ions, the modelled radius of gyration values were significantly closer to the experiment. The power of adjustable parameters to mask real differences between a model and the structure present in solution is demonstrated by the results for the conformationally dynamic ribonuclease A and calculations with pseudo-experimental data. This study shows that, while methods invoking an implicit hydration layer have the unequivocal advantage of speed, care is needed to understand the influence of the adjustable parameters. All-atom solute and solvent MD simulations are slower but are less susceptible to false positives, and can account for thermal fluctuations in atomic positions, and more accurately represent the water molecules of hydration that contribute to the scattering profile.