or New insights into the domain of unknown function (DUF) of EccC5, the pivotal ATPase providing the secretion driving force to the ESX-5 secretion system By journals.iucr.org Published On :: 2024-05-28 Type VII secretion (T7S) systems, also referred to as ESAT-6 secretion (ESX) systems, are molecular machines that have gained great attention due to their implications in cell homeostasis and in host–pathogen interactions in mycobacteria. The latter include important human pathogens such as Mycobacterium tuberculosis (Mtb), the etiological cause of human tuberculosis, which constitutes a pandemic accounting for more than one million deaths every year. The ESX-5 system is exclusively found in slow-growing pathogenic mycobacteria, where it mediates the secretion of a large family of virulence factors: the PE and PPE proteins. The secretion driving force is provided by EccC5, a multidomain ATPase that operates using four globular cytosolic domains: an N-terminal domain of unknown function (EccC5DUF) and three FtsK/SpoIIIE ATPase domains. Recent structural and functional studies of ESX-3 and ESX-5 systems have revealed EccCDUF to be an ATPase-like fold domain with potential ATPase activity, the functionality of which is essential for secretion. Here, the crystal structure of the MtbEccC5DUF domain is reported at 2.05 Å resolution, which reveals a nucleotide-free structure with degenerated cis-acting and trans-acting elements involved in ATP binding and hydrolysis. This crystallographic study, together with a biophysical assessment of the interaction of MtbEccC5DUF with ATP/Mg2+, supports the absence of ATPase activity proposed for this domain. It is shown that this degeneration is also present in DUF domains from other ESX and ESX-like systems, which are likely to exhibit poor or null ATPase activity. Moreover, based on an in silico model of the N-terminal region of MtbEccC5DUF, it is hypothesized that MtbEccC5DUF is a degenerated ATPase domain that may have retained the ability to hexamerize. These observations draw attention to DUF domains as structural elements with potential implications in the opening and closure of the membrane pore during the secretion process via their involvement in inter-protomer interactions. Full Article text
or What shapes template-matching performance in cryogenic electron tomography in situ? By journals.iucr.org Published On :: 2024-05-28 The detection of specific biological macromolecules in cryogenic electron tomography data is frequently approached by applying cross-correlation-based 3D template matching. To reduce computational cost and noise, high binning is used to aggregate voxels before template matching. This remains a prevalent practice in both practical applications and methods development. Here, the relation between template size, shape and angular sampling is systematically evaluated to identify ribosomes in a ground-truth annotated data set. It is shown that at the commonly used binning, a detailed subtomogram average, a sphere and a heart emoji result in near-identical performance. These findings indicate that with current template-matching practices macromolecules can only be detected with high precision if their shape and size are sufficiently different from the background. Using theoretical considerations, the experimental results are rationalized and it is discussed why primarily low-frequency information remains at high binning and that template matching fails to be accurate because similarly shaped and sized macromolecules have similar low-frequency spectra. These challenges are discussed and potential enhancements for future template-matching methodologies are proposed. Full Article text
or Pillar data-acquisition strategies for cryo-electron tomography of beam-sensitive biological samples By journals.iucr.org Published On :: 2024-06-03 For cryo-electron tomography (cryo-ET) of beam-sensitive biological specimens, a planar sample geometry is typically used. As the sample is tilted, the effective thickness of the sample along the direction of the electron beam increases and the signal-to-noise ratio concomitantly decreases, limiting the transfer of information at high tilt angles. In addition, the tilt range where data can be collected is limited by a combination of various sample-environment constraints, including the limited space in the objective lens pole piece and the possible use of fixed conductive braids to cool the specimen. Consequently, most tilt series are limited to a maximum of ±70°, leading to the presence of a missing wedge in Fourier space. The acquisition of cryo-ET data without a missing wedge, for example using a cylindrical sample geometry, is hence attractive for volumetric analysis of low-symmetry structures such as organelles or vesicles, lysis events, pore formation or filaments for which the missing information cannot be compensated by averaging techniques. Irrespective of the geometry, electron-beam damage to the specimen is an issue and the first images acquired will transfer more high-resolution information than those acquired last. There is also an inherent trade-off between higher sampling in Fourier space and avoiding beam damage to the sample. Finally, the necessity of using a sufficient electron fluence to align the tilt images means that this fluence needs to be fractionated across a small number of images; therefore, the order of data acquisition is also a factor to consider. Here, an n-helix tilt scheme is described and simulated which uses overlapping and interleaved tilt series to maximize the use of a pillar geometry, allowing the entire pillar volume to be reconstructed as a single unit. Three related tilt schemes are also evaluated that extend the continuous and classic dose-symmetric tilt schemes for cryo-ET to pillar samples to enable the collection of isotropic information across all spatial frequencies. A fourfold dose-symmetric scheme is proposed which provides a practical compromise between uniform information transfer and complexity of data acquisition. Full Article text
or Introduction of the Capsules environment to support further growth of the SBGrid structural biology software collection By journals.iucr.org Published On :: 2024-06-04 The expansive scientific software ecosystem, characterized by millions of titles across various platforms and formats, poses significant challenges in maintaining reproducibility and provenance in scientific research. The diversity of independently developed applications, evolving versions and heterogeneous components highlights the need for rigorous methodologies to navigate these complexities. In response to these challenges, the SBGrid team builds, installs and configures over 530 specialized software applications for use in the on-premises and cloud-based computing environments of SBGrid Consortium members. To address the intricacies of supporting this diverse application collection, the team has developed the Capsule Software Execution Environment, generally referred to as Capsules. Capsules rely on a collection of programmatically generated bash scripts that work together to isolate the runtime environment of one application from all other applications, thereby providing a transparent cross-platform solution without requiring specialized tools or elevated account privileges for researchers. Capsules facilitate modular, secure software distribution while maintaining a centralized, conflict-free environment. The SBGrid platform, which combines Capsules with the SBGrid collection of structural biology applications, aligns with FAIR goals by enhancing the findability, accessibility, interoperability and reusability of scientific software, ensuring seamless functionality across diverse computing environments. Its adaptability enables application beyond structural biology into other scientific fields. Full Article text
or Deep-learning map segmentation for protein X-ray crystallographic structure determination By journals.iucr.org Published On :: 2024-06-27 When solving a structure of a protein from single-wavelength anomalous diffraction X-ray data, the initial phases obtained by phasing from an anomalously scattering substructure usually need to be improved by an iterated electron-density modification. In this manuscript, the use of convolutional neural networks (CNNs) for segmentation of the initial experimental phasing electron-density maps is proposed. The results reported demonstrate that a CNN with U-net architecture, trained on several thousands of electron-density maps generated mainly using X-ray data from the Protein Data Bank in a supervised learning, can improve current density-modification methods. Full Article text
or Factors affecting macromolecule orientations in thin films formed in cryo-EM By journals.iucr.org Published On :: 2024-06-27 The formation of a vitrified thin film embedded with randomly oriented macromolecules is an essential prerequisite for cryogenic sample electron microscopy. Most commonly, this is achieved using the plunge-freeze method first described nearly 40 years ago. Although this is a robust method, the behaviour of different macromolecules shows great variation upon freezing and often needs to be optimized to obtain an isotropic, high-resolution reconstruction. For a macromolecule in such a film, the probability of encountering the air–water interface in the time between blotting and freezing and adopting preferred orientations is very high. 3D reconstruction using preferentially oriented particles often leads to anisotropic and uninterpretable maps. Currently, there are no general solutions to this prevalent issue, but several approaches largely focusing on sample preparation with the use of additives and novel grid modifications have been attempted. In this study, the effect of physical and chemical factors on the orientations of macromolecules was investigated through an analysis of selected well studied macromolecules, and important parameters that determine the behaviour of proteins on cryo-EM grids were revealed. These insights highlight the nature of the interactions that cause preferred orientations and can be utilized to systematically address orientation bias for any given macromolecule and to provide a framework to design small-molecule additives to enhance sample stability and behaviour. Full Article text
or A structural role for tryptophan in proteins, and the ubiquitous Trp Cδ1—H⋯O=C (backbone) hydrogen bond By journals.iucr.org Published On :: 2024-06-28 Tryptophan is the most prominent amino acid found in proteins, with multiple functional roles. Its side chain is made up of the hydrophobic indole moiety, with two groups that act as donors in hydrogen bonds: the Nɛ—H group, which is a potent donor in canonical hydrogen bonds, and a polarized Cδ1—H group, which is capable of forming weaker, noncanonical hydrogen bonds. Due to adjacent electron-withdrawing moieties, C—H⋯O hydrogen bonds are ubiquitous in macromolecules, albeit contingent on the polarization of the donor C—H group. Consequently, Cα—H groups (adjacent to the carbonyl and amino groups of flanking peptide bonds), as well as the Cɛ1—H and Cδ2—H groups of histidines (adjacent to imidazole N atoms), are known to serve as donors in hydrogen bonds, for example stabilizing parallel and antiparallel β-sheets. However, the nature and the functional role of interactions involving the Cδ1—H group of the indole ring of tryptophan are not well characterized. Here, data mining of high-resolution (r ≤ 1.5 Å) crystal structures from the Protein Data Bank was performed and ubiquitous close contacts between the Cδ1—H groups of tryptophan and a range of electronegative acceptors were identified, specifically main-chain carbonyl O atoms immediately upstream and downstream in the polypeptide chain. The stereochemical analysis shows that most of the interactions bear all of the hallmarks of proper hydrogen bonds. At the same time, their cohesive nature is confirmed by quantum-chemical calculations, which reveal interaction energies of 1.5–3.0 kcal mol−1, depending on the specific stereochemistry. Full Article text
or A snapshot love story: what serial crystallography has done and will do for us By journals.iucr.org Published On :: 2024-07-10 Serial crystallography, born from groundbreaking experiments at the Linac Coherent Light Source in 2009, has evolved into a pivotal technique in structural biology. Initially pioneered at X-ray free-electron laser facilities, it has now expanded to synchrotron-radiation facilities globally, with dedicated experimental stations enhancing its accessibility. This review gives an overview of current developments in serial crystallography, emphasizing recent results in time-resolved crystallography, and discussing challenges and shortcomings. Full Article text
or Managing macromolecular crystallographic data with a laboratory information management system By journals.iucr.org Published On :: 2024-07-10 Protein crystallography is an established method to study the atomic structures of macromolecules and their complexes. A prerequisite for successful structure determination is diffraction-quality crystals, which may require extensive optimization of both the protein and the conditions, and hence projects can stretch over an extended period, with multiple users being involved. The workflow from crystallization and crystal treatment to deposition and publication is well defined, and therefore an electronic laboratory information management system (LIMS) is well suited to management of the data. Completion of the project requires key information on all the steps being available and this information should also be made available according to the FAIR principles. As crystallized samples are typically shipped between facilities, a key feature to be captured in the LIMS is the exchange of metadata between the crystallization facility of the home laboratory and, for example, synchrotron facilities. On completion, structures are deposited in the Protein Data Bank (PDB) and the LIMS can include the PDB code in its database, completing the chain of custody from crystallization to structure deposition and publication. A LIMS designed for macromolecular crystallography, IceBear, is available as a standalone installation and as a hosted service, and the implementation of key features for the capture of metadata in IceBear is discussed as an example. Full Article text
or Crystallographic fragment-binding studies of the Mycobacterium tuberculosis trifunctional enzyme suggest binding pockets for the tails of the acyl-CoA substrates at its active sites and a potential substrate-channeling path between them By journals.iucr.org Published On :: 2024-07-16 The Mycobacterium tuberculosis trifunctional enzyme (MtTFE) is an α2β2 tetrameric enzyme in which the α-chain harbors the 2E-enoyl-CoA hydratase (ECH) and 3S-hydroxyacyl-CoA dehydrogenase (HAD) active sites, and the β-chain provides the 3-ketoacyl-CoA thiolase (KAT) active site. Linear, medium-chain and long-chain 2E-enoyl-CoA molecules are the preferred substrates of MtTFE. Previous crystallographic binding and modeling studies identified binding sites for the acyl-CoA substrates at the three active sites, as well as the NAD binding pocket at the HAD active site. These studies also identified three additional CoA binding sites on the surface of MtTFE that are different from the active sites. It has been proposed that one of these additional sites could be of functional relevance for the substrate channeling (by surface crawling) of reaction intermediates between the three active sites. Here, 226 fragments were screened in a crystallographic fragment-binding study of MtTFE crystals, resulting in the structures of 16 MtTFE–fragment complexes. Analysis of the 121 fragment-binding events shows that the ECH active site is the `binding hotspot' for the tested fragments, with 41 binding events. The mode of binding of the fragments bound at the active sites provides additional insight into how the long-chain acyl moiety of the substrates can be accommodated at their proposed binding pockets. In addition, the 20 fragment-binding events between the active sites identify potential transient binding sites of reaction intermediates relevant to the possible channeling of substrates between these active sites. These results provide a basis for further studies to understand the functional relevance of the latter binding sites and to identify substrates for which channeling is crucial. Full Article text
or Cryo2RT: a high-throughput method for room-temperature macromolecular crystallography from cryo-cooled crystals By journals.iucr.org Published On :: 2024-07-25 Advances in structural biology have relied heavily on synchrotron cryo-crystallography and cryogenic electron microscopy to elucidate biological processes and for drug discovery. However, disparities between cryogenic and room-temperature (RT) crystal structures pose challenges. Here, Cryo2RT, a high-throughput RT data-collection method from cryo-cooled crystals that leverages the cryo-crystallography workflow, is introduced. Tested on endothiapepsin crystals with four soaked fragments, thaumatin and SARS-CoV-2 3CLpro, Cryo2RT reveals unique ligand-binding poses, offers a comparable throughput to cryo-crystallography and eases the exploration of structural dynamics at various temperatures. Full Article text
or Structural analysis of a ligand-triggered intermolecular disulfide switch in a major latex protein from opium poppy By journals.iucr.org Published On :: 2024-08-29 Several proteins from plant pathogenesis-related family 10 (PR10) are highly abundant in the latex of opium poppy and have recently been shown to play diverse and important roles in the biosynthesis of benzylisoquinoline alkaloids (BIAs). The recent determination of the first crystal structures of PR10-10 showed how large conformational changes in a surface loop and adjacent β-strand are coupled to the binding of BIA compounds to the central hydrophobic binding pocket. A more detailed analysis of these conformational changes is now reported to further clarify how ligand binding is coupled to the formation and cleavage of an intermolecular disulfide bond that is only sterically allowed when the BIA binding pocket is empty. To decouple ligand binding from disulfide-bond formation, each of the two highly conserved cysteine residues (Cys59 and Cys155) in PR10-10 was replaced with serine using site-directed mutagenesis. Crystal structures of the Cys59Ser mutant were determined in the presence of papaverine and in the absence of exogenous BIA compounds. A crystal structure of the Cys155Ser mutant was also determined in the absence of exogenous BIA compounds. All three of these crystal structures reveal conformations similar to that of wild-type PR10-10 with bound BIA compounds. In the absence of exogenous BIA compounds, the Cys59Ser and Cys155Ser mutants appear to bind an unidentified ligand or mixture of ligands that was presumably introduced during expression of the proteins in Escherichia coli. The analysis of conformational changes triggered by the binding of BIA compounds suggests a molecular mechanism coupling ligand binding to the disruption of an intermolecular disulfide bond. This mechanism may be involved in the regulation of biosynthetic reactions in plants and possibly other organisms. Full Article text
or Surface-mutagenesis strategies to enable structural biology crystallization platforms By journals.iucr.org Published On :: 2024-08-29 A key prerequisite for the successful application of protein crystallography in drug discovery is to establish a robust crystallization system for a new drug-target protein fast enough to deliver crystal structures when the first inhibitors have been identified in the hit-finding campaign or, at the latest, in the subsequent hit-to-lead process. The first crucial step towards generating well folded proteins with a high likelihood of crystallizing is the identification of suitable truncation variants of the target protein. In some cases an optimal length variant alone is not sufficient to support crystallization and additional surface mutations need to be introduced to obtain suitable crystals. In this contribution, four case studies are presented in which rationally designed surface modifications were key to establishing crystallization conditions for the target proteins (the protein kinases Aurora-C, IRAK4 and BUB1, and the KRAS–SOS1 complex). The design process which led to well diffracting crystals is described and the crystal packing is analysed to understand retrospectively how the specific surface mutations promoted successful crystallization. The presented design approaches are routinely used in our team to support the establishment of robust crystallization systems which enable structure-guided inhibitor optimization for hit-to-lead and lead-optimization projects in pharmaceutical research. Full Article text
or Microcrystal electron diffraction structure of Toll-like receptor 2 TIR-domain-nucleated MyD88 TIR-domain higher-order assembly By journals.iucr.org Published On :: 2024-09-04 Eukaryotic TIR (Toll/interleukin-1 receptor protein) domains signal via TIR–TIR interactions, either by self-association or by interaction with other TIR domains. In mammals, TIR domains are found in Toll-like receptors (TLRs) and cytoplasmic adaptor proteins involved in pro-inflammatory signaling. Previous work revealed that the MAL TIR domain (MALTIR) nucleates the assembly of MyD88TIR into crystalline arrays in vitro. A microcrystal electron diffraction (MicroED) structure of the MyD88TIR assembly has previously been solved, revealing a two-stranded higher-order assembly of TIR domains. In this work, it is demonstrated that the TIR domain of TLR2, which is reported to signal as a heterodimer with either TLR1 or TLR6, induces the formation of crystalline higher-order assemblies of MyD88TIR in vitro, whereas TLR1TIR and TLR6TIR do not. Using an improved data-collection protocol, the MicroED structure of TLR2TIR-induced MyD88TIR microcrystals was determined at a higher resolution (2.85 Å) and with higher completeness (89%) compared with the previous structure of the MALTIR-induced MyD88TIR assembly. Both assemblies exhibit conformational differences in several areas that are important for signaling (for example the BB loop and CD loop) compared with their monomeric structures. These data suggest that TLR2TIR and MALTIR interact with MyD88 in an analogous manner during signaling, nucleating MyD88TIR assemblies unidirectionally. Full Article text
or Comparison of two crystal polymorphs of NowGFP reveals a new conformational state trapped by crystal packing By journals.iucr.org Published On :: 2024-09-02 Crystal polymorphism serves as a strategy to study the conformational flexibility of proteins. However, the relationship between protein crystal packing and protein conformation often remains elusive. In this study, two distinct crystal forms of a green fluorescent protein variant, NowGFP, are compared: a previously identified monoclinic form (space group C2) and a newly discovered orthorhombic form (space group P212121). Comparative analysis reveals that both crystal forms exhibit nearly identical linear assemblies of NowGFP molecules interconnected through similar crystal contacts. However, a notable difference lies in the stacking of these assemblies: parallel in the monoclinic form and perpendicular in the orthorhombic form. This distinct mode of stacking leads to different crystal contacts and induces structural alteration in one of the two molecules within the asymmetric unit of the orthorhombic crystal form. This new conformational state captured by orthorhombic crystal packing exhibits two unique features: a conformational shift of the β-barrel scaffold and a restriction of pH-dependent shifts of the key residue Lys61, which is crucial for the pH-dependent spectral shift of this protein. These findings demonstrate a clear connection between crystal packing and alternative conformational states of proteins, providing insights into how structural variations influence the function of fluorescent proteins. Full Article text
or Utilizing anomalous signals for element identification in macromolecular crystallography By journals.iucr.org Published On :: 2024-09-18 AlphaFold2 has revolutionized structural biology by offering unparalleled accuracy in predicting protein structures. Traditional methods for determining protein structures, such as X-ray crystallography and cryo-electron microscopy, are often time-consuming and resource-intensive. AlphaFold2 provides models that are valuable for molecular replacement, aiding in model building and docking into electron density or potential maps. However, despite its capabilities, models from AlphaFold2 do not consistently match the accuracy of experimentally determined structures, need to be validated experimentally and currently miss some crucial information, such as post-translational modifications, ligands and bound ions. In this paper, the advantages are explored of collecting X-ray anomalous data to identify chemical elements, such as metal ions, which are key to understanding certain structures and functions of proteins. This is achieved through methods such as calculating anomalous difference Fourier maps or refining the imaginary component of the anomalous scattering factor f''. Anomalous data can serve as a valuable complement to the information provided by AlphaFold2 models and this is particularly significant in elucidating the roles of metal ions. Full Article text
or Structural studies of β-glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharolyticus By journals.iucr.org Published On :: 2024-10-01 β-Glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharolyticus (Bgl1) has been denoted as having an attractive catalytic profile for various industrial applications. Bgl1 catalyses the final step of in the decomposition of cellulose, an unbranched glucose polymer that has attracted the attention of researchers in recent years as it is the most abundant renewable source of reduced carbon in the biosphere. With the aim of enhancing the thermostability of Bgl1 for a broad spectrum of biotechnological processes, it has been subjected to structural studies. Crystal structures of Bgl1 and its complex with glucose were determined at 1.47 and 1.95 Å resolution, respectively. Bgl1 is a member of glycosyl hydrolase family 1 (GH1 superfamily, EC 3.2.1.21) and the results showed that the 3D structure of Bgl1 follows the overall architecture of the GH1 family, with a classical (β/α)8 TIM-barrel fold. Comparisons of Bgl1 with sequence or structural homologues of β-glucosidase reveal quite similar structures but also unique structural features in Bgl1 with plausible functional roles. Full Article text
or The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era By journals.iucr.org Published On :: 2024-10-03 The availability of highly accurate protein structure predictions from AlphaFold2 (AF2) and similar tools has hugely expanded the applicability of molecular replacement (MR) for crystal structure solution. Many structures can be solved routinely using raw models, structures processed to remove unreliable parts or models split into distinct structural units. There is therefore an open question around how many and which cases still require experimental phasing methods such as single-wavelength anomalous diffraction (SAD). Here, this question is addressed using a large set of PDB depositions that were solved by SAD. A large majority (87%) could be solved using unedited or minimally edited AF2 predictions. A further 18 (4%) yield straightforwardly to MR after splitting of the AF2 prediction using Slice'N'Dice, although different splitting methods succeeded on slightly different sets of cases. It is also found that further unique targets can be solved by alternative modelling approaches such as ESMFold (four cases), alternative MR approaches such as ARCIMBOLDO and AMPLE (two cases each), and multimeric model building with AlphaFold-Multimer or UniFold (three cases). Ultimately, only 12 cases, or 3% of the SAD-phased set, did not yield to any form of MR tested here, offering valuable hints as to the number and the characteristics of cases where experimental phasing remains essential for macromolecular structure solution. Full Article text
or EMhub: a web platform for data management and on-the-fly processing in scientific facilities By journals.iucr.org Published On :: 2024-10-07 Most scientific facilities produce large amounts of heterogeneous data at a rapid pace. Managing users, instruments, reports and invoices presents additional challenges. To address these challenges, EMhub, a web platform designed to support the daily operations and record-keeping of a scientific facility, has been introduced. EMhub enables the easy management of user information, instruments, bookings and projects. The application was initially developed to meet the needs of a cryoEM facility, but its functionality and adaptability have proven to be broad enough to be extended to other data-generating centers. The expansion of EMHub is enabled by the modular nature of its core functionalities. The application allows external processes to be connected via a REST API, automating tasks such as folder creation, user and password generation, and the execution of real-time data-processing pipelines. EMhub has been used for several years at the Swedish National CryoEM Facility and has been installed in the CryoEM center at the Structural Biology Department at St. Jude Children's Research Hospital. A fully automated single-particle pipeline has been implemented for on-the-fly data processing and analysis. At St. Jude, the X-Ray Crystallography Center and the Single-Molecule Imaging Center have already expanded the platform to support their operational and data-management workflows. Full Article text
or Analysis of crystallographic phase retrieval using iterative projection algorithms By journals.iucr.org Published On :: 2024-10-23 For protein crystals in which more than two thirds of the volume is occupied by solvent, the featureless nature of the solvent region often generates a constraint that is powerful enough to allow direct phasing of X-ray diffraction data. Practical implementation relies on the use of iterative projection algorithms with good global convergence properties to solve the difficult nonconvex phase-retrieval problem. In this paper, some aspects of phase retrieval using iterative projection algorithms are systematically explored, where the diffraction data and density-value distributions in the protein and solvent regions provide the sole constraints. The analysis is based on the addition of random error to the phases of previously determined protein crystal structures, followed by evaluation of the ability to recover the correct phase set as the distance from the solution increases. The properties of the difference-map (DM), relaxed–reflect–reflect (RRR) and relaxed averaged alternating reflectors (RAAR) algorithms are compared. All of these algorithms prove to be effective for crystallographic phase retrieval, and the useful ranges of the adjustable parameter which controls their behavior are established. When these algorithms converge to the solution, the algorithm trajectory becomes stationary; however, the density function continues to fluctuate significantly around its mean position. It is shown that averaging over the algorithm trajectory in the stationary region, following convergence, improves the density estimate, with this procedure outperforming previous approaches for phase or density refinement. Full Article text
or STEM SerialED: achieving high-resolution data for ab initio structure determination of beam-sensitive nanocrystalline materials By journals.iucr.org Published On :: 2024-01-01 Serial electron diffraction (SerialED), which applies a snapshot data acquisition strategy for each crystal, was introduced to tackle the problem of radiation damage in the structure determination of beam-sensitive materials by three-dimensional electron diffraction (3DED). The snapshot data acquisition in SerialED can be realized using both transmission and scanning transmission electron microscopes (TEM/STEM). However, the current SerialED workflow based on STEM setups requires special external devices and software, which limits broader adoption. Here, we present a simplified experimental implementation of STEM-based SerialED on Thermo Fisher Scientific STEMs using common proprietary software interfaced through Python scripts to automate data collection. Specifically, we utilize TEM Imaging and Analysis (TIA) scripting and TEM scripting to access the STEM functionalities of the microscope, and DigitalMicrograph scripting to control the camera for snapshot data acquisition. Data analysis adapts the existing workflow using the software CrystFEL, which was developed for serial X-ray crystallography. Our workflow for STEM SerialED can be used on any Gatan or Thermo Fisher Scientific camera. We apply this workflow to collect high-resolution STEM SerialED data from two aluminosilicate zeolites, zeolite Y and ZSM-25. We demonstrate, for the first time, ab initio structure determination through direct methods using STEM SerialED data. Zeolite Y is relatively stable under the electron beam, and STEM SerialED data extend to 0.60 Å. We show that the structural model obtained using STEM SerialED data merged from 358 crystals is nearly identical to that using continuous rotation electron diffraction data from one crystal. This demonstrates that accurate structures can be obtained from STEM SerialED. Zeolite ZSM-25 is very beam-sensitive and has a complex structure. We show that STEM SerialED greatly improves the data resolution of ZSM-25, compared with serial rotation electron diffraction (SerialRED), from 1.50 to 0.90 Å. This allows, for the first time, the use of standard phasing methods, such as direct methods, for the ab initio structure determination of ZSM-25. Full Article text
or Structure determination using high-order spatial correlations in single-particle X-ray scattering By journals.iucr.org Published On :: 2024-01-01 Single-particle imaging using X-ray free-electron lasers (XFELs) is a promising technique for observing nanoscale biological samples under near-physiological conditions. However, as the sample's orientation in each diffraction pattern is unknown, advanced algorithms are required to reconstruct the 3D diffraction intensity volume and subsequently the sample's density model. While most approaches perform 3D reconstruction via determining the orientation of each diffraction pattern, a correlation-based approach utilizes the averaged spatial correlations of diffraction intensities over all patterns, making it well suited for processing experimental data with a poor signal-to-noise ratio of individual patterns. Here, a method is proposed to determine the 3D structure of a sample by analyzing the double, triple and quadruple spatial correlations in diffraction patterns. This ab initio method can reconstruct the basic shape of an irregular unsymmetric 3D sample without requiring any prior knowledge of the sample. The impact of background and noise on correlations is investigated and corrected to ensure the success of reconstruction under simulated experimental conditions. Additionally, the feasibility of using the correlation-based approach to process incomplete partial diffraction patterns is demonstrated. The proposed method is a variable addition to existing algorithms for 3D reconstruction and will further promote the development and adoption of XFEL single-particle imaging techniques. Full Article text
or Orientational ordering and assembly of silica–nickel Janus particles in a magnetic field By journals.iucr.org Published On :: 2024-01-01 The orientation ordering and assembly behavior of silica–nickel Janus particles in a static external magnetic field were probed by ultra small-angle X-ray scattering (USAXS). Even in a weak applied field, the net magnetic moments of the individual particles aligned in the direction of the field, as indicated by the anisotropy in the recorded USAXS patterns. X-ray photon correlation spectroscopy (XPCS) measurements on these suspensions revealed that the corresponding particle dynamics are primarily Brownian diffusion [Zinn, Sharpnack & Narayanan (2023). Soft Matter, 19, 2311–2318]. At higher fields, the magnetic forces led to chain-like configurations of particles, as indicated by an additional feature in the USAXS pattern. A theoretical framework is provided for the quantitative interpretation of the observed anisotropic scattering diagrams and the corresponding degree of orientation. No anisotropy was detected when the magnetic field was applied along the beam direction, which is also replicated by the model. The method presented here could be useful for the interpretation of oriented scattering patterns from a wide variety of particulate systems. The combination of USAXS and XPCS is a powerful approach for investigating asymmetric colloidal particles in external fields. Full Article text
or Conformation–aggregation interplay in the simplest aliphatic ethers probed under high pressure By journals.iucr.org Published On :: 2024-01-01 The structures of the simplest symmetric primary ethers [(CnH2n+1)2O, n = 1–3] determined under high pressure revealed their conformational preferences and intermolecular interactions. In three new polymorphs of diethyl ether (C2H5)2O, high pressure promotes intermolecular CH⋯O contacts and enforces a conversion from the trans–trans conformer present in the α, β and γ phases to the trans–gauche conformer, which is higher in energy by 6.4 kJ mol−1, in the δ phase. Two new polymorphs of dimethyl ether (CH3)2O display analogous transformations of the CH⋯O bonds. The crystal structure of di-n-propyl ether (C3H7)2O, determined for the first time, is remarkably stable over the whole pressure range investigated from 1.70 up to 5.30 GPa. Full Article text
or Refining short-range order parameters from the three-dimensional diffuse scattering in single-crystal electron diffraction data By journals.iucr.org Published On :: 2024-01-01 Our study compares short-range order parameters refined from the diffuse scattering in single-crystal X-ray and single-crystal electron diffraction data. Nb0.84CoSb was chosen as a reference material. The correlations between neighbouring vacancies and the displacements of Sb and Co atoms were refined from the diffuse scattering using a Monte Carlo refinement in DISCUS. The difference between the Sb and Co displacements refined from the diffuse scattering and the Sb and Co displacements refined from the Bragg reflections in single-crystal X-ray diffraction data is 0.012 (7) Å for the refinement on diffuse scattering in single-crystal X-ray diffraction data and 0.03 (2) Å for the refinement on the diffuse scattering in single-crystal electron diffraction data. As electron diffraction requires much smaller crystals than X-ray diffraction, this opens up the possibility of refining short-range order parameters in many technologically relevant materials for which no crystals large enough for single-crystal X-ray diffraction are available. Full Article text
or Orientational analysis of atomic pair correlations in nanocrystalline indium oxide thin films By journals.iucr.org Published On :: 2024-01-01 The application of grazing-incidence total X-ray scattering (GITXS) for pair distribution function (PDF) analysis using >50 keV X-rays from synchrotron light sources has created new opportunities for structural characterization of supported thin films with high resolution. Compared with grazing-incidence wide-angle X-ray scattering, which is only useful for highly ordered materials, GITXS/PDFs expand such analysis to largely disordered or nanostructured materials by examining the atomic pair correlations dependent on the direction relative to the surface of the supporting substrate. A characterization of nanocrystalline In2O3-derived thin films is presented here with in-plane-isotropic and out-of-plane-anisotropic orientational ordering of the atomic structure, each synthesized using different techniques. The atomic orientations of such films are known to vary based on the synthetic conditions. Here, an azimuthal orientational analysis of these films using GITXS with a single incident angle is shown to resolve the markedly different orientations of the atomic structures with respect to the planar support and the different degrees of long-range order, and hence, the terminal surface chemistries. It is anticipated that orientational analysis of GITXS/PDF data will offer opportunities to extend structural analyses of thin films by providing a means to qualitatively determine the major atomic orientation within nanocrystalline and, eventually, non-crystalline films. Full Article text
or Structural dissection of two redox proteins from the shipworm symbiont Teredinibacter turnerae By journals.iucr.org Published On :: 2024-03-01 The discovery of lytic polysaccharide monooxygenases (LPMOs), a family of copper-dependent enzymes that play a major role in polysaccharide degradation, has revealed the importance of oxidoreductases in the biological utilization of biomass. In fungi, a range of redox proteins have been implicated as working in harness with LPMOs to bring about polysaccharide oxidation. In bacteria, less is known about the interplay between redox proteins and LPMOs, or how the interaction between the two contributes to polysaccharide degradation. We therefore set out to characterize two previously unstudied proteins from the shipworm symbiont Teredinibacter turnerae that were initially identified by the presence of carbohydrate binding domains appended to uncharacterized domains with probable redox functions. Here, X-ray crystal structures of several domains from these proteins are presented together with initial efforts to characterize their functions. The analysis suggests that the target proteins are unlikely to function as LPMO electron donors, raising new questions as to the potential redox functions that these large extracellular multi-haem-containing c-type cytochromes may perform in these bacteria. Full Article text
or Transferable Hirshfeld atom model for rapid evaluation of aspherical atomic form factors By journals.iucr.org Published On :: 2024-03-01 Form factors based on aspherical models of atomic electron density have brought great improvement in the accuracies of hydrogen atom parameters derived from X-ray crystal structure refinement. Today, two main groups of such models are available, the banks of transferable atomic densities parametrized using the Hansen–Coppens multipole model which allows for rapid evaluation of atomic form factors and Hirshfeld atom refinement (HAR)-related methods which are usually more accurate but also slower. In this work, a model that combines the ideas utilized in the two approaches is tested. It uses atomic electron densities based on Hirshfeld partitions of electron densities, which are precalculated and stored in a databank. This model was also applied during the refinement of the structures of five small molecules. A comparison of the resulting hydrogen atom parameters with those derived from neutron diffraction data indicates that they are more accurate than those obtained with the Hansen–Coppens based databank, and only slightly less accurate than those obtained with a version of HAR that neglects the crystal environment. The advantage of using HAR becomes more noticeable when the effects of the environment are included. To speed up calculations, atomic densities were represented by multipole expansion with spherical harmonics up to l = 7, which used numerical radial functions (a different approach to that applied in the Hansen–Coppens model). Calculations of atomic form factors for the small protein crambin (at 0.73 Å resolution) took only 68 s using 12 CPU cores. Full Article text
or Dynamical refinement with multipolar electron scattering factors By journals.iucr.org Published On :: 2024-03-21 Dynamical refinement is a well established method for refining crystal structures against 3D electron diffraction (ED) data and its benefits have been discussed in the literature [Palatinus, Petříček & Corrêa, (2015). Acta Cryst. A71, 235–244; Palatinus, Corrêa et al. (2015). Acta Cryst. B71, 740–751]. However, until now, dynamical refinements have only been conducted using the independent atom model (IAM). Recent research has shown that a more accurate description can be achieved by applying the transferable aspherical atom model (TAAM), but this has been limited only to kinematical refinements [Gruza et al. (2020). Acta Cryst. A76, 92–109; Jha et al. (2021). J. Appl. Cryst. 54, 1234–1243]. In this study, we combine dynamical refinement with TAAM for the crystal structure of 1-methyluracil, using data from precession ED. Our results show that this approach improves the residual Fourier electrostatic potential and refinement figures of merit. Furthermore, it leads to systematic changes in the atomic displacement parameters of all atoms and the positions of hydrogen atoms. We found that the refinement results are sensitive to the parameters used in the TAAM modelling process. Though our results show that TAAM offers superior performance compared with IAM in all cases, they also show that TAAM parameters obtained by periodic DFT calculations on the refined structure are superior to the TAAM parameters from the UBDB/MATTS database. It appears that multipolar parameters transferred from the database may not be sufficiently accurate to provide a satisfactory description of all details of the electrostatic potential probed by the 3D ED experiment. Full Article text
or Droplet microfluidics for time-resolved serial crystallography By journals.iucr.org Published On :: 2024-03-01 Serial crystallography requires large numbers of microcrystals and robust strategies to rapidly apply substrates to initiate reactions in time-resolved studies. Here, we report the use of droplet miniaturization for the controlled production of uniform crystals, providing an avenue for controlled substrate addition and synchronous reaction initiation. The approach was evaluated using two enzymatic systems, yielding 3 µm crystals of lysozyme and 2 µm crystals of Pdx1, an Arabidopsis enzyme involved in vitamin B6 biosynthesis. A seeding strategy was used to overcome the improbability of Pdx1 nucleation occurring with diminishing droplet volumes. Convection within droplets was exploited for rapid crystal mixing with ligands. Mixing times of <2 ms were achieved. Droplet microfluidics for crystal size engineering and rapid micromixing can be utilized to advance time-resolved serial crystallography. Full Article text
or Nanostructure and dynamics of N-truncated copper amyloid-β peptides from advanced X-ray absorption fine structure By journals.iucr.org Published On :: 2024-04-11 An X-ray absorption spectroscopy (XAS) electrochemical cell was used to collect high-quality XAS measurements of N-truncated Cu:amyloid-β (Cu:Aβ) samples under near-physiological conditions. N-truncated Cu:Aβ peptide complexes contribute to oxidative stress and neurotoxicity in Alzheimer's patients' brains. However, the redox properties of copper in different Aβ peptide sequences are inconsistent. Therefore, the geometry of binding sites for the copper binding in Aβ4–8/12/16 was determined using novel advanced extended X-ray absorption fine structure (EXAFS) analysis. This enables these peptides to perform redox cycles in a manner that might produce toxicity in human brains. Fluorescence XAS measurements were corrected for systematic errors including defective-pixel data, monochromator glitches and dispersion of pixel spectra. Experimental uncertainties at each data point were measured explicitly from the point-wise variance of corrected pixel measurements. The copper-binding environments of Aβ4–8/12/16 were precisely determined by fitting XAS measurements with propagated experimental uncertainties, advanced analysis and hypothesis testing, providing a mechanism to pursue many similarly complex questions in bioscience. The low-temperature XAS measurements here determine that CuII is bound to the first amino acids in the high-affinity amino-terminal copper and nickel (ATCUN) binding motif with an oxygen in a tetragonal pyramid geometry in the Aβ4–8/12/16 peptides. Room-temperature XAS electrochemical-cell measurements observe metal reduction in the Aβ4–16 peptide. Robust investigations of XAS provide structural details of CuII binding with a very different bis-His motif and a water oxygen in a quasi-tetrahedral geometry. Oxidized XAS measurements of Aβ4–12/16 imply that both CuII and CuIII are accommodated in an ATCUN-like binding site. Hypotheses for these CuI, CuII and CuIII geometries were proven and disproven using the novel data and statistical analysis including F tests. Structural parameters were determined with an accuracy some tenfold better than literature claims of past work. A new protocol was also developed using EXAFS data analysis for monitoring radiation damage. This gives a template for advanced analysis of complex biosystems. Full Article text
or KINNTREX: a neural network to unveil protein mechanisms from time-resolved X-ray crystallography By journals.iucr.org Published On :: 2024-04-25 Here, a machine-learning method based on a kinetically informed neural network (NN) is introduced. The proposed method is designed to analyze a time series of difference electron-density maps from a time-resolved X-ray crystallographic experiment. The method is named KINNTREX (kinetics-informed NN for time-resolved X-ray crystallography). To validate KINNTREX, multiple realistic scenarios were simulated with increasing levels of complexity. For the simulations, time-resolved X-ray data were generated that mimic data collected from the photocycle of the photoactive yellow protein. KINNTREX only requires the number of intermediates and approximate relaxation times (both obtained from a singular valued decomposition) and does not require an assumption of a candidate mechanism. It successfully predicts a consistent chemical kinetic mechanism, together with difference electron-density maps of the intermediates that appear during the reaction. These features make KINNTREX attractive for tackling a wide range of biomolecular questions. In addition, the versatility of KINNTREX can inspire more NN-based applications to time-resolved data from biological macromolecules obtained by other methods. Full Article text
or Time-series analysis of rhenium(I) organometallic covalent binding to a model protein for drug development By journals.iucr.org Published On :: 2024-04-19 Metal-based complexes with their unique chemical properties, including multiple oxidation states, radio-nuclear capabilities and various coordination geometries yield value as potential pharmaceuticals. Understanding the interactions between metals and biological systems will prove key for site-specific coordination of new metal-based lead compounds. This study merges the concepts of target coordination with fragment-based drug methodologies, supported by varying the anomalous scattering of rhenium along with infrared spectroscopy, and has identified rhenium metal sites bound covalently with two amino acid types within the model protein. A time-based series of lysozyme-rhenium-imidazole (HEWL-Re-Imi) crystals was analysed systematically over a span of 38 weeks. The main rhenium covalent coordination is observed at His15, Asp101 and Asp119. Weak (i.e. noncovalent) interactions are observed at other aspartic, asparagine, proline, tyrosine and tryptophan side chains. Detailed bond distance comparisons, including precision estimates, are reported, utilizing the diffraction precision index supplemented with small-molecule data from the Cambridge Structural Database. Key findings include changes in the protein structure induced at the rhenium metal binding site, not observed in similar metal-free structures. The binding sites are typically found along the solvent-channel-accessible protein surface. The three primary covalent metal binding sites are consistent throughout the time series, whereas binding to neighbouring amino acid residues changes through the time series. Co-crystallization was used, consistently yielding crystals four days after setup. After crystal formation, soaking of the compound into the crystal over 38 weeks is continued and explains these structural adjustments. It is the covalent bond stability at the three sites, their proximity to the solvent channel and the movement of residues to accommodate the metal that are important, and may prove useful for future radiopharmaceutical development including target modification. Full Article text
or RCSB Protein Data Bank: supporting research and education worldwide through explorations of experimentally determined and computationally predicted atomic level 3D biostructures By journals.iucr.org Published On :: 2024-04-10 The Protein Data Bank (PDB) was established as the first open-access digital data resource in biology and medicine in 1971 with seven X-ray crystal structures of proteins. Today, the PDB houses >210 000 experimentally determined, atomic level, 3D structures of proteins and nucleic acids as well as their complexes with one another and small molecules (e.g. approved drugs, enzyme cofactors). These data provide insights into fundamental biology, biomedicine, bioenergy and biotechnology. They proved particularly important for understanding the SARS-CoV-2 global pandemic. The US-funded Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) and other members of the Worldwide Protein Data Bank (wwPDB) partnership jointly manage the PDB archive and support >60 000 `data depositors' (structural biologists) around the world. wwPDB ensures the quality and integrity of the data in the ever-expanding PDB archive and supports global open access without limitations on data usage. The RCSB PDB research-focused web portal at https://www.rcsb.org/ (RCSB.org) supports millions of users worldwide, representing a broad range of expertise and interests. In addition to retrieving 3D structure data, PDB `data consumers' access comparative data and external annotations, such as information about disease-causing point mutations and genetic variations. RCSB.org also provides access to >1 000 000 computed structure models (CSMs) generated using artificial intelligence/machine-learning methods. To avoid doubt, the provenance and reliability of experimentally determined PDB structures and CSMs are identified. Related training materials are available to support users in their RCSB.org explorations. Full Article text
or Biophysical and structural study of La Crosse virus endonuclease inhibition for the development of new antiviral options By journals.iucr.org Published On :: 2024-04-24 The large Bunyavirales order includes several families of viruses with a segmented ambisense (−) RNA genome and a cytoplasmic life cycle that starts by synthesizing viral mRNA. The initiation of transcription, which is common to all members, relies on an endonuclease activity that is responsible for cap-snatching. In La Crosse virus, an orthobunyavirus, it has previously been shown that the cap-snatching endonuclease resides in the N-terminal domain of the L protein. Orthobunyaviruses are transmitted by arthropods and cause diseases in cattle. However, California encephalitis virus, La Crosse virus and Jamestown Canyon virus are North American species that can cause encephalitis in humans. No vaccines or antiviral drugs are available. In this study, three known Influenza virus endonuclease inhibitors (DPBA, L-742,001 and baloxavir) were repurposed on the La Crosse virus endonuclease. Their inhibition was evaluated by fluorescence resonance energy transfer and their mode of binding was then assessed by differential scanning fluorimetry and microscale thermophoresis. Finally, two crystallographic structures were obtained in complex with L-742,001 and baloxavir, providing access to the structural determinants of inhibition and offering key information for the further development of Bunyavirales endonuclease inhibitors. Full Article text
or Toward a quantitative description of solvation structure: a framework for differential solution scattering measurements By journals.iucr.org Published On :: 2024-05-01 Appreciating that the role of the solute–solvent and other outer-sphere interactions is essential for understanding chemistry and chemical dynamics in solution, experimental approaches are needed to address the structural consequences of these interactions, complementing condensed-matter simulations and coarse-grained theories. High-energy X-ray scattering (HEXS) combined with pair distribution function analysis presents the opportunity to probe these structures directly and to develop quantitative, atomistic models of molecular systems in situ in the solution phase. However, at concentrations relevant to solution-phase chemistry, the total scattering signal is dominated by the bulk solvent, prompting researchers to adopt a differential approach to eliminate this unwanted background. Though similar approaches are well established in quantitative structural studies of macromolecules in solution by small- and wide-angle X-ray scattering (SAXS/WAXS), analogous studies in the HEXS regime—where sub-ångström spatial resolution is achieved—remain underdeveloped, in part due to the lack of a rigorous theoretical description of the experiment. To address this, herein we develop a framework for differential solution scattering experiments conducted at high energies, which includes concepts of the solvent-excluded volume introduced to describe SAXS/WAXS data, as well as concepts from the time-resolved X-ray scattering community. Our theory is supported by numerical simulations and experiment and paves the way for establishing quantitative methods to determine the atomic structures of small molecules in solution with resolution approaching that of crystallography. Full Article text
or Evolution of structure and spectroscopic properties of a new 1,3-diacetylpyrene polymorph with temperature and pressure By journals.iucr.org Published On :: 2024-05-10 A new polymorph of 1,3-diacetylpyrene has been obtained from its melt and thoroughly characterized using single-crystal X-ray diffraction, steady-state UV–Vis spectroscopy and periodic density functional theory calculations. Experimental studies covered the temperature range from 90 to 390 K and the pressure range from atmospheric to 4.08 GPa. Optimal sample placement in a diamond anvil cell according to our previously presented methodology ensured over 80% data coverage up to 0.8 Å for a monoclinic sample. Unrestrained Hirshfeld atom refinement of the high-pressure crystal structures was successful and anharmonic behavior of carbonyl oxygen atoms was observed. Unlike the previously characterized polymorph, the structure of 2°AP-β is based on infinite π-stacks of antiparallel 2°AP molecules. 2°AP-β displays piezochromism and piezofluorochromism which are directly related to the variation in interplanar distances within the π-stacking. The importance of weak intermolecular interactions is reflected in the substantial negative thermal expansion coefficient of −55.8 (57) MK−1 in the direction of C—H⋯O interactions. Full Article text
or A step towards 6D WAXD tensor tomography By journals.iucr.org Published On :: 2024-05-09 X-ray scattering/diffraction tensor tomography techniques are promising methods to acquire the 3D texture information of heterogeneous biological tissues at micrometre resolution. However, the methods suffer from a long overall acquisition time due to multi-dimensional scanning across real and reciprocal space. Here, a new approach is introduced to obtain 3D reciprocal information of each illuminated scanning volume using mathematic modeling, which is equivalent to a physical scanning procedure for collecting the full reciprocal information required for voxel reconstruction. The virtual reciprocal scanning scheme was validated by a simulated 6D wide-angle X-ray diffraction tomography experiment. The theoretical validation of the method represents an important technological advancement for 6D diffraction tensor tomography and a crucial step towards pervasive applications in the characterization of heterogeneous materials. Full Article text
or The importance of definitions in crystallography By journals.iucr.org Published On :: 2024-05-28 This paper was motivated by the articles `Same or different – that is the question' in CrystEngComm (July 2020) and `Change to the definition of a crystal' in the IUCr Newsletter (June 2021). Experimental approaches to crystal comparisons require rigorously defined classifications in crystallography and beyond. Since crystal structures are determined in a rigid form, their strongest equivalence in practice is rigid motion, which is a composition of translations and rotations in 3D space. Conventional representations based on reduced cells and standardizations theoretically distinguish all periodic crystals. However, all cell-based representations are inherently discontinuous under almost any atomic displacement that can arbitrarily scale up a reduced cell. Hence, comparison of millions of known structures in materials databases requires continuous distance metrics. Full Article text
or The evolution of raw data archiving and the growth of its importance in crystallography By journals.iucr.org Published On :: 2024-06-12 The hardware for data archiving has expanded capacities for digital storage enormously in the past decade or more. The IUCr evaluated the costs and benefits of this within an official working group which advised that raw data archiving would allow ground truth reproducibility in published studies. Consultations of the IUCr's Commissions ensued via a newly constituted standing advisory committee, the Committee on Data. At all stages, the IUCr financed workshops to facilitate community discussions and possible methods of raw data archiving implementation. The recent launch of the IUCrData journal's Raw Data Letters is a milestone in the implementation of raw data archiving beyond the currently published studies: it includes diffraction patterns that have not been fully interpreted, if at all. The IUCr 75th Congress in Melbourne included a workshop on raw data reuse, discussing the successes and ongoing challenges of raw data reuse. This article charts the efforts of the IUCr to facilitate discussions and plans relating to raw data archiving and reuse within the various communities of crystallography, diffraction and scattering. Full Article text
or A predicted model-aided reconstruction algorithm for X-ray free-electron laser single-particle imaging By journals.iucr.org Published On :: 2024-06-21 Ultra-intense, ultra-fast X-ray free-electron lasers (XFELs) enable the imaging of single protein molecules under ambient temperature and pressure. A crucial aspect of structure reconstruction involves determining the relative orientations of each diffraction pattern and recovering the missing phase information. In this paper, we introduce a predicted model-aided algorithm for orientation determination and phase retrieval, which has been tested on various simulated datasets and has shown significant improvements in the success rate, accuracy and efficiency of XFEL data reconstruction. Full Article text
or A modified phase-retrieval algorithm to facilitate automatic de novo macromolecular structure determination in single-wavelength anomalous diffraction By journals.iucr.org Published On :: 2024-06-21 The success of experimental phasing in macromolecular crystallography relies primarily on the accurate locations of heavy atoms bound to the target crystal. To improve the process of substructure determination, a modified phase-retrieval algorithm built on the framework of the relaxed alternating averaged reflection (RAAR) algorithm has been developed. Importantly, the proposed algorithm features a combination of the π-half phase perturbation for weak reflections and enforces the direct-method-based tangent formula for strong reflections in reciprocal space. The proposed algorithm is extensively demonstrated on a total of 100 single-wavelength anomalous diffraction (SAD) experimental datasets, comprising both protein and nucleic acid structures of different qualities. Compared with the standard RAAR algorithm, the modified phase-retrieval algorithm exhibits significantly improved effectiveness and accuracy in SAD substructure determination, highlighting the importance of additional constraints for algorithmic performance. Furthermore, the proposed algorithm can be performed without human intervention under most conditions owing to the self-adaptive property of the input parameters, thus making it convenient to be integrated into the structural determination pipeline. In conjunction with the IPCAS software suite, we demonstrated experimentally that automatic de novo structure determination is possible on the basis of our proposed algorithm. Full Article text
or Benchmarking predictive methods for small-angle X-ray scattering from atomic coordinates of proteins using maximum likelihood consensus data By journals.iucr.org Published On :: 2024-07-10 Stimulated by informal conversations at the XVII International Small Angle Scattering (SAS) conference (Traverse City, 2017), an international team of experts undertook a round-robin exercise to produce a large dataset from proteins under standard solution conditions. These data were used to generate consensus SAS profiles for xylose isomerase, urate oxidase, xylanase, lysozyme and ribonuclease A. Here, we apply a new protocol using maximum likelihood with a larger number of the contributed datasets to generate improved consensus profiles. We investigate the fits of these profiles to predicted profiles from atomic coordinates that incorporate different models to account for the contribution to the scattering of water molecules of hydration surrounding proteins in solution. Programs using an implicit, shell-type hydration layer generally optimize fits to experimental data with the aid of two parameters that adjust the volume of the bulk solvent excluded by the protein and the contrast of the hydration layer. For these models, we found the error-weighted residual differences between the model and the experiment generally reflected the subsidiary maxima and minima in the consensus profiles that are determined by the size of the protein plus the hydration layer. By comparison, all-atom solute and solvent molecular dynamics (MD) simulations are without the benefit of adjustable parameters and, nonetheless, they yielded at least equally good fits with residual differences that are less reflective of the structure in the consensus profile. Further, where MD simulations accounted for the precise solvent composition of the experiment, specifically the inclusion of ions, the modelled radius of gyration values were significantly closer to the experiment. The power of adjustable parameters to mask real differences between a model and the structure present in solution is demonstrated by the results for the conformationally dynamic ribonuclease A and calculations with pseudo-experimental data. This study shows that, while methods invoking an implicit hydration layer have the unequivocal advantage of speed, care is needed to understand the influence of the adjustable parameters. All-atom solute and solvent MD simulations are slower but are less susceptible to false positives, and can account for thermal fluctuations in atomic positions, and more accurately represent the water molecules of hydration that contribute to the scattering profile. Full Article text
or Comprehensive encoding of conformational and compositional protein structural ensembles through the mmCIF data structure By journals.iucr.org Published On :: 2024-06-25 In the folded state, biomolecules exchange between multiple conformational states crucial for their function. However, most structural models derived from experiments and computational predictions only encode a single state. To represent biomolecules accurately, we must move towards modeling and predicting structural ensembles. Information about structural ensembles exists within experimental data from X-ray crystallography and cryo-electron microscopy. Although new tools are available to detect conformational and compositional heterogeneity within these ensembles, the legacy PDB data structure does not robustly encapsulate this complexity. We propose modifications to the macromolecular crystallographic information file (mmCIF) to improve the representation and interrelation of conformational and compositional heterogeneity. These modifications will enable the capture of macromolecular ensembles in a human and machine-interpretable way, potentially catalyzing breakthroughs for ensemble–function predictions, analogous to the achievements of AlphaFold with single-structure prediction. Full Article text
or High-accuracy measurement, advanced theory and analysis of the evolution of satellite transitions in manganese Kα using XR-HERFD By journals.iucr.org Published On :: 2024-06-21 Here, the novel technique of extended-range high-energy-resolution fluorescence detection (XR-HERFD) has successfully observed the n = 2 satellite in manganese to a high accuracy. The significance of the satellite signature presented is many hundreds of standard errors and well beyond typical discovery levels of three to six standard errors. This satellite is a sensitive indicator for all manganese-containing materials in condensed matter. The uncertainty in the measurements has been defined, which clearly observes multiple peaks and structure indicative of complex physical quantum-mechanical processes. Theoretical calculations of energy eigenvalues, shake-off probability and Auger rates are also presented, which explain the origin of the satellite from physical n = 2 shake-off processes. The evolution in the intensity of this satellite is measured relative to the full Kα spectrum of manganese to investigate satellite structure, and therefore many-body processes, as a function of incident energy. Results demonstrate that the many-body reduction factor S02 should not be modelled with a constant value as is currently done. This work makes a significant contribution to the challenge of understanding many-body processes and interpreting HERFD or resonant inelastic X-ray scattering spectra in a quantitative manner. Full Article text
or Crystallographic phase identifier of a convolutional self-attention neural network (CPICANN) on powder diffraction patterns By journals.iucr.org Published On :: 2024-06-27 Spectroscopic data, particularly diffraction data, are essential for materials characterization due to their comprehensive crystallographic information. The current crystallographic phase identification, however, is very time consuming. To address this challenge, we have developed a real-time crystallographic phase identifier based on a convolutional self-attention neural network (CPICANN). Trained on 692 190 simulated powder X-ray diffraction (XRD) patterns from 23 073 distinct inorganic crystallographic information files, CPICANN demonstrates superior phase-identification power. Single-phase identification on simulated XRD patterns yields 98.5 and 87.5% accuracies with and without elemental information, respectively, outperforming JADE software (68.2 and 38.7%, respectively). Bi-phase identification on simulated XRD patterns achieves 84.2 and 51.5% accuracies, respectively. In experimental settings, CPICANN achieves an 80% identification accuracy, surpassing JADE software (61%). Integration of CPICANN into XRD refinement software will significantly advance the cutting-edge technology in XRD materials characterization. Full Article text
or Exploring serial crystallography for drug discovery By journals.iucr.org Published On :: 2024-07-29 Structure-based drug design is highly dependent on the availability of structures of the protein of interest in complex with lead compounds. Ideally, this information can be used to guide the chemical optimization of a compound into a pharmaceutical drug candidate. A limitation of the main structural method used today – conventional X-ray crystallography – is that it only provides structural information about the protein complex in its frozen state. Serial crystallography is a relatively new approach that offers the possibility to study protein structures at room temperature (RT). Here, we explore the use of serial crystallography to determine the structures of the pharmaceutical target, soluble epoxide hydrolase. We introduce a new method to screen for optimal microcrystallization conditions suitable for use in serial crystallography and present a number of RT ligand-bound structures of our target protein. From a comparison between the RT structural data and previously published cryo-temperature structures, we describe an example of a temperature-dependent difference in the ligand-binding mode and observe that flexible loops are better resolved at RT. Finally, we discuss the current limitations and potential future advances of serial crystallography for use within pharmaceutical drug discovery. Full Article text
or Texture tomography, a versatile framework to study crystalline texture in 3D By journals.iucr.org Published On :: 2024-07-24 Crystallographic texture is a key organization feature of many technical and biological materials. In these materials, especially hierarchically structured ones, the preferential alignment of the nano constituents heavily influences the macroscopic behavior of the material. To study local crystallographic texture with both high spatial and angular resolution, we developed Texture Tomography (TexTOM). This approach allows the user to model the diffraction data of polycrystalline materials using the full reciprocal space of the crystal ensemble and describe the texture in each voxel via an orientation distribution function, hence it provides 3D reconstructions of the local texture by measuring the probabilities of all crystal orientations. The TexTOM approach addresses limitations associated with existing models: it correlates the intensities from several Bragg reflections, thus reducing ambiguities resulting from symmetry. Further, it yields quantitative probability distributions of local real space crystal orientations without further assumptions about the sample structure. Finally, its efficient mathematical formulation enables reconstructions faster than the time scale of the experiment. This manuscript presents the mathematical model, the inversion strategy and its current experimental implementation. We show characterizations of simulated data as well as experimental data obtained from a synthetic, inorganic model sample: the silica–witherite biomorph. TexTOM provides a versatile framework to reconstruct 3D quantitative texture information for polycrystalline samples; it opens the door for unprecedented insights into the nanostructural makeup of natural and technical materials. Full Article text
or On the structure refinement of metal complexes against 3D electron diffraction data using multipolar scattering factors By journals.iucr.org Published On :: 2024-08-15 This study examines various methods for modelling the electron density and, thus, the electrostatic potential of an organometallic complex for use in crystal structure refinement against 3D electron diffraction (ED) data. It focuses on modelling the scattering factors of iron(III), considering the electron density distribution specific for coordination with organic linkers. We refined the structural model of the metal–organic complex, iron(III) acetylacetonate (FeAcAc), using both the independent atom model (IAM) and the transferable aspherical atom model (TAAM). TAAM refinement initially employed multipolar parameters from the MATTS databank for acetylacetonate, while iron was modelled with a spherical and neutral approach (TAAM ligand). Later, custom-made TAAM scattering factors for Fe—O coordination were derived from DFT calculations [TAAM-ligand-Fe(III)]. Our findings show that, in this compound, the TAAM scattering factor corresponding to Fe3+ has a lower scattering amplitude than the Fe3+ charged scattering factor described by IAM. When using scattering factors corresponding to the oxidation state of iron, IAM inaccurately represents electrostatic potential maps and overestimates the scattering potential of the iron. In addition, TAAM significantly improved the fitting of the model to the data, shown by improved R1 values, goodness-of-fit (GooF) and reduced noise in the Fourier difference map (based on the residual distribution analysis). For 3D ED, R1 values improved from 19.36% (IAM) to 17.44% (TAAM-ligand) and 17.49% (TAAM-ligand-Fe3+), and for single-crystal X-ray diffraction (SCXRD) from 3.82 to 2.03% and 1.98%, respectively. For 3D ED, the most significant R1 reductions occurred in the low-resolution region (8.65–2.00 Å), dropping from 20.19% (IAM) to 14.67% and 14.89% for TAAM-ligand and TAAM-ligand-Fe(III), respectively, with less improvement in high-resolution ranges (2.00–0.85 Å). This indicates that the major enhancements are due to better scattering modelling in low-resolution zones. Furthermore, when using TAAM instead of IAM, there was a noticeable improvement in the shape of the thermal ellipsoids, which more closely resembled those of an SCXRD-refined model. This study demonstrates the applicability of more sophisticated scattering factors to improve the refinement of metal–organic complexes against 3D ED data, suggesting the need for more accurate modelling methods and highlighting the potential of TAAM in examining the charge distribution of large molecular structures using 3D ED. Full Article text
or Phase quantification using deep neural network processing of XRD patterns By journals.iucr.org Published On :: 2024-08-12 Mineral identification and quantification are key to the understanding and, hence, the capacity to predict material properties. The method of choice for mineral quantification is powder X-ray diffraction (XRD), generally using a Rietveld refinement approach. However, a successful Rietveld refinement requires preliminary identification of the phases that make up the sample. This is generally carried out manually, and this task becomes extremely long or virtually impossible in the case of very large datasets such as those from synchrotron X-ray diffraction computed tomography. To circumvent this issue, this article proposes a novel neural network (NN) method for automating phase identification and quantification. An XRD pattern calculation code was used to generate large datasets of synthetic data that are used to train the NN. This approach offers significant advantages, including the ability to construct databases with a substantial number of XRD patterns and the introduction of extensive variability into these patterns. To enhance the performance of the NN, a specifically designed loss function for proportion inference was employed during the training process, offering improved efficiency and stability compared with traditional functions. The NN, trained exclusively with synthetic data, proved its ability to identify and quantify mineral phases on synthetic and real XRD patterns. Trained NN errors were equal to 0.5% for phase quantification on the synthetic test set, and 6% on the experimental data, in a system containing four phases of contrasting crystal structures (calcite, gibbsite, dolomite and hematite). The proposed method is freely available on GitHub and allows for major advances since it can be applied to any dataset, regardless of the mineral phases present. Full Article text