an

Advanced exploitation of unmerged reflection data during processing and refinement with autoPROC and BUSTER

The validation of structural models obtained by macromolecular X-ray crystallography against experimental diffraction data, whether before deposition into the PDB or after, is typically carried out exclusively against the merged data that are eventually archived along with the atomic coordinates. It is shown here that the availability of unmerged reflection data enables valuable additional analyses to be performed that yield improvements in the final models, and tools are presented to implement them, together with examples of the results to which they give access. The first example is the automatic identification and removal of image ranges affected by loss of crystal centering or by excessive decay of the diffraction pattern as a result of radiation damage. The second example is the `reflection-auditing' process, whereby individual merged data items showing especially poor agreement with model predictions during refinement are investigated thanks to the specific metadata (such as image number and detector position) that are available for the corresponding unmerged data, potentially revealing previously undiagnosed instrumental, experimental or processing problems. The third example is the calculation of so-called F(early) − F(late) maps from carefully selected subsets of unmerged amplitude data, which can not only highlight the location and extent of radiation damage but can also provide guidance towards suitable fine-grained parametrizations to model the localized effects of such damage.




an

EMinsight: a tool to capture cryoEM microscope configuration and experimental outcomes for analysis and deposition

The widespread adoption of cryoEM technologies for structural biology has pushed the discipline to new frontiers. A significant worldwide effort has refined the single-particle analysis (SPA) workflow into a reasonably standardized procedure. Significant investments of development time have been made, particularly in sample preparation, microscope data-collection efficiency, pipeline analyses and data archiving. The widespread adoption of specific commercial microscopes, software for controlling them and best practices developed at facilities worldwide has also begun to establish a degree of standardization to data structures coming from the SPA workflow. There is opportunity to capitalize on this moment in the maturation of the field, to capture metadata from SPA experiments and correlate the metadata with experimental outcomes, which is presented here in a set of programs called EMinsight. This tool aims to prototype the framework and types of analyses that could lead to new insights into optimal microscope configurations as well as to define methods for metadata capture to assist with the archiving of cryoEM SPA data. It is also envisaged that this tool will be useful to microscope operators and facilities looking to rapidly generate reports on SPA data-collection and screening sessions.




an

Structural determination and modeling of ciliary microtubules

The axoneme, a microtubule-based array at the center of every cilium, has been the subject of structural investigations for decades, but only recent advances in cryo-EM and cryo-ET have allowed a molecular-level interpretation of the entire complex to be achieved. The unique properties of the nine doublet microtubules and central pair of singlet microtubules that form the axoneme, including the highly decorated tubulin lattice and the docking of massive axonemal complexes, provide opportunities and challenges for sample preparation, 3D reconstruction and atomic modeling. Here, the approaches used for cryo-EM and cryo-ET of axonemes are reviewed, while highlighting the unique opportunities provided by the latest generation of AI-guided tools that are transforming structural biology.




an

Tomo Live: an on-the-fly reconstruction pipeline to judge data quality for cryo-electron tomography workflows

Data acquisition and processing for cryo-electron tomography can be a significant bottleneck for users. To simplify and streamline the cryo-ET workflow, Tomo Live, an on-the-fly solution that automates the alignment and reconstruction of tilt-series data, enabling real-time data-quality assessment, has been developed. Through the integration of Tomo Live into the data-acquisition workflow for cryo-ET, motion correction is performed directly after each of the acquired tilt angles. Immediately after the tilt-series acquisition has completed, an unattended tilt-series alignment and reconstruction into a 3D volume is performed. The results are displayed in real time in a dedicated remote web platform that runs on the microscope hardware. Through this web platform, users can review the acquired data (aligned stack and 3D volume) and several quality metrics that are obtained during the alignment and reconstruction process. These quality metrics can be used for fast feedback for subsequent acquisitions to save time. Parameters such as Alignment Accuracy, Deleted Tilts and Tilt Axis Correction Angle are visualized as graphs and can be used as filters to export only the best tomograms (raw data, reconstruction and intermediate data) for further processing. Here, the Tomo Live algorithms and workflow are described and representative results on several biological samples are presented. The Tomo Live workflow is accessible to both expert and non-expert users, making it a valuable tool for the continued advancement of structural biology, cell biology and histology.




an

VitroJet: new features and case studies

Single-particle cryo-electron microscopy has become a widely adopted method in structural biology due to many recent technological advances in microscopes, detectors and image processing. Before being able to inspect a biological sample in an electron microscope, it needs to be deposited in a thin layer on a grid and rapidly frozen. The VitroJet was designed with this aim, as well as avoiding the delicate manual handling and transfer steps that occur during the conventional grid-preparation process. Since its creation, numerous technical developments have resulted in a device that is now widely utilized in multiple laboratories worldwide. It features plasma treatment, low-volume sample deposition through pin printing, optical ice-thickness measurement and cryofixation of pre-clipped Autogrids through jet vitrification. This paper presents recent technical improvements to the VitroJet and the benefits that it brings to the cryo-EM workflow. A wide variety of applications are shown: membrane proteins, nucleosomes, fatty-acid synthase, Tobacco mosaic virus, lipid nanoparticles, tick-borne encephalitis viruses and bacteriophages. These case studies illustrate the advancement of the VitroJet into an instrument that enables accurate control and reproducibility, demonstrating its suitability for time-efficient cryo-EM structure determination.




an

Efficient in situ screening of and data collection from microcrystals in crystallization plates

A considerable bottleneck in serial crystallography at XFEL and synchrotron sources is the efficient production of large quantities of homogenous, well diffracting microcrystals. Efficient high-throughput screening of batch-grown microcrystals and the determination of ground-state structures from different conditions is thus of considerable value in the early stages of a project. Here, a highly sample-efficient methodology to measure serial crystallography data from microcrystals by raster scanning within standard in situ 96-well crystallization plates is described. Structures were determined from very small quantities of microcrystal suspension and the results were compared with those from other sample-delivery methods. The analysis of a two-dimensional batch crystallization screen using this method is also described as a useful guide for further optimization and the selection of appropriate conditions for scaling up microcrystallization.




an

Mononuclear binding and catalytic activity of europium(III) and gadolinium(III) at the active site of the model metalloenzyme phosphotriesterase

Lanthanide ions have ideal chemical properties for catalysis, such as hard Lewis acidity, fast ligand-exchange kinetics, high coordination-number preferences and low geometric requirements for coordination. As a result, many small-molecule lanthanide catalysts have been described in the literature. Yet, despite the ability of enzymes to catalyse highly stereoselective reactions under gentle conditions, very few lanthanoenzymes have been investigated. In this work, the mononuclear binding of europium(III) and gadolinium(III) to the active site of a mutant of the model enzyme phosphotriesterase are described using X-ray crystallography at 1.78 and 1.61 Å resolution, respectively. It is also shown that despite coordinating a single non-natural metal cation, the PTE-R18 mutant is still able to maintain esterase activity.




an

Scaling and merging macromolecular diffuse scattering with mdx2

Diffuse scattering is a promising method to gain additional insight into protein dynamics from macromolecular crystallography experiments. Bragg intensities yield the average electron density, while the diffuse scattering can be processed to obtain a three-dimensional reciprocal-space map that is further analyzed to determine correlated motion. To make diffuse scattering techniques more accessible, software for data processing called mdx2 has been created that is both convenient to use and simple to extend and modify. mdx2 is written in Python, and it interfaces with DIALS to implement self-contained data-reduction workflows. Data are stored in NeXus format for software interchange and convenient visualization. mdx2 can be run on the command line or imported as a package, for instance to encapsulate a complete workflow in a Jupyter notebook for reproducible computing and education. Here, mdx2 version 1.0 is described, a new release incorporating state-of-the-art techniques for data reduction. The implementation of a complete multi-crystal scaling and merging workflow is described, and the methods are tested using a high-redundancy data set from cubic insulin. It is shown that redundancy can be leveraged during scaling to correct systematic errors and obtain accurate and reproducible measurements of weak diffuse signals.




an

HEIDI: an experiment-management platform enabling high-throughput fragment and compound screening

The Swiss Light Source facilitates fragment-based drug-discovery campaigns for academic and industrial users through the Fast Fragment and Compound Screening (FFCS) software suite. This framework is further enriched by the option to utilize the Smart Digital User (SDU) software for automated data collection across the PXI, PXII and PXIII beamlines. In this work, the newly developed HEIDI webpage (https://heidi.psi.ch) is introduced: a platform crafted using state-of-the-art software architecture and web technologies for sample management of rotational data experiments. The HEIDI webpage features a data-review tab for enhanced result visualization and provides programmatic access through a representational state transfer application programming interface (REST API). The migration of the local FFCS MongoDB instance to the cloud is highlighted and detailed. This transition ensures secure, encrypted and consistently accessible data through a robust and reliable REST API tailored for the FFCS software suite. Collectively, these advancements not only significantly elevate the user experience, but also pave the way for future expansions and improvements in the capabilities of the system.




an

STOPGAP: an open-source package for template matching, subtomogram alignment and classification

Cryo-electron tomography (cryo-ET) enables molecular-resolution 3D imaging of complex biological specimens such as viral particles, cellular sections and, in some cases, whole cells. This enables the structural characterization of molecules in their near-native environments, without the need for purification or separation, thereby preserving biological information such as conformational states and spatial relationships between different molecular species. Subtomogram averaging is an image-processing workflow that allows users to leverage cryo-ET data to identify and localize target molecules, determine high-resolution structures of repeating molecular species and classify different conformational states. Here, STOPGAP, an open-source package for subtomogram averaging that is designed to provide users with fine control over each of these steps, is described. In providing detailed descriptions of the image-processing algorithms that STOPGAP uses, this manuscript is also intended to serve as a technical resource to users as well as for further community-driven software development.




an

A database overview of metal-coordination distances in metalloproteins

Metalloproteins are ubiquitous in all living organisms and take part in a very wide range of biological processes. For this reason, their experimental characterization is crucial to obtain improved knowledge of their structure and biological functions. The three-dimensional structure represents highly relevant information since it provides insight into the interaction between the metal ion(s) and the protein fold. Such interactions determine the chemical reactivity of the bound metal. The available PDB structures can contain errors due to experimental factors such as poor resolution and radiation damage. A lack of use of distance restraints during the refinement and validation process also impacts the structure quality. Here, the aim was to obtain a thorough overview of the distribution of the distances between metal ions and their donor atoms through the statistical analysis of a data set based on more than 115 000 metal-binding sites in proteins. This analysis not only produced reference data that can be used by experimentalists to support the structure-determination process, for example as refinement restraints, but also resulted in an improved insight into how protein coordination occurs for different metals and the nature of their binding interactions. In particular, the features of carboxylate coordination were inspected, which is the only type of interaction that is commonly present for nearly all metals.




an

Identifying and avoiding radiation damage in macromolecular crystallography

Radiation damage remains one of the major impediments to accurate structure solution in macromolecular crystallography. The artefacts of radiation damage can manifest as structural changes that result in incorrect biological interpretations being drawn from a model, they can reduce the resolution to which data can be collected and they can even prevent structure solution entirely. In this article, we discuss how to identify and mitigate against the effects of radiation damage at each stage in the macromolecular crystal structure-solution pipeline.




an

A small step towards an important goal: fragment screen of the c-di-AMP-synthesizing enzyme CdaA

CdaA is the most widespread diadenylate cyclase in many bacterial species, including several multidrug-resistant human pathogens. The enzymatic product of CdaA, cyclic di-AMP, is a secondary messenger that is essential for the viability of many bacteria. Its absence in humans makes CdaA a very promising and attractive target for the development of new antibiotics. Here, the structural results are presented of a crystallographic fragment screen against CdaA from Listeria monocytogenes, a saprophytic Gram-positive bacterium and an opportunistic food-borne pathogen that can cause listeriosis in humans and animals. Two of the eight fragment molecules reported here were localized in the highly conserved ATP-binding site. These fragments could serve as potential starting points for the development of antibiotics against several CdaA-dependent bacterial species.




an

What shapes template-matching performance in cryogenic electron tomography in situ?

The detection of specific biological macromolecules in cryogenic electron tomography data is frequently approached by applying cross-correlation-based 3D template matching. To reduce computational cost and noise, high binning is used to aggregate voxels before template matching. This remains a prevalent practice in both practical applications and methods development. Here, the relation between template size, shape and angular sampling is systematically evaluated to identify ribosomes in a ground-truth annotated data set. It is shown that at the commonly used binning, a detailed subtomogram average, a sphere and a heart emoji result in near-identical performance. These findings indicate that with current template-matching practices macromolecules can only be detected with high precision if their shape and size are sufficiently different from the background. Using theoretical considerations, the experimental results are rationalized and it is discussed why primarily low-frequency information remains at high binning and that template matching fails to be accurate because similarly shaped and sized macromolecules have similar low-frequency spectra. These challenges are discussed and potential enhancements for future template-matching methodologies are proposed.




an

High-confidence placement of low-occupancy fragments into electron density using the anomalous signal of sulfur and halogen atoms

Fragment-based drug design using X-ray crystallography is a powerful technique to enable the development of new lead compounds, or probe molecules, against biological targets. This study addresses the need to determine fragment binding orientations for low-occupancy fragments with incomplete electron density, an essential step before further development of the molecule. Halogen atoms play multiple roles in drug discovery due to their unique combination of electronegativity, steric effects and hydrophobic properties. Fragments incorporating halogen atoms serve as promising starting points in hit-to-lead development as they often establish halogen bonds with target proteins, potentially enhancing binding affinity and selectivity, as well as counteracting drug resistance. Here, the aim was to unambiguously identify the binding orientations of fragment hits for SARS-CoV-2 nonstructural protein 1 (nsp1) which contain a combination of sulfur and/or chlorine, bromine and iodine substituents. The binding orientations of carefully selected nsp1 analogue hits were focused on by employing their anomalous scattering combined with Pan-Dataset Density Analysis (PanDDA). Anomalous difference Fourier maps derived from the diffraction data collected at both standard and long-wavelength X-rays were compared. The discrepancies observed in the maps of iodine-containing fragments collected at different energies were attributed to site-specific radiation-damage stemming from the strong X-ray absorption of I atoms, which is likely to cause cleavage of the C—I bond. A reliable and effective data-collection strategy to unambiguously determine the binding orientations of low-occupancy fragments containing sulfur and/or halogen atoms while mitigating radiation damage is presented.




an

Validation of electron-microscopy maps using solution small-angle X-ray scattering

The determination of the atomic resolution structure of biomacromolecules is essential for understanding details of their function. Traditionally, such a structure determination has been performed with crystallographic or nuclear resonance methods, but during the last decade, cryogenic transmission electron microscopy (cryo-TEM) has become an equally important tool. As the blotting and flash-freezing of the samples can induce conformational changes, external validation tools are required to ensure that the vitrified samples are representative of the solution. Although many validation tools have already been developed, most of them rely on fully resolved atomic models, which prevents early screening of the cryo-TEM maps. Here, a novel and automated method for performing such a validation utilizing small-angle X-ray scattering measurements, publicly available through the new software package AUSAXS, is introduced and implemented. The method has been tested on both simulated and experimental data, where it was shown to work remarkably well as a validation tool. The method provides a dummy atomic model derived from the EM map which best represents the solution structure.




an

A structural role for tryptophan in proteins, and the ubiquitous Trp Cδ1—H⋯O=C (backbone) hydrogen bond

Tryptophan is the most prominent amino acid found in proteins, with multiple functional roles. Its side chain is made up of the hydrophobic indole moiety, with two groups that act as donors in hydrogen bonds: the Nɛ—H group, which is a potent donor in canonical hydrogen bonds, and a polarized Cδ1—H group, which is capable of forming weaker, noncanonical hydrogen bonds. Due to adjacent electron-withdrawing moieties, C—H⋯O hydrogen bonds are ubiquitous in macromolecules, albeit contingent on the polarization of the donor C—H group. Consequently, Cα—H groups (adjacent to the carbonyl and amino groups of flanking peptide bonds), as well as the Cɛ1—H and Cδ2—H groups of histidines (adjacent to imidazole N atoms), are known to serve as donors in hydrogen bonds, for example stabilizing parallel and antiparallel β-sheets. However, the nature and the functional role of interactions involving the Cδ1—H group of the indole ring of tryptophan are not well characterized. Here, data mining of high-resolution (r ≤ 1.5 Å) crystal structures from the Protein Data Bank was performed and ubiquitous close contacts between the Cδ1—H groups of tryptophan and a range of electronegative acceptors were identified, specifically main-chain carbonyl O atoms immediately upstream and downstream in the polypeptide chain. The stereochemical analysis shows that most of the interactions bear all of the hallmarks of proper hydrogen bonds. At the same time, their cohesive nature is confirmed by quantum-chemical calculations, which reveal interaction energies of 1.5–3.0 kcal mol−1, depending on the specific stereochemistry.




an

A snapshot love story: what serial crystallography has done and will do for us

Serial crystallography, born from groundbreaking experiments at the Linac Coherent Light Source in 2009, has evolved into a pivotal technique in structural biology. Initially pioneered at X-ray free-electron laser facilities, it has now expanded to synchrotron-radiation facilities globally, with dedicated experimental stations enhancing its accessibility. This review gives an overview of current developments in serial crystallography, emphasizing recent results in time-resolved crystallography, and discussing challenges and shortcomings.




an

Managing macromolecular crystallographic data with a laboratory information management system

Protein crystallography is an established method to study the atomic structures of macromolecules and their complexes. A prerequisite for successful structure determination is diffraction-quality crystals, which may require extensive optimization of both the protein and the conditions, and hence projects can stretch over an extended period, with multiple users being involved. The workflow from crystallization and crystal treatment to deposition and publication is well defined, and therefore an electronic laboratory information management system (LIMS) is well suited to management of the data. Completion of the project requires key information on all the steps being available and this information should also be made available according to the FAIR principles. As crystallized samples are typically shipped between facilities, a key feature to be captured in the LIMS is the exchange of metadata between the crystallization facility of the home laboratory and, for example, synchrotron facilities. On completion, structures are deposited in the Protein Data Bank (PDB) and the LIMS can include the PDB code in its database, completing the chain of custody from crystallization to structure deposition and publication. A LIMS designed for macromolecular crystallography, IceBear, is available as a standalone installation and as a hosted service, and the implementation of key features for the capture of metadata in IceBear is discussed as an example.




an

The crystal structure of Shethna protein II (FeSII) from Azotobacter vinelandii suggests a domain swap

The Azotobacter vinelandii FeSII protein forms an oxygen-resistant complex with the nitrogenase MoFe and Fe proteins. FeSII is an adrenodoxin-type ferredoxin that forms a dimer in solution. Previously, the crystal structure was solved [Schlesier et al. (2016), J. Am. Chem. Soc. 138, 239–247] with five copies in the asymmetric unit. One copy is a normal adrenodoxin domain that forms a dimer with its crystallographic symmetry mate. The other four copies are in an `open' conformation with a loop flipped out exposing the 2Fe–2S cluster. The open and closed conformations were interpreted as oxidized and reduced, respectively, and the large conformational change in the open configuration allowed binding to nitrogenase. Here, the structure of FeSII was independently solved in the same crystal form. The positioning of the atoms in the unit cell is similar to the earlier report. However, the interpretation of the structure is different. The `open' conformation is interpreted as the product of a crystallization-induced domain swap. The 2Fe–2S cluster is not exposed to solvent, but in the crystal its interacting helix is replaced by the same helix residues from a crystal symmetry mate. The domain swap is complicated, as it is unusual in being in the middle of the protein rather than at a terminus, and it creates arrangements of molecules that can be interpreted in multiple ways. It is also cautioned that crystal structures should be interpreted in terms of the contents of the entire crystal rather than of one asymmetric unit.




an

Protonation of histidine rings using quantum-mechanical methods

Histidine can be protonated on either or both of the two N atoms of the imidazole moiety. Each of the three possible forms occurs as a result of the stereochemical environment of the histidine side chain. In an atomic model, comparing the possible protonation states in situ, looking at possible hydrogen bonding and metal coordination, it is possible to predict which is most likely to be correct. A more direct method is described that uses quantum-mechanical methods to calculate, also in situ, the minimum geometry and energy for comparison, and therefore to more accurately identify the most likely proton­ation state.




an

Crystallographic fragment-binding studies of the Mycobacterium tuberculosis trifunctional enzyme suggest binding pockets for the tails of the acyl-CoA substrates at its active sites and a potential substrate-channeling path between them

The Mycobacterium tuberculosis trifunctional enzyme (MtTFE) is an α2β2 tetrameric enzyme in which the α-chain harbors the 2E-enoyl-CoA hydratase (ECH) and 3S-hydroxyacyl-CoA dehydrogenase (HAD) active sites, and the β-chain provides the 3-ketoacyl-CoA thiolase (KAT) active site. Linear, medium-chain and long-chain 2E-enoyl-CoA molecules are the preferred substrates of MtTFE. Previous crystallographic binding and modeling studies identified binding sites for the acyl-CoA substrates at the three active sites, as well as the NAD binding pocket at the HAD active site. These studies also identified three additional CoA binding sites on the surface of MtTFE that are different from the active sites. It has been proposed that one of these additional sites could be of functional relevance for the substrate channeling (by surface crawling) of reaction intermediates between the three active sites. Here, 226 fragments were screened in a crystallographic fragment-binding study of MtTFE crystals, resulting in the structures of 16 MtTFE–fragment complexes. Analysis of the 121 fragment-binding events shows that the ECH active site is the `binding hotspot' for the tested fragments, with 41 binding events. The mode of binding of the fragments bound at the active sites provides additional insight into how the long-chain acyl moiety of the substrates can be accommodated at their proposed binding pockets. In addition, the 20 fragment-binding events between the active sites identify potential transient binding sites of reaction intermediates relevant to the possible channeling of substrates between these active sites. These results provide a basis for further studies to understand the functional relevance of the latter binding sites and to identify substrates for which channeling is crucial.




an

Structural analysis of a ligand-triggered intermolecular disulfide switch in a major latex protein from opium poppy

Several proteins from plant pathogenesis-related family 10 (PR10) are highly abundant in the latex of opium poppy and have recently been shown to play diverse and important roles in the biosynthesis of benzylisoquinoline alkaloids (BIAs). The recent determination of the first crystal structures of PR10-10 showed how large conformational changes in a surface loop and adjacent β-strand are coupled to the binding of BIA compounds to the central hydrophobic binding pocket. A more detailed analysis of these conformational changes is now reported to further clarify how ligand binding is coupled to the formation and cleavage of an intermolecular disulfide bond that is only sterically allowed when the BIA binding pocket is empty. To decouple ligand binding from disulfide-bond formation, each of the two highly conserved cysteine residues (Cys59 and Cys155) in PR10-10 was replaced with serine using site-directed mutagenesis. Crystal structures of the Cys59Ser mutant were determined in the presence of papaverine and in the absence of exogenous BIA compounds. A crystal structure of the Cys155Ser mutant was also determined in the absence of exogenous BIA compounds. All three of these crystal structures reveal conformations similar to that of wild-type PR10-10 with bound BIA compounds. In the absence of exogenous BIA compounds, the Cys59Ser and Cys155Ser mutants appear to bind an unidentified ligand or mixture of ligands that was presumably introduced during expression of the proteins in Escherichia coli. The analysis of conformational changes triggered by the binding of BIA compounds suggests a molecular mechanism coupling ligand binding to the disruption of an intermolecular disulfide bond. This mechanism may be involved in the regulation of biosynthetic reactions in plants and possibly other organisms.




an

Post-translational modifications in the Protein Data Bank

Proteins frequently undergo covalent modification at the post-translational level, which involves the covalent attachment of chemical groups onto amino acids. This can entail the singular or multiple addition of small groups, such as phosphorylation; long-chain modifications, such as glycosylation; small proteins, such as ubiquitination; as well as the interconversion of chemical groups, such as the formation of pyroglutamic acid. These post-translational modifications (PTMs) are essential for the normal functioning of cells, as they can alter the physicochemical properties of amino acids and therefore influence enzymatic activity, protein localization, protein–protein interactions and protein stability. Despite their inherent importance, accurately depicting PTMs in experimental studies of protein structures often poses a challenge. This review highlights the role of PTMs in protein structures, as well as the prevalence of PTMs in the Protein Data Bank, directing the reader to accurately built examples suitable for use as a modelling reference.




an

Robust and automatic beamstop shadow outlier rejection: combining crystallographic statistics with modern clustering under a semi-supervised learning strategy

During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.




an

Utilizing anomalous signals for element identification in macromolecular crystallography

AlphaFold2 has revolutionized structural biology by offering unparalleled accuracy in predicting protein structures. Traditional methods for determining protein structures, such as X-ray crystallography and cryo-electron microscopy, are often time-consuming and resource-intensive. AlphaFold2 provides models that are valuable for molecular replacement, aiding in model building and docking into electron density or potential maps. However, despite its capabilities, models from AlphaFold2 do not consistently match the accuracy of experimentally determined structures, need to be validated experimentally and currently miss some crucial information, such as post-translational modifications, ligands and bound ions. In this paper, the advantages are explored of collecting X-ray anomalous data to identify chemical elements, such as metal ions, which are key to understanding certain structures and functions of proteins. This is achieved through methods such as calculating anomalous difference Fourier maps or refining the imaginary component of the anomalous scattering factor f''. Anomalous data can serve as a valuable complement to the information provided by AlphaFold2 models and this is particularly significant in elucidating the roles of metal ions.




an

EMhub: a web platform for data management and on-the-fly processing in scientific facilities

Most scientific facilities produce large amounts of heterogeneous data at a rapid pace. Managing users, instruments, reports and invoices presents additional challenges. To address these challenges, EMhub, a web platform designed to support the daily operations and record-keeping of a scientific facility, has been introduced. EMhub enables the easy management of user information, instruments, bookings and projects. The application was initially developed to meet the needs of a cryoEM facility, but its functionality and adaptability have proven to be broad enough to be extended to other data-generating centers. The expansion of EMHub is enabled by the modular nature of its core functionalities. The application allows external processes to be connected via a REST API, automating tasks such as folder creation, user and password generation, and the execution of real-time data-processing pipelines. EMhub has been used for several years at the Swedish National CryoEM Facility and has been installed in the CryoEM center at the Structural Biology Department at St. Jude Children's Research Hospital. A fully automated single-particle pipeline has been implemented for on-the-fly data processing and analysis. At St. Jude, the X-Ray Crystallography Center and the Single-Molecule Imaging Center have already expanded the platform to support their operational and data-management workflows.




an

Structure and stability of an apo thermophilic esterase that hydrolyzes polyhydroxybutyrate

Pollution from plastics is a global problem that threatens the biosphere for a host of reasons, including the time scale that it takes for most plastics to degrade. Biodegradation is an ideal solution for remediating bioplastic waste as it does not require the high temperatures necessary for thermal degradation and does not introduce additional pollutants into the environment. Numerous organisms can scavenge for bioplastics, such as polylactic acid (PLA) or poly-(R)-hydroxybutyrate (PHB), which they can use as an energy source. Recently, a promiscuous PHBase from the thermophilic soil bacterium Lihuaxuella thermophila (LtPHBase) was identified. LtPHBase can accommodate many substrates, including PHB granules and films and PHB block copolymers, as well as the unrelated polymers polylactic acid (PLA) and polycaprolactone (PCL). LtPHBase uses the expected Ser–His–Asp catalytic triad for hydrolysis at an optimal enzyme activity near 70°C. Here, the 1.75 Å resolution crystal structure of apo LtPHBase is presented and its chemical stability is profiled. Knowledge of its substrate preferences was extended to different-sized PHB granules. It is shown that LtPHBase is highly resistant to unfolding, with barriers typical for thermophilic enzymes, and shows a preference for low-molecular-mass PHB granules. These insights have implications for the long-term potential of LtPHBase as an industrial PHB hydrolase and shed light on the evolutionary role that this enzyme plays in bacterial metabolism.




an

Analysis of crystallographic phase retrieval using iterative projection algorithms

For protein crystals in which more than two thirds of the volume is occupied by solvent, the featureless nature of the solvent region often generates a constraint that is powerful enough to allow direct phasing of X-ray diffraction data. Practical implementation relies on the use of iterative projection algorithms with good global convergence properties to solve the difficult nonconvex phase-retrieval problem. In this paper, some aspects of phase retrieval using iterative projection algorithms are systematically explored, where the diffraction data and density-value distributions in the protein and solvent regions provide the sole constraints. The analysis is based on the addition of random error to the phases of previously determined protein crystal structures, followed by evaluation of the ability to recover the correct phase set as the distance from the solution increases. The properties of the difference-map (DM), relaxed–reflect–reflect (RRR) and relaxed averaged alternating reflectors (RAAR) algorithms are compared. All of these algorithms prove to be effective for crystallographic phase retrieval, and the useful ranges of the adjustable parameter which controls their behavior are established. When these algorithms converge to the solution, the algorithm trajectory becomes stationary; however, the density function continues to fluctuate significantly around its mean position. It is shown that averaging over the algorithm trajectory in the stationary region, following convergence, improves the density estimate, with this procedure outperforming previous approaches for phase or density refinement.




an

The role of alkyl chain length in the melt and solution crystallization of paliperidone aliphatic prodrugs

Fatty acid-derivative prodrugs have been utilized extensively to improve the physicochemical, biopharmaceutical and pharmacokinetic properties of active pharmaceutical ingredients. However, to our knowledge, the crystallization behavior of prodrugs modified with different fatty acids has not been explored. In the present work, a series of paliperidone aliphatic prodrugs with alkyl chain lengths ranging from C4 to C16 was investigated with respect to crystal structure, crystal morphology and crystallization kinetics. The paliperidone derivatives exhibited isostructural crystal packing, despite the different alkyl chain lengths, and crystallized with the dominant (100) face in both melt and solution. The rate of crystallization for paliperidone derivatives in the melt increases with alkyl chain length owing to greater molecular mobility. In contrast, the longer chains prolong the nucleation induction time and reduce the crystal growth kinetics in solution. The results show a correlation between difficulty of nucleation in solution and the interfacial energy. This work provides insight into the crystallization behavior of paliperidone aliphatic prodrugs and reveals that the role of alkyl chain length in the crystallization behavior has a strong dependence on the crystallization method.




an

STEM SerialED: achieving high-resolution data for ab initio structure determination of beam-sensitive nanocrystalline materials

Serial electron diffraction (SerialED), which applies a snapshot data acquisition strategy for each crystal, was introduced to tackle the problem of radiation damage in the structure determination of beam-sensitive materials by three-dimensional electron diffraction (3DED). The snapshot data acquisition in SerialED can be realized using both transmission and scanning transmission electron microscopes (TEM/STEM). However, the current SerialED workflow based on STEM setups requires special external devices and software, which limits broader adoption. Here, we present a simplified experimental implementation of STEM-based SerialED on Thermo Fisher Scientific STEMs using common proprietary software interfaced through Python scripts to automate data collection. Specifically, we utilize TEM Imaging and Analysis (TIA) scripting and TEM scripting to access the STEM functionalities of the microscope, and DigitalMicrograph scripting to control the camera for snapshot data acquisition. Data analysis adapts the existing workflow using the software CrystFEL, which was developed for serial X-ray crystallography. Our workflow for STEM SerialED can be used on any Gatan or Thermo Fisher Scientific camera. We apply this workflow to collect high-resolution STEM SerialED data from two aluminosilicate zeolites, zeolite Y and ZSM-25. We demonstrate, for the first time, ab initio structure determination through direct methods using STEM SerialED data. Zeolite Y is relatively stable under the electron beam, and STEM SerialED data extend to 0.60 Å. We show that the structural model obtained using STEM SerialED data merged from 358 crystals is nearly identical to that using continuous rotation electron diffraction data from one crystal. This demonstrates that accurate structures can be obtained from STEM SerialED. Zeolite ZSM-25 is very beam-sensitive and has a complex structure. We show that STEM SerialED greatly improves the data resolution of ZSM-25, compared with serial rotation electron diffraction (SerialRED), from 1.50 to 0.90 Å. This allows, for the first time, the use of standard phasing methods, such as direct methods, for the ab initio structure determination of ZSM-25.




an

Orientational ordering and assembly of silica–nickel Janus particles in a magnetic field

The orientation ordering and assembly behavior of silica–nickel Janus particles in a static external magnetic field were probed by ultra small-angle X-ray scattering (USAXS). Even in a weak applied field, the net magnetic moments of the individual particles aligned in the direction of the field, as indicated by the anisotropy in the recorded USAXS patterns. X-ray photon correlation spectroscopy (XPCS) measurements on these suspensions revealed that the corresponding particle dynamics are primarily Brownian diffusion [Zinn, Sharpnack & Narayanan (2023). Soft Matter, 19, 2311–2318]. At higher fields, the magnetic forces led to chain-like configurations of particles, as indicated by an additional feature in the USAXS pattern. A theoretical framework is provided for the quantitative interpretation of the observed anisotropic scattering diagrams and the corresponding degree of orientation. No anisotropy was detected when the magnetic field was applied along the beam direction, which is also replicated by the model. The method presented here could be useful for the interpretation of oriented scattering patterns from a wide variety of particulate systems. The combination of USAXS and XPCS is a powerful approach for investigating asymmetric colloidal particles in external fields.




an

Refining short-range order parameters from the three-dimensional diffuse scattering in single-crystal electron diffraction data

Our study compares short-range order parameters refined from the diffuse scattering in single-crystal X-ray and single-crystal electron diffraction data. Nb0.84CoSb was chosen as a reference material. The correlations between neighbouring vacancies and the displacements of Sb and Co atoms were refined from the diffuse scattering using a Monte Carlo refinement in DISCUS. The difference between the Sb and Co displacements refined from the diffuse scattering and the Sb and Co displacements refined from the Bragg reflections in single-crystal X-ray diffraction data is 0.012 (7) Å for the refinement on diffuse scattering in single-crystal X-ray diffraction data and 0.03 (2) Å for the refinement on the diffuse scattering in single-crystal electron diffraction data. As electron diffraction requires much smaller crystals than X-ray diffraction, this opens up the possibility of refining short-range order parameters in many technologically relevant materials for which no crystals large enough for single-crystal X-ray diffraction are available.




an

C-SPAM: an open-source time-resolved specimen vitrification device with light-activated molecules

Molecular structures can be determined in vitro and in situ with cryo-electron microscopy (cryo-EM). Specimen preparation is a major obstacle in cryo-EM. Typical sample preparation is orders of magnitude slower than biological processes. Time-resolved cryo-EM (TR-cryo-EM) can capture short-lived states. Here, Cryo-EM sample preparation with light-activated molecules (C-SPAM) is presented, an open-source, photochemistry-coupled device for TR-cryo-EM that enables millisecond resolution and tunable timescales across broad biological applications.




an

Solving protein structures by combining structure prediction, molecular replacement and direct-methods-aided model completion

Highly accurate protein structure prediction can generate accurate models of protein and protein–protein complexes in X-ray crystallography. However, the question of how to make more effective use of predicted models for completing structure analysis, and which strategies should be employed for the more challenging cases such as multi-helical structures, multimeric structures and extremely large structures, both in the model preparation and in the completion steps, remains open for discussion. In this paper, a new strategy is proposed based on the framework of direct methods and dual-space iteration, which can greatly simplify the pre-processing steps of predicted models both in normal and in challenging cases. Following this strategy, full-length models or the conservative structural domains could be used directly as the starting model, and the phase error and the model bias between the starting model and the real structure would be modified in the direct-methods-based dual-space iteration. Many challenging cases (from CASP14) have been tested for the general applicability of this constructive strategy, and almost complete models have been generated with reasonable statistics. The hybrid strategy therefore provides a meaningful scheme for X-ray structure determination using a predicted model as the starting point.




an

Orientational analysis of atomic pair correlations in nanocrystalline indium oxide thin films

The application of grazing-incidence total X-ray scattering (GITXS) for pair distribution function (PDF) analysis using >50 keV X-rays from synchrotron light sources has created new opportunities for structural characterization of supported thin films with high resolution. Compared with grazing-incidence wide-angle X-ray scattering, which is only useful for highly ordered materials, GITXS/PDFs expand such analysis to largely disordered or nanostructured materials by examining the atomic pair correlations dependent on the direction relative to the surface of the supporting substrate. A characterization of nanocrystalline In2O3-derived thin films is presented here with in-plane-isotropic and out-of-plane-anisotropic orientational ordering of the atomic structure, each synthesized using different techniques. The atomic orientations of such films are known to vary based on the synthetic conditions. Here, an azimuthal orientational analysis of these films using GITXS with a single incident angle is shown to resolve the markedly different orientations of the atomic structures with respect to the planar support and the different degrees of long-range order, and hence, the terminal surface chemistries. It is anticipated that orientational analysis of GITXS/PDF data will offer opportunities to extend structural analyses of thin films by providing a means to qualitatively determine the major atomic orientation within nanocrystalline and, eventually, non-crystalline films.




an

The interoperability of crystallographic data and databases

Interoperability of crystallographic data with other disciplines is essential for the smooth and rapid progress of structure-based science in the computer age. Within crystallography and closely related subject areas, there is already a high level of conformance to the generally accepted FAIR principles (that data be findable, accessible, interoperable and reusable) through the adoption of common information exchange protocols by databases, publishers, instrument vendors, experimental facilities and software authors. Driven by the success within these domains, the IUCr has worked closely with CODATA (the Committee on Data of the International Science Council) to help develop the latter's commitment to cross-domain integration of discipline-specific data. The IUCr has, in particular, emphasized the need for standards relating to data quality and completeness as an adjunct to the FAIR data landscape. This can ensure definitive reusable data, which in turn can aid interoperability across domains. A microsymposium at the IUCr 2023 Congress provided an up-to-date survey of data interoperability within and outside of crystallography, expounded using a broad range of examples.




an

Crystal structure of vancomycin bound to the resistance determinant d-alanine-d-serine

Vancomycin is a glycopeptide antibiotic that for decades has been a mainstay of treatment for persistent bacterial infections. However, the spread of antibiotic resistance threatens its continued utility. In particular, vancomycin-resistant enterococci (VRE) have become a pressing clinical challenge. Vancomycin acts by binding and sequestering the intermediate Lipid II in cell-wall biosynthesis, specifically recognizing a d-alanine-d-alanine dipeptide motif within the Lipid II molecule. VRE achieve resistance by remodeling this motif to either d-alanine-d-lactate or d-alanine-d-serine; the former substitution essentially abolishes recognition by vancomycin of Lipid II, whereas the latter reduces the affinity of the antibiotic by roughly one order of magnitude. The complex of vancomycin bound to d-alanine-d-serine has been crystallized, and its 1.20 Å X-ray crystal structure is presented here. This structure reveals that the d-alanine-d-serine ligand is bound in essentially the same position and same pose as the native d-alanine-d-alanine ligand. The serine-containing ligand appears to be slightly too large to be comfortably accommodated in this way, suggesting one possible contribution to the reduced binding affinity. In addition, two flexible hydroxyl groups – one from the serine side chain of the ligand, and the other from a glucose sugar on the antibiotic – are locked into single conformations in the complex, which is likely to contribute an unfavorable entropic component to the recognition of the serine-containing ligand.




an

The curious case of proton migration under pressure in the malonic acid and 4,4'-bi­pyridine cocrystal

In the search for new active pharmaceutical ingredients, the precise control of the chemistry of cocrystals becomes essential. One crucial step within this chemistry is proton migration between cocrystal coformers to form a salt, usually anticipated by the empirical ΔpKa rule. Due to the effective role it plays in modifying intermolecular distances and interactions, pressure adds a new dimension to the ΔpKa rule. Still, this variable has been scarcely applied to induce proton-transfer reactions within these systems. In our study, high-pressure X-ray diffraction and Raman spectroscopy experiments, supported by DFT calculations, reveal modifications to the protonation states of the 4,4'-bi­pyridine (BIPY) and malonic acid (MA) cocrystal (BIPYMA) that allow the conversion of the cocrystal phase into ionic salt polymorphs. On compression, neutral BIPYMA and monoprotonated (BIPYH+MA−) species coexist up to 3.1 GPa, where a phase transition to a structure of P21/c symmetry occurs, induced by a double proton-transfer reaction forming BIPYH22+MA2−. The low-pressure C2/c phase is recovered at 2.4 GPa on decompression, leading to a 0.7 GPa hysteresis pressure range. This is one of a few studies on proton transfer in multicomponent crystals that shows how susceptible the interconversion between differently charged species is to even slight pressure changes, and how the proton transfer can be a triggering factor leading to changes in the crystal symmetry. These new data, coupled with information from previous reports on proton-transfer reactions between coformers, extend the applicability of the ΔpKa rule incorporating the pressure required to induce salt formation.




an

Structural analysis of nanocrystals by pair distribution function combining electron diffraction with crystal tilting

As an important characterization method, pair distribution function (PDF) has been extensively used in structural analysis of nanomaterials, providing key insights into the degree of crystallinity, atomic structure, local disorder etc. The collection of scattering signals with good statistics is necessary for a reliable structural analysis. However, current conventional electron diffraction experiments using PDF (ePDF) are limited in their ability to acquire continuous diffraction rings for large nanoparticles. Herein, a new method – tilt-ePDF – is proposed to improve the data quality and compatibility of ePDF by a combination of electron diffraction and specimen tilting. In the present work, a tilt-series of electron diffraction patterns was collected from gold nanoparticles with three different sizes and a standard sample polycrystalline aluminium film for ePDF analysis. The results show that tilt-ePDF can not only enhance the continuity of diffraction rings, but can also improve the signal-to-noise ratio in the high scattering angle range. As a result, compared with conventional ePDF data, tilt-ePDF data provide structure parameters with a better accuracy and lower residual factors in the refinement against the crystal structure. This method provides a new way of utilizing ePDF to obtain accurate local structure information from nanoparticles.




an

Community recommendations on cryoEM data archiving and validation

In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for the deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 47 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discussed, and the resulting consensus recommendations. Some challenges for future methods-development efforts in this area are also highlighted, as is the implementation to date of some of the recommendations.




an

Cocrystals of a coumarin derivative: an efficient approach towards anti-leishmanial cocrystals against MIL-resistant Leishmania tropica

Leishmaniasis is a neglected parasitic tropical disease with numerous clinical manifestations. One of the causative agents of cutaneous leishmaniasis (CL) is Leishmania tropica (L. tropica) known for causing ulcerative lesions on the skin. The adverse effects of the recommended available drugs, such as amphotericin B and pentavalent antimonial, and the emergence of drug resistance in parasites, mean the search for new safe and effective anti-leishmanial agents is crucial. Miltefosine (MIL) was the first recommended oral medication, but its use is now limited because of the rapid emergence of resistance. Pharmaceutical cocrystallization is an effective method to improve the physicochemical and biological properties of active pharmaceutical ingredients (APIs). Herein, we describe the cocrystallization of coumarin-3-carb­oxy­lic acid (CU, 1a; 2-oxobenzo­pyrane-3-carb­oxy­lic acid, C10H6O4) with five coformers [2-amino-3-bromo­pyridine (1b), 2-amino-5-(tri­fluoro­methyl)-pyridine (1c), 2-amino-6-methyl­pyridine (1d), p-amino­benzoic acid (1e) and amitrole (1f)] in a 1:1 stoichiometric ratio via the neat grinding method. The cocrystals 2–6 obtained were characterized via single-crystal X-ray diffraction, powder X-ray diffraction, differential scanning calorimetry and thermogravimetric analysis, as well as Fourier transform infrared spectroscopy. Non-covalent interactions, such as van der Waals, hydrogen bonding, C—H⋯π and π⋯π interactions contribute significantly towards the packing of a crystal structure and alter the physicochemical and biological activity of CU. In this research, newly synthesized cocrystals were evaluated for their anti-leishmanial activity against the MIL-resistant L. tropica and cytotoxicity against the 3T3 (normal fibroblast) cell line. Among the non-cytotoxic cocrystals synthesized (2–6), CU:1b (2, IC50 = 61.83 ± 0.59 µM), CU:1c (3, 125.7 ± 1.15 µM) and CU:1d (4, 48.71 ± 0.75 µM) appeared to be potent anti-leishmanial agents and showed several-fold more anti-leishmanial potential than the tested standard drug (MIL, IC50 = 169.55 ± 0.078 µM). The results indicate that cocrystals 2–4 are promising anti-leishmanial agents which require further exploration.




an

Transferable Hirshfeld atom model for rapid evaluation of aspherical atomic form factors

Form factors based on aspherical models of atomic electron density have brought great improvement in the accuracies of hydrogen atom parameters derived from X-ray crystal structure refinement. Today, two main groups of such models are available, the banks of transferable atomic densities parametrized using the Hansen–Coppens multipole model which allows for rapid evaluation of atomic form factors and Hirshfeld atom refinement (HAR)-related methods which are usually more accurate but also slower. In this work, a model that combines the ideas utilized in the two approaches is tested. It uses atomic electron densities based on Hirshfeld partitions of electron densities, which are precalculated and stored in a databank. This model was also applied during the refinement of the structures of five small molecules. A comparison of the resulting hydrogen atom parameters with those derived from neutron diffraction data indicates that they are more accurate than those obtained with the Hansen–Coppens based databank, and only slightly less accurate than those obtained with a version of HAR that neglects the crystal environment. The advantage of using HAR becomes more noticeable when the effects of the environment are included. To speed up calculations, atomic densities were represented by multipole expansion with spherical harmonics up to l = 7, which used numerical radial functions (a different approach to that applied in the Hansen–Coppens model). Calculations of atomic form factors for the small protein crambin (at 0.73 Å resolution) took only 68 s using 12 CPU cores.




an

Nanostructure and dynamics of N-truncated copper amyloid-β peptides from advanced X-ray absorption fine structure

An X-ray absorption spectroscopy (XAS) electrochemical cell was used to collect high-quality XAS measurements of N-truncated Cu:amyloid-β (Cu:Aβ) samples under near-physiological conditions. N-truncated Cu:Aβ peptide complexes contribute to oxidative stress and neurotoxicity in Alzheimer's patients' brains. However, the redox properties of copper in different Aβ peptide sequences are inconsistent. Therefore, the geometry of binding sites for the copper binding in Aβ4–8/12/16 was determined using novel advanced extended X-ray absorption fine structure (EXAFS) analysis. This enables these peptides to perform redox cycles in a manner that might produce toxicity in human brains. Fluorescence XAS measurements were corrected for systematic errors including defective-pixel data, monochromator glitches and dispersion of pixel spectra. Experimental uncertainties at each data point were measured explicitly from the point-wise variance of corrected pixel measurements. The copper-binding environments of Aβ4–8/12/16 were precisely determined by fitting XAS measurements with propagated experimental uncertainties, advanced analysis and hypothesis testing, providing a mechanism to pursue many similarly complex questions in bioscience. The low-temperature XAS measurements here determine that CuII is bound to the first amino acids in the high-affinity amino-terminal copper and nickel (ATCUN) binding motif with an oxygen in a tetragonal pyramid geometry in the Aβ4–8/12/16 peptides. Room-temperature XAS electrochemical-cell measurements observe metal reduction in the Aβ4–16 peptide. Robust investigations of XAS provide structural details of CuII binding with a very different bis-His motif and a water oxygen in a quasi-tetrahedral geometry. Oxidized XAS measurements of Aβ4–12/16 imply that both CuII and CuIII are accommodated in an ATCUN-like binding site. Hypotheses for these CuI, CuII and CuIII geometries were proven and disproven using the novel data and statistical analysis including F tests. Structural parameters were determined with an accuracy some tenfold better than literature claims of past work. A new protocol was also developed using EXAFS data analysis for monitoring radiation damage. This gives a template for advanced analysis of complex biosystems.




an

The ABC toxin complex from Yersinia entomophaga can package three different cytotoxic components expressed from distinct genetic loci in an unfolded state: the structures of both shell and cargo

Bacterial ABC toxin complexes (Tcs) comprise three core proteins: TcA, TcB and TcC. The TcA protein forms a pentameric assembly that attaches to the surface of target cells and penetrates the cell membrane. The TcB and TcC proteins assemble as a heterodimeric TcB–TcC subcomplex that makes a hollow shell. This TcB–TcC subcomplex self-cleaves and encapsulates within the shell a cytotoxic `cargo' encoded by the C-terminal region of the TcC protein. Here, we describe the structure of a previously uncharacterized TcC protein from Yersinia entomophaga, encoded by a gene at a distant genomic location from the genes encoding the rest of the toxin complex, in complex with the TcB protein. When encapsulated within the TcB–TcC shell, the C-terminal toxin adopts an unfolded and disordered state, with limited areas of local order stabilized by the chaperone-like inner surface of the shell. We also determined the structure of the toxin cargo alone and show that when not encapsulated within the shell, it adopts an ADP-ribosyltransferase fold most similar to the catalytic domain of the SpvB toxin from Salmonella typhimurium. Our structural analysis points to a likely mechanism whereby the toxin acts directly on actin, modifying it in a way that prevents normal polymerization.




an

KINNTREX: a neural network to unveil protein mechanisms from time-resolved X-ray crystallography

Here, a machine-learning method based on a kinetically informed neural network (NN) is introduced. The proposed method is designed to analyze a time series of difference electron-density maps from a time-resolved X-ray crystallographic experiment. The method is named KINNTREX (kinetics-informed NN for time-resolved X-ray crystallography). To validate KINNTREX, multiple realistic scenarios were simulated with increasing levels of complexity. For the simulations, time-resolved X-ray data were generated that mimic data collected from the photocycle of the photoactive yellow protein. KINNTREX only requires the number of intermediates and approximate relaxation times (both obtained from a singular valued decomposition) and does not require an assumption of a candidate mechanism. It successfully predicts a consistent chemical kinetic mechanism, together with difference electron-density maps of the intermediates that appear during the reaction. These features make KINNTREX attractive for tackling a wide range of biomolecular questions. In addition, the versatility of KINNTREX can inspire more NN-based applications to time-resolved data from biological macromolecules obtained by other methods.




an

Crystal structure of human peptidylarginine deiminase type VI (PAD6) provides insights into its inactivity

Human peptidylarginine deiminase isoform VI (PAD6), which is predominantly limited to cytoplasmic lattices in the mammalian oocytes in ovarian tissue, is essential for female fertility. It belongs to the peptidylarginine deiminase (PAD) enzyme family that catalyzes the conversion of arginine residues to citrulline in proteins. In contrast to other members of the family, recombinant PAD6 was previously found to be catalytically inactive. We sought to provide structural insight into the human homologue to shed light on this observation. We report here the first crystal structure of PAD6, determined at 1.7 Å resolution. PAD6 follows the same domain organization as other structurally known PAD isoenzymes. Further structural analysis and size-exclusion chromatography show that PAD6 behaves as a homodimer similar to PAD4. Differential scanning fluorimetry suggests that PAD6 does not coordinate Ca2+ which agrees with acidic residues found to coordinate Ca2+ in other PAD homologs not being conserved in PAD6. The crystal structure of PAD6 shows similarities with the inactive state of apo PAD2, in which the active site conformation is unsuitable for catalytic citrullination. The putative active site of PAD6 adopts a non-productive conformation that would not allow protein–substrate binding due to steric hindrance with rigid secondary structure elements. This observation is further supported by the lack of activity on the histone H3 and cytokeratin 5 substrates. These findings suggest a different mechanism for enzymatic activation compared with other PADs; alternatively, PAD6 may exert a non-enzymatic function in the cytoplasmic lattice of oocytes and early embryos.




an

Time-series analysis of rhenium(I) organometallic covalent binding to a model protein for drug development

Metal-based complexes with their unique chemical properties, including multiple oxidation states, radio-nuclear capabilities and various coordination geometries yield value as potential pharmaceuticals. Understanding the interactions between metals and biological systems will prove key for site-specific coordination of new metal-based lead compounds. This study merges the concepts of target coordination with fragment-based drug methodologies, supported by varying the anomalous scattering of rhenium along with infrared spectroscopy, and has identified rhenium metal sites bound covalently with two amino acid types within the model protein. A time-based series of lysozyme-rhenium-imidazole (HEWL-Re-Imi) crystals was analysed systematically over a span of 38 weeks. The main rhenium covalent coordination is observed at His15, Asp101 and Asp119. Weak (i.e. noncovalent) interactions are observed at other aspartic, asparagine, proline, tyrosine and tryptophan side chains. Detailed bond distance comparisons, including precision estimates, are reported, utilizing the diffraction precision index supplemented with small-molecule data from the Cambridge Structural Database. Key findings include changes in the protein structure induced at the rhenium metal binding site, not observed in similar metal-free structures. The binding sites are typically found along the solvent-channel-accessible protein surface. The three primary covalent metal binding sites are consistent throughout the time series, whereas binding to neighbouring amino acid residues changes through the time series. Co-crystallization was used, consistently yielding crystals four days after setup. After crystal formation, soaking of the compound into the crystal over 38 weeks is continued and explains these structural adjustments. It is the covalent bond stability at the three sites, their proximity to the solvent channel and the movement of residues to accommodate the metal that are important, and may prove useful for future radiopharmaceutical development including target modification.




an

RCSB Protein Data Bank: supporting research and education worldwide through explorations of experimentally determined and computationally predicted atomic level 3D biostructures

The Protein Data Bank (PDB) was established as the first open-access digital data resource in biology and medicine in 1971 with seven X-ray crystal structures of proteins. Today, the PDB houses >210 000 experimentally determined, atomic level, 3D structures of proteins and nucleic acids as well as their complexes with one another and small molecules (e.g. approved drugs, enzyme cofactors). These data provide insights into fundamental biology, biomedicine, bioenergy and biotechnology. They proved particularly important for understanding the SARS-CoV-2 global pandemic. The US-funded Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) and other members of the Worldwide Protein Data Bank (wwPDB) partnership jointly manage the PDB archive and support >60 000 `data depositors' (structural biologists) around the world. wwPDB ensures the quality and integrity of the data in the ever-expanding PDB archive and supports global open access without limitations on data usage. The RCSB PDB research-focused web portal at https://www.rcsb.org/ (RCSB.org) supports millions of users worldwide, representing a broad range of expertise and interests. In addition to retrieving 3D structure data, PDB `data consumers' access comparative data and external annotations, such as information about disease-causing point mutations and genetic variations. RCSB.org also provides access to >1 000 000 computed structure models (CSMs) generated using artificial intelligence/machine-learning methods. To avoid doubt, the provenance and reliability of experimentally determined PDB structures and CSMs are identified. Related training materials are available to support users in their RCSB.org explorations.




an

Chaperone-mediated MHC-I peptide exchange in antigen presentation

This work focuses on molecules that are encoded by the major histocompatibility complex (MHC) and that bind self-, foreign- or tumor-derived peptides and display these at the cell surface for recognition by receptors on T lymphocytes (T cell receptors, TCR) and natural killer (NK) cells. The past few decades have accumulated a vast knowledge base of the structures of MHC molecules and the complexes of MHC/TCR with specificity for many different peptides. In recent years, the structures of MHC-I molecules complexed with chaperones that assist in peptide loading have been revealed by X-ray crystallography and cryogenic electron microscopy. These structures have been further studied using mutagenesis, molecular dynamics and NMR approaches. This review summarizes the current structures and dynamic principles that govern peptide exchange as these relate to the process of antigen presentation.