for Structural insights into conformational switching in latency-associated peptide between transforming growth factor β-1 bound and unbound states By scripts.iucr.org Published On :: 2020-02-06 Transforming growth factor β-1 (TGFβ-1) is a secreted signalling protein that directs many cellular processes and is an attractive target for the treatment of several diseases. The primary endogenous activity regulatory mechanism for TGFβ-1 is sequestration by its pro-peptide, latency-associated peptide (LAP), which sterically prohibits receptor binding by caging TGFβ-1. As such, recombinant LAP is promising as a protein-based therapeutic for modulating TGFβ-1 activity; however, the mechanism of binding is incompletely understood. Comparison of the crystal structure of unbound LAP (solved here to 3.5 Å resolution) with that of the bound complex shows that LAP is in a more open and extended conformation when unbound to TGFβ-1. Analysis suggests a mechanism of binding TGFβ-1 through a large-scale conformational change that includes contraction of the inter-monomer interface and caging by the `straight-jacket' domain that may occur in partnership through a loop-to-helix transition in the core jelly-roll fold. This conformational change does not appear to include a repositioning of the integrin-binding motif as previously proposed. X-ray scattering-based modelling supports this mechanism and reveals possible orientations and ensembles in solution. Although native LAP is heavily glycosylated, solution scattering experiments show that the overall folding and flexibility of unbound LAP are not influenced by glycan modification. The combination of crystallography, solution scattering and biochemical experiments reported here provide insight into the mechanism of LAP sequestration of TGFβ-1 that is of fundamental importance for therapeutic development. Full Article text
for Comparing serial X-ray crystallography and microcrystal electron diffraction (MicroED) as methods for routine structure determination from small macromolecular crystals By scripts.iucr.org Published On :: 2020-02-26 Innovative new crystallographic methods are facilitating structural studies from ever smaller crystals of biological macromolecules. In particular, serial X-ray crystallography and microcrystal electron diffraction (MicroED) have emerged as useful methods for obtaining structural information from crystals on the nanometre to micrometre scale. Despite the utility of these methods, their implementation can often be difficult, as they present many challenges that are not encountered in traditional macromolecular crystallography experiments. Here, XFEL serial crystallography experiments and MicroED experiments using batch-grown microcrystals of the enzyme cyclophilin A are described. The results provide a roadmap for researchers hoping to design macromolecular microcrystallography experiments, and they highlight the strengths and weaknesses of the two methods. Specifically, we focus on how the different physical conditions imposed by the sample-preparation and delivery methods required for each type of experiment affect the crystal structure of the enzyme. Full Article text
for The achievable resolution for X-ray imaging of cells and other soft biological material By scripts.iucr.org Published On :: 2020-03-07 X-ray imaging of soft materials is often difficult because of the low contrast of the components. This particularly applies to frozen hydrated biological cells where the feature of interest can have a similar density to the surroundings. As a consequence, a high dose is often required to achieve the desired resolution. However, the maximum dose that a specimen can tolerate is limited by radiation damage. Results from 3D coherent diffraction imaging (CDI) of frozen hydrated specimens have given resolutions of ∼80 nm compared with the expected resolution of 10 nm predicted from theoretical considerations for identifying a protein embedded in water. Possible explanations for this include the inapplicability of the dose-fractionation theorem, the difficulty of phase determination, an overall object-size dependence on the required fluence and dose, a low contrast within the biological cell, insufficient exposure, and a variety of practical difficulties such as scattering from surrounding material. A recent article [Villaneuva-Perez et al. (2018), Optica, 5, 450–457] concluded that imaging by Compton scattering gave a large dose advantage compared with CDI because of the object-size dependence for CDI. An object-size dependence would severely limit the applicability of CDI and perhaps related coherence-based methods for structural studies. This article specifically includes the overall object size in the analysis of the fluence and dose requirements for coherent imaging in order to investigate whether there is a dependence on object size. The applicability of the dose-fractionation theorem is also discussed. The analysis is extended to absorption-based imaging and imaging by incoherent scattering (Compton) and fluorescence. This article includes analysis of the dose required for imaging specific low-contrast cellular organelles as well as for protein against water. This article concludes that for both absorption-based and coherent diffraction imaging, the dose-fractionation theorem applies and the required dose is independent of the overall size of the object. For incoherent-imaging methods such as Compton scattering, the required dose depends on the X-ray path length through the specimen. For all three types of imaging, the dependence of fluence and dose on a resolution d goes as 1/d4 when imaging uniform-density voxels. The independence of CDI on object size means that there is no advantage for Compton scattering over coherent-based imaging methods. The most optimistic estimate of achievable resolution is 3 nm for imaging protein molecules in water/ice using lensless imaging methods in the water window. However, the attainable resolution depends on a variety of assumptions including the model for radiation damage as a function of resolution, the efficiency of any phase-retrieval process, the actual contrast of the feature of interest within the cell and the definition of resolution itself. There is insufficient observational information available regarding the most appropriate model for radiation damage in frozen hydrated biological material. It is advocated that, in order to compare theory with experiment, standard methods of reporting results covering parameters such as the feature examined (e.g. which cellular organelle), resolution, contrast, depth of the material (for 2D), estimate of noise and dose should be adopted. Full Article text
for Probing the structural pathway of conformational polymorph nucleation by comparing a series of α,ω-alkanedicarboxylic acids By scripts.iucr.org Published On :: 2020-03-26 Herein the nucleation pathway of conformational polymorphs was revealed by studying the relationships and distinctions among a series of α,ω-alkanedicarboxylic acids [HOOC–(CH2)n−2–COOH, named DAn, where n = 5, 7, 9, 11, 13, 15] in the solid state and in solution. Their polymorphic outcomes, with the exception of DA5, show solvent dependence: form I with conformation I crystallizes from solvents with hydrogen-bond donating (HBD) ability, whereas form II with conformation II crystallizes preferentially from solvents with no HBD ability. In contrast, form II of DA5 does not crystallize in any of the solvents used. Quantum mechanical computation showed that there is no direct conformational link between the solvents and the resultant polymorphic outcomes. Surprisingly, solute aggregates were found in no-HBD solvents by Fourier transform infrared spectroscopy, and only monomers could be detected in HBD solvents, suggesting stronger solvation. Furthermore, it was found that all six compounds including DA5 followed the same pattern in solution. Moreover, crystal-packing efficiency calculations and stability tests stated that dimorphs of DA5 bear a greater stability difference than others. These suggest that the rearrangement from conformation II to I could not be limited by hard desolvation in HBD solvents, where form I was also obtained. In other systems, metastable II was produced in the same solvents, probably as a result of the rearrangement being limited by hard desolvation. In this work, a comparative study uncovers the proposed nucleation pathway: difficulty in desolvation has a remarkable effect on the result of rearrangement and nucleation outcome. Full Article text
for Hypothesis for a mechanism of beam-induced motion in cryo-electron microscopy By scripts.iucr.org Published On :: 2020-03-26 Estimates of heat-transfer rates during plunge-cooling and the patterns of ice observed in cryo-EM samples indicate that the grid bars cool much more slowly than do the support foil and sample near the middle of the grid openings. The resulting transient temperature differences generate transient tensile stresses in the support foil. Most of this foil stress develops while the sample is liquid and cooling toward its glass transition Tg, and so does not generate tensile sample stress. As the grid bars continue cooling towards the cryogen temperature and contracting, the tensile stress in the foil is released, placing the sample in compressive stress. Radiation-induced creep in the presence of this compressive stress should generate a doming of the sample in the foil openings, as is observed experimentally. Crude estimates of the magnitude of the doming that may be generated by this mechanism are consistent with observation. Several approaches to reducing beam-induced motion are discussed. Full Article text
for A complete compendium of crystal structures for the human SEPT3 subgroup reveals functional plasticity at a specific septin interface By scripts.iucr.org Published On :: 2020-03-28 Human septins 3, 9 and 12 are the only members of a specific subgroup of septins that display several unusual features, including the absence of a C-terminal coiled coil. This particular subgroup (the SEPT3 septins) are present in rod-like octameric protofilaments but are lacking in similar hexameric assemblies, which only contain representatives of the three remaining subgroups. Both hexamers and octamers can self-assemble into mixed filaments by end-to-end association, implying that the SEPT3 septins may facilitate polymerization but not necessarily function. These filaments frequently associate into higher order complexes which associate with biological membranes, triggering a wide range of cellular events. In the present work, a complete compendium of crystal structures for the GTP-binding domains of all of the SEPT3 subgroup members when bound to either GDP or to a GTP analogue is provided. The structures reveal a unique degree of plasticity at one of the filamentous interfaces (dubbed NC). Specifically, structures of the GDP and GTPγS complexes of SEPT9 reveal a squeezing mechanism at the NC interface which would expel a polybasic region from its binding site and render it free to interact with negatively charged membranes. On the other hand, a polyacidic region associated with helix α5', the orientation of which is particular to this subgroup, provides a safe haven for the polybasic region when retracted within the interface. Together, these results suggest a mechanism which couples GTP binding and hydrolysis to membrane association and implies a unique role for the SEPT3 subgroup in this process. These observations can be accounted for by constellations of specific amino-acid residues that are found only in this subgroup and by the absence of the C-terminal coiled coil. Such conclusions can only be reached owing to the completeness of the structural studies presented here. Full Article text
for The active form of quinol-dependent nitric oxide reductase from Neisseria meningitidis is a dimer By scripts.iucr.org Published On :: 2020-03-21 Neisseria meningitidis is carried by nearly a billion humans, causing developmental impairment and over 100 000 deaths a year. A quinol-dependent nitric oxide reductase (qNOR) plays a critical role in the survival of the bacterium in the human host. X-ray crystallographic analyses of qNOR, including that from N. meningitidis (NmqNOR) reported here at 3.15 Å resolution, show monomeric assemblies, despite the more active dimeric sample being used for crystallization. Cryo-electron microscopic analysis of the same chromatographic fraction of NmqNOR, however, revealed a dimeric assembly at 3.06 Å resolution. It is shown that zinc (which is used in crystallization) binding near the dimer-stabilizing TMII region contributes to the disruption of the dimer. A similar destabilization is observed in the monomeric (∼85 kDa) cryo-EM structure of a mutant (Glu494Ala) qNOR from the opportunistic pathogen Alcaligenes (Achromobacter) xylosoxidans, which primarily migrates as a monomer. The monomer–dimer transition of qNORs seen in the cryo-EM and crystallographic structures has wider implications for structural studies of multimeric membrane proteins. X-ray crystallographic and cryo-EM structural analyses have been performed on the same chromatographic fraction of NmqNOR to high resolution. This represents one of the first examples in which the two approaches have been used to reveal a monomeric assembly in crystallo and a dimeric assembly in vitrified cryo-EM grids. A number of factors have been identified that may trigger the destabilization of helices that are necessary to preserve the integrity of the dimer. These include zinc binding near the entry of the putative proton-transfer channel and the preservation of the conformational integrity of the active site. The mutation near the active site results in disruption of the active site, causing an additional destabilization of helices (TMIX and TMX) that flank the proton-transfer channel helices, creating an inert monomeric enzyme. Full Article text
for Scanning electron microscopy as a method for sample visualization in protein X-ray crystallography By scripts.iucr.org Published On :: 2020-04-10 Developing methods to determine high-resolution structures from micrometre- or even submicrometre-sized protein crystals has become increasingly important in recent years. This applies to both large protein complexes and membrane proteins, where protein production and the subsequent growth of large homogeneous crystals is often challenging, and to samples which yield only micro- or nanocrystals such as amyloid or viral polyhedrin proteins. The versatile macromolecular crystallography microfocus (VMXm) beamline at Diamond Light Source specializes in X-ray diffraction measurements from micro- and nanocrystals. Because of the possibility of measuring data from crystalline samples that approach the resolution limit of visible-light microscopy, the beamline design includes a scanning electron microscope (SEM) to visualize, locate and accurately centre crystals for X-ray diffraction experiments. To ensure that scanning electron microscopy is an appropriate method for sample visualization, tests were carried out to assess the effect of SEM radiation on diffraction quality. Cytoplasmic polyhedrosis virus polyhedrin protein crystals cryocooled on electron-microscopy grids were exposed to SEM radiation before X-ray diffraction data were collected. After processing the data with DIALS, no statistically significant difference in data quality was found between datasets collected from crystals exposed and not exposed to SEM radiation. This study supports the use of an SEM as a tool for the visualization of protein crystals and as an integrated visualization tool on the VMXm beamline. Full Article text
for Fast and accurate defocus modulation for improved tunability of cryo-EM experiments By scripts.iucr.org Published On :: 2020-04-25 Current data collection strategies in electron cryo-microscopy (cryo-EM) record multiframe movies with static optical settings. This limits the number of adjustable parameters that can be used to optimize the experiment. Here, a method for fast and accurate defocus (FADE) modulation during movie acquisition is proposed. It uses the objective lens aperture as an electrostatic pole that locally modifies the electron beam potential. The beam potential variation is converted to defocus change by the typically undesired chromatic aberration of the objective lens. The simplicity, electrostatic principle and low electrical impedance of the device allow fast switching speeds that will enable per-frame defocus modulation of cryo-EM movies. Researchers will be able to define custom defocus `recipes' and tailor the experiment for optimal information extraction from the sample. The FADE method could help to convert the microscope into a more dynamic and flexible optical platform that delivers better performance in cryo-EM single-particle analysis and electron cryo-tomography. Full Article text
for Molecular conformational evolution mechanism during nucleation of crystals in solution By scripts.iucr.org Published On :: 2020-04-24 Nucleation of crystals from solution is fundamental to many natural and industrial processes. In this work, the molecular mechanism of conformational polymorphism nucleation and the links between the molecular conformation in solutions and in crystals were investigated in detail by using 5-nitrofurazone as the model compound. Different polymorphs were prepared, and the conformations in solutions obtained by dissolving different polymorphs were analysed and compared. The solutions of 5-nitrofurazone were proven to contain multiple conformers through quantum chemical computation, Raman spectra analysis, 2D nuclear Overhauser effect spectroscopy spectra analysis and molecular dynamics simulation. The conformational evolution and desolvation path was illustrated according to the 1H NMR spectra of solutions with different concentrations. Finally, based on all the above analysis, the molecular conformational evolution path during nucleation of 5-nitrofurazone was illustrated. The results presented in this work shed a new light on the molecular mechanism of conformational polymorphism nucleation in solution. Full Article text
for Structures of substrate- and product-bound forms of a multi-domain copper nitrite reductase shed light on the role of domain tethering in protein complexes By scripts.iucr.org Published On :: 2020-04-25 Copper-containing nitrite reductases (CuNiRs) are found in all three kingdoms of life and play a major role in the denitrification branch of the global nitrogen cycle where nitrate is used in place of dioxygen as an electron acceptor in respiratory energy metabolism. Several C- and N-terminal redox domain tethered CuNiRs have been identified and structurally characterized during the last decade. Our understanding of the role of tethered domains in these new classes of three-domain CuNiRs, where an extra cytochrome or cupredoxin domain is tethered to the catalytic two-domain CuNiRs, has remained limited. This is further compounded by a complete lack of substrate-bound structures for these tethered CuNiRs. There is still no substrate-bound structure for any of the as-isolated wild-type tethered enzymes. Here, structures of nitrite and product-bound states from a nitrite-soaked crystal of the N-terminal cupredoxin-tethered enzyme from the Hyphomicrobium denitrificans strain 1NES1 (Hd1NES1NiR) are provided. These, together with the as-isolated structure of the same species, provide clear evidence for the role of the N-terminal peptide bearing the conserved His27 in water-mediated anchoring of the substrate at the catalytic T2Cu site. Our data indicate a more complex role of tethering than the intuitive advantage for a partner-protein electron-transfer complex by narrowing the conformational search in such a combined system. Full Article text
for CM01: a facility for cryo-electron microscopy at the European Synchrotron By scripts.iucr.org Published On :: 2019-05-28 Recent improvements in direct electron detectors, microscope technology and software provided the stimulus for a `quantum leap' in the application of cryo-electron microscopy in structural biology, and many national and international centres have since been created in order to exploit this. Here, a new facility for cryo-electron microscopy focused on single-particle reconstruction of biological macromolecules that has been commissioned at the European Synchrotron Radiation Facility (ESRF) is presented. The facility is operated by a consortium of institutes co-located on the European Photon and Neutron Campus and is managed in a similar fashion to a synchrotron X-ray beamline. It has been open to the ESRF structural biology user community since November 2017 and will remain open during the 2019 ESRF–EBS shutdown. Full Article text
for Prediction of models for ordered solvent in macromolecular structures by a classifier based upon resolution-independent projections of local feature data By scripts.iucr.org Published On :: 2019-07-30 Current software tools for the automated building of models for macromolecular X-ray crystal structures are capable of assembling high-quality models for ordered macromolecule and small-molecule scattering components with minimal or no user supervision. Many of these tools also incorporate robust functionality for modelling the ordered water molecules that are found in nearly all macromolecular crystal structures. However, no current tools focus on differentiating these ubiquitous water molecules from other frequently occurring multi-atom solvent species, such as sulfate, or the automated building of models for such species. PeakProbe has been developed specifically to address the need for such a tool. PeakProbe predicts likely solvent models for a given point (termed a `peak') in a structure based on analysis (`probing') of its local electron density and chemical environment. PeakProbe maps a total of 19 resolution-dependent features associated with electron density and two associated with the local chemical environment to a two-dimensional score space that is independent of resolution. Peaks are classified based on the relative frequencies with which four different classes of solvent (including water) are observed within a given region of this score space as determined by large-scale sampling of solvent models in the Protein Data Bank. Designed to classify peaks generated from difference density maxima, PeakProbe also incorporates functionality for identifying peaks associated with model errors or clusters of peaks likely to correspond to multi-atom solvent, and for the validation of existing solvent models using solvent-omit electron-density maps. When tasked with classifying peaks into one of four distinct solvent classes, PeakProbe achieves greater than 99% accuracy for both peaks derived directly from the atomic coordinates of existing solvent models and those based on difference density maxima. While the program is still under development, a fully functional version is publicly available. PeakProbe makes extensive use of cctbx libraries, and requires a PHENIX licence and an up-to-date phenix.python environment for execution. Full Article text
for Combining random microseed matrix screening and the magic triangle for the efficient structure solution of a potential lysin from bacteriophage P68 By scripts.iucr.org Published On :: 2019-07-02 Two commonly encountered bottlenecks in the structure determination of a protein by X-ray crystallography are screening for conditions that give high-quality crystals and, in the case of novel structures, finding derivatization conditions for experimental phasing. In this study, the phasing molecule 5-amino-2,4,6-triiodoisophthalic acid (I3C) was added to a random microseed matrix screen to generate high-quality crystals derivatized with I3C in a single optimization experiment. I3C, often referred to as the magic triangle, contains an aromatic ring scaffold with three bound I atoms. This approach was applied to efficiently phase the structures of hen egg-white lysozyme and the N-terminal domain of the Orf11 protein from Staphylococcus phage P68 (Orf11 NTD) using SAD phasing. The structure of Orf11 NTD suggests that it may play a role as a virion-associated lysin or endolysin. Full Article text
for Sequence assignment for low-resolution modelling of protein crystal structures By scripts.iucr.org Published On :: 2019-07-31 The performance of automated model building in crystal structure determination usually decreases with the resolution of the experimental data, and may result in fragmented models and incorrect side-chain assignment. Presented here are new methods for machine-learning-based docking of main-chain fragments to the sequence and for their sequence-independent connection using a dedicated library of protein fragments. The combined use of these new methods noticeably increases sequence coverage and reduces fragmentation of the protein models automatically built with ARP/wARP. Full Article text
for Methods for merging data sets in electron cryo-microscopy By scripts.iucr.org Published On :: 2019-08-23 Recent developments have resulted in electron cryo-microscopy (cryo-EM) becoming a useful tool for the structure determination of biological macromolecules. For samples containing inherent flexibility, heterogeneity or preferred orientation, the collection of extensive cryo-EM data using several conditions and microscopes is often required. In such a scenario, merging cryo-EM data sets is advantageous because it allows improved three-dimensional reconstructions to be obtained. Since data sets are not always collected with the same pixel size, merging data can be challenging. Here, two methods to combine cryo-EM data are described. Both involve the calculation of a rescaling factor from independent data sets. The effects of errors in the scaling factor on the results of data merging are also estimated. The methods described here provide a guideline for cryo-EM users who wish to combine data sets from the same type of microscope and detector. Full Article text
for Structures of the substrate-binding protein YfeA in apo and zinc-reconstituted holo forms By scripts.iucr.org Published On :: 2019-08-22 In the structural biology of bacterial substrate-binding proteins (SBPs), a growing number of comparisons between substrate-bound and substrate-free forms of metal atom-binding (cluster A-I) SBPs have revealed minimal structural differences between forms. These observations contrast with SBPs that bind substrates such as amino acids or nucleic acids and may undergo >60° rigid-body rotations. Substrate transfer in these SBPs is described by a Venus flytrap model, although this model may not apply to all SBPs. In this report, structures are presented of substrate-free (apo) and reconstituted substrate-bound (holo) YfeA, a polyspecific cluster A-I SBP from Yersinia pestis. It is demonstrated that an apo cluster A-I SBP can be purified by fractionation when co-expressed with its cognate transporter, adding an alternative strategy to the mutagenesis or biochemical treatment used to generate other apo cluster A-I SBPs. The apo YfeA structure contains 111 disordered protein atoms in a mobile helix located in the flexible carboxy-terminal lobe. Metal binding triggers a 15-fold reduction in the solvent-accessible surface area of the metal-binding site and reordering of the 111 protein atoms in the mobile helix. The flexible lobe undergoes a 13.6° rigid-body rotation that is driven by a spring-hammer metal-binding mechanism. This asymmetric rigid-body rotation may be unique to metal atom-binding SBPs (i.e. clusters A-I, A-II and D-IV). Full Article text
for Flexible workflows for on-the-fly electron-microscopy single-particle image processing using Scipion By scripts.iucr.org Published On :: 2019-10-01 Electron microscopy of macromolecular structures is an approach that is in increasing demand in the field of structural biology. The automation of image acquisition has greatly increased the potential throughput of electron microscopy. Here, the focus is on the possibilities in Scipion to implement flexible and robust image-processing workflows that allow the electron-microscope operator and the user to monitor the quality of image acquisition, assessing very simple acquisition measures or obtaining a first estimate of the initial volume, or the data resolution and heterogeneity, without any need for programming skills. These workflows can implement intelligent automatic decisions and they can warn the user of possible acquisition failures. These concepts are illustrated by analysis of the well known 2.2 Å resolution β-galactosidase data set. Full Article text
for Well-based crystallization of lipidic cubic phase microcrystals for serial X-ray crystallography experiments By scripts.iucr.org Published On :: 2019-10-01 Serial crystallography is having an increasing impact on structural biology. This emerging technique opens up new possibilities for studying protein structures at room temperature and investigating structural dynamics using time-resolved X-ray diffraction. A limitation of the method is the intrinsic need for large quantities of well ordered micrometre-sized crystals. Here, a method is presented to screen for conditions that produce microcrystals of membrane proteins in the lipidic cubic phase using a well-based crystallization approach. A key advantage over earlier approaches is that the progress of crystal formation can be easily monitored without interrupting the crystallization process. In addition, the protocol can be scaled up to efficiently produce large quantities of crystals for serial crystallography experiments. Using the well-based crystallization methodology, novel conditions for the growth of showers of microcrystals of three different membrane proteins have been developed. Diffraction data are also presented from the first user serial crystallography experiment performed at MAX IV Laboratory. Full Article text
for Using Phaser and ensembles to improve the performance of SIMBAD By scripts.iucr.org Published On :: 2020-01-01 The conventional approach to search-model identification in molecular replacement (MR) is to screen a database of known structures using the target sequence. However, this strategy is not always effective, for example when the relationship between sequence and structural similarity fails or when the crystal contents are not those expected. An alternative approach is to identify suitable search models directly from the experimental data. SIMBAD is a sequence-independent MR pipeline that uses either a crystal lattice search or MR functions to directly locate suitable search models from databases. The previous version of SIMBAD used the fast AMoRe rotation-function search. Here, a new version of SIMBAD which makes use of Phaser and its likelihood scoring to improve the sensitivity of the pipeline is presented. It is shown that the additional compute time potentially required by the more sophisticated scoring is counterbalanced by the greater sensitivity, allowing more cases to trigger early-termination criteria, rather than running to completion. Using Phaser solved 17 out of 25 test cases in comparison to the ten solved with AMoRe, and it is shown that use of ensemble search models produces additional performance benefits. Full Article text
for Improved chemistry restraints for crystallographic refinement by integrating the Amber force field into Phenix By scripts.iucr.org Published On :: 2020-01-01 The refinement of biomolecular crystallographic models relies on geometric restraints to help to address the paucity of experimental data typical in these experiments. Limitations in these restraints can degrade the quality of the resulting atomic models. Here, an integration of the full all-atom Amber molecular-dynamics force field into Phenix crystallographic refinement is presented, which enables more complete modeling of biomolecular chemistry. The advantages of the force field include a carefully derived set of torsion-angle potentials, an extensive and flexible set of atom types, Lennard–Jones treatment of nonbonded interactions and a full treatment of crystalline electrostatics. The new combined method was tested against conventional geometry restraints for over 22 000 protein structures. Structures refined with the new method show substantially improved model quality. On average, Ramachandran and rotamer scores are somewhat better, clashscores and MolProbity scores are significantly improved, and the modeling of electrostatics leads to structures that exhibit more, and more correct, hydrogen bonds than those refined using traditional geometry restraints. In general it is found that model improvements are greatest at lower resolutions, prompting plans to add the Amber target function to real-space refinement for use in electron cryo-microscopy. This work opens the door to the future development of more advanced applications such as Amber-based ensemble refinement, quantum-mechanical representation of active sites and improved geometric restraints for simulated annealing. Full Article text
for Factors influencing estimates of coordinate error for molecular replacement By scripts.iucr.org Published On :: 2020-01-01 Good prior estimates of the effective root-mean-square deviation (r.m.s.d.) between the atomic coordinates of the model and the target optimize the signal in molecular replacement, thereby increasing the success rate in difficult cases. Previous studies using protein structures solved by X-ray crystallography as models showed that optimal error estimates (refined after structure solution) were correlated with the sequence identity between the model and target, and with the number of residues in the model. Here, this work has been extended to find additional correlations between parameters of the model and the target and hence improved prior estimates of the coordinate error. Using a graph database, a curated set of 6030 molecular-replacement calculations using models that had been solved by X-ray crystallography was analysed to consider about 120 model and target parameters. Improved estimates were achieved by replacing the sequence identity with the Gonnet score for sequence similarity, as well as by considering the resolution of the target structure and the MolProbity score of the model. This approach was extended by analysing 12 610 additional molecular-replacement calculations where the model was determined by NMR. The median r.m.s.d. between pairs of models in an ensemble was found to be correlated with the estimated r.m.s.d. to the target. For models solved by NMR, the overall coordinate error estimates were larger than for structures determined by X-ray crystallography, and were more highly correlated with the number of residues. Full Article text
for Development of SPACE-II for rapid sample exchange at SPring-8 macromolecular crystallography beamlines By scripts.iucr.org Published On :: 2020-01-31 Reducing the sample-exchange time is a crucial issue in maximizing the throughput of macromolecular crystallography (MX) beamlines because the diffraction data collection itself is completed within a minute in the era of pixel-array detectors. To this end, an upgraded sample changer, SPACE-II, has been developed on the basis of the previous model, SPACE (SPring-8 Precise Automatic Cryo-sample Exchanger), at the BL41XU beamline at SPring-8. SPACE-II achieves one sample-exchange step within 16 s, of which its action accounts for only 11 s, because of three features: (i) the implementation of twin arms that enable samples to be exchanged in one cycle of mount-arm action, (ii) the implementation of long-stroke mount arms that allow samples to be exchanged without withdrawal of the detector and (iii) the use of a fast-moving translation and rotation stage for the mount arms. By pre-holding the next sample prior to the sample-exchange sequence, the time was further decreased to 11 s in the case of automatic data collection, of which the action of SPACE-II accounted for 8 s. Moreover, the sample capacity was expanded from four to eight Uni-Pucks. The performance of SPACE-II has been demonstrated in over two years of operation at BL41XU; the average number of samples mounted on the diffractometer in one day was increased from 132 to 185, with an error rate of 0.089%, which counted incidents in which users could not continue with an experiment without recovery work by entering the experimental hutch. On the basis of these results, SPACE-II has been installed at three other MX beamlines at SPring-8 as of July 2019. The fast and highly reliable SPACE-II is now one of the most important pieces of infrastructure for the MX beamlines at SPring-8, providing users with the opportunity to fully make use of limited beamtime with brilliant X-rays. Full Article text
for SEQUENCE SLIDER: expanding polyalanine fragments for phasing with multiple side-chain hypotheses By scripts.iucr.org Published On :: 2020-02-25 Fragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content. Full Article text
for The use of local structural similarity of distant homologues for crystallographic model building from a molecular-replacement solution By scripts.iucr.org Published On :: 2020-02-28 The performance of automated protein model building usually decreases with resolution, mainly owing to the lower information content of the experimental data. This calls for a more elaborate use of the available structural information about macromolecules. Here, a new method is presented that uses structural homologues to improve the quality of protein models automatically constructed using ARP/wARP. The method uses local structural similarity between deposited models and the model being built, and results in longer main-chain fragments that in turn can be more reliably docked to the protein sequence. The application of the homology-based model extension method to the example of a CFA synthase at 2.7 Å resolution resulted in a more complete model with almost all of the residues correctly built and docked to the sequence. The method was also evaluated on 1493 molecular-replacement solutions at a resolution of 4.0 Å and better that were submitted to the ARP/wARP web service for model building. A significant improvement in the completeness and sequence coverage of the built models has been observed. Full Article text
for ALIXE: a phase-combination tool for fragment-based molecular replacement By scripts.iucr.org Published On :: 2020-02-25 Fragment-based molecular replacement exploits the use of very accurate yet incomplete search models. In the case of the ARCIMBOLDO programs, consistent phase sets produced from the placement and refinement of fragments with Phaser can be combined in order to increase their signal before proceeding to the step of density modification and autotracing with SHELXE. The program ALIXE compares multiple phase sets, evaluating mean phase differences to determine their common origin, and subsequently produces sets of combined phases that group consistent solutions. In this work, its use on different scenarios of very partial molecular-replacement solutions and its performance after the development of a much-optimized set of algorithms are described. The program is available both standalone and integrated within the ARCIMBOLDO programs. ALIXE has been analysed to identify its rate-limiting steps while exploring the best parameterization to improve its performance and make this software efficient enough to work on modest hardware. The algorithm has been parallelized and redesigned to meet the typical landscape of solutions. Analysis of pairwise correlation between the phase sets has also been explored to test whether this would provide additional insight. ALIXE can be used to exhaustively analyse all partial solutions produced or to complement those already selected for expansion, and also to reduce the number of redundant solutions, which is particularly relevant to the case of coiled coils, or to combine partial solutions from different programs. In each case parallelization and optimization to provide speedup makes its use amenable to typical hardware found in crystallography. ARCIMBOLDO_BORGES and ARCIMBOLDO_SHREDDER now call on ALIXE by default. Full Article text
for Estimating local protein model quality: prospects for molecular replacement By scripts.iucr.org Published On :: 2020-03-03 Model quality assessment programs estimate the quality of protein models and can be used to estimate local error in protein models. ProQ3D is the most recent and most accurate version of our software. Here, it is demonstrated that it is possible to use local error estimates to substantially increase the quality of the models for molecular replacement (MR). Adjusting the B factors using ProQ3D improved the log-likelihood gain (LLG) score by over 50% on average, resulting in significantly more successful models in MR compared with not using error estimates. On a data set of 431 homology models to address difficult MR targets, models with error estimates from ProQ3D received an LLG of >50 for almost half of the models 209/431 (48.5%), compared with 175/431 (40.6%) for the previous version, ProQ2, and only 74/431 (17.2%) for models with no error estimates, clearly demonstrating the added value of using error estimates to enable MR for more targets. ProQ3D is available from http://proq3.bioinfo.se/ both as a server and as a standalone download. Full Article text
for Measuring and using information gained by observing diffraction data By scripts.iucr.org Published On :: 2020-02-25 The information gained by making a measurement, termed the Kullback–Leibler divergence, assesses how much more precisely the true quantity is known after the measurement was made (the posterior probability distribution) than before (the prior probability distribution). It provides an upper bound for the contribution that an observation can make to the total likelihood score in likelihood-based crystallographic algorithms. This makes information gain a natural criterion for deciding which data can legitimately be omitted from likelihood calculations. Many existing methods use an approximation for the effects of measurement error that breaks down for very weak and poorly measured data. For such methods a different (higher) information threshold is appropriate compared with methods that account well for even large measurement errors. Concerns are raised about a current trend to deposit data that have been corrected for anisotropy, sharpened and pruned without including the original unaltered measurements. If not checked, this trend will have serious consequences for the reuse of deposited data by those who hope to repeat calculations using improved new methods. Full Article text
for ALEPH: a network-oriented approach for the generation of fragment-based libraries and for structure interpretation By scripts.iucr.org Published On :: 2020-02-26 The analysis of large structural databases reveals general features and relationships among proteins, providing useful insight. A different approach is required to characterize ubiquitous secondary-structure elements, where flexibility is essential in order to capture small local differences. The ALEPH software is optimized for the analysis and the extraction of small protein folds by relying on their geometry rather than on their sequence. The annotation of the structural variability of a given fold provides valuable information for fragment-based molecular-replacement methods, in which testing alternative model hypotheses can succeed in solving difficult structures when no homology models are available or are successful. ARCIMBOLDO_BORGES combines the use of composite secondary-structure elements as a search model with density modification and tracing to reveal the rest of the structure when both steps are successful. This phasing method relies on general fold libraries describing variations around a given pattern of β-sheets and helices extracted using ALEPH. The program introduces characteristic vectors defined from the main-chain atoms as a way to describe the geometrical properties of the structure. ALEPH encodes structural properties in a graph network, the exploration of which allows secondary-structure annotation, decomposition of a structure into small compact folds, generation of libraries of models representing a variation of a given fold and finally superposition of these folds onto a target structure. These functions are available through a graphical interface designed to interactively show the results of structure manipulation, annotation, fold decomposition, clustering and library generation. ALEPH can produce pictures of the graphs, structures and folds for publication purposes. Full Article text
for Scaling diffraction data in the DIALS software package: algorithms and new approaches for multi-crystal scaling By scripts.iucr.org Published On :: 2020-03-31 In processing X-ray diffraction data, the intensities obtained from integration of the diffraction images must be corrected for experimental effects in order to place all intensities on a common scale both within and between data collections. Scaling corrects for effects such as changes in sample illumination, absorption and, to some extent, global radiation damage that cause the measured intensities of symmetry-equivalent observations to differ throughout a data set. This necessarily requires a prior evaluation of the point-group symmetry of the crystal. This paper describes and evaluates the scaling algorithms implemented within the DIALS data-processing package and demonstrates the effectiveness and key features of the implementation on example macromolecular crystallographic rotation data. In particular, the scaling algorithms enable new workflows for the scaling of multi-crystal or multi-sweep data sets, providing the analysis required to support current trends towards collecting data from ever-smaller samples. In addition, the implementation of a free-set validation method is discussed, which allows the quantification of the suitability of scaling-model and algorithm choices. Full Article text
for Development of basic building blocks for cryo-EM: the emcore and emvis software libraries By scripts.iucr.org Published On :: 2020-03-31 Image-processing software has always been an integral part of structure determination by cryogenic electron microscopy (cryo-EM). Recent advances in hardware and software are recognized as one of the key factors in the so-called cryo-EM resolution revolution. Increasing computational power has opened many possibilities to consider more demanding algorithms, which in turn allow more complex biological problems to be tackled. Moreover, data processing has become more accessible to many experimental groups, with computations that used to last for many days at supercomputing facilities now being performed in hours on personal workstations. All of these advances, together with the rapid expansion of the community, continue to pose challenges and new demands on the software-development side. In this article, the development of emcore and emvis, two basic software libraries for image manipulation and data visualization in cryo-EM, is presented. The main goal is to provide basic functionality organized in modular components that other developers can reuse to implement new algorithms or build graphical applications. An additional aim is to showcase the importance of following established practices in software engineering, with the hope that this could be a first step towards a more standardized way of developing and distributing software in the field. Full Article text
for Pressure-induced transformation of CH3NH3PbI3: the role of the noble-gas pressure transmitting media By scripts.iucr.org Published On :: 2019-05-18 The photovoltaic perovskite, methylammonium lead triiodide [CH3NH3PbI3 (MAPbI3)], is one of the most efficient materials for solar energy conversion. Various kinds of chemical and physical modifications have been applied to MAPbI3 towards better understanding of the relation between composition, structure, electronic properties and energy conversion efficiency of this material. Pressure is a particularly useful tool, as it can substantially reduce the interatomic spacing in this relatively soft material and cause significant modifications to the electronic structure. Application of high pressure induces changes in the crystal symmetry up to a threshold level above which it leads to amorphization. Here, a detailed structural study of MAPbI3 at high hydrostatic pressures using Ne and Ar as pressure transmitting media is reported. Single-crystal X-ray diffraction experiments with synchrotron radiation at room temperature in the 0–20 GPa pressure range show that atoms of both gaseous media, Ne and Ar, are gradually incorporated into MAPbI3, thus leading to marked structural changes of the material. Specifically, Ne stabilizes the high-pressure phase of NexMAPbI3 and prevents amorphization up to 20 GPa. After releasing the pressure, the crystal has the composition of Ne0.97MAPbI3, which remains stable under ambient conditions. In contrast, above 2.4 GPa, Ar accelerates an irreversible amorphization. The distinct impacts of Ne and Ar are attributed to differences in their chemical reactivity under pressure inside the restricted space between the PbI6 octahedra. Full Article text
for The TELL automatic sample changer for macromolecular crystallography By scripts.iucr.org Published On :: 2020-03-31 In this paper, the design and functionalities of the high-throughput TELL sample exchange system for macromolecular crystallography is presented. TELL was developed at the Paul Scherrer Institute with a focus on speed, storage capacity and reliability to serve the three macromolecular crystallography beamlines of the Swiss Light Source, as well as the SwissMX instrument at SwissFEL. Full Article text
for A phase-retrieval toolbox for X-ray holography and tomography By scripts.iucr.org Published On :: 2020-04-14 Propagation-based phase-contrast X-ray imaging is by now a well established imaging technique, which – as a full-field technique – is particularly useful for tomography applications. Since it can be implemented with synchrotron radiation and at laboratory micro-focus sources, it covers a wide range of applications. A limiting factor in its development has been the phase-retrieval step, which was often performed using methods with a limited regime of applicability, typically based on linearization. In this work, a much larger set of algorithms, which covers a wide range of cases (experimental parameters, objects and constraints), is compiled into a single toolbox – the HoloTomoToolbox – which is made publicly available. Importantly, the unified structure of the implemented phase-retrieval functions facilitates their use and performance test on different experimental data. Full Article text
for ID30A-3 (MASSIF-3) – a beamline for macromolecular crystallography at the ESRF with a small intense beam By scripts.iucr.org Published On :: 2020-04-29 ID30A-3 (or MASSIF-3) is a mini-focus (beam size 18 µm × 14 µm) highly intense (2.0 × 1013 photons s−1), fixed-energy (12.81 keV) beamline for macromolecular crystallography (MX) experiments at the European Synchrotron Radiation Facility (ESRF). MASSIF-3 is one of two fixed-energy beamlines sited on the first branch of the canted undulator setup on the ESRF ID30 port and is equipped with a MD2 micro-diffractometer, a Flex HCD sample changer, and an Eiger X 4M fast hybrid photon-counting detector. MASSIF-3 is recommended for collecting diffraction data from single small crystals (≤15 µm in one dimension) or for experiments using serial methods. The end-station has been in full user operation since December 2014, and here its current characteristics and capabilities are described. Full Article text
for Versatile compact heater design for in situ nano-tomography by transmission X-ray microscopy By scripts.iucr.org Published On :: 2020-04-16 A versatile, compact heater designed at National Synchrotron Light Source-II for in situ X-ray nano-imaging in a full-field transmission X-ray microscope is presented. Heater design for nano-imaging is challenging, combining tight spatial constraints with stringent design requirements for the temperature range and stability. Finite-element modeling and analytical calculations were used to determine the heater design parameters. Performance tests demonstrated reliable and stable performance, including maintaining the exterior casing close to room temperature while the heater is operating at above 1100°C, a homogenous heating zone and small temperature fluctuations. Two scientific experiments are presented to demonstrate the heater capabilities: (i) in situ 3D nano-tomography including a study of metal dealloying in a liquid molten salt extreme environment, and (ii) a study of pore formation in icosahedral quasicrystals. The progression of structural changes in both studies were clearly resolved in 3D, showing that the new heater enables powerful capabilities to directly visualize and quantify 3D morphological evolution of materials under real conditions by X-ray nano-imaging at elevated temperature during synthesis, fabrication and operation processes. This heater design concept can be applied to other applications where a precise, compact heater design is required. Full Article text
for LamNI – an instrument for X-ray scanning microscopy in laminography geometry By scripts.iucr.org Published On :: 2020-04-06 Across all branches of science, medicine and engineering, high-resolution microscopy is required to understand functionality. Although optical methods have been developed to `defeat' the diffraction limit and produce 3D images, and electrons have proven ever more useful in creating pictures of small objects or thin sections, so far there is no substitute for X-ray microscopy in providing multiscale 3D images of objects with a single instrument and minimal labeling and preparation. A powerful technique proven to continuously access length scales from 10 nm to 10 µm is ptychographic X-ray computed tomography, which, on account of the orthogonality of the tomographic rotation axis to the illuminating beam, still has the limitation of necessitating pillar-shaped samples of small (ca 10 µm) diameter. Large-area planar samples are common in science and engineering, and it is therefore highly desirable to create an X-ray microscope that can examine such samples without the extraction of pillars. Computed laminography, where the axis of rotation is not perpendicular to the illumination direction, solves this problem. This entailed the development of a new instrument, LamNI, dedicated to high-resolution 3D scanning X-ray microscopy via hard X-ray ptychographic laminography. Scanning precision is achieved by a dedicated interferometry scheme and the instrument covers a scan range of 12 mm × 12 mm with a position stability of 2 nm and positioning errors below 5 nm. A new feature of LamNI is a pair of counter-rotating stages carrying the sample and interferometric mirrors, respectively. Full Article text
for A design of resonant inelastic X-ray scattering (RIXS) spectrometer for spatial- and time-resolved spectroscopy By scripts.iucr.org Published On :: 2020-04-16 The optical design of a Hettrick–Underwood-style soft X-ray spectrometer with Wolter type 1 mirrors is presented. The spectrometer with a nominal length of 3.1 m can achieve a high resolving power (resolving power higher than 10000) in the soft X-ray regime when a small source beam (<3 µm in the grating dispersion direction) and small pixel detector (5 µm effective pixel size) are used. Adding Wolter mirrors to the spectrometer before its dispersive elements can realize the spatial imaging capability, which finds applications in the spectroscopic studies of spatially dependent electronic structures in tandem catalysts, heterostructures, etc. In the pump–probe experiments where the pump beam perturbs the materials followed by the time-delayed probe beam to reveal the transient evolution of electronic structures, the imaging capability of the Wolter mirrors can offer the pixel-equivalent femtosecond time delay between the pump and probe beams when their wavefronts are not collinear. In combination with some special sample handing systems, such as liquid jets and droplets, the imaging capability can also be used to study the time-dependent electronic structure of chemical transformation spanning multiple time domains from microseconds to nanoseconds. The proposed Wolter mirrors can also be adopted to the existing soft X-ray spectrometers that use the Hettrick–Underwood optical scheme, expanding their capabilities in materials research. Full Article text
for High-efficiency ultra-precision comparator for d-spacing mapping measurement of silicon By scripts.iucr.org Published On :: 2020-03-13 This article describes a high-efficiency experimental configuration for a self-referenced lattice comparator with a `brush beam' of synchrotron radiation from a bending magnet and two linear position-sensitive photon-counting-type X-ray detectors. The efficiency is more than ten times greater compared with the `pencil-beam' configuration and a pair of zero-dimensional detectors. A solution for correcting the systematic deviation of d-spacing measurements caused by the horizontal non-uniformity of the brush beam is provided. Also, the use of photon-counting-type one-dimensional detectors not only improves the spatial resolution of the measurements remarkably but can also adjust the sample's attitude angles easily. Full Article text
for Foreword to the special virtual issue on X-ray free-electron lasers By scripts.iucr.org Published On :: 2020-05-01 Full Article text
for Development of an X-ray imaging detector for high-energy X-ray microtomography By journals.iucr.org Published On :: A dedicated X-ray imaging detector for 200 keV high-energy X-ray microtomography was developed to realize high-efficiency high-resolution imaging while keeping the field of view wide. Full Article text
for A high-power, high-repetition-rate THz source for pump–probe experiments at Linac Coherent Light Source II By journals.iucr.org Published On :: Full Article text
for Comprehensive characterization of TSV etching performance with phase-contrast X-ray microtomography By journals.iucr.org Published On :: A complete method of comprehensive and quantitative evaluation of through-silicon via reliability using a highly sensitive phase-contrast X-ray microtomography was established. Quantitative characterizations include 3D local morphology and overall consistency of statistics. Full Article text
for Hard X-ray phase-contrast-enhanced micro-CT for quantifying interfaces within brittle dense root-filling-restored human teeth By journals.iucr.org Published On :: Phase-contrast enhanced micro-computed tomography reveals huge discontinuities at the interfaces between dental fillings and the tooth substrate. Despite the complex micromorphology, gaps in bonding could be visualized and quantified in 3D. Full Article text
for Forthcoming article in Journal of Synchrotron Radiation By journals.iucr.org Published On :: Full Article Still image
for GIDVis: a comprehensive software tool for geometry-independent grazing-incidence X-ray diffraction data analysis and pole-figure calculations By scripts.iucr.org Published On :: 2019-05-31 GIDVis is a software package based on MATLAB specialized for, but not limited to, the visualization and analysis of grazing-incidence thin-film X-ray diffraction data obtained during sample rotation around the surface normal. GIDVis allows the user to perform detector calibration, data stitching, intensity corrections, standard data evaluation (e.g. cuts and integrations along specific reciprocal-space directions), crystal phase analysis etc. To take full advantage of the measured data in the case of sample rotation, pole figures can easily be calculated from the experimental data for any value of the scattering angle covered. As an example, GIDVis is applied to phase analysis and the evaluation of the epitaxial alignment of pentacenequinone crystallites on a single-crystalline Au(111) surface. Full Article text
for ClickX: a visualization-based program for preprocessing of serial crystallography data By scripts.iucr.org Published On :: 2019-05-28 Serial crystallography is a powerful technique in structure determination using many small crystals at X-ray free-electron laser or synchrotron radiation facilities. The large diffraction data volumes require high-throughput software to preprocess the raw images for subsequent analysis. ClickX is a program designated for serial crystallography data preprocessing, capable of rapid data sorting for online feedback and peak-finding refinement by parameter optimization. The graphical user interface (GUI) provides convenient access to various operations such as pattern visualization, statistics plotting and parameter tuning. A batch job module is implemented to facilitate large-data-volume processing. A two-step geometry calibration for single-panel detectors is also integrated into the GUI, where the beam center and detector tilting angles are optimized using an ellipse center shifting method first, then all six parameters, including the photon energy and detector distance, are refined together using a residual minimization method. Implemented in Python, ClickX has good portability and extensibility, so that it can be installed, configured and used on any computing platform that provides a Python interface or common data file format. ClickX has been tested in online analysis at the Pohang Accelerator Laboratory X-ray Free-Electron Laser, Korea, and the Linac Coherent Light Source, USA. It has also been applied in post-experimental data analysis. The source code is available via https://github.com/LiuLab-CSRC/ClickX under a GNU General Public License. Full Article text
for High-performance Python for crystallographic computing By scripts.iucr.org Published On :: 2019-07-24 The Python programming language, combined with the numerical computing library NumPy and the scientific computing library SciPy, has become the de facto standard for scientific computing in a variety of fields. This popularity is mainly due to the ease with which a Python program can be written and executed (easy syntax, dynamical typing, no compilation etc.), coupled with the existence of a large number of specialized third-party libraries that aim to lift all the limitations of the raw Python language. NumPy introduces vector programming, improving execution speeds, whereas SciPy brings a wealth of highly optimized and reliable scientific functions. There are cases, however, where vector programming alone is not sufficient to reach optimal performance. This issue is addressed with dedicated compilers that aim to translate Python code into native and statically typed code with support for the multi-core architectures of modern processors. In the present article it is shown how these approaches can be efficiently used to tackle different problems, with increasing complexity, that are relevant to crystallography: the 2D Laue function, scattering from a strained 2D crystal, scattering from 3D nanocrystals and, finally, diffraction from films and multilayers. For each case, detailed implementations and explanations of the functioning of the algorithms are provided. Different Python compilers (namely NumExpr, Numba, Pythran and Cython) are used to improve performance and are benchmarked against state-of-the-art NumPy implementations. All examples are also provided as commented and didactic Python (Jupyter) notebooks that can be used as starting points for crystallographers curious to enter the Python ecosystem or wishing to accelerate their existing codes. Full Article text
for DatView: a graphical user interface for visualizing and querying large data sets in serial femtosecond crystallography By scripts.iucr.org Published On :: 2019-10-31 DatView is a new graphical user interface (GUI) for plotting parameters to explore correlations, identify outliers and export subsets of data. It was designed to simplify and expedite analysis of very large unmerged serial femtosecond crystallography (SFX) data sets composed of indexing results from hundreds of thousands of microcrystal diffraction patterns. However, DatView works with any tabulated data, offering its functionality to many applications outside serial crystallography. In DatView's user-friendly GUI, selections are drawn onto plots and synchronized across all other plots, so correlations between multiple parameters in large multi-parameter data sets can be rapidly identified. It also includes an item viewer for displaying images in the current selection alongside the associated metadata. For serial crystallography data processed by indexamajig from CrystFEL [White, Kirian, Martin, Aquila, Nass, Barty & Chapman (2012). J. Appl. Cryst. 45, 335–341], DatView generates a table of parameters and metadata from stream files and, optionally, the associated HDF5 files. By combining the functionality of several commonly needed tools for SFX in a single GUI that operates on tabulated data, the time needed to load and calculate statistics from large data sets is reduced. This paper describes how DatView facilitates (i) efficient feedback during data collection by examining trends in time, sample position or any parameter, (ii) determination of optimal indexing and integration parameters via the comparison mode, (iii) identification of systematic errors in unmerged SFX data sets, and (iv) sorting and highly flexible data filtering (plot selections, Boolean filters and more), including direct export of subset CrystFEL stream files for further processing. Full Article text
for FXD-CSD-GUI: a graphical user interface for the X-ray-diffraction-based determination of crystallite size distributions By scripts.iucr.org Published On :: 2019-10-22 Bragg intensities can be used to analyse crystal size distributions in a method called FXD-CSD, which is based on the fast measurement of many Bragg spots using two-dimensional detectors. This work presents the Python-based software and its graphical user interface FXD-CSD-GUI. The GUI enables user-friendly data handling and processing and provides both graphical and numerical crystal size distribution results. Full Article text