ic

TAAM refinement on high-resolution experimental and simulated 3D ED/MicroED data for organic mol­ecules

3D electron diffraction (3D ED), or microcrystal electron diffraction (MicroED), has become an alternative technique for determining the high-resolution crystal structures of compounds from sub-micron-sized crystals. Here, we considered l-alanine, α-glycine and urea, which are known to form good-quality crystals, and collected high-resolution 3D ED data on our in-house TEM instrument. In this study, we present a comparison of independent atom model (IAM) and transferable aspherical atom model (TAAM) kinematical refinement against experimental and simulated data. TAAM refinement on both experimental and simulated data clearly improves the model fitting statistics (R factors and residual electrostatic potential) compared to IAM refinement. This shows that TAAM better represents the experimental electrostatic potential of organic crystals than IAM. Furthermore, we compared the geometrical parameters and atomic displacement parameters (ADPs) resulting from the experimental refinements with the simulated refinements, with the periodic density functional theory (DFT) calculations and with published X-ray and neutron crystal structures. The TAAM refinements on the 3D ED data did not improve the accuracy of the bond lengths between the non-H atoms. The experimental 3D ED data provided more accurate H-atom positions than the IAM refinements on the X-ray diffraction data. The IAM refinements against 3D ED data had a tendency to lead to slightly longer X—H bond lengths than TAAM, but the difference was statistically insignificant. Atomic displacement parameters were too large by tens of percent for l-alanine and α-glycine. Most probably, other unmodelled effects were causing this behaviour, such as radiation damage or dynamical scattering.




ic

The crystal structure of the ammonium salt of 2-amino­malonic acid

The salt ammonium 2-am­ino­mal­on­ate (systematic name: ammonium 2-aza­niumyl­propane­dioate), NH4+·C3H4NO4−, was synthesized in diethyl ether from the starting materials malonic acid, ammonia and bromine. The salt was recrystallized from water as colourless blocks. In the solid state, intra­molecular medium–strong N—H⋯O, weak C—H⋯O and weak C—H⋯N hydrogen bonds build a three-dimensional network.




ic

Crystal structure and cryomagnetic study of a mononuclear erbium(III) ox­am­ate inclusion com­plex

The synthesis, crystal structure and magnetic properties of an ox­am­ate-con­taining erbium(III) com­plex, namely, tetra­butyl­ammonium aqua­[N-(2,4,6-tri­methyl­phen­yl)oxamato]erbium(III)–di­methyl sulfoxide–water (1/3/1.5), (C16H36N)[Er(C11H12NO3)4(H2O)]·3C2H6OS·1.5H2O or n-Bu4N[Er(Htmpa)4(H2O)]·3DMSO·1.5H2O (1), are reported. The crystal structure of 1 reveals the occurrence of an erbium(III) ion, which is surrounded by four N-phenyl-substituted ox­am­ate ligands and one water mol­ecule in a nine-coordinated environment, together with one tetra­butyl­ammonium cation acting as a counter-ion, and one water and three dimethyl sulfoxide (DMSO) mol­ecules of crystallization. Variable-temperature static (dc) and dynamic (ac) magnetic mea­sure­ments were carried out for this mononuclear com­plex, revealing that it behaves as a field-induced single-ion magnet (SIM) below 5.0 K.




ic

Synthesis, spectroscopic and crystallographic characterization of various cymantrenyl thio­ethers [Mn{C5HxBry(SMe)z}(PPh3)(CO)2]

Starting from [Mn(C5H4Br)(PPh3)(CO)2] (1a), the cymantrenyl thio­ethers [Mn(C5H4SMe)(PPh3)(CO)2] (1b) and [Mn{C5H4–nBr(SMe)n}(PPh3)(CO)2] (n = 1 for com­pound 2, n = 2 for 3 and n = 3 for 4) were obtained, using either n-butyllithium (n-BuLi), lithium diiso­propyl­amide (LDA) or lithium tetra­methyl­piperidide (LiTMP) as base, followed by electrophilic quenching with MeSSMe. Stepwise consecutive reaction of [Mn(C5Br5)(PPh3)(CO)2] with n-BuLi and MeSSMe led finally to [Mn{C5(SMe)5}(PPh3)(CO)2] (11), only the fifth com­plex to be reported containing a perthiol­ated cyclo­penta­dienyl ring. The mol­ecular and crystal structures of 1b, 3, 4 and 11 were determined and were studied for the occurrence of S⋯S and S⋯Br inter­actions. It turned out that although some inter­actions of this type occurred, they were of minor importance for the arrangement of the mol­ecules in the crystal.




ic

Crystal structure of the cytotoxic macrocyclic trichothecene Isororidin A

The highly cytotoxic macrocyclic trichothecene Isororidin A (C29H40O9) was isolated from the fungus Myrothesium verrucaria endophytic on the wild medicinal plant `Datura' (Datura stramonium L.) and was characterized by one- (1D) and two-dimensional (2D) NMR spectroscopy. The three-dimensional structure of Isororidin A has been confirmed by X-ray crystallography at 0.81 Å resolution from crystals grown in the ortho­rhom­bic space group P212121, with one mol­ecule per asymmetric unit. Isororidin A is the epimer of previously described (by X-ray crystallography) Roridin A at position C-13' of the macrocyclic ring.




ic

Occupational modulation in the (3+1)-dimensional incommensurate structure of (2S,3S)-2-amino-3-hy­droxy-3-methyl-4-phen­oxy­butanoic acid dihydrate

The incommensurately modulated structure of (2S,3S)-2-amino-3-hy­droxy-3-methyl-4-phen­oxy­butanoic acid dihydrate (C11H15NO4·2H2O or I·2H2O) is described in the (3+1)-dimensional superspace group P212121(0β0)000 (β = 0.357). The loss of the three-dimensional periodicity is ascribed to the occupational modulation of one positionally disordered solvent water mol­ecule, where the two positions are related by a small translation [ca 0.666 (9) Å] and ∼168 (5)° rotation about one of its O—H bonds, with an average 0.624 (3):0.376 (3) occupancy ratio. The occupational modulation of this mol­ecule arises due to the com­petition between the different hy­dro­gen-bonding motifs associated with each position. The structure can be very well refined in the average approximation (all satellite reflections disregarded) in the space group P212121, with the water mol­ecule refined as disordered over two positions in a 0.625 (16):0.375 (16) ratio. The refinement in the commensurate threefold supercell approximation in the space group P1121 is also of high quality, with the six corresponding water mol­ecules exhibiting three different occupancy ratios averaging 0.635:0.365.




ic

Further evaluation of the shape of atomic Hirshfeld surfaces: M⋯H contacts and homoatomic bonds

It is well known that Hirshfeld surfaces provide an easy and straightforward way of analysing inter­molecular inter­actions in the crystal environment. The use of atomic Hirshfeld surfaces has also demonstrated that such surfaces carry information related to chemical bonds which allow a deeper evaluation of the structures. Here we briefly summarize the approach of atomic Hirshfeld surfaces while further evaluating the kind of information that can be retrieved from them. We show that the analysis of the metal-centre Hirshfeld surfaces from structures refined via Hirshfeld Atom Refinement (HAR) allow accurate evaluation of contacts of type M⋯H, and that such contacts can be related to the overall shape of the surfaces. The com­pounds analysed were tetra­aqua­bis­(3-carb­oxy­propionato)metal(II), [M(C4H3O4)2(H2O)4], for metal(II)/M = manganese/Mn, cobalt/Co, nickel/Ni and zinc/Zn. We also evaluate the sensitivity of the surfaces by an investigation of seemingly flat surfaces through analysis of the curvature functions in the direction of C—C bonds. The obtained values not only demonstrate variations in curvature but also show a correlation with the hybridization of the C atoms involved in the bond.




ic

2,4-Di­aryl­pyrroles: synthesis, characterization and crystallographic insights

Three 2,4-di­aryl­pyrroles were synthesized starting from 4-nitro­butano­nes and the crystal structures of two derivatives were analysed. These are 4-(4-meth­oxy­phen­yl)-2-(thio­phen-2-yl)-1H-pyrrole, C15H13NOS, and 3-(4-bromo­phen­yl)-2-nitroso-5-phenyl-1H-pyrrole, C16H11BrN2O. Although pyrroles without sub­stituents at the α-position with respect to the N atom are very air sensitive and tend to polymerize, we succeeded in growing an adequate crystal for X-ray diffraction analysis. Further derivatization using sodium nitrite afforded a nitrosyl pyrrole derivative, which crystallized in the triclinic space group Poverline{1} with Z = 6. Thus, herein we report the first crystal structure of a nitrosyl pyrrole. Inter­estingly, the co-operative hydrogen bonds in this NO-substituted pyrrole lead to a trimeric structure with bifurcated halogen bonds at the ends, forming a two-dimensional (2D) layer with inter­stitial voids having a radius of 5 Å, similar to some reported macrocyclic porphyrins.




ic

Crystal structures of two unexpected products of vicinal di­amines left to crystallize in acetone

Herein we report the crystal structures of two ben­zo­di­az­e­pines obtained by reacting N,N'-(4,5-di­amino-1,2-phenyl­ene)bis­(4-methyl­ben­zene­sul­fon­am­ide) (1) or 4,5-(4-methyl­ben­zene­sul­fon­am­ido)­ben­zene-1,2-diaminium dichloride (1·2HCl) with acetone, giving 2,2,4-trimethyl-8,9-bis­(4-methyl­ben­zene­sul­fon­am­ido)-2,3-di­hydro-5H-1,5-ben­zo­di­az­e­pine, C26H30N4O4S2 (2), and 2,2,4-tri­methyl-8,9-bis­(4-methyl­ben­zene­sul­fon­am­ido)-2,3-di­hydro-5H-1,5-ben­zo­di­az­e­pin-1-ium chloride 0.3-hydrate, C26H31N4O4S2+·Cl−·0.3H2O (3). Compounds 2 and 3 were first obtained in attempts to recrystallize 1 and 1·2HCl using acetone as solvent. This solvent reacted with the vicinal di­amines present in the mol­ecular structures, forming a 5H-1,5-ben­zo­di­az­e­pine ring. In the crystal structure of 2, the seven-membered ring of ben­zo­di­az­e­pine adopts a boat-like conformation, while upon protonation, observed in the crystal structure of 3, it adopts an envelope-like conformation. In both crystalline com­pounds, the tosyl­amide N atoms are not in resonance with the arene ring, mainly due to hy­dro­gen bonds and steric hindrance caused by the large vicinal groups in the aromatic ring. At a supra­molecular level, the crystal structure is maintained by a combination of hy­dro­gen bonds and hydro­phobic inter­actions. In 2, amine-to-tosyl N—H⋯O and amide-to-imine N—H⋯N hy­dro­gen bonds can be observed. In contrast, in 3, the chloride counter-ion and water mol­ecule result in most of the hy­dro­gen bonds being of the amide-to-chloride and ammonium-to-chloride N—H⋯Cl types, while the amine inter­acts with the tosyl group, as seen in 2. In conclusion, we report the synthesis of 1, 1·2HCl and 2, as well as their chemical characterization. For 2, two synthetic methods are described, i.e. solvent-mediated crystallization and synthesis via a more efficient and cleaner route as a polycrystalline material. Salt 3 was only obtained as presented, with only a few crystals being formed.




ic

Coordination variety of phenyl­tetra­zolato and di­methyl­amido ligands in dimeric Ti, Zr, and Ta com­plexes

Three structurally diverse 5-phenyl­tetra­zolato (Tz) Ti, Zr, and Ta com­plexes, namely, (C2H8N)[Ti2(C7H5N4)5(C2H6N)4]·1.45C6H6 or (Me2NH2)[Ti2(NMe2)4(2,3-μ-Tz)3(2-η1-Tz)2]·1.45C6H6, (1·1.45C6H6), [Zr2(C7H5N4)6(C2H6N)2(C2H7N)2]·1.12C6H6·0.382CH2Cl2 or [Zr2(Me2NH)2(NMe2)2(2,3-μ-Tz)3(2-η1-Tz)2(1,2-η2-Tz)]·1.12C6H6·0.38CH2Cl2 (2·1.12C6H6·0.38CH2Cl2), and (C2H8N)2[Ta2(C7H5N4)8(C2H6N)2O]·0.25C7H8 or (Me2NH2)2[Ta2(NMe2)2(2,3-μ-Tz)2(2-η1-Tz)6O]·0.25C7H8 (3·0.25C7H8), where TzH is 5-phenyl-1H-tetra­zole, have been synthesized and structurally characterized. All three com­plexes are dinuclear; the Ti center in 1 is six-coordinate, whereas the Zr and Ta atoms in 2 and 3 are seven-coordinate. The coordination environments of the Ti centers in 1 are similar, and so are the ligations of the Ta centers in 3. In contrast, the two Zr centers in 2 bear a different number of ligands, one of which is a bidentate η2-5-phenyl­tetra­zolato ligand that has not been observed previously for d-block elements. The di­methyl­amido ligand, present in the starting materials, remained un­changed, or was converted to di­methyl­amine and di­methyl­ammonium during the synthesis. Di­methyl­amine coordinates as a neutral ligand, whereas di­methyl­ammonium is retained as a hy­dro­gen-bonded entity bridging Tz ligands.




ic

3D electron diffraction studies of synthetic rhabdophane (DyPO4·nH2O)

In this study, we report the results of continuous rotation electron diffraction studies of single DyPO4·nH2O (rhabdophane) nanocrystals. The diffraction patterns can be fit to a trigonal lattice (P3121) with lattice parameters a = 7.019 (5) and c = 6.417 (5) Å. However, there is also a set of diffuse background scattering features present that are associated with a disordered superstructure that is double these lattice parameters and fits with an arrangement of water mol­ecules present in the structure pore. Pair distribution function (PDF) maps based on the diffuse background allowed the extent of the water correlation to be estimated, with 2–3 nm correlation along the c axis and ∼5 nm along the a/b axis.




ic

On the importance of crystal structures for organic thin film transistors

Historically, knowledge of the mol­ecular packing within the crystal structures of organic semi­con­duc­tors has been instrumental in understanding their solid-state electronic properties. Nowadays, crystal structures are thus becoming increasingly important for enabling engineering properties, understanding poly­mor­phism in bulk and in thin films, exploring dynamics and elucidating phase-transition mech­a­nisms. This review article introduces the most salient and recent results of the field.




ic

Introducing the Best practice in crystallography series

 




ic

Crystal clear: the impact of crystal structure in the development of high-performance organic semiconductors

 




ic

The TR-icOS setup at the ESRF: time-resolved microsecond UV–Vis absorption spectroscopy on protein crystals

The technique of time-resolved macromolecular crystallography (TR-MX) has recently been rejuvenated at synchrotrons, resulting in the design of dedicated beamlines. Using pump–probe schemes, this should make the mechanistic study of photoactive proteins and other suitable systems possible with time resolutions down to microseconds. In order to identify relevant time delays, time-resolved spectroscopic experiments directly performed on protein crystals are often desirable. To this end, an instrument has been built at the icOS Lab (in crystallo Optical Spectroscopy Laboratory) at the European Synchrotron Radiation Facility using reflective focusing objectives with a tuneable nanosecond laser as a pump and a microsecond xenon flash lamp as a probe, called the TR-icOS (time-resolved icOS) setup. Using this instrument, pump–probe spectra can rapidly be recorded from single crystals with time delays ranging from a few microseconds to seconds and beyond. This can be repeated at various laser pulse energies to track the potential presence of artefacts arising from two-photon absorption, which amounts to a power titration of a photoreaction. This approach has been applied to monitor the rise and decay of the M state in the photocycle of crystallized bacteriorhodopsin and showed that the photocycle is increasingly altered with laser pulses of peak fluence greater than 100 mJ cm−2, providing experimental laser and delay parameters for a successful TR-MX experiment.




ic

Deep residual networks for crystallography trained on synthetic data

The use of artificial intelligence to process diffraction images is challenged by the need to assemble large and precisely designed training data sets. To address this, a codebase called Resonet was developed for synthesizing diffraction data and training residual neural networks on these data. Here, two per-pattern capabilities of Resonet are demonstrated: (i) interpretation of crystal resolution and (ii) identification of overlapping lattices. Resonet was tested across a compilation of diffraction images from synchrotron experiments and X-ray free-electron laser experiments. Crucially, these models readily execute on graphics processing units and can thus significantly outperform conventional algorithms. While Resonet is currently utilized to provide real-time feedback for macromolecular crystallography users at the Stanford Synchrotron Radiation Lightsource, its simple Python-based interface makes it easy to embed in other processing frameworks. This work highlights the utility of physics-based simulation for training deep neural networks and lays the groundwork for the development of additional models to enhance diffraction collection and analysis.




ic

Investigation of how gate residues in the main channel affect the catalytic activity of Scytalidium thermophilum catalase

Catalase is an antioxidant enzyme that breaks down hydrogen peroxide (H2O2) into molecular oxygen and water. In all monofunctional catalases the pathway that H2O2 takes to the catalytic centre is via the `main channel'. However, the structure of this channel differs in large-subunit and small-subunit catalases. In large-subunit catalases the channel is 15 Å longer and consists of two distinct parts, including a hydrophobic lower region near the heme and a hydrophilic upper region where multiple H2O2 routes are possible. Conserved glutamic acid and threonine residues are located near the intersection of these two regions. Mutations of these two residues in the Scytalidium thermophilum catalase had no significant effect on catalase activity. However, the secondary phenol oxidase activity was markedly altered, with kcat and kcat/Km values that were significantly increased in the five variants E484A, E484I, T188D, T188I and T188F. These variants also showed a lower affinity for inhibitors of oxidase activity than the wild-type enzyme and a higher affinity for phenolic substrates. Oxidation of heme b to heme d did not occur in most of the studied variants. Structural changes in solvent-chain integrity and channel architecture were also observed. In summary, modification of the main-channel gate glutamic acid and threonine residues has a greater influence on the secondary activity of the catalase enzyme, and the oxidation of heme b to heme d is predominantly inhibited by their conversion to aliphatic and aromatic residues.




ic

A service-based approach to cryoEM facility processing pipelines at eBIC

Electron cryo-microscopy image-processing workflows are typically composed of elements that may, broadly speaking, be categorized as high-throughput workloads which transition to high-performance workloads as preprocessed data are aggregated. The high-throughput elements are of particular importance in the context of live processing, where an optimal response is highly coupled to the temporal profile of the data collection. In other words, each movie should be processed as quickly as possible at the earliest opportunity. The high level of disconnected parallelization in the high-throughput problem directly allows a completely scalable solution across a distributed computer system, with the only technical obstacle being an efficient and reliable implementation. The cloud computing frameworks primarily developed for the deployment of high-availability web applications provide an environment with a number of appealing features for such high-throughput processing tasks. Here, an implementation of an early-stage processing pipeline for electron cryotomography experiments using a service-based architecture deployed on a Kubernetes cluster is discussed in order to demonstrate the benefits of this approach and how it may be extended to scenarios of considerably increased complexity.




ic

The crystal structure of mycothiol disulfide reductase (Mtr) provides mechanistic insight into the specific low-molecular-weight thiol reductase activity of Actinobacteria

Low-molecular-weight (LMW) thiols are involved in many processes in all organisms, playing a protective role against reactive species, heavy metals, toxins and antibiotics. Actinobacteria, such as Mycobacterium tuberculosis, use the LMW thiol mycothiol (MSH) to buffer the intracellular redox environment. The NADPH-dependent FAD-containing oxidoreductase mycothiol disulfide reductase (Mtr) is known to reduce oxidized mycothiol disulfide (MSSM) to MSH, which is crucial to maintain the cellular redox balance. In this work, the first crystal structures of Mtr are presented, expanding the structural knowledge and understanding of LMW thiol reductases. The structural analyses and docking calculations provide insight into the nature of Mtrs, with regard to the binding and reduction of the MSSM substrate, in the context of related oxidoreductases. The putative binding site for MSSM suggests a similar binding to that described for the homologous glutathione reductase and its respective substrate glutathione disulfide, but with distinct structural differences shaped to fit the bulkier MSSM substrate, assigning Mtrs as uniquely functioning reductases. As MSH has been acknowledged as an attractive antitubercular target, the structural findings presented in this work may contribute towards future antituberculosis drug development.




ic

Characterization of novel mevalonate kinases from the tardigrade Ramazzottius varieornatus and the psychrophilic archaeon Methanococcoides burtonii

Mevalonate kinase is central to the isoprenoid biosynthesis pathway. Here, high-resolution X-ray crystal structures of two mevalonate kinases are presented: a eukaryotic protein from Ramazzottius varieornatus and an archaeal protein from Methanococcoides burtonii. Both enzymes possess the highly conserved motifs of the GHMP enzyme superfamily, with notable differences between the two enzymes in the N-terminal part of the structures. Biochemical characterization of the two enzymes revealed major differences in their sensitivity to geranyl pyrophosphate and farnesyl pyrophosphate, and in their thermal stabilities. This work adds to the understanding of the structural basis of enzyme inhibition and thermostability in mevalonate kinases.




ic

EMinsight: a tool to capture cryoEM microscope configuration and experimental outcomes for analysis and deposition

The widespread adoption of cryoEM technologies for structural biology has pushed the discipline to new frontiers. A significant worldwide effort has refined the single-particle analysis (SPA) workflow into a reasonably standardized procedure. Significant investments of development time have been made, particularly in sample preparation, microscope data-collection efficiency, pipeline analyses and data archiving. The widespread adoption of specific commercial microscopes, software for controlling them and best practices developed at facilities worldwide has also begun to establish a degree of standardization to data structures coming from the SPA workflow. There is opportunity to capitalize on this moment in the maturation of the field, to capture metadata from SPA experiments and correlate the metadata with experimental outcomes, which is presented here in a set of programs called EMinsight. This tool aims to prototype the framework and types of analyses that could lead to new insights into optimal microscope configurations as well as to define methods for metadata capture to assist with the archiving of cryoEM SPA data. It is also envisaged that this tool will be useful to microscope operators and facilities looking to rapidly generate reports on SPA data-collection and screening sessions.




ic

Structural determination and modeling of ciliary microtubules

The axoneme, a microtubule-based array at the center of every cilium, has been the subject of structural investigations for decades, but only recent advances in cryo-EM and cryo-ET have allowed a molecular-level interpretation of the entire complex to be achieved. The unique properties of the nine doublet microtubules and central pair of singlet microtubules that form the axoneme, including the highly decorated tubulin lattice and the docking of massive axonemal complexes, provide opportunities and challenges for sample preparation, 3D reconstruction and atomic modeling. Here, the approaches used for cryo-EM and cryo-ET of axonemes are reviewed, while highlighting the unique opportunities provided by the latest generation of AI-guided tools that are transforming structural biology.




ic

Efficient in situ screening of and data collection from microcrystals in crystallization plates

A considerable bottleneck in serial crystallography at XFEL and synchrotron sources is the efficient production of large quantities of homogenous, well diffracting microcrystals. Efficient high-throughput screening of batch-grown microcrystals and the determination of ground-state structures from different conditions is thus of considerable value in the early stages of a project. Here, a highly sample-efficient methodology to measure serial crystallography data from microcrystals by raster scanning within standard in situ 96-well crystallization plates is described. Structures were determined from very small quantities of microcrystal suspension and the results were compared with those from other sample-delivery methods. The analysis of a two-dimensional batch crystallization screen using this method is also described as a useful guide for further optimization and the selection of appropriate conditions for scaling up microcrystallization.




ic

Mononuclear binding and catalytic activity of europium(III) and gadolinium(III) at the active site of the model metalloenzyme phosphotriesterase

Lanthanide ions have ideal chemical properties for catalysis, such as hard Lewis acidity, fast ligand-exchange kinetics, high coordination-number preferences and low geometric requirements for coordination. As a result, many small-molecule lanthanide catalysts have been described in the literature. Yet, despite the ability of enzymes to catalyse highly stereoselective reactions under gentle conditions, very few lanthanoenzymes have been investigated. In this work, the mononuclear binding of europium(III) and gadolinium(III) to the active site of a mutant of the model enzyme phosphotriesterase are described using X-ray crystallography at 1.78 and 1.61 Å resolution, respectively. It is also shown that despite coordinating a single non-natural metal cation, the PTE-R18 mutant is still able to maintain esterase activity.




ic

STOPGAP: an open-source package for template matching, subtomogram alignment and classification

Cryo-electron tomography (cryo-ET) enables molecular-resolution 3D imaging of complex biological specimens such as viral particles, cellular sections and, in some cases, whole cells. This enables the structural characterization of molecules in their near-native environments, without the need for purification or separation, thereby preserving biological information such as conformational states and spatial relationships between different molecular species. Subtomogram averaging is an image-processing workflow that allows users to leverage cryo-ET data to identify and localize target molecules, determine high-resolution structures of repeating molecular species and classify different conformational states. Here, STOPGAP, an open-source package for subtomogram averaging that is designed to provide users with fine control over each of these steps, is described. In providing detailed descriptions of the image-processing algorithms that STOPGAP uses, this manuscript is also intended to serve as a technical resource to users as well as for further community-driven software development.




ic

What shapes template-matching performance in cryogenic electron tomography in situ?

The detection of specific biological macromolecules in cryogenic electron tomography data is frequently approached by applying cross-correlation-based 3D template matching. To reduce computational cost and noise, high binning is used to aggregate voxels before template matching. This remains a prevalent practice in both practical applications and methods development. Here, the relation between template size, shape and angular sampling is systematically evaluated to identify ribosomes in a ground-truth annotated data set. It is shown that at the commonly used binning, a detailed subtomogram average, a sphere and a heart emoji result in near-identical performance. These findings indicate that with current template-matching practices macromolecules can only be detected with high precision if their shape and size are sufficiently different from the background. Using theoretical considerations, the experimental results are rationalized and it is discussed why primarily low-frequency information remains at high binning and that template matching fails to be accurate because similarly shaped and sized macromolecules have similar low-frequency spectra. These challenges are discussed and potential enhancements for future template-matching methodologies are proposed.




ic

Pillar data-acquisition strategies for cryo-electron tomography of beam-sensitive biological samples

For cryo-electron tomography (cryo-ET) of beam-sensitive biological specimens, a planar sample geometry is typically used. As the sample is tilted, the effective thickness of the sample along the direction of the electron beam increases and the signal-to-noise ratio concomitantly decreases, limiting the transfer of information at high tilt angles. In addition, the tilt range where data can be collected is limited by a combination of various sample-environment constraints, including the limited space in the objective lens pole piece and the possible use of fixed conductive braids to cool the specimen. Consequently, most tilt series are limited to a maximum of ±70°, leading to the presence of a missing wedge in Fourier space. The acquisition of cryo-ET data without a missing wedge, for example using a cylindrical sample geometry, is hence attractive for volumetric analysis of low-symmetry structures such as organelles or vesicles, lysis events, pore formation or filaments for which the missing information cannot be compensated by averaging techniques. Irrespective of the geometry, electron-beam damage to the specimen is an issue and the first images acquired will transfer more high-resolution information than those acquired last. There is also an inherent trade-off between higher sampling in Fourier space and avoiding beam damage to the sample. Finally, the necessity of using a sufficient electron fluence to align the tilt images means that this fluence needs to be fractionated across a small number of images; therefore, the order of data acquisition is also a factor to consider. Here, an n-helix tilt scheme is described and simulated which uses overlapping and interleaved tilt series to maximize the use of a pillar geometry, allowing the entire pillar volume to be reconstructed as a single unit. Three related tilt schemes are also evaluated that extend the continuous and classic dose-symmetric tilt schemes for cryo-ET to pillar samples to enable the collection of isotropic information across all spatial frequencies. A fourfold dose-symmetric scheme is proposed which provides a practical compromise between uniform information transfer and complexity of data acquisition.




ic

Deep-learning map segmentation for protein X-ray crystallographic structure determination

When solving a structure of a protein from single-wavelength anomalous diffraction X-ray data, the initial phases obtained by phasing from an anomalously scattering substructure usually need to be improved by an iterated electron-density modification. In this manuscript, the use of convolutional neural networks (CNNs) for segmentation of the initial experimental phasing electron-density maps is proposed. The results reported demonstrate that a CNN with U-net architecture, trained on several thousands of electron-density maps generated mainly using X-ray data from the Protein Data Bank in a supervised learning, can improve current density-modification methods.




ic

Validation of electron-microscopy maps using solution small-angle X-ray scattering

The determination of the atomic resolution structure of biomacromolecules is essential for understanding details of their function. Traditionally, such a structure determination has been performed with crystallographic or nuclear resonance methods, but during the last decade, cryogenic transmission electron microscopy (cryo-TEM) has become an equally important tool. As the blotting and flash-freezing of the samples can induce conformational changes, external validation tools are required to ensure that the vitrified samples are representative of the solution. Although many validation tools have already been developed, most of them rely on fully resolved atomic models, which prevents early screening of the cryo-TEM maps. Here, a novel and automated method for performing such a validation utilizing small-angle X-ray scattering measurements, publicly available through the new software package AUSAXS, is introduced and implemented. The method has been tested on both simulated and experimental data, where it was shown to work remarkably well as a validation tool. The method provides a dummy atomic model derived from the EM map which best represents the solution structure.




ic

Managing macromolecular crystallographic data with a laboratory information management system

Protein crystallography is an established method to study the atomic structures of macromolecules and their complexes. A prerequisite for successful structure determination is diffraction-quality crystals, which may require extensive optimization of both the protein and the conditions, and hence projects can stretch over an extended period, with multiple users being involved. The workflow from crystallization and crystal treatment to deposition and publication is well defined, and therefore an electronic laboratory information management system (LIMS) is well suited to management of the data. Completion of the project requires key information on all the steps being available and this information should also be made available according to the FAIR principles. As crystallized samples are typically shipped between facilities, a key feature to be captured in the LIMS is the exchange of metadata between the crystallization facility of the home laboratory and, for example, synchrotron facilities. On completion, structures are deposited in the Protein Data Bank (PDB) and the LIMS can include the PDB code in its database, completing the chain of custody from crystallization to structure deposition and publication. A LIMS designed for macromolecular crystallography, IceBear, is available as a standalone installation and as a hosted service, and the implementation of key features for the capture of metadata in IceBear is discussed as an example.




ic

Protonation of histidine rings using quantum-mechanical methods

Histidine can be protonated on either or both of the two N atoms of the imidazole moiety. Each of the three possible forms occurs as a result of the stereochemical environment of the histidine side chain. In an atomic model, comparing the possible protonation states in situ, looking at possible hydrogen bonding and metal coordination, it is possible to predict which is most likely to be correct. A more direct method is described that uses quantum-mechanical methods to calculate, also in situ, the minimum geometry and energy for comparison, and therefore to more accurately identify the most likely proton­ation state.




ic

Crystallographic fragment-binding studies of the Mycobacterium tuberculosis trifunctional enzyme suggest binding pockets for the tails of the acyl-CoA substrates at its active sites and a potential substrate-channeling path between them

The Mycobacterium tuberculosis trifunctional enzyme (MtTFE) is an α2β2 tetrameric enzyme in which the α-chain harbors the 2E-enoyl-CoA hydratase (ECH) and 3S-hydroxyacyl-CoA dehydrogenase (HAD) active sites, and the β-chain provides the 3-ketoacyl-CoA thiolase (KAT) active site. Linear, medium-chain and long-chain 2E-enoyl-CoA molecules are the preferred substrates of MtTFE. Previous crystallographic binding and modeling studies identified binding sites for the acyl-CoA substrates at the three active sites, as well as the NAD binding pocket at the HAD active site. These studies also identified three additional CoA binding sites on the surface of MtTFE that are different from the active sites. It has been proposed that one of these additional sites could be of functional relevance for the substrate channeling (by surface crawling) of reaction intermediates between the three active sites. Here, 226 fragments were screened in a crystallographic fragment-binding study of MtTFE crystals, resulting in the structures of 16 MtTFE–fragment complexes. Analysis of the 121 fragment-binding events shows that the ECH active site is the `binding hotspot' for the tested fragments, with 41 binding events. The mode of binding of the fragments bound at the active sites provides additional insight into how the long-chain acyl moiety of the substrates can be accommodated at their proposed binding pockets. In addition, the 20 fragment-binding events between the active sites identify potential transient binding sites of reaction intermediates relevant to the possible channeling of substrates between these active sites. These results provide a basis for further studies to understand the functional relevance of the latter binding sites and to identify substrates for which channeling is crucial.




ic

Post-translational modifications in the Protein Data Bank

Proteins frequently undergo covalent modification at the post-translational level, which involves the covalent attachment of chemical groups onto amino acids. This can entail the singular or multiple addition of small groups, such as phosphorylation; long-chain modifications, such as glycosylation; small proteins, such as ubiquitination; as well as the interconversion of chemical groups, such as the formation of pyroglutamic acid. These post-translational modifications (PTMs) are essential for the normal functioning of cells, as they can alter the physicochemical properties of amino acids and therefore influence enzymatic activity, protein localization, protein–protein interactions and protein stability. Despite their inherent importance, accurately depicting PTMs in experimental studies of protein structures often poses a challenge. This review highlights the role of PTMs in protein structures, as well as the prevalence of PTMs in the Protein Data Bank, directing the reader to accurately built examples suitable for use as a modelling reference.




ic

Microcrystal electron diffraction structure of Toll-like receptor 2 TIR-domain-nucleated MyD88 TIR-domain higher-order assembly

Eukaryotic TIR (Toll/interleukin-1 receptor protein) domains signal via TIR–TIR interactions, either by self-association or by interaction with other TIR domains. In mammals, TIR domains are found in Toll-like receptors (TLRs) and cytoplasmic adaptor proteins involved in pro-inflammatory signaling. Previous work revealed that the MAL TIR domain (MALTIR) nucleates the assembly of MyD88TIR into crystalline arrays in vitro. A microcrystal electron diffraction (MicroED) structure of the MyD88TIR assembly has previously been solved, revealing a two-stranded higher-order assembly of TIR domains. In this work, it is demonstrated that the TIR domain of TLR2, which is reported to signal as a heterodimer with either TLR1 or TLR6, induces the formation of crystalline higher-order assemblies of MyD88TIR in vitro, whereas TLR1TIR and TLR6TIR do not. Using an improved data-collection protocol, the MicroED structure of TLR2TIR-induced MyD88TIR microcrystals was determined at a higher resolution (2.85 Å) and with higher completeness (89%) compared with the previous structure of the MALTIR-induced MyD88TIR assembly. Both assemblies exhibit conformational differences in several areas that are important for signaling (for example the BB loop and CD loop) compared with their monomeric structures. These data suggest that TLR2TIR and MALTIR interact with MyD88 in an analogous manner during signaling, nucleating MyD88TIR assemblies uni­directionally.




ic

Robust and automatic beamstop shadow outlier rejection: combining crystallographic statistics with modern clustering under a semi-supervised learning strategy

During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.




ic

Utilizing anomalous signals for element identification in macromolecular crystallography

AlphaFold2 has revolutionized structural biology by offering unparalleled accuracy in predicting protein structures. Traditional methods for determining protein structures, such as X-ray crystallography and cryo-electron microscopy, are often time-consuming and resource-intensive. AlphaFold2 provides models that are valuable for molecular replacement, aiding in model building and docking into electron density or potential maps. However, despite its capabilities, models from AlphaFold2 do not consistently match the accuracy of experimentally determined structures, need to be validated experimentally and currently miss some crucial information, such as post-translational modifications, ligands and bound ions. In this paper, the advantages are explored of collecting X-ray anomalous data to identify chemical elements, such as metal ions, which are key to understanding certain structures and functions of proteins. This is achieved through methods such as calculating anomalous difference Fourier maps or refining the imaginary component of the anomalous scattering factor f''. Anomalous data can serve as a valuable complement to the information provided by AlphaFold2 models and this is particularly significant in elucidating the roles of metal ions.




ic

Structural studies of β-glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharolyticus

β-Glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharo­lyticus (Bgl1) has been denoted as having an attractive catalytic profile for various industrial applications. Bgl1 catalyses the final step of in the decomposition of cellulose, an unbranched glucose polymer that has attracted the attention of researchers in recent years as it is the most abundant renewable source of reduced carbon in the biosphere. With the aim of enhancing the thermostability of Bgl1 for a broad spectrum of biotechnological processes, it has been subjected to structural studies. Crystal structures of Bgl1 and its complex with glucose were determined at 1.47 and 1.95 Å resolution, respectively. Bgl1 is a member of glycosyl hydrolase family 1 (GH1 superfamily, EC 3.2.1.21) and the results showed that the 3D structure of Bgl1 follows the overall architecture of the GH1 family, with a classical (β/α)8 TIM-barrel fold. Comparisons of Bgl1 with sequence or structural homologues of β-glucosidase reveal quite similar structures but also unique structural features in Bgl1 with plausible functional roles.




ic

CHiMP: deep-learning tools trained on protein crystallization micrographs to enable automation of experiments

A group of three deep-learning tools, referred to collectively as CHiMP (Crystal Hits in My Plate), were created for analysis of micrographs of protein crystallization experiments at the Diamond Light Source (DLS) synchrotron, UK. The first tool, a classification network, assigns images into categories relating to experimental outcomes. The other two tools are networks that perform both object detection and instance segmentation, resulting in masks of individual crystals in the first case and masks of crystallization droplets in addition to crystals in the second case, allowing the positions and sizes of these entities to be recorded. The creation of these tools used transfer learning, where weights from a pre-trained deep-learning network were used as a starting point and repurposed by further training on a relatively small set of data. Two of the tools are now integrated at the VMXi macromolecular crystallography beamline at DLS, where they have the potential to absolve the need for any user input, both for monitoring crystallization experiments and for triggering in situ data collections. The third is being integrated into the XChem fragment-based drug-discovery screening platform, also at DLS, to allow the automatic targeting of acoustic compound dispensing into crystallization droplets.




ic

The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era

The availability of highly accurate protein structure predictions from AlphaFold2 (AF2) and similar tools has hugely expanded the applicability of molecular replacement (MR) for crystal structure solution. Many structures can be solved routinely using raw models, structures processed to remove unreliable parts or models split into distinct structural units. There is therefore an open question around how many and which cases still require experimental phasing methods such as single-wavelength anomalous diffraction (SAD). Here, this question is addressed using a large set of PDB depositions that were solved by SAD. A large majority (87%) could be solved using unedited or minimally edited AF2 predictions. A further 18 (4%) yield straightforwardly to MR after splitting of the AF2 prediction using Slice'N'Dice, although different splitting methods succeeded on slightly different sets of cases. It is also found that further unique targets can be solved by alternative modelling approaches such as ESMFold (four cases), alternative MR approaches such as ARCIMBOLDO and AMPLE (two cases each), and multimeric model building with AlphaFold-Multimer or UniFold (three cases). Ultimately, only 12 cases, or 3% of the SAD-phased set, did not yield to any form of MR tested here, offering valuable hints as to the number and the characteristics of cases where experimental phasing remains essential for macromolecular structure solution.




ic

EMhub: a web platform for data management and on-the-fly processing in scientific facilities

Most scientific facilities produce large amounts of heterogeneous data at a rapid pace. Managing users, instruments, reports and invoices presents additional challenges. To address these challenges, EMhub, a web platform designed to support the daily operations and record-keeping of a scientific facility, has been introduced. EMhub enables the easy management of user information, instruments, bookings and projects. The application was initially developed to meet the needs of a cryoEM facility, but its functionality and adaptability have proven to be broad enough to be extended to other data-generating centers. The expansion of EMHub is enabled by the modular nature of its core functionalities. The application allows external processes to be connected via a REST API, automating tasks such as folder creation, user and password generation, and the execution of real-time data-processing pipelines. EMhub has been used for several years at the Swedish National CryoEM Facility and has been installed in the CryoEM center at the Structural Biology Department at St. Jude Children's Research Hospital. A fully automated single-particle pipeline has been implemented for on-the-fly data processing and analysis. At St. Jude, the X-Ray Crystallography Center and the Single-Molecule Imaging Center have already expanded the platform to support their operational and data-management workflows.




ic

Structure and stability of an apo thermophilic esterase that hydrolyzes polyhydroxybutyrate

Pollution from plastics is a global problem that threatens the biosphere for a host of reasons, including the time scale that it takes for most plastics to degrade. Biodegradation is an ideal solution for remediating bioplastic waste as it does not require the high temperatures necessary for thermal degradation and does not introduce additional pollutants into the environment. Numerous organisms can scavenge for bioplastics, such as polylactic acid (PLA) or poly-(R)-hydroxybutyrate (PHB), which they can use as an energy source. Recently, a promiscuous PHBase from the thermophilic soil bacterium Lihuaxuella thermophila (LtPHBase) was identified. LtPHBase can accommodate many substrates, including PHB granules and films and PHB block copolymers, as well as the unrelated polymers polylactic acid (PLA) and polycaprolactone (PCL). LtPHBase uses the expected Ser–His–Asp catalytic triad for hydrolysis at an optimal enzyme activity near 70°C. Here, the 1.75 Å resolution crystal structure of apo LtPHBase is presented and its chemical stability is profiled. Knowledge of its substrate preferences was extended to different-sized PHB granules. It is shown that LtPHBase is highly resistant to unfolding, with barriers typical for thermophilic enzymes, and shows a preference for low-molecular-mass PHB granules. These insights have implications for the long-term potential of LtPHBase as an industrial PHB hydrolase and shed light on the evolutionary role that this enzyme plays in bacterial metabolism.




ic

Analysis of crystallographic phase retrieval using iterative projection algorithms

For protein crystals in which more than two thirds of the volume is occupied by solvent, the featureless nature of the solvent region often generates a constraint that is powerful enough to allow direct phasing of X-ray diffraction data. Practical implementation relies on the use of iterative projection algorithms with good global convergence properties to solve the difficult nonconvex phase-retrieval problem. In this paper, some aspects of phase retrieval using iterative projection algorithms are systematically explored, where the diffraction data and density-value distributions in the protein and solvent regions provide the sole constraints. The analysis is based on the addition of random error to the phases of previously determined protein crystal structures, followed by evaluation of the ability to recover the correct phase set as the distance from the solution increases. The properties of the difference-map (DM), relaxed–reflect–reflect (RRR) and relaxed averaged alternating reflectors (RAAR) algorithms are compared. All of these algorithms prove to be effective for crystallographic phase retrieval, and the useful ranges of the adjustable parameter which controls their behavior are established. When these algorithms converge to the solution, the algorithm trajectory becomes stationary; however, the density function continues to fluctuate significantly around its mean position. It is shown that averaging over the algorithm trajectory in the stationary region, following convergence, improves the density estimate, with this procedure outperforming previous approaches for phase or density refinement.




ic

The role of alkyl chain length in the melt and solution crystallization of paliperidone aliphatic prodrugs

Fatty acid-derivative prodrugs have been utilized extensively to improve the physicochemical, biopharmaceutical and pharmacokinetic properties of active pharmaceutical ingredients. However, to our knowledge, the crystallization behavior of prodrugs modified with different fatty acids has not been explored. In the present work, a series of paliperidone aliphatic prodrugs with alkyl chain lengths ranging from C4 to C16 was investigated with respect to crystal structure, crystal morphology and crystallization kinetics. The paliperidone derivatives exhibited isostructural crystal packing, despite the different alkyl chain lengths, and crystallized with the dominant (100) face in both melt and solution. The rate of crystallization for paliperidone derivatives in the melt increases with alkyl chain length owing to greater molecular mobility. In contrast, the longer chains prolong the nucleation induction time and reduce the crystal growth kinetics in solution. The results show a correlation between difficulty of nucleation in solution and the interfacial energy. This work provides insight into the crystallization behavior of paliperidone aliphatic prodrugs and reveals that the role of alkyl chain length in the crystallization behavior has a strong dependence on the crystallization method.




ic

Structure determination using high-order spatial correlations in single-particle X-ray scattering

Single-particle imaging using X-ray free-electron lasers (XFELs) is a promising technique for observing nanoscale biological samples under near-physiological conditions. However, as the sample's orientation in each diffraction pattern is unknown, advanced algorithms are required to reconstruct the 3D diffraction intensity volume and subsequently the sample's density model. While most approaches perform 3D reconstruction via determining the orientation of each diffraction pattern, a correlation-based approach utilizes the averaged spatial correlations of diffraction intensities over all patterns, making it well suited for processing experimental data with a poor signal-to-noise ratio of individual patterns. Here, a method is proposed to determine the 3D structure of a sample by analyzing the double, triple and quadruple spatial correlations in diffraction patterns. This ab initio method can reconstruct the basic shape of an irregular unsymmetric 3D sample without requiring any prior knowledge of the sample. The impact of background and noise on correlations is investigated and corrected to ensure the success of reconstruction under simulated experimental conditions. Additionally, the feasibility of using the correlation-based approach to process incomplete partial diffraction patterns is demonstrated. The proposed method is a variable addition to existing algorithms for 3D reconstruction and will further promote the development and adoption of XFEL single-particle imaging techniques.




ic

Orientational ordering and assembly of silica–nickel Janus particles in a magnetic field

The orientation ordering and assembly behavior of silica–nickel Janus particles in a static external magnetic field were probed by ultra small-angle X-ray scattering (USAXS). Even in a weak applied field, the net magnetic moments of the individual particles aligned in the direction of the field, as indicated by the anisotropy in the recorded USAXS patterns. X-ray photon correlation spectroscopy (XPCS) measurements on these suspensions revealed that the corresponding particle dynamics are primarily Brownian diffusion [Zinn, Sharpnack & Narayanan (2023). Soft Matter, 19, 2311–2318]. At higher fields, the magnetic forces led to chain-like configurations of particles, as indicated by an additional feature in the USAXS pattern. A theoretical framework is provided for the quantitative interpretation of the observed anisotropic scattering diagrams and the corresponding degree of orientation. No anisotropy was detected when the magnetic field was applied along the beam direction, which is also replicated by the model. The method presented here could be useful for the interpretation of oriented scattering patterns from a wide variety of particulate systems. The combination of USAXS and XPCS is a powerful approach for investigating asymmetric colloidal particles in external fields.




ic

Conformation–aggregation interplay in the simplest aliphatic ethers probed under high pressure

The structures of the simplest symmetric primary ethers [(CnH2n+1)2O, n = 1–3] determined under high pressure revealed their conformational preferences and intermolecular interactions. In three new polymorphs of di­ethyl ether (C2H5)2O, high pressure promotes intermolecular CH⋯O contacts and enforces a conversion from the trans–trans conformer present in the α, β and γ phases to the trans–gauche conformer, which is higher in energy by 6.4 kJ mol−1, in the δ phase. Two new polymorphs of di­methyl ether (CH3)2O display analogous transformations of the CH⋯O bonds. The crystal structure of di-n-propyl ether (C3H7)2O, determined for the first time, is remarkably stable over the whole pressure range investigated from 1.70 up to 5.30 GPa.




ic

Dynamic X-ray speckle-tracking imaging with high-accuracy phase retrieval based on deep learning

Speckle-tracking X-ray imaging is an attractive candidate for dynamic X-ray imaging owing to its flexible setup and simultaneous yields of phase, transmission and scattering images. However, traditional speckle-tracking imaging methods suffer from phase distortion at locations with abrupt changes in density, which is always the case for real samples, limiting the applications of the speckle-tracking X-ray imaging method. In this paper, we report a deep-learning based method which can achieve dynamic X-ray speckle-tracking imaging with high-accuracy phase retrieval. The calibration results of a phantom show that the profile of the retrieved phase is highly consistent with the theoretical one. Experiments of polyurethane foaming demonstrated that the proposed method revealed the evolution of the complicated microstructure of the bubbles accurately. The proposed method is a promising solution for dynamic X-ray imaging with high-accuracy phase retrieval, and has extensive applications in metrology and quantitative analysis of dynamics in material science, physics, chemistry and biomedicine.




ic

C-SPAM: an open-source time-resolved specimen vitrification device with light-activated molecules

Molecular structures can be determined in vitro and in situ with cryo-electron microscopy (cryo-EM). Specimen preparation is a major obstacle in cryo-EM. Typical sample preparation is orders of magnitude slower than biological processes. Time-resolved cryo-EM (TR-cryo-EM) can capture short-lived states. Here, Cryo-EM sample preparation with light-activated molecules (C-SPAM) is presented, an open-source, photochemistry-coupled device for TR-cryo-EM that enables millisecond resolution and tunable timescales across broad biological applications.




ic

Solving protein structures by combining structure prediction, molecular replacement and direct-methods-aided model completion

Highly accurate protein structure prediction can generate accurate models of protein and protein–protein complexes in X-ray crystallography. However, the question of how to make more effective use of predicted models for completing structure analysis, and which strategies should be employed for the more challenging cases such as multi-helical structures, multimeric structures and extremely large structures, both in the model preparation and in the completion steps, remains open for discussion. In this paper, a new strategy is proposed based on the framework of direct methods and dual-space iteration, which can greatly simplify the pre-processing steps of predicted models both in normal and in challenging cases. Following this strategy, full-length models or the conservative structural domains could be used directly as the starting model, and the phase error and the model bias between the starting model and the real structure would be modified in the direct-methods-based dual-space iteration. Many challenging cases (from CASP14) have been tested for the general applicability of this constructive strategy, and almost complete models have been generated with reasonable statistics. The hybrid strategy therefore provides a meaningful scheme for X-ray structure determination using a predicted model as the starting point.




ic

Orientational analysis of atomic pair correlations in nanocrystalline indium oxide thin films

The application of grazing-incidence total X-ray scattering (GITXS) for pair distribution function (PDF) analysis using >50 keV X-rays from synchrotron light sources has created new opportunities for structural characterization of supported thin films with high resolution. Compared with grazing-incidence wide-angle X-ray scattering, which is only useful for highly ordered materials, GITXS/PDFs expand such analysis to largely disordered or nanostructured materials by examining the atomic pair correlations dependent on the direction relative to the surface of the supporting substrate. A characterization of nanocrystalline In2O3-derived thin films is presented here with in-plane-isotropic and out-of-plane-anisotropic orientational ordering of the atomic structure, each synthesized using different techniques. The atomic orientations of such films are known to vary based on the synthetic conditions. Here, an azimuthal orientational analysis of these films using GITXS with a single incident angle is shown to resolve the markedly different orientations of the atomic structures with respect to the planar support and the different degrees of long-range order, and hence, the terminal surface chemistries. It is anticipated that orientational analysis of GITXS/PDF data will offer opportunities to extend structural analyses of thin films by providing a means to qualitatively determine the major atomic orientation within nanocrystalline and, eventually, non-crystalline films.