li

Synthesis, crystal structure and in-silico evaluation of aryl­sul­fon­amide Schiff bases for potential activity against colon cancer

This report presents a comprehensive investigation into the synthesis and characterization of Schiff base com­pounds derived from benzene­sul­fon­amide. The synthesis process, involved the reaction between N-cyclo­amino-2-sulf­anil­amide and various substituted o-salicyl­aldehydes, resulted in a set of com­pounds that were subjected to rigorous characterization using advanced spectral techniques, including 1H NMR, 13C NMR and FT–IR spectroscopy, and single-crystal X-ray diffraction. Furthermore, an in-depth assessment of the synthesized com­pounds was conducted through Absorption, Distribution, Metabolism, Excretion and Toxicity (ADMET) analysis, in conjunction with docking studies, to elucidate their pharmacokinetic profiles and potential. Impressively, the ADMET analysis showcased encouraging drug-likeness properties of the newly synthesized Schiff bases. These computational findings were substanti­ated by mol­ecular properties derived from density functional theory (DFT) calculations using the B3LYP/6-31G* method within the Jaguar Module of Schrödinger 2023-2 from Maestro (Schrodinger LLC, New York, USA). The ex­plor­ation of frontier mol­ecular orbitals (HOMO and LUMO) enabled the computation of global reactivity descriptors (GRDs), encompassing charge separation (Egap) and global softness (S). Notably, within this analysis, one Schiff base, namely, 4-bromo-2-{N-[2-(pyr­rol­idine-1-sul­fonyl)phenyl]car­box­imid­oyl}phenol, 20, em­erged with the smallest charge separation (ΔEgap = 3.5780 eV), signifying heightened potential for biological properties. Conversely, 4-bromo-2-{N-[2-(piper­idine-1-sul­fonyl)phenyl]car­box­imid­oyl}phenol, 17, exhibited the largest charge separation (ΔEgap = 4.9242 eV), implying a relatively lower propensity for biological activity. Moreover, the synthesized Schiff bases displayed re­marke­able inhibition of tankyrase poly(ADP-ribose) polymerase enzymes, integral in colon cancer, surpassing the efficacy of a standard drug used for the same purpose. Additionally, their bioavailability scores aligned closely with established medications such as trifluridine and 5-fluoro­uracil. The ex­plor­ation of mol­ecular electrostatic potential through colour mapping delved into the electronic behaviour and reactivity tendencies intrinsic to this diverse range of mol­ecules.




li

Borotropic shifting of the hydro­tris­[3-(2-furyl)pyrazol-1-yl]borate ligand in high-coordinate lan­tha­nide com­plexes

The coordination of hydro­tris­[3-(2-furyl)pyrazol-1-yl]borate (Tp2-Fu, C21H16BN6O3) to lan­tha­nide(III) ions is achieved for the first time with the com­plex [Ln(Tp2-Fu)2](BPh4)·xCH2Cl2 (1-Ln has Ln = Ce and x = 2; 1-Dy has Ln = Dy and x = 1). This was accom­plished via both hydrous (Ln = Ce) and anhydrous methods (Ln = Dy). When isolating the dysprosium analogue, the filtrate produced a second crop of crystals which were revealed to be the 1,2-borotropic-shifted product [Dy(κ4-Tp2-Fu)(κ5-Tp2-Fu*)](BPh4) (2) {Tp2-Fu* = hydro­bis­[3-(2-furyl)pyrazol-1-yl][5-(2-furyl)pyrazol-1-yl]borate}. We con­clude that the pres­ence of a strong Lewis acid and a sterically crowded coordination environment are contributing factors for the 1,2-borotropic shifting of scorpionate ligands in conjunction with the size of the conical angle with the scorpionate ligand.




li

Mol­ecular structure and selective theophylline com­plexation by conformational change of diethyl N,N'-(1,3-phenyl­ene)dicarbamate

The receptor ability of diethyl N,N'-(1,3-phenyl­ene)dicarbamate (1) to form host–guest com­plexes with theophylline (TEO) and caffeine (CAF) by mechanochemistry was evaluated. The formation of the 1–TEO com­plex (C12H16N2O4·C7H8N4O2) was preferred and involves the conformational change of one of the ethyl carbamate groups of 1 from the endo conformation to the exo conformation to allow the formation of inter­molecular inter­actions. The formation of an N—H⋯O=C hydrogen bond between 1 and TEO triggers the conformational change of 1. CAF mol­ecules are unable to form an N—H⋯O=C hydrogen bond with 1, making the conformational change and, therefore, the formation of the com­plex impossible. Conformational change and selective binding were monitored by IR spectroscopy, solid-state 13C nuclear magnetic resonance and single-crystal X-ray diffraction. The 1–TEO com­plex was characterized by IR spectroscopy, solid-state 13C nuclear magnetic resonance, powder X-ray diffraction and single-crystal X-ray diffraction.




li

Using cocrystals as a tool to study non-crystallizing mol­ecules: crystal structure, Hirshfeld surface analysis and com­putational study of the 1:1 cocrystal of (E)-N-(3,4-di­fluoro­phen­yl)-1-(pyridin-4-yl)methanimine and acetic

Using a 1:1 cocrystal of (E)-N-(3,4-di­fluoro­phen­yl)-1-(pyridin-4-yl)methanimine with acetic acid, C12H8F2N2·C2H4O2, we investigate the influence of F atoms introduced to the aromatic ring on promoting π–π inter­actions. The cocrystal crystallizes in the triclinic space group P1. Through crystallographic analysis and com­putational studies, we reveal the mol­ecular arrangement within this co­crystal, demonstrating the presence of hydrogen bonding between the acetic acid mol­ecule and the pyridyl group, along with π–π inter­actions between the aromatic rings. Our findings highlight the importance of F atoms in promoting π–π inter­actions without necessitating full halogenation of the aromatic ring.




li

Crystal structure of the cytotoxic macrocyclic trichothecene Isororidin A

The highly cytotoxic macrocyclic trichothecene Isororidin A (C29H40O9) was isolated from the fungus Myrothesium verrucaria endophytic on the wild medicinal plant `Datura' (Datura stramonium L.) and was characterized by one- (1D) and two-dimensional (2D) NMR spectroscopy. The three-dimensional structure of Isororidin A has been confirmed by X-ray crystallography at 0.81 Å resolution from crystals grown in the ortho­rhom­bic space group P212121, with one mol­ecule per asymmetric unit. Isororidin A is the epimer of previously described (by X-ray crystallography) Roridin A at position C-13' of the macrocyclic ring.




li

Crystal structures of two unexpected products of vicinal di­amines left to crystallize in acetone

Herein we report the crystal structures of two ben­zo­di­az­e­pines obtained by reacting N,N'-(4,5-di­amino-1,2-phenyl­ene)bis­(4-methyl­ben­zene­sul­fon­am­ide) (1) or 4,5-(4-methyl­ben­zene­sul­fon­am­ido)­ben­zene-1,2-diaminium dichloride (1·2HCl) with acetone, giving 2,2,4-trimethyl-8,9-bis­(4-methyl­ben­zene­sul­fon­am­ido)-2,3-di­hydro-5H-1,5-ben­zo­di­az­e­pine, C26H30N4O4S2 (2), and 2,2,4-tri­methyl-8,9-bis­(4-methyl­ben­zene­sul­fon­am­ido)-2,3-di­hydro-5H-1,5-ben­zo­di­az­e­pin-1-ium chloride 0.3-hydrate, C26H31N4O4S2+·Cl−·0.3H2O (3). Compounds 2 and 3 were first obtained in attempts to recrystallize 1 and 1·2HCl using acetone as solvent. This solvent reacted with the vicinal di­amines present in the mol­ecular structures, forming a 5H-1,5-ben­zo­di­az­e­pine ring. In the crystal structure of 2, the seven-membered ring of ben­zo­di­az­e­pine adopts a boat-like conformation, while upon protonation, observed in the crystal structure of 3, it adopts an envelope-like conformation. In both crystalline com­pounds, the tosyl­amide N atoms are not in resonance with the arene ring, mainly due to hy­dro­gen bonds and steric hindrance caused by the large vicinal groups in the aromatic ring. At a supra­molecular level, the crystal structure is maintained by a combination of hy­dro­gen bonds and hydro­phobic inter­actions. In 2, amine-to-tosyl N—H⋯O and amide-to-imine N—H⋯N hy­dro­gen bonds can be observed. In contrast, in 3, the chloride counter-ion and water mol­ecule result in most of the hy­dro­gen bonds being of the amide-to-chloride and ammonium-to-chloride N—H⋯Cl types, while the amine inter­acts with the tosyl group, as seen in 2. In conclusion, we report the synthesis of 1, 1·2HCl and 2, as well as their chemical characterization. For 2, two synthetic methods are described, i.e. solvent-mediated crystallization and synthesis via a more efficient and cleaner route as a polycrystalline material. Salt 3 was only obtained as presented, with only a few crystals being formed.




li

Salt forms of amides: protonation of acetanilide

Treating the amide acetanilide (N-phenyl­acetamide, C8H9NO) with aqueous strong acids allowed the structures of five hemi-protonated salt forms of acetanilide to be elucidated. N-(1-Hy­droxy­ethyl­idene)anilinium chloride–N-phenyl­acetamide (1/1), [(C8H9NO)2H][Cl], and the bromide, [(C8H9NO)2H][Br], triiodide, [(C8H9NO)2H][I3], tetra­fluoro­borate, [(C8H9NO)2H][BF4], and di­iodo­bromide hemi(diiodine), [(C8H9NO)2H][I2Br]·0.5I2, analogues all feature centrosymmetric dimeric units linked by O—H⋯O hy­dro­gen bonds that extend into one-dimensional hy­dro­gen-bonded chains through N—H⋯X inter­actions, where X is the halide atom of the anion. Protonation occurs at the amide O atom and results in systematic lengthening of the C=O bond and a corresponding shortening of the C—N bond. The size of these geometric changes is similar to those found for hemi-protonated paracetamol structures, but less than those in fully protonated paracetamol structures. The bond angles of the amide fragments are also found to change on protonation, but these angular changes are also influenced by conformation, namely, whether the amide group is coplanar with the phenyl ring or twisted out of plane.




li

Coordination variety of phenyl­tetra­zolato and di­methyl­amido ligands in dimeric Ti, Zr, and Ta com­plexes

Three structurally diverse 5-phenyl­tetra­zolato (Tz) Ti, Zr, and Ta com­plexes, namely, (C2H8N)[Ti2(C7H5N4)5(C2H6N)4]·1.45C6H6 or (Me2NH2)[Ti2(NMe2)4(2,3-μ-Tz)3(2-η1-Tz)2]·1.45C6H6, (1·1.45C6H6), [Zr2(C7H5N4)6(C2H6N)2(C2H7N)2]·1.12C6H6·0.382CH2Cl2 or [Zr2(Me2NH)2(NMe2)2(2,3-μ-Tz)3(2-η1-Tz)2(1,2-η2-Tz)]·1.12C6H6·0.38CH2Cl2 (2·1.12C6H6·0.38CH2Cl2), and (C2H8N)2[Ta2(C7H5N4)8(C2H6N)2O]·0.25C7H8 or (Me2NH2)2[Ta2(NMe2)2(2,3-μ-Tz)2(2-η1-Tz)6O]·0.25C7H8 (3·0.25C7H8), where TzH is 5-phenyl-1H-tetra­zole, have been synthesized and structurally characterized. All three com­plexes are dinuclear; the Ti center in 1 is six-coordinate, whereas the Zr and Ta atoms in 2 and 3 are seven-coordinate. The coordination environments of the Ti centers in 1 are similar, and so are the ligations of the Ta centers in 3. In contrast, the two Zr centers in 2 bear a different number of ligands, one of which is a bidentate η2-5-phenyl­tetra­zolato ligand that has not been observed previously for d-block elements. The di­methyl­amido ligand, present in the starting materials, remained un­changed, or was converted to di­methyl­amine and di­methyl­ammonium during the synthesis. Di­methyl­amine coordinates as a neutral ligand, whereas di­methyl­ammonium is retained as a hy­dro­gen-bonded entity bridging Tz ligands.




li

Coordination structure and inter­molecular inter­actions in copper(II) acetate com­plexes with 1,10-phenanthroline and 2,2'-bi­py­ri­dine

The crystal structures of two coordination com­pounds, (acetato-κO)(2,2'-bi­py­ri­dine-κ2N,N')(1,10-phenanthroline-κ2N,N')copper(II) acetate hexa­hydrate, [Cu(C2H3O2)(C10H8N2)(C12H8N2)](C2H3O2)·6H2O or [Cu(bipy)(phen)Ac]Ac·6H2O, and (acetato-κO)bis­(2,2'-bi­py­ri­dine-κ2N,N')copper(II) acetate–acetic acid–water (1/1/3), [Cu(C2H3O2)(C10H8N2)2](C2H3O2)·C2H4O2·3H2O or [Cu(bipy)2Ac]Ac·HAc·3H2O, are reported and com­pared with the previously published structure of [Cu(phen)2Ac]Ac·7H2O (phen is 1,10-phenanthroline, bipy for 2,2'-bi­py­ri­dine, ac is acetate and Hac is acetic acid). The geometry around the metal centre is penta­coordinated, but highly distorted in all three cases. The coordination number and the geometric distortion are both discussed in detail, and all com­plexes belong to the space group Poverline{1}. The analysis of the geometric parameters and the Hirshfeld surface properties dnorm and curvedness provide information about the metal–ligand inter­actions in these com­plexes and allow com­parison with similar systems.




li

The High-Pressure Freezing Laboratory for Macromolecular Crystallography (HPMX), an ancillary tool for the macromolecular crystallography beamlines at the ESRF

This article describes the High-Pressure Freezing Laboratory for Macromolecular Crystallography (HPMX) at the ESRF, and highlights new and complementary research opportunities that can be explored using this facility. The laboratory is dedicated to investigating interactions between macromolecules and gases in crystallo, and finds applications in many fields of research, including fundamental biology, biochemistry, and environmental and medical science. At present, the HPMX laboratory offers the use of different high-pressure cells adapted for helium, argon, krypton, xenon, nitrogen, oxygen, carbon dioxide and methane. Important scientific applications of high pressure to macromolecules at the HPMX include noble-gas derivatization of crystals to detect and map the internal architecture of proteins (pockets, tunnels and channels) that allows the storage and diffusion of ligands or substrates/products, the investigation of the catalytic mechanisms of gas-employing enzymes (using oxygen, carbon dioxide or methane as substrates) to possibly decipher intermediates, and studies of the conformational fluctuations or structure modifications that are necessary for proteins to function. Additionally, cryo-cooling protein crystals under high pressure (helium or argon at 2000 bar) enables the addition of cryo-protectant to be avoided and noble gases can be employed to produce derivatives for structure resolution. The high-pressure systems are designed to process crystals along a well defined pathway in the phase diagram (pressure–temperature) of the gas to cryo-cool the samples according to the three-step `soak-and-freeze method'. Firstly, crystals are soaked in a pressurized pure gas atmosphere (at 294 K) to introduce the gas and facilitate its inter­actions within the macromolecules. Samples are then flash-cooled (at 100 K) while still under pressure to cryo-trap macromolecule–gas complexation states or pressure-induced protein modifications. Finally, the samples are recovered after depressurization at cryo-temperatures. The final section of this publication presents a selection of different typical high-pressure experiments carried out at the HPMX, showing that this technique has already answered a wide range of scientific questions. It is shown that the use of different gases and pressure conditions can be used to probe various effects, such as mapping the functional internal architectures of enzymes (tunnels in the haloalkane dehalogenase DhaA) and allosteric sites on membrane-protein surfaces, the interaction of non-inert gases with proteins (oxygen in the hydrogenase ReMBH) and pressure-induced structural changes of proteins (tetramer dissociation in urate oxidase). The technique is versatile and the provision of pressure cells and their application at the HPMX is gradually being extended to address new scientific questions.




li

A web-based dashboard for RELION metadata visualization

Cryo-electron microscopy (cryo-EM) has witnessed radical progress in the past decade, driven by developments in hardware and software. While current software packages include processing pipelines that simplify the image-processing workflow, they do not prioritize the in-depth analysis of crucial metadata, limiting troubleshooting for challenging data sets. The widely used RELION software package lacks a graphical native representation of the underlying metadata. Here, two web-based tools are introduced: relion_live.py, which offers real-time feedback on data collection, aiding swift decision-making during data acquisition, and relion_analyse.py, a graphical interface to represent RELION projects by plotting essential metadata including interactive data filtration and analysis. A useful script for estimating ice thickness and data quality during movie pre-processing is also presented. These tools empower researchers to analyse data efficiently and allow informed decisions during data collection and processing.




li

Investigation of how gate residues in the main channel affect the catalytic activity of Scytalidium thermophilum catalase

Catalase is an antioxidant enzyme that breaks down hydrogen peroxide (H2O2) into molecular oxygen and water. In all monofunctional catalases the pathway that H2O2 takes to the catalytic centre is via the `main channel'. However, the structure of this channel differs in large-subunit and small-subunit catalases. In large-subunit catalases the channel is 15 Å longer and consists of two distinct parts, including a hydrophobic lower region near the heme and a hydrophilic upper region where multiple H2O2 routes are possible. Conserved glutamic acid and threonine residues are located near the intersection of these two regions. Mutations of these two residues in the Scytalidium thermophilum catalase had no significant effect on catalase activity. However, the secondary phenol oxidase activity was markedly altered, with kcat and kcat/Km values that were significantly increased in the five variants E484A, E484I, T188D, T188I and T188F. These variants also showed a lower affinity for inhibitors of oxidase activity than the wild-type enzyme and a higher affinity for phenolic substrates. Oxidation of heme b to heme d did not occur in most of the studied variants. Structural changes in solvent-chain integrity and channel architecture were also observed. In summary, modification of the main-channel gate glutamic acid and threonine residues has a greater influence on the secondary activity of the catalase enzyme, and the oxidation of heme b to heme d is predominantly inhibited by their conversion to aliphatic and aromatic residues.




li

Structural flexibility of Toscana virus nucleoprotein in the presence of a single-chain camelid antibody

Phenuiviridae nucleoprotein is the main structural and functional component of the viral cycle, protecting the viral RNA and mediating the essential replication/transcription processes. The nucleoprotein (N) binds the RNA using its globular core and polymerizes through the N-terminus, which is presented as a highly flexible arm, as demonstrated in this article. The nucleoprotein exists in an `open' or a `closed' conformation. In the case of the closed conformation the flexible N-terminal arm folds over the RNA-binding cleft, preventing RNA adsorption. In the open conformation the arm is extended in such a way that both RNA adsorption and N polymerization are possible. In this article, single-crystal X-ray diffraction and small-angle X-ray scattering were used to study the N protein of Toscana virus complexed with a single-chain camelid antibody (VHH) and it is shown that in the presence of the antibody the nucleoprotein is unable to achieve a functional assembly to form a ribonucleoprotein complex.




li

A service-based approach to cryoEM facility processing pipelines at eBIC

Electron cryo-microscopy image-processing workflows are typically composed of elements that may, broadly speaking, be categorized as high-throughput workloads which transition to high-performance workloads as preprocessed data are aggregated. The high-throughput elements are of particular importance in the context of live processing, where an optimal response is highly coupled to the temporal profile of the data collection. In other words, each movie should be processed as quickly as possible at the earliest opportunity. The high level of disconnected parallelization in the high-throughput problem directly allows a completely scalable solution across a distributed computer system, with the only technical obstacle being an efficient and reliable implementation. The cloud computing frameworks primarily developed for the deployment of high-availability web applications provide an environment with a number of appealing features for such high-throughput processing tasks. Here, an implementation of an early-stage processing pipeline for electron cryotomography experiments using a service-based architecture deployed on a Kubernetes cluster is discussed in order to demonstrate the benefits of this approach and how it may be extended to scenarios of considerably increased complexity.




li

Characterization of novel mevalonate kinases from the tardigrade Ramazzottius varieornatus and the psychrophilic archaeon Methanococcoides burtonii

Mevalonate kinase is central to the isoprenoid biosynthesis pathway. Here, high-resolution X-ray crystal structures of two mevalonate kinases are presented: a eukaryotic protein from Ramazzottius varieornatus and an archaeal protein from Methanococcoides burtonii. Both enzymes possess the highly conserved motifs of the GHMP enzyme superfamily, with notable differences between the two enzymes in the N-terminal part of the structures. Biochemical characterization of the two enzymes revealed major differences in their sensitivity to geranyl pyrophosphate and farnesyl pyrophosphate, and in their thermal stabilities. This work adds to the understanding of the structural basis of enzyme inhibition and thermostability in mevalonate kinases.




li

Structural determination and modeling of ciliary microtubules

The axoneme, a microtubule-based array at the center of every cilium, has been the subject of structural investigations for decades, but only recent advances in cryo-EM and cryo-ET have allowed a molecular-level interpretation of the entire complex to be achieved. The unique properties of the nine doublet microtubules and central pair of singlet microtubules that form the axoneme, including the highly decorated tubulin lattice and the docking of massive axonemal complexes, provide opportunities and challenges for sample preparation, 3D reconstruction and atomic modeling. Here, the approaches used for cryo-EM and cryo-ET of axonemes are reviewed, while highlighting the unique opportunities provided by the latest generation of AI-guided tools that are transforming structural biology.




li

Tomo Live: an on-the-fly reconstruction pipeline to judge data quality for cryo-electron tomography workflows

Data acquisition and processing for cryo-electron tomography can be a significant bottleneck for users. To simplify and streamline the cryo-ET workflow, Tomo Live, an on-the-fly solution that automates the alignment and reconstruction of tilt-series data, enabling real-time data-quality assessment, has been developed. Through the integration of Tomo Live into the data-acquisition workflow for cryo-ET, motion correction is performed directly after each of the acquired tilt angles. Immediately after the tilt-series acquisition has completed, an unattended tilt-series alignment and reconstruction into a 3D volume is performed. The results are displayed in real time in a dedicated remote web platform that runs on the microscope hardware. Through this web platform, users can review the acquired data (aligned stack and 3D volume) and several quality metrics that are obtained during the alignment and reconstruction process. These quality metrics can be used for fast feedback for subsequent acquisitions to save time. Parameters such as Alignment Accuracy, Deleted Tilts and Tilt Axis Correction Angle are visualized as graphs and can be used as filters to export only the best tomograms (raw data, reconstruction and intermediate data) for further processing. Here, the Tomo Live algorithms and workflow are described and representative results on several biological samples are presented. The Tomo Live workflow is accessible to both expert and non-expert users, making it a valuable tool for the continued advancement of structural biology, cell biology and histology.




li

Efficient in situ screening of and data collection from microcrystals in crystallization plates

A considerable bottleneck in serial crystallography at XFEL and synchrotron sources is the efficient production of large quantities of homogenous, well diffracting microcrystals. Efficient high-throughput screening of batch-grown microcrystals and the determination of ground-state structures from different conditions is thus of considerable value in the early stages of a project. Here, a highly sample-efficient methodology to measure serial crystallography data from microcrystals by raster scanning within standard in situ 96-well crystallization plates is described. Structures were determined from very small quantities of microcrystal suspension and the results were compared with those from other sample-delivery methods. The analysis of a two-dimensional batch crystallization screen using this method is also described as a useful guide for further optimization and the selection of appropriate conditions for scaling up microcrystallization.




li

Mononuclear binding and catalytic activity of europium(III) and gadolinium(III) at the active site of the model metalloenzyme phosphotriesterase

Lanthanide ions have ideal chemical properties for catalysis, such as hard Lewis acidity, fast ligand-exchange kinetics, high coordination-number preferences and low geometric requirements for coordination. As a result, many small-molecule lanthanide catalysts have been described in the literature. Yet, despite the ability of enzymes to catalyse highly stereoselective reactions under gentle conditions, very few lanthanoenzymes have been investigated. In this work, the mononuclear binding of europium(III) and gadolinium(III) to the active site of a mutant of the model enzyme phosphotriesterase are described using X-ray crystallography at 1.78 and 1.61 Å resolution, respectively. It is also shown that despite coordinating a single non-natural metal cation, the PTE-R18 mutant is still able to maintain esterase activity.




li

Scaling and merging macromolecular diffuse scattering with mdx2

Diffuse scattering is a promising method to gain additional insight into protein dynamics from macromolecular crystallography experiments. Bragg intensities yield the average electron density, while the diffuse scattering can be processed to obtain a three-dimensional reciprocal-space map that is further analyzed to determine correlated motion. To make diffuse scattering techniques more accessible, software for data processing called mdx2 has been created that is both convenient to use and simple to extend and modify. mdx2 is written in Python, and it interfaces with DIALS to implement self-contained data-reduction workflows. Data are stored in NeXus format for software interchange and convenient visualization. mdx2 can be run on the command line or imported as a package, for instance to encapsulate a complete workflow in a Jupyter notebook for reproducible computing and education. Here, mdx2 version 1.0 is described, a new release incorporating state-of-the-art techniques for data reduction. The implementation of a complete multi-crystal scaling and merging workflow is described, and the methods are tested using a high-redundancy data set from cubic insulin. It is shown that redundancy can be leveraged during scaling to correct systematic errors and obtain accurate and reproducible measurements of weak diffuse signals.




li

HEIDI: an experiment-management platform enabling high-throughput fragment and compound screening

The Swiss Light Source facilitates fragment-based drug-discovery campaigns for academic and industrial users through the Fast Fragment and Compound Screening (FFCS) software suite. This framework is further enriched by the option to utilize the Smart Digital User (SDU) software for automated data collection across the PXI, PXII and PXIII beamlines. In this work, the newly developed HEIDI webpage (https://heidi.psi.ch) is introduced: a platform crafted using state-of-the-art software architecture and web technologies for sample management of rotational data experiments. The HEIDI webpage features a data-review tab for enhanced result visualization and provides programmatic access through a representational state transfer application programming interface (REST API). The migration of the local FFCS MongoDB instance to the cloud is highlighted and detailed. This transition ensures secure, encrypted and consistently accessible data through a robust and reliable REST API tailored for the FFCS software suite. Collectively, these advancements not only significantly elevate the user experience, but also pave the way for future expansions and improvements in the capabilities of the system.




li

STOPGAP: an open-source package for template matching, subtomogram alignment and classification

Cryo-electron tomography (cryo-ET) enables molecular-resolution 3D imaging of complex biological specimens such as viral particles, cellular sections and, in some cases, whole cells. This enables the structural characterization of molecules in their near-native environments, without the need for purification or separation, thereby preserving biological information such as conformational states and spatial relationships between different molecular species. Subtomogram averaging is an image-processing workflow that allows users to leverage cryo-ET data to identify and localize target molecules, determine high-resolution structures of repeating molecular species and classify different conformational states. Here, STOPGAP, an open-source package for subtomogram averaging that is designed to provide users with fine control over each of these steps, is described. In providing detailed descriptions of the image-processing algorithms that STOPGAP uses, this manuscript is also intended to serve as a technical resource to users as well as for further community-driven software development.




li

Validation of electron-microscopy maps using solution small-angle X-ray scattering

The determination of the atomic resolution structure of biomacromolecules is essential for understanding details of their function. Traditionally, such a structure determination has been performed with crystallographic or nuclear resonance methods, but during the last decade, cryogenic transmission electron microscopy (cryo-TEM) has become an equally important tool. As the blotting and flash-freezing of the samples can induce conformational changes, external validation tools are required to ensure that the vitrified samples are representative of the solution. Although many validation tools have already been developed, most of them rely on fully resolved atomic models, which prevents early screening of the cryo-TEM maps. Here, a novel and automated method for performing such a validation utilizing small-angle X-ray scattering measurements, publicly available through the new software package AUSAXS, is introduced and implemented. The method has been tested on both simulated and experimental data, where it was shown to work remarkably well as a validation tool. The method provides a dummy atomic model derived from the EM map which best represents the solution structure.




li

Crystallographic fragment-binding studies of the Mycobacterium tuberculosis trifunctional enzyme suggest binding pockets for the tails of the acyl-CoA substrates at its active sites and a potential substrate-channeling path between them

The Mycobacterium tuberculosis trifunctional enzyme (MtTFE) is an α2β2 tetrameric enzyme in which the α-chain harbors the 2E-enoyl-CoA hydratase (ECH) and 3S-hydroxyacyl-CoA dehydrogenase (HAD) active sites, and the β-chain provides the 3-ketoacyl-CoA thiolase (KAT) active site. Linear, medium-chain and long-chain 2E-enoyl-CoA molecules are the preferred substrates of MtTFE. Previous crystallographic binding and modeling studies identified binding sites for the acyl-CoA substrates at the three active sites, as well as the NAD binding pocket at the HAD active site. These studies also identified three additional CoA binding sites on the surface of MtTFE that are different from the active sites. It has been proposed that one of these additional sites could be of functional relevance for the substrate channeling (by surface crawling) of reaction intermediates between the three active sites. Here, 226 fragments were screened in a crystallographic fragment-binding study of MtTFE crystals, resulting in the structures of 16 MtTFE–fragment complexes. Analysis of the 121 fragment-binding events shows that the ECH active site is the `binding hotspot' for the tested fragments, with 41 binding events. The mode of binding of the fragments bound at the active sites provides additional insight into how the long-chain acyl moiety of the substrates can be accommodated at their proposed binding pockets. In addition, the 20 fragment-binding events between the active sites identify potential transient binding sites of reaction intermediates relevant to the possible channeling of substrates between these active sites. These results provide a basis for further studies to understand the functional relevance of the latter binding sites and to identify substrates for which channeling is crucial.




li

Likelihood-based interactive local docking into cryo-EM maps in ChimeraX

The interpretation of cryo-EM maps often includes the docking of known or predicted structures of the components, which is particularly useful when the map resolution is worse than 4 Å. Although it can be effective to search the entire map to find the best placement of a component, the process can be slow when the maps are large. However, frequently there is a well-founded hypothesis about where particular components are located. In such cases, a local search using a map subvolume will be much faster because the search volume is smaller, and more sensitive because optimizing the search volume for the rotation-search step enhances the signal to noise. A Fourier-space likelihood-based local search approach, based on the previously published em_placement software, has been implemented in the new emplace_local program. Tests confirm that the local search approach enhances the speed and sensitivity of the computations. An interactive graphical interface in the ChimeraX molecular-graphics program provides a convenient way to set up and evaluate docking calculations, particularly in defining the part of the map into which the components should be placed.




li

Structural analysis of a ligand-triggered intermolecular disulfide switch in a major latex protein from opium poppy

Several proteins from plant pathogenesis-related family 10 (PR10) are highly abundant in the latex of opium poppy and have recently been shown to play diverse and important roles in the biosynthesis of benzylisoquinoline alkaloids (BIAs). The recent determination of the first crystal structures of PR10-10 showed how large conformational changes in a surface loop and adjacent β-strand are coupled to the binding of BIA compounds to the central hydrophobic binding pocket. A more detailed analysis of these conformational changes is now reported to further clarify how ligand binding is coupled to the formation and cleavage of an intermolecular disulfide bond that is only sterically allowed when the BIA binding pocket is empty. To decouple ligand binding from disulfide-bond formation, each of the two highly conserved cysteine residues (Cys59 and Cys155) in PR10-10 was replaced with serine using site-directed mutagenesis. Crystal structures of the Cys59Ser mutant were determined in the presence of papaverine and in the absence of exogenous BIA compounds. A crystal structure of the Cys155Ser mutant was also determined in the absence of exogenous BIA compounds. All three of these crystal structures reveal conformations similar to that of wild-type PR10-10 with bound BIA compounds. In the absence of exogenous BIA compounds, the Cys59Ser and Cys155Ser mutants appear to bind an unidentified ligand or mixture of ligands that was presumably introduced during expression of the proteins in Escherichia coli. The analysis of conformational changes triggered by the binding of BIA compounds suggests a molecular mechanism coupling ligand binding to the disruption of an intermolecular disulfide bond. This mechanism may be involved in the regulation of biosynthetic reactions in plants and possibly other organisms.




li

Surface-mutagenesis strategies to enable structural biology crystallization platforms

A key prerequisite for the successful application of protein crystallography in drug discovery is to establish a robust crystallization system for a new drug-target protein fast enough to deliver crystal structures when the first inhibitors have been identified in the hit-finding campaign or, at the latest, in the subsequent hit-to-lead process. The first crucial step towards generating well folded proteins with a high likelihood of crystallizing is the identification of suitable truncation variants of the target protein. In some cases an optimal length variant alone is not sufficient to support crystallization and additional surface mutations need to be introduced to obtain suitable crystals. In this contribution, four case studies are presented in which rationally designed surface modifications were key to establishing crystallization conditions for the target proteins (the protein kinases Aurora-C, IRAK4 and BUB1, and the KRAS–SOS1 complex). The design process which led to well diffracting crystals is described and the crystal packing is analysed to understand retrospectively how the specific surface mutations promoted successful crystallization. The presented design approaches are routinely used in our team to support the establishment of robust crystallization systems which enable structure-guided inhibitor optimization for hit-to-lead and lead-optimization projects in pharmaceutical research.




li

Microcrystal electron diffraction structure of Toll-like receptor 2 TIR-domain-nucleated MyD88 TIR-domain higher-order assembly

Eukaryotic TIR (Toll/interleukin-1 receptor protein) domains signal via TIR–TIR interactions, either by self-association or by interaction with other TIR domains. In mammals, TIR domains are found in Toll-like receptors (TLRs) and cytoplasmic adaptor proteins involved in pro-inflammatory signaling. Previous work revealed that the MAL TIR domain (MALTIR) nucleates the assembly of MyD88TIR into crystalline arrays in vitro. A microcrystal electron diffraction (MicroED) structure of the MyD88TIR assembly has previously been solved, revealing a two-stranded higher-order assembly of TIR domains. In this work, it is demonstrated that the TIR domain of TLR2, which is reported to signal as a heterodimer with either TLR1 or TLR6, induces the formation of crystalline higher-order assemblies of MyD88TIR in vitro, whereas TLR1TIR and TLR6TIR do not. Using an improved data-collection protocol, the MicroED structure of TLR2TIR-induced MyD88TIR microcrystals was determined at a higher resolution (2.85 Å) and with higher completeness (89%) compared with the previous structure of the MALTIR-induced MyD88TIR assembly. Both assemblies exhibit conformational differences in several areas that are important for signaling (for example the BB loop and CD loop) compared with their monomeric structures. These data suggest that TLR2TIR and MALTIR interact with MyD88 in an analogous manner during signaling, nucleating MyD88TIR assemblies uni­directionally.




li

Robust and automatic beamstop shadow outlier rejection: combining crystallographic statistics with modern clustering under a semi-supervised learning strategy

During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.




li

Utilizing anomalous signals for element identification in macromolecular crystallography

AlphaFold2 has revolutionized structural biology by offering unparalleled accuracy in predicting protein structures. Traditional methods for determining protein structures, such as X-ray crystallography and cryo-electron microscopy, are often time-consuming and resource-intensive. AlphaFold2 provides models that are valuable for molecular replacement, aiding in model building and docking into electron density or potential maps. However, despite its capabilities, models from AlphaFold2 do not consistently match the accuracy of experimentally determined structures, need to be validated experimentally and currently miss some crucial information, such as post-translational modifications, ligands and bound ions. In this paper, the advantages are explored of collecting X-ray anomalous data to identify chemical elements, such as metal ions, which are key to understanding certain structures and functions of proteins. This is achieved through methods such as calculating anomalous difference Fourier maps or refining the imaginary component of the anomalous scattering factor f''. Anomalous data can serve as a valuable complement to the information provided by AlphaFold2 models and this is particularly significant in elucidating the roles of metal ions.




li

Structural studies of β-glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharolyticus

β-Glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharo­lyticus (Bgl1) has been denoted as having an attractive catalytic profile for various industrial applications. Bgl1 catalyses the final step of in the decomposition of cellulose, an unbranched glucose polymer that has attracted the attention of researchers in recent years as it is the most abundant renewable source of reduced carbon in the biosphere. With the aim of enhancing the thermostability of Bgl1 for a broad spectrum of biotechnological processes, it has been subjected to structural studies. Crystal structures of Bgl1 and its complex with glucose were determined at 1.47 and 1.95 Å resolution, respectively. Bgl1 is a member of glycosyl hydrolase family 1 (GH1 superfamily, EC 3.2.1.21) and the results showed that the 3D structure of Bgl1 follows the overall architecture of the GH1 family, with a classical (β/α)8 TIM-barrel fold. Comparisons of Bgl1 with sequence or structural homologues of β-glucosidase reveal quite similar structures but also unique structural features in Bgl1 with plausible functional roles.




li

CHiMP: deep-learning tools trained on protein crystallization micrographs to enable automation of experiments

A group of three deep-learning tools, referred to collectively as CHiMP (Crystal Hits in My Plate), were created for analysis of micrographs of protein crystallization experiments at the Diamond Light Source (DLS) synchrotron, UK. The first tool, a classification network, assigns images into categories relating to experimental outcomes. The other two tools are networks that perform both object detection and instance segmentation, resulting in masks of individual crystals in the first case and masks of crystallization droplets in addition to crystals in the second case, allowing the positions and sizes of these entities to be recorded. The creation of these tools used transfer learning, where weights from a pre-trained deep-learning network were used as a starting point and repurposed by further training on a relatively small set of data. Two of the tools are now integrated at the VMXi macromolecular crystallography beamline at DLS, where they have the potential to absolve the need for any user input, both for monitoring crystallization experiments and for triggering in situ data collections. The third is being integrated into the XChem fragment-based drug-discovery screening platform, also at DLS, to allow the automatic targeting of acoustic compound dispensing into crystallization droplets.




li

The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era

The availability of highly accurate protein structure predictions from AlphaFold2 (AF2) and similar tools has hugely expanded the applicability of molecular replacement (MR) for crystal structure solution. Many structures can be solved routinely using raw models, structures processed to remove unreliable parts or models split into distinct structural units. There is therefore an open question around how many and which cases still require experimental phasing methods such as single-wavelength anomalous diffraction (SAD). Here, this question is addressed using a large set of PDB depositions that were solved by SAD. A large majority (87%) could be solved using unedited or minimally edited AF2 predictions. A further 18 (4%) yield straightforwardly to MR after splitting of the AF2 prediction using Slice'N'Dice, although different splitting methods succeeded on slightly different sets of cases. It is also found that further unique targets can be solved by alternative modelling approaches such as ESMFold (four cases), alternative MR approaches such as ARCIMBOLDO and AMPLE (two cases each), and multimeric model building with AlphaFold-Multimer or UniFold (three cases). Ultimately, only 12 cases, or 3% of the SAD-phased set, did not yield to any form of MR tested here, offering valuable hints as to the number and the characteristics of cases where experimental phasing remains essential for macromolecular structure solution.




li

EMhub: a web platform for data management and on-the-fly processing in scientific facilities

Most scientific facilities produce large amounts of heterogeneous data at a rapid pace. Managing users, instruments, reports and invoices presents additional challenges. To address these challenges, EMhub, a web platform designed to support the daily operations and record-keeping of a scientific facility, has been introduced. EMhub enables the easy management of user information, instruments, bookings and projects. The application was initially developed to meet the needs of a cryoEM facility, but its functionality and adaptability have proven to be broad enough to be extended to other data-generating centers. The expansion of EMHub is enabled by the modular nature of its core functionalities. The application allows external processes to be connected via a REST API, automating tasks such as folder creation, user and password generation, and the execution of real-time data-processing pipelines. EMhub has been used for several years at the Swedish National CryoEM Facility and has been installed in the CryoEM center at the Structural Biology Department at St. Jude Children's Research Hospital. A fully automated single-particle pipeline has been implemented for on-the-fly data processing and analysis. At St. Jude, the X-Ray Crystallography Center and the Single-Molecule Imaging Center have already expanded the platform to support their operational and data-management workflows.




li

Structure and stability of an apo thermophilic esterase that hydrolyzes polyhydroxybutyrate

Pollution from plastics is a global problem that threatens the biosphere for a host of reasons, including the time scale that it takes for most plastics to degrade. Biodegradation is an ideal solution for remediating bioplastic waste as it does not require the high temperatures necessary for thermal degradation and does not introduce additional pollutants into the environment. Numerous organisms can scavenge for bioplastics, such as polylactic acid (PLA) or poly-(R)-hydroxybutyrate (PHB), which they can use as an energy source. Recently, a promiscuous PHBase from the thermophilic soil bacterium Lihuaxuella thermophila (LtPHBase) was identified. LtPHBase can accommodate many substrates, including PHB granules and films and PHB block copolymers, as well as the unrelated polymers polylactic acid (PLA) and polycaprolactone (PCL). LtPHBase uses the expected Ser–His–Asp catalytic triad for hydrolysis at an optimal enzyme activity near 70°C. Here, the 1.75 Å resolution crystal structure of apo LtPHBase is presented and its chemical stability is profiled. Knowledge of its substrate preferences was extended to different-sized PHB granules. It is shown that LtPHBase is highly resistant to unfolding, with barriers typical for thermophilic enzymes, and shows a preference for low-molecular-mass PHB granules. These insights have implications for the long-term potential of LtPHBase as an industrial PHB hydrolase and shed light on the evolutionary role that this enzyme plays in bacterial metabolism.




li

The role of alkyl chain length in the melt and solution crystallization of paliperidone aliphatic prodrugs

Fatty acid-derivative prodrugs have been utilized extensively to improve the physicochemical, biopharmaceutical and pharmacokinetic properties of active pharmaceutical ingredients. However, to our knowledge, the crystallization behavior of prodrugs modified with different fatty acids has not been explored. In the present work, a series of paliperidone aliphatic prodrugs with alkyl chain lengths ranging from C4 to C16 was investigated with respect to crystal structure, crystal morphology and crystallization kinetics. The paliperidone derivatives exhibited isostructural crystal packing, despite the different alkyl chain lengths, and crystallized with the dominant (100) face in both melt and solution. The rate of crystallization for paliperidone derivatives in the melt increases with alkyl chain length owing to greater molecular mobility. In contrast, the longer chains prolong the nucleation induction time and reduce the crystal growth kinetics in solution. The results show a correlation between difficulty of nucleation in solution and the interfacial energy. This work provides insight into the crystallization behavior of paliperidone aliphatic prodrugs and reveals that the role of alkyl chain length in the crystallization behavior has a strong dependence on the crystallization method.




li

STEM SerialED: achieving high-resolution data for ab initio structure determination of beam-sensitive nanocrystalline materials

Serial electron diffraction (SerialED), which applies a snapshot data acquisition strategy for each crystal, was introduced to tackle the problem of radiation damage in the structure determination of beam-sensitive materials by three-dimensional electron diffraction (3DED). The snapshot data acquisition in SerialED can be realized using both transmission and scanning transmission electron microscopes (TEM/STEM). However, the current SerialED workflow based on STEM setups requires special external devices and software, which limits broader adoption. Here, we present a simplified experimental implementation of STEM-based SerialED on Thermo Fisher Scientific STEMs using common proprietary software interfaced through Python scripts to automate data collection. Specifically, we utilize TEM Imaging and Analysis (TIA) scripting and TEM scripting to access the STEM functionalities of the microscope, and DigitalMicrograph scripting to control the camera for snapshot data acquisition. Data analysis adapts the existing workflow using the software CrystFEL, which was developed for serial X-ray crystallography. Our workflow for STEM SerialED can be used on any Gatan or Thermo Fisher Scientific camera. We apply this workflow to collect high-resolution STEM SerialED data from two aluminosilicate zeolites, zeolite Y and ZSM-25. We demonstrate, for the first time, ab initio structure determination through direct methods using STEM SerialED data. Zeolite Y is relatively stable under the electron beam, and STEM SerialED data extend to 0.60 Å. We show that the structural model obtained using STEM SerialED data merged from 358 crystals is nearly identical to that using continuous rotation electron diffraction data from one crystal. This demonstrates that accurate structures can be obtained from STEM SerialED. Zeolite ZSM-25 is very beam-sensitive and has a complex structure. We show that STEM SerialED greatly improves the data resolution of ZSM-25, compared with serial rotation electron diffraction (SerialRED), from 1.50 to 0.90 Å. This allows, for the first time, the use of standard phasing methods, such as direct methods, for the ab initio structure determination of ZSM-25.




li

Orientational ordering and assembly of silica–nickel Janus particles in a magnetic field

The orientation ordering and assembly behavior of silica–nickel Janus particles in a static external magnetic field were probed by ultra small-angle X-ray scattering (USAXS). Even in a weak applied field, the net magnetic moments of the individual particles aligned in the direction of the field, as indicated by the anisotropy in the recorded USAXS patterns. X-ray photon correlation spectroscopy (XPCS) measurements on these suspensions revealed that the corresponding particle dynamics are primarily Brownian diffusion [Zinn, Sharpnack & Narayanan (2023). Soft Matter, 19, 2311–2318]. At higher fields, the magnetic forces led to chain-like configurations of particles, as indicated by an additional feature in the USAXS pattern. A theoretical framework is provided for the quantitative interpretation of the observed anisotropic scattering diagrams and the corresponding degree of orientation. No anisotropy was detected when the magnetic field was applied along the beam direction, which is also replicated by the model. The method presented here could be useful for the interpretation of oriented scattering patterns from a wide variety of particulate systems. The combination of USAXS and XPCS is a powerful approach for investigating asymmetric colloidal particles in external fields.




li

Conformation–aggregation interplay in the simplest aliphatic ethers probed under high pressure

The structures of the simplest symmetric primary ethers [(CnH2n+1)2O, n = 1–3] determined under high pressure revealed their conformational preferences and intermolecular interactions. In three new polymorphs of di­ethyl ether (C2H5)2O, high pressure promotes intermolecular CH⋯O contacts and enforces a conversion from the trans–trans conformer present in the α, β and γ phases to the trans–gauche conformer, which is higher in energy by 6.4 kJ mol−1, in the δ phase. Two new polymorphs of di­methyl ether (CH3)2O display analogous transformations of the CH⋯O bonds. The crystal structure of di-n-propyl ether (C3H7)2O, determined for the first time, is remarkably stable over the whole pressure range investigated from 1.70 up to 5.30 GPa.




li

C-SPAM: an open-source time-resolved specimen vitrification device with light-activated molecules

Molecular structures can be determined in vitro and in situ with cryo-electron microscopy (cryo-EM). Specimen preparation is a major obstacle in cryo-EM. Typical sample preparation is orders of magnitude slower than biological processes. Time-resolved cryo-EM (TR-cryo-EM) can capture short-lived states. Here, Cryo-EM sample preparation with light-activated molecules (C-SPAM) is presented, an open-source, photochemistry-coupled device for TR-cryo-EM that enables millisecond resolution and tunable timescales across broad biological applications.




li

Orientational analysis of atomic pair correlations in nanocrystalline indium oxide thin films

The application of grazing-incidence total X-ray scattering (GITXS) for pair distribution function (PDF) analysis using >50 keV X-rays from synchrotron light sources has created new opportunities for structural characterization of supported thin films with high resolution. Compared with grazing-incidence wide-angle X-ray scattering, which is only useful for highly ordered materials, GITXS/PDFs expand such analysis to largely disordered or nanostructured materials by examining the atomic pair correlations dependent on the direction relative to the surface of the supporting substrate. A characterization of nanocrystalline In2O3-derived thin films is presented here with in-plane-isotropic and out-of-plane-anisotropic orientational ordering of the atomic structure, each synthesized using different techniques. The atomic orientations of such films are known to vary based on the synthetic conditions. Here, an azimuthal orientational analysis of these films using GITXS with a single incident angle is shown to resolve the markedly different orientations of the atomic structures with respect to the planar support and the different degrees of long-range order, and hence, the terminal surface chemistries. It is anticipated that orientational analysis of GITXS/PDF data will offer opportunities to extend structural analyses of thin films by providing a means to qualitatively determine the major atomic orientation within nanocrystalline and, eventually, non-crystalline films.




li

The interoperability of crystallographic data and databases

Interoperability of crystallographic data with other disciplines is essential for the smooth and rapid progress of structure-based science in the computer age. Within crystallography and closely related subject areas, there is already a high level of conformance to the generally accepted FAIR principles (that data be findable, accessible, interoperable and reusable) through the adoption of common information exchange protocols by databases, publishers, instrument vendors, experimental facilities and software authors. Driven by the success within these domains, the IUCr has worked closely with CODATA (the Committee on Data of the International Science Council) to help develop the latter's commitment to cross-domain integration of discipline-specific data. The IUCr has, in particular, emphasized the need for standards relating to data quality and completeness as an adjunct to the FAIR data landscape. This can ensure definitive reusable data, which in turn can aid interoperability across domains. A microsymposium at the IUCr 2023 Congress provided an up-to-date survey of data interoperability within and outside of crystallography, expounded using a broad range of examples.




li

What is isostructurality? Questions on the definition

Investigation of isostructurality leads to a deeper understanding of close-packing principles and contributes to the ability of crystal engineering. A given packing motif may tolerate small molecular changes within a limit. Slight alterations of a crystal packing arrangement are carried out in order to fine-tune the structural and macroscopic properties, keeping the balance of the spatial requirements and electrostatic effects of the altered molecules in the crystals, preserving their isostructurality. Even so, the definition of isostructurality is not explicit about several issues. Are the corresponding structures required to have the same stoichiometry, Z', symmetry elements and the same space group? Because it is not obvious in the definition, studies on structure analysis and software calculating various numerical descriptors developed for the quantitative comparison of the degree of similarity of isostructural crystals self-define their criteria. The extent of the difference between corresponding crystal structures referred to as isostructural is not limited. Should it be determined numerically? There is nothing in the definition about a demand for similar supramolecular arrangements in isostructural crystals. Should the similarity of supramolecular interactions be a criterion of isostructurality? The definition of isostructurality deserves reconsideration regarding symmetry, measure of similarity and formation of supramolecular interactions.




li

Community recommendations on cryoEM data archiving and validation

In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for the deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 47 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discussed, and the resulting consensus recommendations. Some challenges for future methods-development efforts in this area are also highlighted, as is the implementation to date of some of the recommendations.




li

Time-series analysis of rhenium(I) organometallic covalent binding to a model protein for drug development

Metal-based complexes with their unique chemical properties, including multiple oxidation states, radio-nuclear capabilities and various coordination geometries yield value as potential pharmaceuticals. Understanding the interactions between metals and biological systems will prove key for site-specific coordination of new metal-based lead compounds. This study merges the concepts of target coordination with fragment-based drug methodologies, supported by varying the anomalous scattering of rhenium along with infrared spectroscopy, and has identified rhenium metal sites bound covalently with two amino acid types within the model protein. A time-based series of lysozyme-rhenium-imidazole (HEWL-Re-Imi) crystals was analysed systematically over a span of 38 weeks. The main rhenium covalent coordination is observed at His15, Asp101 and Asp119. Weak (i.e. noncovalent) interactions are observed at other aspartic, asparagine, proline, tyrosine and tryptophan side chains. Detailed bond distance comparisons, including precision estimates, are reported, utilizing the diffraction precision index supplemented with small-molecule data from the Cambridge Structural Database. Key findings include changes in the protein structure induced at the rhenium metal binding site, not observed in similar metal-free structures. The binding sites are typically found along the solvent-channel-accessible protein surface. The three primary covalent metal binding sites are consistent throughout the time series, whereas binding to neighbouring amino acid residues changes through the time series. Co-crystallization was used, consistently yielding crystals four days after setup. After crystal formation, soaking of the compound into the crystal over 38 weeks is continued and explains these structural adjustments. It is the covalent bond stability at the three sites, their proximity to the solvent channel and the movement of residues to accommodate the metal that are important, and may prove useful for future radiopharmaceutical development including target modification.




li

Linking solid-state phenomena via energy differences in `archetype crystal structures'

Categorization underlies understanding. Conceptualizing solid-state structures of organic molecules with `archetype crystal structures' bridges established categories of disorder, polymorphism and solid solutions and is herein extended to special position and high-Z' structures. The concept was developed in the context of disorder modelling [Dittrich, B. (2021). IUCrJ, 8, 305–318] and relies on adding quantum chemical energy differences between disorder components to other criteria as an explanation as to why disorder – and disappearing disorder – occurs in an average structure. Part of the concept is that disorder, as probed by diffraction, affects entire molecules, rather than just the parts of a molecule with differing conformations, and the finding that an R·T energy difference between disorder archetypes is usually not exceeded. An illustrative example combining disorder and special positions is the crystal structure of oestradiol hemihydrate analysed here, where its space-group/subgroup relationship is required to explain its disorder of hydrogen-bonded hydrogen atoms. In addition, we show how high-Z' structures can also be analysed energetically and understood via archetypes: high-Z' structures occur when an energy gain from combining different rather than overall alike conformations in a crystal significantly exceeds R·T, and this finding is discussed in the context of earlier explanations in the literature. Twinning is not related to archetype structures since it involves macroscopic domains of the same crystal structure. Archetype crystal structures are distinguished from crystal structure prediction trial structures in that an experimental reference structure is required for them. Categorization into archetype structures also has practical relevance, leading to a new practice of disorder modelling in experimental least-squares refinement alluded to in the above-mentioned publication.




li

From X-ray crystallographic structure to intrinsic thermodynamics of protein–ligand binding using carbonic anhydrase isozymes as a model system

Carbonic anhydrase (CA) was among the first proteins whose X-ray crystal structure was solved to atomic resolution. CA proteins have essentially the same fold and similar active centers that differ in only several amino acids. Primary sulfonamides are well defined, strong and specific binders of CA. However, minor variations in chemical structure can significantly alter their binding properties. Over 1000 sulfonamides have been designed, synthesized and evaluated to understand the correlations between the structure and thermodynamics of their binding to the human CA isozyme family. Compound binding was determined by several binding assays: fluorescence-based thermal shift assay, stopped-flow enzyme activity inhibition assay, isothermal titration calorimetry and competition assay for enzyme expressed on cancer cell surfaces. All assays have advantages and limitations but are necessary for deeper characterization of these protein–ligand interactions. Here, the concept and importance of intrinsic binding thermodynamics is emphasized and the role of structure–thermodynamics correlations for the novel inhibitors of CA IX is discussed – an isozyme that is overexpressed in solid hypoxic tumors, and thus these inhibitors may serve as anticancer drugs. The abundant structural and thermodynamic data are assembled into the Protein–Ligand Binding Database to understand general protein–ligand recognition principles that could be used in drug discovery.




li

Statistical optimization of guest uptake in crystalline sponges: grading structural outcomes

Investigation of the analyte soaking conditions on the crystalline sponge {[(ZnI2)3(tpt)2·x(solvent)]n} method using a statistical design of experiments model has provided fundamental insights into the influence of experimental variables. This approach focuses on a single analyte tested via 60 experiments (20 unique conditions) to identify the main effects for success and overall guest structure quality. This is employed as a basis for the development of a novel molecular structure grading system that enables the quantification of guest exchange quality.




li

A modified phase-retrieval algorithm to facilitate automatic de novo macromolecular structure determination in single-wavelength anomalous diffraction

The success of experimental phasing in macromolecular crystallography relies primarily on the accurate locations of heavy atoms bound to the target crystal. To improve the process of substructure determination, a modified phase-retrieval algorithm built on the framework of the relaxed alternating averaged reflection (RAAR) algorithm has been developed. Importantly, the proposed algorithm features a combination of the π-half phase perturbation for weak reflections and enforces the direct-method-based tangent formula for strong reflections in reciprocal space. The proposed algorithm is extensively demonstrated on a total of 100 single-wavelength anomalous diffraction (SAD) experimental datasets, comprising both protein and nucleic acid structures of different qualities. Compared with the standard RAAR algorithm, the modified phase-retrieval algorithm exhibits significantly improved effectiveness and accuracy in SAD substructure determination, highlighting the importance of additional constraints for algorithmic performance. Furthermore, the proposed algorithm can be performed without human intervention under most conditions owing to the self-adaptive property of the input parameters, thus making it convenient to be integrated into the structural determination pipeline. In conjunction with the IPCAS software suite, we demonstrated experimentally that automatic de novo structure determination is possible on the basis of our proposed algorithm.




li

Benchmarking predictive methods for small-angle X-ray scattering from atomic coordinates of proteins using maximum likelihood consensus data

Stimulated by informal conversations at the XVII International Small Angle Scattering (SAS) conference (Traverse City, 2017), an international team of experts undertook a round-robin exercise to produce a large dataset from proteins under standard solution conditions. These data were used to generate consensus SAS profiles for xylose isomerase, urate oxidase, xylanase, lysozyme and ribonuclease A. Here, we apply a new protocol using maximum likelihood with a larger number of the contributed datasets to generate improved consensus profiles. We investigate the fits of these profiles to predicted profiles from atomic coordinates that incorporate different models to account for the contribution to the scattering of water molecules of hydration surrounding proteins in solution. Programs using an implicit, shell-type hydration layer generally optimize fits to experimental data with the aid of two parameters that adjust the volume of the bulk solvent excluded by the protein and the contrast of the hydration layer. For these models, we found the error-weighted residual differences between the model and the experiment generally reflected the subsidiary maxima and minima in the consensus profiles that are determined by the size of the protein plus the hydration layer. By comparison, all-atom solute and solvent molecular dynamics (MD) simulations are without the benefit of adjustable parameters and, nonetheless, they yielded at least equally good fits with residual differences that are less reflective of the structure in the consensus profile. Further, where MD simulations accounted for the precise solvent composition of the experiment, specifically the inclusion of ions, the modelled radius of gyration values were significantly closer to the experiment. The power of adjustable parameters to mask real differences between a model and the structure present in solution is demonstrated by the results for the conformationally dynamic ribonuclease A and calculations with pseudo-experimental data. This study shows that, while methods invoking an implicit hydration layer have the unequivocal advantage of speed, care is needed to understand the influence of the adjustable parameters. All-atom solute and solvent MD simulations are slower but are less susceptible to false positives, and can account for thermal fluctuations in atomic positions, and more accurately represent the water molecules of hydration that contribute to the scattering profile.