Biochimica et Biophysica Acta (BBA) - Protein Structure
Comparison of the predicted and observed secondary structure of T4 phage lysozyme
Abstract
Predictions of the secondary structure of T4 phage lysozyme, made by a number of investigators on the basis of the amino acid sequence, are compared with the structure of the protein determined experimentally by X-ray crystallography.
Within the amino terminal half of the molecule the locations of helices predicted by a number of methods agree moderately well with the observed structure, however within the carboxyl half of the molecule the overall agreement is poor. For eleven different helix predictions, the coefficients giving the correlation between prediction and observation range from 0.14 to 0.42. The accuracy of the predictions for both β-sheet regions and for turns are generally lower than for the helices, and in a number of instances the agreement between prediction and observation is no better than would be expected for a random selection of residues. The structural predictions for T4 phage lysozyme are much less successful than was the case for adenylate kinase (Schulz et al. (1974) Nature 250, 140–142). No one method of prediction is clearly superior to all others, and although empirical predictions based on larger numbers of known protein structure tend to be more accurate than those based on a limited sample, the improvement in accuracy is not dramatic, suggesting that the accuracy of current empirical predictive methods will not be substantially increased simply by the inclusion of more data from additional protein structure determinations.
References (37)
- A.V. Guzzo
Biophys. J.
(1965) - J.W. Prothero
Biophys. J.
(1966) - D.A. Cook
J. Mol. Biol.
(1967) - P.F. Periti et al.
J. Mol. Biol.
(1967) - M. Schiffer et al.
Biophys. J.
(1968) - P. Dunnill
Biophys. J.
(1968) - R. Leberman
J. Mol. Biol.
(1971) - A.V. Finkelstein et al.
J. Mol. Biol.
(1971) - K. Nagano
J. Mol. Biol.
(1974) - V.I. Lim
J. Mol. Biol.
(1974)
J. Mol. Biol.
J. Mol. Biol.
Arch. Biochem. Biophys.
J. Biol. Chem.
Dokl. Akad. Nauk. S.S.S.R.
Cited by (4359)
Introducing the overall risk scoring as an early warning system
2024, Expert Systems with ApplicationsBusiness performance is a critical field of study, which should be assessed in more than just credit risk in the banking context. However, much previous research takes business performance in the context of credit risk in banking. In this study, risks will be analyzed to measure business performance in an integrated manner within the framework of non-financial sector dynamics. To this end, brainstorming sessions, risk workshops, surveys, and face-to-face interviews were held with representatives of small medium enterprises in 11 different sectors. Through field studies, risks have been identified and assessed in terms of impact and probability, Key Risk Indicators have been determined, and risks have been scored based on financial and non-financial metrics to estimate the Overall Risk Scoring. Moreover, the Overall Risk Scoring model has been tested using Logit Regression and Artificial Neural Networks (ANN), the Naïve Bayes Algorithm, and the C4.5 decision tree model. All methods have statistically verified that the predictive power of the structure of the Overall Risk Scoring is high. Our findings reveal that when business performance is analyzed with non-financial and financial metrics, dynamic and static data, market expectations, and banking needs, the predictive power of the calculated Overall Risk Scoring increases. The created Overall Risk Scoring Model can be used as an early warning system for many objectives by business executives, suppliers, consumers, investors, financial institutions, public bodies, credit rating agencies, and entities like the Credit Guarantee Fund.
Complex matrices such as soil have a range of measurable characteristics, and thus data to describe them can be considered multidimensional. These characteristics can be strongly influenced by factors that introduce confounding effects that hinder analyses. Traditional statistical approaches lack the flexibility and granularity required to adequately evaluate such matrices, particularly those with large dataset of varying data types (i.e. quantitative non-compositional, quantitative compositional). We present a statistical workflow designed to effectively analyse complex, multidimensional systems, even in the presence of confounding variables. The developed methodology involves exploratory analysis to identify the presence of confounding variables, followed by data decomposition (including strategies for both compositional and non-compositional quantitative data) to minimise the influence of these confounding factors such as sampling site/location. These data processing methods then allow for common patterns to be highlighted in the data, including the identification of biomarkers and determination of non-trivial associations between variables. We demonstrate the utility of this statistical workflow by jointly analysing the chemical composition and fungal biodiversity of New Zealand vineyard soils that have been managed with either organic low-input or conventional input approaches. By applying this pipeline, we were able to identify biomarkers that distinguish viticultural soil from both approaches and also unearth links and associations between the chemical and metagenomic profiles. While soil is an example of a system that can require this type of statistical methodology, there are a range of biological and ecological systems that are challenging to analyse due to the complex interplay of global and local effects. Utilising our developed pipeline will greatly enhance the way that these systems can be studied and the quality and impact of insight gained from their analysis.
Identifying data-driven subtypes of major depressive disorder with electronic health records
2024, Journal of Affective DisordersEfforts to reduce the heterogeneity of major depressive disorder (MDD) by identifying subtypes have not yet facilitated treatment personalization or investigation of biology, so novel approaches merit consideration.
We utilized electronic health records drawn from 2 academic medical centers and affiliated health systems in Massachusetts to identify data-driven subtypes of MDD, characterizing sociodemographic features, comorbid diagnoses, and treatment patterns. We applied Latent Dirichlet Allocation (LDA) to summarize diagnostic codes followed by agglomerative clustering to define patient subgroups.
Among 136,371 patients (95,034 women [70 %]; 41,337 men [30 %]; mean [SD] age, 47.0 [14.0] years), the 15 putative MDD subtypes were characterized by comorbidities and distinct patterns in medication use. There was substantial variation in rates of selective serotonin reuptake inhibitor (SSRI) use (from a low of 62 % to a high of 78 %) and selective norepinephrine reuptake inhibitor (SNRI) use (from 4 % to 21 %).
Electronic health records lack reliable symptom-level data, so we cannot examine the extent to which subtypes might differ in clinical presentation or symptom dimensions.
These data-driven subtypes, drawing on representative clinical cohorts, merit further investigation for their utility in identifying more homogeneous patient populations for basic as well as clinical investigation.
Novel lossy compression method of noisy time series data with anomalies: Application to partial discharge monitoring in overhead power lines
2024, Engineering Applications of Artificial IntelligenceIn overhead power transmission lines, particularly in regions like natural parks where establishing a safe zone is difficult, the adoption of cross-linked polyethylene insulated covered conductors (CCs) helps prevent outages due to vegetation contact. However, these CCs are susceptible to partial discharge (PD) activity, which can degrade insulation and lead to system failures. Detecting and analyzing PD are essential for maintaining power system reliability and safety. A key challenge in PD monitoring is transmitting the large volumes of PD signal data over unreliable 2G networks, as existing compression methods either compromise on data integrity or are ineffective. This paper introduces a novel lossy compression technique utilizing an autoencoder with skip connections and correction data to address this issue. Unlike previous algorithms that struggle with noisy time series data and fail to preserve crucial anomaly information, our method reconstructs the signal without anomalies, which are subsequently restored using correction data. Achieving a compression factor of about 25 (reducing data to 4.1% of its original size), this approach maintains essential PD signal features for analysis. The effectiveness of our method is validated by three classification algorithms, showing promise for future fault detection, diagnosis, and memory space reduction. This innovative compression solution marks a significant advancement in PD data processing, offering a balanced trade-off between compression efficiency and data fidelity, and paving the way for enhanced remote monitoring in power transmission systems.
Annotator bias and its effect on deep learning segmentation of uncured composite micrographs
2024, NDT and E InternationalThis study demonstrated substantial agreement between groups of experts and non-experts in labelling image regions required for semantic segmentation of X-ray micrographs of uncured composite prepregs. Twelve participants, six experts and six non-experts, were given a 1-h training session covering the three different image regions that are present in an uncured composite micrograph: voids, dry fibre areas, and filled fibre areas. High consensus was observed in the centre of objects, but disagreement in labelling between the groups was observed at the interphase regions where the grey level intensity becomes ambiguous in these low-contrast images, such as at the edges of the dry fibre areas and voids. Also, labelling small interlaminar voids caused disagreement. The participants highlighted the role of software by reporting a preference to defining the vertices of a polygon over colouring-in regions of image segments. The resulting Deep Learning segmentation with expert and non-expert group labels were in almost perfect agreement, as measured by the Fleiss Kappa coefficient, and was able to segment voids and dry fibre areas better than thresholding.
Multitarget anti-parasitic activities of isoquinoline alkaloids isolated from Hippeastrum aulicum (Amaryllidaceae)
2024, PhytomedicineChagas disease and leishmaniasis affect a significant portion of the Latin American population and still lack efficient treatments. In this context, natural products emerge as promising compounds for developing more effective therapies, aiming to mitigate side effects and drug resistance. Notably, species from the Amaryllidaceae family emerge as potential reservoirs of antiparasitic agents due to the presence of diverse biologically active alkaloids.
To assess the anti-Trypanosoma cruzi and anti-Leishmania infantum activity of five isolated alkaloids from Hippeastrum aulicum Herb. (Amaryllidaceae) against different life stages of the parasites using in silico and in vitro assays. Furthermore, molecular docking was employed to evaluate the interaction of the most active alkaloids.
Five natural isoquinoline alkaloids isolated in suitable quantities for in vitro testing underwent preliminary in silico analysis to predict their potential efficacy against Trypanosoma cruzi (amastigote and trypomastigote forms) and Leishmania infantum (amastigote and promastigote forms). The in vitro antiparasitic activity and mammalian cytotoxicity were investigated with a subsequent comparison of both analysis (in silico and in vitro) findings. Additionally, this study employed the molecular docking technique, utilizing cruzain (T. cruzi) and sterol 14α-demethylase (CYP51, L. infantum) as crucial biological targets for parasite survival, specifically focusing on compounds that exhibited promising activities against both parasites.
Through computational techniques, it was identified that the alkaloids haemanthamine (1) and lycorine (8) were the most active against T. cruzi (amastigote and trypomastigote) and L. infantum (amastigote and promastigote), while also revealing unprecedented activity of alkaloid 7‑methoxy-O-methyllycorenine (6). The in vitro analysis confirmed the in silico tests, in which compound 1 presented the best activities against the promastigote and amastigote forms of L. infantum with half-maximal inhibitory concentration (IC50) 0.6 µM and 1.78 µM, respectively. Compound 8 exhibited significant activity against the amastigote form of T. cruzi (IC50 7.70 µM), and compound 6 demonstrated activity against the trypomastigote forms of T. cruzi and amastigote of L. infantum, with IC50 values of 89.55 and 86.12 µM, respectively. Molecular docking analyses indicated that alkaloids 1 and 8 exhibited superior interaction energies compared to the inhibitors.
The hitherto unreported potential of compound 6 against T. cruzi trypomastigotes and L. infantum amastigotes is now brought to the forefront. Furthermore, the acquired dataset signifies that the isolated alkaloids 1 and 8 from H. aulicum might serve as prototypes for subsequent structural refinements aimed at the exploration of novel leads against both T. cruzi and L. infantum parasites.