Shared posts

15 Jul 04:43

Rapid and sensitive detection of genome contamination at scale with FCS-GX [NEW RESULTS]

by Astashyn, A.
Yi

NCBI出品

Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBIs Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/.
23 Oct 08:48

Contamination in Reference Sequence Databases: Time for Divide-and-Rule Tactics

by Valérian Lupo

Contaminating sequences in public genome databases is a pervasive issue with potentially far-reaching consequences. This problem has attracted much attention in the recent literature and many different tools are now available to detect contaminants. Although these methods are based on diverse algorithms that can sometimes produce widely different estimates of the contamination level, the majority of genomic studies rely on a single method of detection, which represents a risk of systematic error. In this work, we used two orthogonal methods to assess the level of contamination among National Center for Biotechnological Information Reference Sequence Database (RefSeq) bacterial genomes. First, we applied the most popular solution, CheckM, which is based on gene markers. We then complemented this approach by a genome-wide method, termed Physeter, which now implements a k-folds algorithm to avoid inaccurate detection due to potential contamination of the reference database. We demonstrate that CheckM cannot currently be applied to all available genomes and bacterial groups. While it performed well on the majority of RefSeq genomes, it produced dubious results for 12,326 organisms. Among those, Physeter identified 239 contaminated genomes that had been missed by CheckM. In conclusion, we emphasize the importance of using multiple methods of detection while providing an upgrade of our own detection tool, Physeter, which minimizes incorrect contamination estimates in the context of unavoidably contaminated reference databases.

05 Aug 01:17

The structural basis of odorant recognition in insect olfactory receptors

by Josefina del Mármol

Nature, Published online: 04 August 2021; doi:10.1038/s41586-021-03794-8

Structural and functional analysis of an insect olfactory receptor shed light on how receptors can be activated by diverse odorants.
28 Jul 10:28

[Articles] Safety, tolerability, and immunogenicity of an aerosolised adenovirus type-5 vector-based COVID-19 vaccine (Ad5-nCoV) in adults: preliminary report of an open-label and randomised phase 1 clinical trial

by Shipo Wu, Jianying Huang, Zhe Zhang, Jianyuan Wu, Jinlong Zhang, Hanning Hu, Tao Zhu, Jun Zhang, Lin Luo, Pengfei Fan, Busen Wang, Chang Chen, Yi Chen, Xiaohong Song, Yudong Wang, Weixue Si, Tianjian Sun, Xinghuan Wang, Lihua Hou, Wei Chen
Yi

雾化疫苗

Aerosolised Ad5-nCoV is well tolerated, and two doses of aerosolised Ad5-nCoV elicited neutralising antibody responses, similar to one dose of intramuscular injection. An aerosolised booster vaccination at 28 days after first intramuscular injection induced strong IgG and neutralising antibody responses. The efficacy and cost-effectiveness of aerosol vaccination should be evaluated in future studies.
10 Nov 01:42

Trends in Antibiotic Consumption and Resistance in France Over 20 Years: Large and Continuous Efforts but Contrasting Results

by Carlet J, Jarlier V, Acar J, et al.
Abstract
Background
Antimicrobial resistance (AMR) is a serious threat to humanity. This paper describes the French efforts made since 2001 and presents data on antimicrobial consumption (AC) and AMR.
Methods
We gathered all data on AC and AMR recorded since 2001 from different national agencies, transferred on a regular basis to standardized European data on AC and resistance in both humans and animals.
Results
After a large information campaign implemented in France from 2001 to 2005 in humans, AC in the community decreased significantly (18% to 34% according to the calculation method used). It remained at the same level from 2005 to 2010 and increased again from 2010 to 2018 (8%). Contrasting results were observed for AMR. The resistance of Staphylococcus aureus decreased significantly. For gram-negative bacilli, the results were variable according to the microorganism. The resistance of Enterobacteriaceae to third-generation cephalosporins increased, remaining moderate for Escherichia coli (12% in 2017) but reaching 35% in the same year for Klebsiella pneumoniae. Resistance to carbapenems in those 2 microorganisms remained below 1%. Both global AC and resistance to most antibiotics decreased significantly in animals.
Conclusions
Antibiotic consumption decreased significantly in France after a large public campaign from 2001 to 2005, but this positive effect was temporary. The effect on AMR varied according to the specific microorganism: The effect was very impressive for gram-positive cocci, variable for gram-negative bacilli, and moderate for E. coli, but that for K. pneumoniae was of concern. The consumption of and resistance to antibiotics decreased significantly in animals.
26 Jul 00:00

NanoReviser: An Error-correction Tool for Nanopore Sequencing Based on a Deep Learning Algorithm [NEW RESULTS]

by Wang, L.
Nanopore sequencing is regarded as one of the most promising third-generation sequencing (TGS) technologies. Since 2014, Oxford Nanopore Technologies (ONT) has developed a series of devices based on nanopore sequencing to produce very long reads, with an expected impact on genomics. However, the nanopore sequencing reads are susceptible to a fairly high error rate owing to the difficulty in identifying the DNA bases from the complex electrical signals. Although several basecalling tools have been developed for nanopore sequencing over the past years, it is still challenging to correct the sequences after applying the basecalling procedure. In this study, we developed an open-source DNA basecalling reviser, NanoReviser, based on a deep learning algorithm to correct the basecalling errors introduced by current basecallers provided by default. In our module, we re-segmented the raw electrical signals based on the basecalled sequences provided by the default basecallers. By employing convolution neural networks (CNNs) and bidirectional long short-term memory (Bi-LSTM) networks, we took advantage of the information from the raw electrical signals and the basecalled sequences from the basecallers. Our results showed NanoReviser, as a post-basecalling reviser, significantly improving the basecalling quality. After being trained on standard ONT sequencing reads from public E. coli and human NA12878 datasets, NanoReviser reduced the sequencing error rate by over 5% for both the E. coli dataset and the human dataset. The performance of NanoReviser was found to be better than those of all current basecalling tools. Furthermore, we analyzed the modified bases of the E. coli dataset and added the methylation information to train our module. With the methylation annotation, NanoReviser reduced the error rate by 7% for the E. coli dataset and specifically reduced the error rate by over 10% for the regions of the sequence rich in methylated bases. To the best of our knowledge, NanoReviser is the first post-processing tool after basecalling to accurately correct the nanopore sequences without the time-consuming procedure of building the consensus sequence. The NanoReviser package is freely available at https://github.com/pkubioinformatics/NanoReviser.