|
Cedric Chauve Person1 #685333 Professor in the Department of Mathematics at Simon Fraser University. | - My research focuses on the development of mathematical and computational methods for the analysis of biological sequence data. I am particularly interested in genotyping algorithms both for infectious diseases and cancer, and in combinatorial algorithms for genome evolution.
|
+Citations (4) - CitationsAdd new citationList by: CiterankMapLink[2] plASgraph2: using graph neural networks to detect plasmid contigs from an assembly graph
Author: Janik Sielemann, Katharina Sielemann, Broňa Brejová, Tomáš Vinař, Cedric Chauve Publication date: 6 October 2023 Publication info: Frontiers in Microbiology, 6 October 2023 Cited by: David Price 4:16 PM 10 December 2023 GMT Citerank: (3) 701020CANMOD – PublicationsPublications by CANMOD Members144B5ACA0, 708734Genomics859FDEF6, 715930plASgraph2Classifying Plasmid Contigs From Bacterial Assembly Graphs Using Graph Neural Networks.122C78CB7 URL: DOI: 2023/10/06
| Excerpt / Summary [Frontiers in Microbiology, 6 October 2023]
Identification of plasmids from sequencing data is an important and challenging problem related to antimicrobial resistance spread and other One-Health issues. We provide a new architecture for identifying plasmid contigs in fragmented genome assemblies built from short-read data. We employ graph neural networks (GNNs) and the assembly graph to propagate the information from nearby nodes, which leads to more accurate classification, especially for short contigs that are difficult to classify based on sequence features or database searches alone. We trained plASgraph2 on a data set of samples from the ESKAPEE group of pathogens. plASgraph2 either outperforms or performs on par with a wide range of state-of-the-art methods on testing sets of independent ESKAPEE samples and samples from related pathogens. On one hand, our study provides a new accurate and easy to use tool for contig classification in bacterial isolates; on the other hand, it serves as a proof-of-concept for the use of GNNs in genomics.
Our software is available at:
https://github.com/cch...
…and the training and testing data sets are available at:
https://github.com/fmf... |
Link[4] TKSM: highly modular, user-customizable, and scalable transcriptomic sequencing long-read simulator
Author: Fatih Karaoğlanoğlu, Baraa Orabi, Ryan Flannigan, Cedric Chauve, Faraz Hach Publication date: 25 January 2024 Publication info: Bioinformatics, Volume 40, Issue 2, February 2024, btae051, Cited by: David Price 4:15 PM 1 March 2024 GMT Citerank: (4) 701020CANMOD – PublicationsPublications by CANMOD Members144B5ACA0, 704022Surveillance859FDEF6, 708734Genomics859FDEF6, 715831Diagnostic testing859FDEF6 URL: DOI: https://doi.org/10.1093/bioinformatics/btae051
| Excerpt / Summary [Bioinformatics, 25 January 2024]
Motivation: Transcriptomic long-read (LR) sequencing is an increasingly cost-effective technology for probing various RNA features. Numerous tools have been developed to tackle various transcriptomic sequencing tasks (e.g. isoform and gene fusion detection). However, the lack of abundant gold-standard datasets hinders the benchmarking of such tools. Therefore, the simulation of LR sequencing is an important and practical alternative. While the existing LR simulators aim to imitate the sequencing machine noise and to target specific library protocols, they lack some important library preparation steps (e.g. PCR) and are difficult to modify to new and changing library preparation techniques (e.g. single-cell LRs).
Results: We present TKSM, a modular and scalable LR simulator, designed so that each RNA modification step is targeted explicitly by a specific module. This allows the user to assemble a simulation pipeline as a combination of TKSM modules to emulate a specific sequencing design. Additionally, the input/output of all the core modules of TKSM follows the same simple format (Molecule Description Format) allowing the user to easily extend TKSM with new modules targeting new library preparation steps.
Availability and implementation: TKSM is available as an open source software at:
https://github.com/vpc... |
|
|