Baudisgroup Publications¶

A list of publication can also be retrieved through EuropePMC.

Multicancer analyses of short tandem repeat variations reveal shared gene regulatory mechanisms

Repeat lengths association with the expression of nearby genes (eSTRs) in CRC, STAD and UCEC tumors

Feifei Xia, Max Verbiest, Oxana Lundström, Tugce Bilgin Sonay, Michael Baudis and Maria Anisimova¶

doi: https://doi.org/10.1101/2025.01.06.629343 ¶

biorXiv logo Background Short tandem repeats (STRs) have been reported to influence gene expression across various human tissues. While STR variations are enriched in colorectal (CRC), stomach (STAD) and endometrial (UCEC) cancers, particularly in microsatellite instable (MSI) tumors, their functional effects and regulatory mechanisms on gene expression remain poorly understood across these cancer types.

Results Here, we leverage whole-exome sequencing and gene expression data to identify STRs for which repeat lengths are associated with the expression of nearby genes (eSTRs) Continue reading

A systematic benchmark of copy number variation detection tools for high density SNP genotyping arrays

M.N. van Baardwijk, L.S.E.M. Heijnen, H. Zhao, M. Baudis and A.P. Stubbs¶

Genomics (Elsevier). 2024 Nov 14.¶

doi: 10.1016/j.ygeno.2024.110962
PMID: 39547585

Abstract Copy Number Variations (CNVs) are crucial in various diseases, especially cancer, but detecting them accurately from SNP genotyping arrays remains challenging. Therefore, this study benchmarked five CNV detection tools-PennCNV, QuantiSNP, iPattern, EnsembleCNV, and R-GADA-using SNP array and WGS data from 2002 individuals of the DRAGEN re-analysis of the 1000 Genomes project. Continue reading

Copy number variation heterogeneity reveals biological inconsistency in hierarchical cancer classifications

Research Article

Ziying Yang, Paula Carrio-Cordo and Michael Baudis¶

Molecular Cytogenetics (Spring Nature). doi: 10.1186/s13039-024-00692-2 ¶

Abstract: Cancers are heterogeneous diseases with unifying features of abnormal and consuming cell growth, where the deregulation of normal cellular functions is initiated by the accumulation of genomic mutations in cells of - potentially - any organ. At diagnosis malignancies typically present with patterns of somatic genome variants on diverse levels of heterogeneity. Among the different types of genomic alterations, copy number variants (CNV) represent a distinct, near-ubiquitous class of structural variants. Cancer classifications are foundational for patient care and oncology research. Terminologies such as the National Cancer Institute Thesaurus provide large sets of hierarchical cancer classification vocabularies and promote data interoperability and ontology-driven computational analysis. To find out how categorical classifications correspond to genomic observations, we conducted a meta-analysis of inter-sample genomic heterogeneity for classification hierarchies on CNV profiles from 97,142 individual samples across 512 cancer entities, and evaluated recurring CNV signatures across diagnostic subsets. Our results highlight specific biological mechanisms across cancer entities with the potential for improvement of patient stratification and future enhancement of cancer classification systems and provide some indications for cooperative genomic events across distinct clinical entities.

Baudisgroup Publications¶

Multicancer analyses of short tandem repeat variations reveal shared gene regulatory mechanisms

Repeat lengths association with the expression of nearby genes (eSTRs) in CRC, STAD and UCEC tumors

Feifei Xia, Max Verbiest, Oxana Lundström, Tugce Bilgin Sonay, Michael Baudis and Maria Anisimova¶

doi: https://doi.org/10.1101/2025.01.06.629343¶

A systematic benchmark of copy number variation detection tools for high density SNP genotyping arrays

M.N. van Baardwijk, L.S.E.M. Heijnen, H. Zhao, M. Baudis and A.P. Stubbs¶

Genomics (Elsevier). 2024 Nov 14.¶

Copy number variation heterogeneity reveals biological inconsistency in hierarchical cancer classifications

Research Article

Ziying Yang, Paula Carrio-Cordo and Michael Baudis¶

Molecular Cytogenetics (Spring Nature). doi: 10.1186/s13039-024-00692-2¶

cancercelllines.org - a Novel Resource for Genomic Variants in Cancer Cell Lines

DATABASE Article

Rahel Paloots and Michael Baudis¶

Database (Oxford). 2024 Apr 30:2024:baae030. doi: 10.1093/database/baae030¶

bioarXiv preprint (2023-12-13): https://doi.org/10.1101/2023.12.12.571281¶

Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines

Literature-derived annotations as entry point for data exploration

Ellery Smith, Rahel Paloots, Dimitris Giagkos, Michael Baudis and Kurt Stockinger¶

Bioinformatics Advances, vbae045, doi.org/10.1093/bioadv/vbae045¶

Previous arXiv preprint (2023-07-03): https://doi.org/10.48550/arXiv.2307.00933¶

Twelve quick tips for deploying a Beacon

Some hints for Beacon developers & implementers

Lauren A Fromont, Mauricio Moldes, Michael Baudis, Anthony J Brookes, Arcadi Navarro and Jordi Rambla¶

PLoS Comput Biol. 2024 Mar 1;20(3):e1011817.¶

labelSeg: segment annotation for tumor copy number alteration profiles

A tool to assign relative SCNA levels to segments

Hangjia Zhao and Michael Baudis¶

Briefings in Bioinformatics (Oxford). 2024 Jan 31;2024:bbad541.¶

Short tandem repeat mutations regulate gene expression in colorectal cancer

Exploring STR patterns and their relation to expression changes in cancer

Max A Verbiest, Oxana Lundström, Feifei Xia, Michael Baudis, Tugce Bilgin Sonay, Maria Anisimova¶

doi: https://doi.org/10.1101/2023.11.29.569189¶

Phenopacket-tools: Building and validating GA4GH Phenopackets

Bioinformatics tools and examples for working with the Phenopackets standard

Danis D, Jacobsen JOB, Wagner AH, Groza T, Beckwith MA, Rekerle L, Carmody LC, Reese J, Hegde H, Ladewig MS, Seitz B, Munoz-Torres M, Harris NL, Rambla J, Baudis M, Mungall CJ, Haendel MA, Robinson PN. (2023) Phenopacket-tools: Building and validating GA4GH Phenopackets. PLoS One. 18:e0285433.¶

Candidate targets of copy number deletion events across 17 cancer types

Identifying cancer related genes against the background of somatic CNV events

Huang Q and Baudis M¶

doi: 10.3389/fgene.2022.1017657¶

previous bioRxiv (first )2022-06-29), doi.org/10.1101/2022.06.29.498080¶

GA4GH Phenopackets: A Practical Introduction

Phenopackets v2 introduction with practical examples

Ladewig MS, Jacobsen JO, Wagner AH, Danis D, Kassaby BE, Gargano M, Groza T, Baudis M, Steinhaus R, Seelow D, Bechrakis NE, Mungall CJ, Schofield PN, Elemento O, Smith L, McMurry JA, Munoz-Torres M, Haendel MA and Robinson PN¶

Advanced Genetics 2022, 2200016. LINK¶

The GA4GH Phenopacket schema defines a computable representation of clinical data

Phenopackets v2 publication

Jacobsen JOB, Baudis M, Baynam GS, Beckmann JS, Beltran S, Buske OJ, Callahan TJ, Chute CG, Courtot M, Danis D, Elemento O, Essenwanger A, Freimuth RR, ... , Haendel MA, Robinson PN, The GAGHPMC.¶

Nature Biotechnology. 2022;40:817-820. LINK | PMID:35705716¶

Beacon v2 and Beacon networks: A "lingua franca" for federated data discovery in biomedical genomics, and beyond

Beacon v2 publication

Rambla J, Baudis M, Ariosa R, Beck T, Fromont LA, Navarro A, Paloots R, Rueda M, Saunders G, Singh B, Spalding JD.¶

Human Mutation. 2022 Mar 17. PMID:35297548¶

The GA4GH Phenopacket schema: A computable representation of clinical data for precision medicine

Phenopackets v2 preprint

Jacobsen JOB, Baudis M, Baynam GS, Beckmann JS, Beltran S, Callahan TJ, Chute CG, Courtot M, Danis D, Elemento O, Freimuth RR, ..., Haendel MA, Robinson PN.¶

medRxiv, 2021.11.27.21266944. doi:10.1101/2021.11.27.21266944¶

The GA4GH Variation Representation Specification (VRS): a Computational Framework for the Precise Representation and Federated Identification of Molecular Variation.

Alex H. Wagner, Lawrence Babb, Gil Alterovitz, Michael Baudis, Matthew Brush, Daniel L. Cameron, Melissa Cline , Malachi Griffith, Obi L. Griffith, ..., Melissa Konopko, Heidi L. Rehm, Andrew D. Yates, Robert R. Freimuth, Reece K. Hart¶

Wagner, Alex H. et al. Cell Genomics, Volume 1, Issue 2, 100027 doi:10.1016/j.xgen.2021.100027¶

bioRxiv. version 20212021.01.15.426843. (2021-01-15)¶

Note¶

International federation of genomic medicine databases using GA4GH standards

Adrian Thorogood, Heidi L. Rehm, Peter Goodhand, Angela J.H. Page, Yann Joly, Michael Baudis, Jordi Rambla, Arcadi Navarro, Tommi H. Nyronen, Mikael Linden, Edward S. Dove, Marc Fiume, Michael Brudno, Melissa S. Cline, Ewan Birney¶

Thorogood, Adrian et al. Cell Genomics, Volume 1, Issue 2, 100032 doi:10.1016/j.xgen.2021.100032¶

Note¶

GA4GH: International policies and standards for data sharing across genomic research and healthcare

Rehm, Heidi L. et al. Cell Genomics, Volume 1, Issue 2, 100029 doi:10.1016/j.xgen.2021.100029¶

Note¶

The Progenetix oncogenomic resource in 2021

Article describing the current content & technical status of progenetix.org

Qingyao Huang, Paula Carrio Cordo, Bo Gao, Rahel Paloots, Michael Baudis¶

Database (Oxford). 2021 Jul 17;2021:baab043.¶

Signatures of Discriminative CNA in 31 Cancer Subtypes

Bo Gao and Michael Baudis (2021)¶

Published at Frontiers in Genetics, 2021-05-13¶

Abstract¶

Copy number variant heterogeneity among cancer types reflects inconsistent concordance with diagnostic classifications

Paula Carrio Cordo and Michael Baudis¶

doi: https://doi.org/10.1101/2025.01.06.629343 ¶

Molecular Cytogenetics (Spring Nature). doi: 10.1186/s13039-024-00692-2 ¶

Database (Oxford). 2024 Apr 30:2024:baae030. doi: 10.1093/database/baae030 ¶

bioarXiv preprint (2023-12-13): https://doi.org/10.1101/2023.12.12.571281 ¶

Bioinformatics Advances, vbae045, doi.org/10.1093/bioadv/vbae045 ¶

Previous arXiv preprint (2023-07-03): https://doi.org/10.48550/arXiv.2307.00933 ¶

doi: https://doi.org/10.1101/2023.11.29.569189 ¶

doi: 10.3389/fgene.2022.1017657 ¶

previous bioRxiv (first )2022-06-29), doi.org/10.1101/2022.06.29.498080 ¶

Advanced Genetics 2022, 2200016. LINK ¶

Nature Biotechnology. 2022;40:817-820. LINK | PMID:35705716 ¶

Human Mutation. 2022 Mar 17. PMID:35297548 ¶

medRxiv, 2021.11.27.21266944. doi:10.1101/2021.11.27.21266944 ¶

Wagner, Alex H. et al. Cell Genomics, Volume 1, Issue 2, 100027 doi:10.1016/j.xgen.2021.100027 ¶

Thorogood, Adrian et al. Cell Genomics, Volume 1, Issue 2, 100032 doi:10.1016/j.xgen.2021.100032 ¶

Rehm, Heidi L. et al. Cell Genomics, Volume 1, Issue 2, 100029 doi:10.1016/j.xgen.2021.100029 ¶

bioRxiv. doi: doi.org/10.1101/2021.03.01.433348 ¶

Cell Rep. 2020 Aug 4 DOI: 10.1016/j.celrep.2020.107985 ¶

bioRxiv, 2019-07-31. DOI 10.1101/720854 ¶

DATABASE, Volume 2020, 2020, baaa009, doi.org/10.1093/database/baaa009 ¶

bioRxiv preprint, 2020-01-11. DOI 10.1101/827683 ¶

Sci Rep 10, 4846 (2020). doi.org/10.1038/s41598-020-61854-x ¶

bioRxiv, 2020-11-01. DOI 10.1101/827683 ¶

bioRxiv, 2019-07-31. DOI 10.1101/720854 ¶

JEADV, 2019-01-19. doi.org/10.1111/jdv.15442 ¶