Skip to content

Welcome to the baudisgroup Pages

The baudisgroup website represents projects and information by the Computational Oncogenomics Group of the University of Zurich (UZH) and the Swiss Institute of Bioinformatics (SIB). For visitors more interested in Particle Astrophysics, we strongly recommend the website of another, although related, Professor Baudis.

The Computational Oncogenomics Group's research focus lies in the exploration of structural genome variations in cancer. Our work centres around our Progenetix resource of curated molecular-cytogenetic and sequencing data. Specific projects explore computational methods, genomics of selected tumour entities and genomic variant patterns across malignancies. As members of the Global Alliance for Genomics and Health, the group is developing standards in biocuration and data sharing for genomic variants and phenotypic data, for instance in driving development of the ELIXIR Beacon project. Other research is related to genome data epistemology, e.g. geographic and diagnostic sampling biases in cancer studies.

Some Projects

Ongoing software and service projects can be visited at our Github organizations (progenetix and baudisgroup) and when looking at individual contributions to e.g. GA4GH and ELIXIR.

Support or Contact

For requests & support, please use standard Github procedures such as pull requests or issues on our project repositories, or contact us.

Address

Winterthurerstrasse 190 | Y-13F-01 | CH-8057 Zurich | Switzerland

How to find the Baudisgroup

Latest News & Publications

A cancer genomics resource built on GA4GH standards

Rahel Paloots, Michael Baudis

CGC St Louis 2022

Progenetix is a cancer genomics resource that includes genomic profiling data as well as biomedical annotations and provenance data for cancer studies. The main goal of the Progenetix database is to provide easy, open access for research studies and clinical diagnostics. To facilitate sharing of genomic data, Progenetix complies with and contributes to GA4GH and Beacon data standards. Beacon, developed with the support from ELXIR (the European bioinformatics infrastructure organization), started out as protocol to share genomic variants over federated queries.

Continue reading

Candidate targets of copy number deletion events across 17 cancer types

Identifying cancer related genes against the background of somatic CNV events

Huang Q and Baudis M

bioRxiv, doi.org/10.1101/2022.06.29.498080

Abstract Genome variation is the direct cause of cancer and driver of its clonal evolution. While the impact of many point mutations can be evaluated through their modification of individual genomic elements, even single copy number aberrations (CNAs) may encompass hundreds of genes and therefore pose challenges to untangle potentially complex functional effects. However, consistent, recurring and disease-specific patterns in the genome-wide CNA landscape imply that particular CNA may promote cancer type-specific characteristics. Discerning essential cancer-promoting alterations from the inherent co-dependency in CNA would improve the understanding of mechanisms of CNA and provide new insights into cancer biology and potential therapeutic targets. Continue reading

Progenetix, BeaconPlus & GA4GH

A cancer genomics reference resource powered by GA4GH standards

Michael Baudis

Roche Data Science Seminar

The presentation reports about the Progenetix cancer genomics resource and its role in the GA4GH ecosystem & the Beacon genomics API development process.

Continue reading

The GA4GH Phenopacket schema defines a computable representation of clinical data

Phenopackets v2 publication

Cell Genomics logo

Jacobsen JOB, Baudis M, Baynam GS, Beckmann JS, Beltran S, Buske OJ, Callahan TJ, Chute CG, Courtot M, Danis D, Elemento O, Essenwanger A, Freimuth RR, ... , Haendel MA, Robinson PN, The GAGHPMC.

Abstract B Despite great strides made in the development and wide acceptance of standards for exchanging structured information about genomic variants, progress in standards for computational phenotype analysis for translational genomics has lagged behind. Phenotypic features (signs, symptoms, laboratory and imaging findings, results of physiological tests, etc.) are of high clinical importance, yet exchanging them in conjunction with genomic variation information is often overlooked or even neglected. Continue reading

Implementation of the GA4GH Beacon protocol for discovery and sharing of genomic copy number variation data

Michael Baudis

ESHG Vienna 2022

Background & Objectives Genomic copy number variations (CNV) are a major contributor to inter-individual genomic variation, can be causative events in rare diseases, but especially represent the majority of the mutational landscape in the most malignancies. While specific CNV events and some recurring patterns have contributed to the identification of individual cancer drivers and the recognition of cancer subtypes, the complexity of genomic CNV patterns requires large amounts of well-defined genomic profiles for statistically meaningful analyses. At the other end of the spectrum, in the area of rare disease genomics the potential pathogenicity of individual CNV events requires validation against a vast set of disease-related and reference genomic profiles and annotations.

Continue reading

Progenetix & BeaconPlus

An open cancer genomics resource on a stack of Beacon code...

Michael Baudis

ELIXIR All Hands Amsterdam 2022

Here Michael provides some overview of the multi-year trajectory of the Beacon API development, and how BeaconPlus & Progenetix have been utilized for "implementation driven design".

Continue reading

Beacon v2 and Beacon networks: A "lingua franca" for federated data discovery in biomedical genomics, and beyond

Beacon v2 publication

Rambla J, Baudis M, Ariosa R, Beck T, Fromont LA, Navarro A, Paloots R, Rueda M, Saunders G, Singh B, Spalding JD.

Human Mutation. 2022 Mar 17. PMID:35297548

Abstract Beacon is a basic data discovery protocol issued by the Global Alliance for Genomics and Health (GA4GH). The main goal addressed by version 1 of the Beacon protocol was to test the feasibility of broadly sharing human genomic data, through providing simple "yes" or "no" responses to queries about the presence of a given variant in datasets hosted by Beacon providers.

Continue reading

hCNV Implementation Studies Old and New

Michael Baudis

ELIXIR Human Data Communities

This presentation provided an overview about the hCNV community, implementation studies and ongoing work, e.g. interaction with the GA4GH VRS standard group and Beacon development.

Continue reading

The GA4GH Phenopacket schema: A computable representation of clinical data for precision medicine

Phenopackets v2 preprint

Jacobsen JOB, Baudis M, Baynam GS, Beckmann JS, Beltran S, Callahan TJ, Chute CG, Courtot M, Danis D, Elemento O, Freimuth RR, ..., Haendel MA, Robinson PN.

medRxiv, 2021.11.27.21266944. doi:10.1101/2021.11.27.21266944

Abstract Despite great strides in the development and wide acceptance of standards for exchanging structured information about genomic variants, there is no corresponding standard for exchanging phenotypic data, and this has impeded the sharing of phenotypic information for computational analysis. Here, we introduce the Global Alliance for Genomics and Health (GA4GH) Phenopacket schema, which supports exchange of computable longitudinal case-level phenotypic information for diagnosis and research of all types of disease including Mendelian and complex genetic diseases, cancer, and infectious diseases. Continue reading

The GA4GH Variation Representation Specification (VRS): a Computational Framework for the Precise Representation and Federated Identification of Molecular Variation.

Alex H. Wagner, Lawrence Babb, Gil Alterovitz, Michael Baudis, Matthew Brush, Daniel L. Cameron, Melissa Cline , Malachi Griffith, Obi L. Griffith, ..., Melissa Konopko, Heidi L. Rehm, Andrew D. Yates, Robert R. Freimuth, Reece K. Hart

Wagner, Alex H. et al. Cell Genomics, Volume 1, Issue 2, 100027 doi:10.1016/j.xgen.2021.100027
bioRxiv. version 20212021.01.15.426843. (2021-01-15)

Note

Cell Genomics logoThis article was published as part of a special GA4GH edition of Cell Genomics.

Abstract Maximizing the personal, public, research, and clinical value of genomic information will require the reliable exchange of genetic variation data. We report here the Variation Representation Specification (VRS, pronounced “verse”), an extensible framework for the computable representation of variation that complements contemporary human-readable and flat file standards for genomic variation representation. VRS provides semantically precise representations of variation and leverages this design to enable federated identification of biomolecular variation with globally consistent and unique computed identifiers. Continue reading

International federation of genomic medicine databases using GA4GH standards

Adrian Thorogood, Heidi L. Rehm, Peter Goodhand, Angela J.H. Page, Yann Joly, Michael Baudis, Jordi Rambla, Arcadi Navarro, Tommi H. Nyronen, Mikael Linden, Edward S. Dove, Marc Fiume, Michael Brudno, Melissa S. Cline, Ewan Birney

Thorogood, Adrian et al. Cell Genomics, Volume 1, Issue 2, 100032 doi:10.1016/j.xgen.2021.100032

Note

Cell Genomics logoThis article was published as part of a special GA4GH edition of Cell Genomics.

Abstract We promote a shared vision and guide for how and when to federate genomic and health-related data sharing, enabling connections and insights across independent, secure databases. The GA4GH encourages a federated approach wherein data providers have the mandate and resources to share, but where data cannot move for legal or technical reasons. We recommend a federated approach to connect national genomics initiatives into a global network and precision medicine resource.

Continue reading

GA4GH: International policies and standards for data sharing across genomic research and healthcare

Heidi L. Rehm, Angela J.H. Page, Lindsay Smith, Jeremy B. Adams, Gil Alterovitz, Lawrence J. Babb, Maxmillian P. Barkley, Michael Baudis, Michael J.S. Beauvais, Tim Beck, Jacques S. Beckmann, Sergi Beltran, David Bernick, Alexander Bernier, James K. Bonfield, Tiffany F. Boughtwood, Guillaume Bourque, Sarion R. Bowers, Anthony J. Brookes, Michael Brudno, Matthew H. Brush, David Bujold, Tony Burdett, Orion J. Buske, Moran N. Cabili , Daniel L. Cameron, Robert J. Carroll, Esmeralda Casas-Silva, Debyani Chakravarty, Bimal P. Chaudhari, Shu Hui Chen, J. Michael Cherry, Justina Chung, Melissa Cline, Hayley L. Clissold, Robert M. Cook-Deegan, Mélanie Courtot, ..., Peter Goodhand, Kathryn North, Ewan Birney

Rehm, Heidi L. et al. Cell Genomics, Volume 1, Issue 2, 100029 doi:10.1016/j.xgen.2021.100029

Note

This article was published as part of a special GA4GH edition of Cell Genomics.

Abstract The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. Continue reading

Progenetix & GA4GH Beacon

GRIC sponsored workshop with the Swiss Institute of Bioinformatics

A cancer genomics resource built around and driving GA4GH standards

Michael Baudis

The Progenetix oncogenomics resource provides sample-specific cancer genome profiling data and biomedical annotations as well as provenance data for cancer studies. With more than 100k genomic copy number number (CNV) profiles from over 700 cancer types, Progenetix empowers comparative analyses beyond individual studies and diagnostic concepts.

Continue reading

A Standardized Format for Federated Genomic Data Exchange

The GA4GH Beacon Protocol Presented at BC2

Michael Baudis

BC2 Basel 2021

Session "Federating computational analyses with GA4GH standards"

BC2 logoDuring the "Federating computational analyses with GA4GH standards" workshop at BC2 2021 Michael presented history and the current status of the Beacon project, as well as its integration with specific data resources and analysis initiatives.

Continue reading

Cancer genomics reference resource and toolkit around GA4GH standards

Q. Huang, B. Gao, R. Paloots, P. Carrio-Cordo, Z. Yang, M. Baudis

ESHG 2021

ESHG LogoThis poster presentation at the European Society of Human Genetics meeting 2021 discusses the integration and development of GA4GH standards by the Progenetix oncogenomics resource.

Continue reading

Progenetix - An open reference resource for copy number vatiation data in cancer

Qingyao Huang

Continue reading

The Progenetix oncogenomic resource in 2021

Article describing the current content & technical status of progenetix.org

Qingyao Huang, Paula Carrio Cordo, Bo Gao, Rahel Paloots, Michael Baudis

Database (Oxford). 2021 Jul 17;2021:baab043.

DATABASE logo

Abstract In cancer, copy number aberrations (CNAs) represent a type of nearly ubiquitous and frequently extensive structural genome variations. To disentangle the molecular mechanisms underlying tumorigenesis as well as identify and characterize molecular subtypes, the comparative and meta-analysis of large genomic variant collections can be of immense importance. Over the last decades, cancer genomic profiling projects have resulted in a large amount of somatic genome variation profiles, however segregated in a multitude of individual studies and datasets. The Progenetix project, initiated in 2001, curates individual cancer CNA profiles and associated metadata from published oncogenomic studies and data repositories with the aim to empower integrative analyses spanning all different cancer biologies. Continue reading

hCNV Community and Implementation Studies

Michael Baudis

ELIXIR All Hands 2021 Human Data Day

At the Human Data Day Michael presents a very brief overview of the ending and upcoming ELIXIR hCNV implementation studies.

Continue reading

hCNV data and the Progenetix Beacon

Presentation at ELIXIR All Hands 2021

Michael Baudis

ELIXIR All Hands 2021

This presentation gives a brief overview of the use of the Progenetix resource to test and implement a genomics reference resource using the emerging Beacon v2 protocol.

Continue reading

Signatures of Discriminative CNA in 31 Cancer Subtypes

Bo Gao and Michael Baudis (2021)

Published at Frontiers in Genetics, 2021-05-13

Abstract

Copy number aberrations (CNA) are one of the most important classes of genomic mutations relatedto oncogenetic effects. In the past three decades, a vast amount of CNA data has been generated bymolecular-cytogenetic and genome sequencing based methods. While this data has been instrumentalin the identification of cancer-related genes and promoted research into the relation between CNA andhisto-pathologically defined cancer types, the heterogeneity of source data and derived CNV profilespose great challenges for data integration and comparative analysis. Furthermore, a majority of exist-ing studies have been focused on the association of CNA to pre-selected ”driver” genes with limitedapplication to rare drivers and other genomic elements.

Continue reading

Progenetix, Beacon and GA4GH at RDA

Research Data Alliance - RDA Virtual Plenary 17

Concepts | Status | History | Outlook

Michael Baudis

Research Data Alliance - RDA Virtual Plenary 17

This seminar gives an overview of current state of the Progenetix Beacon project and the overall connection to the Global Alliance for Genomics and Health (GA4GH).

Continue reading

Implementing GA4GH Standards to Drive an Open Oncogenomics Resource

Research Seminar Kinderspital Zürich - Neuroonkologie

Michael Baudis

Seminar Neurooncology Childrens Hospital Zürich

This seminar gives an overview of the history & current state of the Progenetix resource, it's role in Beacon API development and the overall connection to the Global Alliance for Genomics and Health (GA4GH).

Continue reading

Discovering copy number variation across multiple cancer types

Qingyao Huang

Abstract

Genomic variations are direct cause of tumor formation and accomplice in its continuous evolution. While point mutations can be pinpointed to a targeted genetic element, copy number variations (CNVs) involve copy number gain or loss of a large DNA segment which often covers hundreds of genetic elements in one event. Continue reading

EACR conference - The Progenetix Oncogenomic Resource

Continue reading

GA4GH Connect - Beacon v2 and SchemaBlocks

GA4GH Connect 2020

Michael Baudis

Beacon v2 Structural Variants [slides]
SchemaBlocks {S}[B] [slides]
Continue reading

Copy number variant heterogeneity among cancer types reflects inconsistent concordance with diagnostic classifications

Paula Carrio Cordo and Michael Baudis

bioRxiv. doi: doi.org/10.1101/2021.03.01.433348

This article explores the correlation between subsets of cancer entities, grouped by their somatic CNV patterns, and levels of diagnostic classification systems.

Continue reading

Genomic data and Privacy

Michael Baudis

ETHZ Lecture

The understanding of the impact of individual inherited and somatic genome variants on phenotypes and diseases requires a thorough understanding of the occurrence of such variants amongst populations in general and carriers of the phenotypes and diseases in particular. This information can only be provided through the inclusion of data from a multitude of genome resources in variant evaluation efforts, including such from outside (international) jurisdictions. However, opening such resources carries the inherent risk of breaching privacy, particularly through re-identification of individuals or their relatives and potentially through the exposure of individual genome-related personal information including phenotypic and "performance" prediction and relative disease risk.

Continue reading

Beacon v2 – Towards flexible use and clinical applications for a reference genomic data protocol

SPHN Webinar

Michael Baudis

Genomic “Beacons” provide discovery services for genomic data using the Beacon API developed as a key driver project of the Global Alliance for Genomics and Health (GA4GH). The Beacon protocol itself defines an open standard for genomics data discovery and provides a framework for web services responding to queries against genomic data collections, for instance from population based or disease specific genome repositories. Continue reading

Update of Progenetix Oncogenomics Resource

Research Progress Report, DMLS, University of Zurich

Qingyao Huang

Copy number aberration (CNA) is frequently observed in cancer genomes. Meta-analysis of genomic variations helps to disentangle the multiplex molecular mechanism underlying tumorigenesis as well as identify and characterize molecular subtypes. Over the years, cancer genomic research have resulted in a considerable amount of data segregated by studies. The Progenetix project (www.progenetix.org), initiated in 2001, aims to systematize the published cancer genomic profiles and provide accurate annotation to facilitate integrative analysis. Continue reading

Welcome to Ziying

Today Ziying Yang arrived as a new member of the baudisgroup.

Welcome Ziying!

Continue reading

GA4GH Beacon v2 at GA4GH Plenary

GA4GH Beacon v2 - Evolving Reference Standard for Genomic Data Exchange

GA4GH 8th Plenary

Gary Saunders, Jordi Rambla de Argila, Anthony Brookes, Juha Törnroos and Michael Baudis

For the ELIXIR Beacon project, GA4GH Discovery work stream and the international network of Beacon API developers

The Beacon driver project was one of the earliest initiatives of the Global Alliance for Genomics and Health with the Beacon v1.0 API as first approved GA4GH standard. Version 2 of the protocol is slated to provide fundamental changes, towards a Internet of Genomics foundational standard: * requests beyond genomic variants ("filters") * payload responses, secured through open AAI * aligning w/ GA4GH standards (Phenopackets, VRS, DUO...) through SchemaBlocks {S}[B] * Working with international partners on deployment of advanced implementations

Continue reading

Progenetix - A cancer genomics reference resource around GA4GH standards

GA4GH 8th Plenary

Michael Baudis

The Progenetix oncogenomics resource provides sample-specific cancer genome profiling data and biomedical annotations as well as provenance data from cancer studies. Especially through currently 113322 curated genomic copy number number (CNV) profiles from 1600 individual studies representing over 500 cancer types (NCIt), Progenetix empowers aggregate and comparative analyses which vastly exceed individual studies or single diagnostic concepts. Continue reading

Cancer Data - ELIXIR::GA4GH: Advancing genomics resources through standards and ontologies

ECCB2020

Michael Baudis

Continue reading

The Ubiquitin Ligase TRIP12 Limits PARP1 Trapping and Constrains PARP Inhibitor Efficiency

Marco Gatti, Ralph Imhof, Qingyao Huang, Michael Baudis, Matthias Altmeyer

Cell Rep. 2020 Aug 4 DOI: 10.1016/j.celrep.2020.107985

Abstract PARP inhibitors (PARPi) cause synthetic lethality in BRCA-deficient tumors. Whether specific vulnerabilities to PARPi exist beyond BRCA mutations and related defects in homology-directed repair (HDR) is not well understood. Here, we identify the ubiquitin E3 ligase TRIP12 as negative regulator of PARPi sensitivity. Continue reading

Beacon v2 - Towards Flexible Use and Clinical Applications for a Reference Genomic Data Sharing Protocol

Personalized Health Technologies 2020

Michael Baudis

Beacons provide discovery services for genomic data using the Beacon API developed under the leadership of ELIXIR, as a key driver project of the Global Alliance for Genomics and Health (GA4GH). The Beacon protocol itself defines an open standard for genomics data discovery. It provides a framework for public web services responding to queries against genomic data collections, for instance from population based or disease specific genome repositories. Sites offering beacons can scale through aggregation in "Beacon Networks", which distribute single genome queries among a potentially large number of international beacons and assemble their responses. Continue reading

ELIXIR All Hands - Beacon Evolution

ELIXIR All Hands 2020 - Beacon Workshop

Michael Baudis

This presentation covers some of Beacon's origins, features and directions.

Continue reading

Oncology Informatics: Status Quo and Outlook - Review

Paul Martin Putora, Michael Baudis, Beth M. Beadle, Issam El Naqa, Frank A. Giordano and Nils H. Nicolay

Oncology, 2020-05-14. DOI 10.1159/000507586 (Review)

Abstract Oncology has undergone rapid progress, with emerging developments in areas including cancer stem cells, molecularly targeted therapies, genomic analyses, and individually tai- lored immunotherapy. These advances have expanded the tools available in the fight against cancer. Some of these have seen broad media coverage resulting in justified public attention. However, these achievements have only been possible due to rapid developments in the expanding field of biomedical informatics and information technology (IT). Continue reading

Swissnex SF: Laura & Michael Baudis - Life & Family

Originally planned for their stays at UCB and LBNL, Laura & Michael were interviewed by Tabea Stoeckel from swissnex San Francisco about their stay in the Bay Area and their research & family life as internationally active scientists.

Continue reading

Minimum Error Calibration and Normalization for Genomic Copy Number Analysis

Bo Gao and Michael Baudis (2020)

bioRxiv, 2019-07-31. DOI 10.1101/720854
Genomics, Volume 112, Issue 5, September 2020, Pages 3331-3341, accepted 2020-05-06 doi.org/10.1016/j.ygeno.2020.05.008.
Background

Copy number variations (CNV) are regional deviations from the normal autosomal bi-allelic DNA content. While germline CNVs are a major contributor to genomic syndromes and inherited diseases, the majority of cancers accumulate extensive "somatic" CNV (sCNV or CNA) during the process of oncogenetic transformation and progression. While specific sCNV have closely been associated with tumorigenesis, intriguingly many neoplasias exhibit recurrent sCNV patterns beyond the involvement of a few cancer driver genes. Continue reading

SchemaBlocks and GA4GH TASC

A brief presentation about SchemaBlocks concepts and its possible integration into the new GA4GH TASC effort.

Continue reading

A harmonized meta-knowledgebase of clinical interpretations of somatic genomic variants in cancer

Alex H. Wagner, Brian Walsh, Georgia Mayfield, David Tamborero, Dmitriy Sonkin, Kilannin Krysiak, Jordi Deu-Pons, Ryan P. Duren, Jianjiong Gao, Julie McMurry, Sara Patterson, Catherine del Vecchio Fitz, Beth A. Pitel, ..., Nuria Lopez-Bigas, Mark Lawler, Jeremy Goecks, Malachi Griffith, Obi L. Griffith, Adam A. Margolin & Variant Interpretation for Cancer Consortium

Nature Genetics volume 52, pages 448–457 (2020)

Precision oncology relies on accurate discovery and interpretation of genomic variants, enabling individualized diagnosis, prognosis and therapy selection. We found that six prominent somatic cancer variant knowledgebases were highly disparate in content, structure and supporting primary literature, impeding consensus when evaluating variants and their relevance in a clinical setting. We developed a framework for harmonizing variant interpretations to produce a meta-knowledgebase of 12,856 aggregate interpretations. Continue reading

Geographic assessment of cancer genome profiling studies

Paula Carrio Cordo, Elise Acheson, Qingyao Huang and Michael Baudis (2020)

DATABASE, Volume 2020, 2020, baaa009, doi.org/10.1093/database/baaa009
bioRxiv preprint, 2020-01-11. DOI 10.1101/827683

Abstract Cancers arise from the accumulation of somatic genome mutations, which can be influenced by inherited genomic variants and external factors such as environmental or lifestyle-related exposure. Due to the heterogeneity of cancers, precise information about the genomic composition of germline and malignant tissues has to be correlated with morphological, clinical and extrinsic features to advance medical knowledge and treatment options. With global differences in cancer frequencies and disease types, geographic data is of importance to understand the interplay between genetic ancestry and environmental influence in cancer incidence, progression and treatment outcome. Continue reading

SWISSNEX SF Lunch Seminar - Data Mining in Genomics

Genomic Research and Personalised Health Strategies

Resources | Standards | Protocols | Tools | Discourse

These are the slides of a short presentation, given (virtually, since COVID-19) for a SWISSNEX San Francisco lunch meeting.

Continue reading

Enabling population assignment from cancer genomes with SNP2pop

Huang Q and Baudis M. (2020)

Sci Rep 10, 4846 (2020). doi.org/10.1038/s41598-020-61854-x

Abstract In many cancers, incidence, treatment efficacy and overall prognosis vary between geographic populations. Studies disentangling the contributing factors may help in both understanding cancer biology and tailoring therapeutic interventions. Ancestry estimation in such studies should preferably be driven by genomic data, due to frequently missing or erroneous self-reported or inferred metadata. While respective algorithms have been demonstrated for baseline genomes, such a strategy has not been shown for cancer genomes carrying a substantial somatic mutation load. We have developed a bioinformatics tool for the assignment of population groups from genome profiling data for both unaltered and cancer genomes. Continue reading

BBOP Presentation - Baudisgroup Projects & Interests

Continue reading

Michael @ BBOP

Between 2020-02-08 and 2020-04-10, Michael has switched his office for a desk at the Berkeley Bioinformatics Open-source Projects (BBOP) where he will work with members of Chris Mungall's team on projects related to biomedical ontologies, data standards and the analysis of cancer genome data.

Continue reading

ELIXIR Beacon Project - Networking Resources Across and Beyond ELIXIR Human Data Communities

ELIXIR Open Day - Wellcome Trust Genome Campus Hinxton

Michael Baudis

In this presentation I introduce the Beacon project and provide my opinions about its future trajectory, and especially its role in driving the alignment of ELIXIR and GA4GH projects in related to (human) genome data sharing.

Continue reading

GA4GH SchemaBlocks for Human Cell Atlas

This is a presentation of the SchemaBlocks initiative and the overall GA4GH context, for the Human Cell Atlas project, given by Michael at one of their teleconferences.

Continue reading

Geographic assessment of cancer genome profiling studies

Paula Carrio Cordo, Elise Acheson, Qingyao Huang and Michael Baudis (2020)

bioRxiv, 2020-11-01. DOI 10.1101/827683

Abstract Cancers arise from the accumulation of somatic genome mutations, which can be influenced by inherited genomic variants and external factors such as environmental or lifestyle-related exposure. Due to the heterogeneity of cancers, precise information about the genomic composition of germline and malignant tissues has to be correlated with morphological, clinical and extrinsic features to advance medical knowledge and treatment options. With global differences in cancer frequencies and disease types, geographic data is of importance to understand the interplay between genetic ancestry and environmental influence in cancer incidence, progression and treatment outcome. Continue reading

Talk at St. Gallen Radiation Oncology - Bioinformatics and Data Exchange

3rd St. Gallen Radiation Oncology Informatics Meeting

Bioinformatics and data exchange for genomics in an international context

Michael Baudis

The presentation at the St. Gallen meeting introduced the audience to the group's research and resources, and how this is connected to the different national & international data standards and sharing initiatives.

Continue reading

Talk at AMED Tokyo - Cancer Genomics and Standards

Mini-Symposium about CNV and Data Standards at AMED Japan, Tokyo

Cancer Genomics and Implementation of Data Driven Standards for Genomic Data Exchange

Michael Baudis

At this meeting, several Japanese participants presented their research and results, with a focus on Copy Number Variants and other structural genome variations. Continue reading

Minimum Error Calibration and Normalization for Genomic Copy Number Analysis

BC2 2019, Basel

Bo Gao

Abstract

Background:
Copy number variations (CNV) are regional deviations from thenormal autosomal bi-allelic DNA content. While germline CNVs are a majorcontributor to genomic syndromes and inherited diseases, the majority of cancersaccumulate extensive ”somatic” CNV (sCNV or CNA) during the process ofoncogenetic transformation and progression. While specific sCNV have closelybeen associated with tumorigenesis, intriguingly many neoplasias exhibitrecurrent sCNV patterns beyond the involvement of a few cancer driver genes.Currently, CNV profiles of tumor samples are generated using genomicmicro-arrays or high-throughput DNA sequencing. Regardless of the underlyingtechnology, genomic copy number data is derived from the relative assessmentand integration of multiple signals, with the data generation process being proneto contamination from several sources. Estimated copy number values have noabsolute and linear correlation to their corresponding DNA levels, and the extentof deviation differs between sample profiles which poses a great challenge fordata integration and comparison in large scale genome analysis. Continue reading

Cancer cell lines in focus: somatic copy number & germline variation

BC2 2019, Basel

Qingyao Huang

Abstract

Background:
Human cell lines are convenient model systems in cancer research, for validation of proposed molecular mechanisms as well as to evaluate potential therapeutic approaches, e.g. through high- throughput screening of potential anti-tumour compounds against cancer cell line panels. However, conclusions about biological pathways or pharmacological potential depend on a close molecular relation between the cancer type represented and the cell line model used for analyses. Continue reading

Structural Genome Variants in Cancer: Research, resources standards

Seminar at the University of Florence

Seminar Universita degli Studi Firenze - Dipartimento di Biologia

Structural Genome Variants in Cancer: Research, resources standards

Michael Baudis

Abstract

Genomic copy number variations are major contributors to malignant transformation and progression and constitute - at least in their quantitative extension - the largest contributors to genomic mutation landscapes, in the majority of cancer types. Such mutations occur in the vast majority of tumors as somatic genome alterations (sCNV) during clonal development and expansion and are promoted by a variety of mechanisms leading to extended or focal changes in the number of genomic segments. Continue reading

Leveraging European infrastructures to access 1 million human genomes by 2022

Gary Saunders, Michael Baudis, Regina Becker, Sergi Beltran, Christophe Béroud, Ewan Birney, Cath Brooksbank, Søren Brunak, Marc Van den Bulcke, Rachel Drysdale, Salvador Capella-Gutierrez, Paul Flicek, ..., Niklas Blomberg, and Serena Scollen

Nature Reviews Genetics volume 20, pages693–701 (2019)

Abstract Human genomics is undergoing a step change from being a predominantly research-driven activity to one driven through health care as many countries in Europe now have nascent precision medicine programmes. To maximize the value of the genomic data generated, these data will need to be shared between institutions and across countries. In recognition of this challenge, 21 European countries recently signed a declaration to transnationally share data on at least 1 million human genomes by 2022. In this Roadmap, we identify the challenges of data sharing across borders and demonstrate that European research infrastructures are well-positioned to support the rapid implementation of widespread genomic data access.

Continue reading

Minimum Error Calibration and Normalization for Genomic Copy Number Analysis

Bo Gao and Michael Baudis (2019)

bioRxiv, 2019-07-31. DOI 10.1101/720854

Abstract Copy number variations (CNV) are regional deviations from the normal autosomal bi-allelic DNA content. While germline CNVs are a major contributor to genomic syndromes and inherited diseases, the majority of cancers accumulate extensive “somatic” CNV (sCNV or CNA) during the process of oncogenetic transformation and progression. While specific sCNV have closely been associated with tumorigenesis, intriguingly many neoplasias exhibit recurrent sCNV patterns beyond the involvement of a few cancer driver genes. Continue reading

ELIXIR All Hands - Beacon Introduction

Michael Baudis

This presentation was the opener for the ELIXIR Beacon session, and introduces to current developments and especially the interactions between GA4GH :: Discovery and ELIXIR Beacon.

Continue reading

HGVS 2019 - Development of Standards for Genomic Data Exchange

Human Genome Variation Society - Gothenburg 2019

Implementation Driven Development of Standards for Genomic Data Exchange from Cancer Genome Data Collections

Michael Baudis

Abstract

Cancers are genomic diseases, arising from the clonal propagation of somatic mutation events, with a limited contribution from inherited genomic variants. Genomic copy number variations are major contributors to malignant transformation and progression and constitute - at least in their quantitative extension - the largest contributors to genomic mutation landscapes, in the majority of cancer types. Continue reading

Connecting the silos - Genomic Data Standards, Resources and the Global Alliance for Genomics and Health

R&D Data Intelligence Leaders Forum Basel

Michael Baudis

Abstract

This presentation discusses the need for data sharing in genomics, provides information about the Global Alliance for Genomics and Health (GA4GH), and shows some of our group's contributions, especially regarding Beacon development.

Continue reading

Federated discovery and sharing of genomic data using Beacons

Miroslav Cupak , Stephen Keenan , Jordi Rambla , Sabela de la Torre , Stephanie Dyke , Anthony Brookes , Knox Carey , David Lloyd , Peter Goodhand , Maximilian Haeussler , Michael Baudis , Heinz Stockinger , Lena Dolman , Ilkka Lappalainen , Juha Törnroos , Mikael Linden , John Spalding , Saif Ur-Rehman , Angela Page , Paul Flicek , Susheel Varma , Gary Saunders , Serena Scollen , Stephen Sherry , David Haussler , Beacon Project Team

Nat Biotechnol (2019), accepted 2019-01-23

Abstract The Beacon Project (github.com/ga4gh-beacon/) is a GA4GH initiative that is developing an open specification for genetic variation discovery and sharing. The project is demonstrating the willingness of international organizations to work together to define standards for, and actively engage in, genomic data sharing. In the two years since the project’s inception, over 90 Beacons have been lit by 35 organizations serving over 200 datasets. Continue reading

DNA copy number imbalances in primary cutaneous lymphomas (PCL)

Gug G, Huang Q, Chiticariu E, Solovan C and Baudis M (2019)

JEADV, 2019-01-19. doi.org/10.1111/jdv.15442

The article has been published with the Journal of the European Academy of Dermatology and Venereology on January 19, 2019. A corresponding preprint can be accessed through [bioRxiv].

Background

Cutaneous lymphomas (CL) represent a clinically defined group of extran‐ odal non‐Hodgkin lymphomas harbouring heterogeneous and incompletely delineated molecular aberrations. Over the past decades, molecular stud‐ ies have identified several chromosomal aberrations, but the interpreta‐ tion of individual genomic studies can be challenging.

Objective

With a comprehensive meta‐analysis, we aim to delineate genomic alter‐ ations for different types of CL and propose a more accurate classifica‐ tion in line with their various pathogenicity. Continue reading

Enabling population assignment from cancer genomes with SNP2pop

Huang Q and Baudis M. (2019)

bioRxiv, 2019-01-14. doi.org/10.1101/368647 (first version 2018-07-14)

Abstract For a variety of human malignancies, incidence, treatment efficacy and overall prognosis show considerable variation between different populations and ethnic groups. Disentangling the effects related to particular population backgrounds can help in both understanding cancer biology and in tailoring therapeutic interventions. Because self-reported or inferred patient data can be incomplete or misleading due to migration and genomic admixture, a data-driven ancestry estimation should be preferred. While algorithms to analyze ancestry structure from healthy individuals have been developed, an easy-to-use tool to assign population groups based on genotyping data from SNP profiles is still missing and benchmarking for the validity of population assignment strategy for aberrant cancer genomes was not tested. Continue reading

2018 09 18 BIO390 Michael Baudis Introduction to Bioinformatics

UZH BIO390 "Introduction to Bioinformatics"

Bioinformatics - Introduction

Michael Baudis

Abstract

First lecture in the UZH BIO390 "Introduction to Bioinformatics" series, introducing concepts and scope of bioinformatics as a field - 2018 version.

Continue reading

ECCB 2018 - Beacon

Abstract: ECCB 2018

ELIXIR Beacon - A Driver Project for the Global Alliance for Genomics and Health

Michael Baudis for the ELIXIR Beacon Project

The Global Alliance for Genomics and Health (GA4GH) develops standards and guidelines to facilitate the international sharing of genomic and health related metadata. The creation of GA4GH work stream products is moved forward through driver projects, which address particular scientific, technical, regulatory or security related aspects of data access and sharing. Continue reading

Registered access: authorizing data access

Dyke SOM, Linden M, Lappalainen I, De Argila JR, Carey K, Lloyd D, Spalding JD, Cabili MN, Kerry G, Foreman J, Cutts T, Shabani M, Rodriguez LL, Haeussler M, Walsh B, Jiang X, Wang S, Perrett D, Boughtwood T, ..., Rehm HL, Baudis M, Sherry ST, Kato K, Knoppers BM, Baker D, and Flicek P

European Journal of Human Genetics (2018)

Abstract The Global Alliance for Genomics and Health (GA4GH) proposes a data access policy model—“registered access”—to increase and improve access to data requiring an agreement to basic terms and conditions, such as the use of DNA sequence and health data in research. A registered access policy would enable a range of categories of users to gain access, starting with researchers and clinical care professionals. It would also facilitate general use and reuse of data but within the bounds of consent restrictions and other ethical obligations. In piloting registered access with the Scientific Demonstration data sharing projects of GA4GH, we provide additional ethics, policy and technical guidance to facilitate the implementation of this access model in an international setting.

Continue reading

Mountains and Chasms - Surveying the Oncogenomic Publication Landscape

Carrio Cordo P and Baudis M. (2018)

Preprints 2018, 2018070618 (doi: 10.20944/preprints201807.0618.v1).
Oncology (2018; online Oct 26)

Abstract Cancers arise from the accumulation of somatic genome mutations, with varying contributions of intrinsic (i.e. genetic predisposition) and extrinsic (i.e. environmental) factors. For the understanding of malignant clones, precise information about their genomic composition has to be correlated with morphological, clinical and individual features, in the context of the available medical knowledge. Continue reading

Population assignment from cancer genome profiling data

Huang Q and Baudis M. (2018)

bioRxiv, 2018-07-14. doi:10.1101/368647

Abstract For a variety of human malignancies, incidence, treatment efficacy and overall prognosis show considerable variation between different populations and ethnic groups. Disentangling the effects related to particular population backgrounds can help in both understanding cancer biology and in tailoring therapeutic interventions. Because self-reported or inferred patient data can be incomplete or misleading due to migration and genomic admixture, a data-driven ancestry estimation should be preferred. While tools to map and utilize ancestry information from healthy individuals have been introduced, a population assignment based on genotyping data from somatic variation profiling of cancer samples is still missing. Continue reading

A harmonized meta-knowledgebase of clinical interpretations of cancer genomic variants

Wagner AH, Walsh B, Mayfield G, Tamborero D, Sonkin D, Krysiak K, Deu Pons J, Duren R, Gao J, McMurry J, Patterson S, Del Vecchio Fitz C, Sezerman OU, Warner J, Rieke DT, Aittokallio T, Cerami E, Ritter D, Schriml LM, Haendel M, Raca G, Madhavan S, Baudis M, ..., Griffith M, Griffith OL, and Margolin A

bioRxiv. doi:10.1101/366856

Precision oncology relies on the accurate discovery and interpretation of genomic variants to enable individualized therapy selection, diagnosis, or prognosis. However, knowledgebases containing clinical interpretations of somatic cancer variants are highly disparate in interpretation content, structure, and supporting primary literature, reducing consistency and impeding consensus when evaluating variants and their relevance in a clinical settin Continue reading

Qingyao - Institute Progress Report

IMLS Progress Report

Towards understanding population effect on cancer

Qingyao Huang

Abstract

With a combination of ~50,000 curated oncogenomic array data from the arrayMap database and ~20,000 profiles from TCGA project depository, we perform a meta- analysis to investigate influence of genetic background on the CNV patterns in cancer. From sequencing data of 26 world-wide populations from 1000 Genomes project, we extract the SNP markers and use them for subsequent sample analysis. Continue reading

The ELIXIR Beacon in 2018: A driver project of GA4GH

ELIXIR All Hands, Berlin

Michael Baudis

Abstract

The core mission of the Global Alliance for Genomics and Health is to "...enable genomic data sharing for the benefit of human health". One of the instruments to enact this mission is the selection and support of driver projects, which address particular scientific, technical, regulatory or security related aspects of federated access to human genomes and related metadata. Continue reading

segment_liftover : a Python tool to convert segments between genome assemblies

Abstract: ELIXIR All Hands 2018

Bo Gao, Qingyao Huang and Michael Baudis

The process of assembling a species’ reference genome may be performed in a number of iterations, with subsequent genome assemblies differing in the coordinates of mapped elements. The conversion of genome coordinates between different assemblies is required for many integrative and comparative studies. While currently a number of bioinformatics tools are available to accomplish this task, most of them are tailored towards the conversion of single genome coordinates. When converting the boundary positions of segments spanning larger genome regions, segments may be mapped into smaller sub-segments if the original segment’s continuity is disrupted in the target assembly. Continue reading

ELIXIR Beacon - A Driver Project for the Global Alliance for Genomics and Health

Abstract: ELIXIR All Hands 2018

Michael Baudis for the ELIXIR Beacon Project

The core mission of the Global Alliance for Genomics and Health is to "...enable genomic data sharing for the benefit of human health". One of the instruments to enact this mission is the selection and support of driver projects, which address particular scientific, technical, regulatory or security related aspects of federated access to human genomes and related metadata.

The "Beacon" project had been initiated at the first GA4GH plenary in 2014, to test the willingness of genome resource providers to allow web-based queries in the most simple format, against aggregated genome variant data. While this particular model had limited practical utility, it established the foundation for automated queries against world-wide data resources, using a unified protocol. A secondary aspect of the Beacon project was the, intentional, challenge of existing notions of data security and privacy protection, including possible re-identification attacks and risk mitigation scenarios.

Now continued through the ELIXIR-Beacon GA4GH driver project, extensions to the Beacon protocol will provide real-world utility for genome resource access under a number of different usage scenarios. With its support for developing the Beacon protocol and distributing it throughout its members, ELIXIR is providing one of the first, widespread implementations of a GA4GH related products. Additionally, the upcoming version of the Beacon protocol is expected to be one of the first official standards which underwent the GA4GH product approval process, facilitating widespread international adoption.

Continue reading

Biocuration for cancer genome databases: arrayMap and Progenetix

Abstract: ELIXIR All Hands 2018

Paula Carrio Cordo and Michael Baudis

Screening for somatic mutations in cancer has become integral to diagnostic and target evaluation for personalized therapeutic approaches. arrayMap is a curated oncogenomic resource, focusing on copy number aberration (CNA) profiles derived from genomic arrays. The information has been processed from data accessed through NCBI’s Gene Expression Omnibus (GEO), EBI’s ArrayExpress, and, importantly, through targeted mining of publication data. Whereas this database is based on raw probe data sets, the parental project, Progenetix, allows for genome variant analysis from additional sources and serves as metadata reference. Continue reading

ELIXIR All Hands 2018 - arrayMap

Abstract: ELIXIR All Hands 2018

Update on arrayMap Cancer Genome Resource

Qingyao Huang and Michael Baudis

arrayMap is a cancer-related genome profile database, curated from public data repositories. The resources’s data processing pipelines handle the homogeneous conversion of available raw data from various genomic array platforms (e.g. cCGH, SNP) and represents pre-computed copy number variation (CNV) profiles, allowing the evaluation of target gene involvement as well as the comparative analysis of whole-genome CNV patterns, e.g. for assessing between-sample CNV heterogeneity and associate sets of similar patterns with metadata qualifiers such as diagnostic classifications. In addition to the representation of cancer-related samples, we also plan to generate copy number profiles from non-cancer samples, in an effort to represent population-related CNV patterns. Continue reading

Genomic profiles of cancer cell lines - A systematic review

Abstract: ELIXIR All Hands 2018

Rahel Paloots, Qingyao Huang, Paula Carrio Cordo and Michael Baudis

Human cell lines are convenient model systems in cancer research, for validation of proposed molecular mechanisms as well as to evaluate potential therapeutic approaches, e.g. through high-throughput screening of potential antitumor compounds against cancer cell line panels. Thousands of established cell lines are available from commercial or academic providers, covering a wide selection of cancer types. However, conclusions about biological pathways or pharmacological potential depend on a close molecular relation between the cancer type represented and the cell line model used for analyses. Continue reading

Krüppel-Like Factor 10 participates in cervical cancer immunoediting through transcriptional regulation of Pregnancy-Specific Beta-1 Glycoproteins

Marrero-Rodríguez D, Taniguchi-Ponciano K, Subramaniam M, Hawse JR, Pitel KS, Arreola-De la Cruz H, Huerta-Padilla V, Ponce-Navarrete G, Figueroa-Corona MDP, Gomez-Virgilio L, Martinez-Cuevas TI, Mendoza-Rodriguez M, Rodriguez-Esquivel M, Romero-Morelos P, Ramirez-Salcedo J, Baudis M, Meraz-Rios M, Jimenez-Vega F, Salcedo M.

Abstract Cervical cancer (CC) is associated with alterations in immune system balance, which is primarily due to a shift from Th1 to Th2 and the unbalance of Th17/Treg cells. Using in silico DNA copy number analysis, we have demonstrated that ~20% of CC samples exhibit gain of 8q22.3 and 19q13.31; the regions of the genome that encodes the KLF10 and PSG genes, respectively. Continue reading

segment_liftover...

segment_liftover : a Python tool to convert segments between genome assemblies.

Gao B, Huang Q and Baudis M

Abstract The process of assembling a species’ reference genome may be performed in a number of iterations, with subsequent genome assemblies differing in the coordinates of mapped elements. The conversion of genome coordinates between different assemblies is required for many integrative and comparative studies. While currently a number of bioinformatics tools are available to accomplish this task, most of them are tailored towards the conversion of single genome coordinates. When converting the boundary positions of segments spanning larger genome regions, segments may be mapped into smaller sub-segments if the original segment’s continuity is disrupted in the target assembly. Such a conversion may lead to a relevant degree of data loss in some circumstances such as copy number variation (CNV) analysis, where the quantitative representation of a genomic region takes precedence over base-specific accuracy. segment_liftover aims at continuity-preserving remapping of genome segments between assemblies and provides features such as approximate locus conversion, automated batch processing and comprehensive logging to facilitate processing of datasets containing large numbers of structural genome variation data.

Online
Continue reading

2018 02 15 talk DPPH michael

DPPH - Data Protection in Personalized Health 2018, Lausanne

Genome Beacons for Data Discovery - Technical advances in a challenging environment

Michael Baudis

Abstract

The core mission of the Global Alliance for Genomics and Health is to "...enable genomic data sharing for the benefit of human health". One of the instruments to enact this mission is the selection and support of driver projects, which address particular scientific, technical, regulatory or security related aspects of federated access to human genomes and related metadata. The "Beacon" project had been initiated at the first GA4GH plenary in 2014, to test the willingness of genome resource providers to allow web-based queries in the most simple format, against aggregated genome variant data. While this particular model had limited practical utility, it established the foundation for automated queries against world-wide data resources, using a unified protocol. A secondary aspect of the Beacon project was the, intentional, challenge of existing notions of data security and privacy protection, including possible re-identification attacks and risk mitigation scenarios. Now continued through the ELIXIR-Beacon GA4GH driver project, extensions to the Beacon protocol will provide real-world utility for genome resource access under a number of different usage scenarios, some of which will be addressed in this presentation.

Continue reading

2017 12 07 talk timisoara michael

Special seminar at University of Timisoara Dermatology

Cancer Genome (Data) Analysis in Personalised Medicine

Michael Baudis

Abstract

General presentation about challenges and possibilities of cancer genome screening for medical applications.

Continue reading

2017 12 01 talk lszgs michael

UZH Cancer Biology course "Research with clinical samples

Personalised Medicine - Genome Variation / Data Formats / Resources / Sharing / Privacy

Michael Baudis

Abstract

Personalised Medicine - Genome Variation / Data Formats / Resources / Sharing / Privacy, presented for PhD program in Cancer Biology

Continue reading

Developing Beacons for Data Discovery

GA4GH 2017 Plenary - Workstream Day

Michael Baudis

  • presenting concepts on how to advance the Beacon protcol for practical use cases
Continue reading

Integrated Molecular...

Integrated Molecular Meta-Analysis of 1,000 Pediatric High-Grade and Diffuse Intrinsic Pontine Glioma.

Mackay A, Burford A, Carvalho D, Izquierdo E, Fazal-Salom J, Taylor KR, Bjerke L, Clarke M, Vinci M, Nandhabalan M, Temelso S, Popov S, Molinari V, Raman P, Waanders AJ, Han HJ, Gupta S, Marshall L, Zacharoulis S, Vaidya S, Mandeville HC, Bridges LR, Martin AJ, Al-Sarraj S, Chandler C, Ng HK, Li X, Mu K, Trabelsi S, Brahim DH, Kisljakov AN, Konovalov DM, Moore AS, Carcaboso AM, Sunol M, de Torres C, Cruz O, Mora J, Shats LI, Stavale JN, Bidinotto LT, Reis RM, Entz-Werle N, Farrell M, Cryan J, Crimmins D, Caird J, Pears J, Monje M, Debily MA, Castel D, Grill J, Hawkins C, Nikbakht H, Jabado N, Baker SJ, Pfister SM, Jones DTW, Fouladi M, von Bueren AO, Baudis M, Resnick A, Jones C.

Abstract We collated data from 157 unpublished cases of pediatric high-grade glioma and diffuse intrinsic pontine glioma and 20 publicly available datasets in an integrated analysis of >1,000 cases. We identified co-segregating mutations in histone-mutant subgroups including loss of FBXW7 in H3.3G34R/V, TOP3A rearrangements in H3.3K27M, and BCOR mutations in H3.1K27M. Histone wild-type subgroups are refined by the presence of key oncogenic events or methylation profiles more closely resembling lower-grade tumors. Genomic aberrations increase with age, highlighting the infant population as biologically and clinically distinct. Uncommon pathway dysregulation is seen in small subsets of tumors, further defining the molecular diversity of the disease, opening up avenues for biological study and providing a basis for functionally defined future treatment stratification.

Continue reading

UZH BIO390 - Introduction to Bioinformatics

Michael Baudis

Abstract

First lecture in the UZH BIO390 "Introduction to Bioinformatics" series, introducing concepts and scope of bioinformatics as a field.

Continue reading

Advancing the Global Alliance for Genomics and Health data schemas through data-driven implementations

[BC]2 2017 - Basel Computational Biology Conference

###$ Michael Baudis

Abstract

The core mission of the Global Alliance for Genomics and Health (GA4GH) is to advance the utility of biomedical data through facilitating of federated analyses, across geographic, regulatory and technical boundaries. A central component is the development of standardised data schemas and application programming interfaces (APIs), for molecular screening data as well as biological and technical metadata. Continue reading

Somatic CNV databases as resources for cancer research && GA4GH schema development

Seminar at SDU Odense

Michael Baudis

  • Presentation about pProgenetix and the group's research projects
Continue reading

GA4GH Metadata & Beyond

ELIXIR All Hands, Rome

Michael Baudis

Presentation about GA4GH metadata standards work

Continue reading

Personalized medicine in cancer

Seminar - Stanford Center for Genomics & Personalized Medicine

Michael Baudis

Abstract

Some reasoning for the need of genome profiling data in cancer and reference samples, and our own work on structural aberration data in cancer as well as contributions to the projects and standard definitions of the Global Alliance for Genomics and Health

Continue reading

Implementing Data Models for the Global Alliance for Genomics and Health

Brupbacher Meeting 2017

Bo Gao

Abstract

Poster about our implementation studies for testing and developing data schemas and resources for the Global Alliance for Genomics and Health, based on data from our arrayMap resource

Continue reading

Genome data access - Application to personalized health and cancer research

R&D Data Intelligence Leaders Forum Basel

Michael Baudis

Abstract
  • Discussing genome data access, with focus ion cancer, GA4GH and own contributions
Continue reading

Cancer Genome (Data) Analysis

Antrittsvorlesung Universität Zürich - Inaugural Lecture at University of Zurich

Michael Baudis

Abstract
  • General lecture introducing the "Why"s of cancer genome analysis and some specific problems, as well as some of our own contributions
Continue reading

Personalized medicine in cancer

LSZGS Cancer Biology course 2016

Michael Baudis

Abstract

Lecture as part of the LSZGS Cancer Biology PhD program, touching some aspects of cancer genome data analysis (why, amount and cost of genome sequencing, genome variation, privacy & data sharing ...

Continue reading

CNARA: reliability assessment for genomic copy number profiles

Ai N, Cai H, Solovan C, Baudis M.

Abstract DNA copy number profiles from microarray and sequencing experiments sometimes contain wave artefacts which may be introduced during sample preparation and cannot be removed completely by existing preprocessing methods. Besides, large derivative log ratio spread (DLRS) of the probes correlating with poor DNA quality is sometimes observed in genome screening experiments and may lead to unreliable copy number profiles. Depending on the extent of these artefacts and the resulting misidentification of copy number alterations/variations (CNA/CNV), it may be desirable to exclude such samples from analyses or to adapt the downstream data analysis strategy accordingly.Here, we propose a method to distinguish reliable genomic copy number profiles from those containing heavy wave artefacts and/or large DLRS. We define four features that adequately summarize the copy number profiles for reliability assessment, and train a classifier on a dataset of 1522 copy number profiles from various microarray platforms. The method can be applied to predict the reliability of copy number profiles irrespective of the underlying microarray platform and may be adapted for those sequencing platforms from which copy number estimates could be computed as a piecewise constant signal. Further details can be found at github.com/baudisgroup/CNARA .We have developed a method for the assessment of genomic copy number profiling data, and suggest to apply the method in addition to and after other state-of-the-art noise correction and quality control procedures. CNARA could be instrumental in improving the assessment of data used for genomic data mining experiments and support the reliable functional attribution of copy number aberrations especially in cancer research.

Continue reading

PKCα and HMGB1 antagonistically control hydrogen peroxide-induced poly-ADP-ribose formation

PKCα and HMGB1 antagonistically control hydrogen peroxide-induced poly-ADP-ribose formation.

Andersson A, Bluwstein A, Kumar N, Teloni F, Traenkle J, Baudis M, Altmeyer M, Hottiger MO.

Abstract Harmful oxidation of proteins, lipids and nucleic acids is observed when reactive oxygen species (ROS) are produced excessively and/or the antioxidant capacity is reduced, causing 'oxidative stress'. Nuclear poly-ADP-ribose (PAR) formation is thought to be induced in response to oxidative DNA damage and to promote cell death under sustained oxidative stress conditions. However, what exactly triggers PAR induction in response to oxidative stress is incompletely understood. Using reverse phase protein array (RPPA) and in-depth analysis of key stress signaling components, we observed that PAR formation induced by H2O2 was mediated by the PLC/IP3R/Ca(2+)/PKCα signaling axis. Mechanistically, H2O2-induced PAR formation correlated with Ca(2+)-dependent DNA damage, which, however, was PKCα-independent. In contrast, PAR formation was completely lost upon knockdown of PKCα, suggesting that DNA damage alone was not sufficient for inducing PAR formation, but required a PKCα-dependent process. Intriguingly, the loss of PAR formation observed upon PKCα depletion was overcome when the chromatin structure-modifying protein HMGB1 was co-depleted with PKCα, suggesting that activation and nuclear translocation of PKCα releases the inhibitory effect of HMGB1 on PAR formation. Together, these results identify PKCα and HMGB1 as important co-regulators involved in H2O2-induced PAR formation, a finding that may have important relevance for oxidative stress-associated pathophysiological conditions.

Continue reading

Harvesting Cancer Genome Data

Biocuration 2016, Geneva

Michael Baudis

Abstract

While the analysis of cancer genomes using high-throughput technologies has generated tens of thousands of oncogenomic profiles, meta-analyses of datasets is greatly inhibited through limited data access, technical fragmentation and a multitude of raw data and annotation formats.

For the arrayMap cancer genome resource, our group collects, re-processes and annotates cancer and associated reference data from genomic array experiments, retrieved from public repositories (e.g. NCBI GEO, EBI ArrayExpress) as well as from publication supplements and through direct requests to the primary producers. Continue reading

Gene-based analysis of focal copy number aberration patterns in cancer genomes

VIB Conference "Applied Bioinformatics in Life Sciences"

Saumya Gupta

Poster about the project to evaluate focal copy number variants in cancer, based on data from arraymap.org

Continue reading

The SIB Swiss Institute of Bioinformatics' resources: focus on curated databases

SIB Swiss Institute of Bioinformatics Members.

Abstract The SIB Swiss Institute of Bioinformatics (www.isb-sib.ch) provides world-class bioinformatics databases, software tools, services and training to the international life science community in academia and industry. These solutions allow life scientists to turn the exponentially growing amount of data into knowledge. Here, we provide an overview of SIB's resources and competence areas, with a strong focus on curated databases and SIB's most popular and widely used resources. In particular, SIB's Bioinformatics resource portal ExPASy features over 150 resources, including UniProtKB/Swiss-Prot, ENZYME, PROSITE, neXtProt, STRING, UniCarbKB, SugarBindDB, SwissRegulon, EPD, arrayMap, Bgee, SWISS-MODEL Repository, OMA, OrthoDB and other databases, which are briefly described in this article.

Continue reading

A Biobank Supporting Rare Disease Research In Dermatopathology. Our Experience In Establishing A Biobank.

Beleut M, Seclaman E, Baudis M, Nicula A, and Solovan C

RoJCED 2, 202-206 (2015)

Abstract Biobanks of human patient sample tissues and blood fractions are increasingly recognized as major assets in disease research. We aim to identify DNA copy number and gene expression aberrations typical of different cutaneous pathologies. Another goal is the identification of circulating biomarkers both as prognostic, therapy- responsive and/or therapy-monitoring factors and as disease classifiers and subclassifiers.

Continue reading

Genomic instability of osteosarcoma cell lines in culture: impact on the prediction of metastasis relevant genes

Muff R, Rath P, Ram Kumar RM, Husmann K, Born W, Baudis M, Fuchs B.

Abstract Osteosarcoma is a rare but highly malignant cancer of the bone. As a consequence, the number of established cell lines used for experimental in vitro and in vivo osteosarcoma research is limited and the value of these cell lines relies on their stability during culture. Here we investigated the stability in gene expression by microarray analysis and array genomic hybridization of three low metastatic cell lines and derivatives thereof with increased metastatic potential using cells of different passages.The osteosarcoma cell lines showed altered gene expression during in vitro culture, and it was more pronounced in two metastatic cell lines compared to the respective parental cells. Chromosomal instability contributed in part to the altered gene expression in SAOS and LM5 cells with low and high metastatic potential. To identify metastasis-relevant genes in a background of passage-dependent altered gene expression, genes involved in "Pathways in cancer" that were consistently regulated under all passage comparisons were evaluated. Genes belonging to "Hedgehog signaling pathway" and "Wnt signaling pathway" were significantly up-regulated, and IHH, WNT10B and TCF7 were found up-regulated in all three metastatic compared to the parental cell lines.Considerable instability during culture in terms of gene expression and chromosomal aberrations was observed in osteosarcoma cell lines. The use of cells from different passages and a search for genes consistently regulated in early and late passages allows the analysis of metastasis-relevant genes despite the observed instability in gene expression in osteosarcoma cell lines during culture.

Continue reading

arrayMap 2014: an updated cancer genome resource

Cai H, Gupta S, Rath P, Ai N, Baudis M.

Abstract Somatic copy number aberrations (CNA) represent a mutation type encountered in the majority of cancer genomes. Here, we present the 2014 edition of arrayMap (www.arraymap.org), a publicly accessible collection of pre-processed oncogenomic array data sets and CNA profiles, representing a vast range of human malignancies. Since the initial release, we have enhanced this resource both in content and especially with regard to data mining support. The 2014 release of arrayMap contains more than 64,000 genomic array data sets, representing about 250 tumor diagnoses. Continue reading

Biopsying parapsoriasis: quo vadis? Are morphological stains enough or are ancillary tests needed?

Baderca F, Chiticariu E, Baudis M, Solovan C.

Abstract BACKGROUND: Parapsoriasis represents a group of cutaneous disorders that shows variable clinical aspects somehow resembling to psoriasis, how is reflecting by its name. It was first named by Brocq, in 1902, as an entity with three components: pityriasis lichenoides, small plaque parapsoriasis and large plaque parapsoriasis. Nowadays, under the name of parapsoriasis are included only the last two categories, that are considered disorders characterized by the presence of a mononuclear infiltrate in the dermis, composed of T-cells. Until now, there were not established pathognomonic histopathological features to diagnose parapsoriasis. Continue reading