The Next Webinar on the 8th of May at 2 pm
Speakers:
Dr. Jing Tang from the University of Helsinki
Topic will be announced soon!

Dr. Jing Tang from the University of Helsinki
Title will be announced soon.
Abstract: Will be announced soon.
Past webinars

Dr. Jing Tang from the University of Helsinki
Title will be announced soon.
Abstract:
Will be announced soon.
.
Alexandru Tomescu from the University of Helsinki
Decomposing a weighted graph into few paths for solving multi-assembly problems
Abstract: A multi-assembly problem usually asks to decompose a weighted graph into a small number of weighted paths. For example, in the RNA transcript assembly problem, the graph nodes are gene exons, the weights are read coverages, and the edges indicate connections between exons observed in the RNA-seq reads. The weighted paths are then the RNA transcripts (together with their expression levels), that need to be identified from the RNA-seq reads. Such problems easily become NP-hard, and thus many RNA assembly tools solve them heuristically. In this talk I will give an overview of our recent results on extremely fast and exact solvers for a range of such decomposition problems. These are based on Mixed Integer Linear Programming, and benefit from additional “safety optimizations” based on insight into the “simple” parts of the graph structure. These results are currently being incorporated into a Python package, so that developers of future multi-assembly tools can easily use exact and accurate decomposition methods.
Chengbo Fu from Aalto University
KMAP: Kmer Manifold Approximation and Projection for visualizing DNA sequences
Abstract:
Abstract: Identifying and illustrating patterns in DNA sequences is a crucial task in various biological data analyses. In this task, patterns are often represented by sets of kmers, the fundamental building blocks of DNA sequences. To visually unveil these patterns, we could project each kmer onto a point in two-dimensional (2D) space. However, this projection poses challenges due to the high-dimensional nature of kmers and their unique mathematical properties. Here, we established a mathematical system to address the peculiarities of the kmer manifold. Leveraging this kmer manifold theory, we developed a statistical method named KMAP for detecting kmer patterns and visualizing them in 2D space. We applied KMAP to three distinct datasets to showcase its utility. KMAP achieved a comparable performance to the classical method MEME, with approximately 90% similarity in motif discovery from HT-SELEX data. In the analysis of H3K27ac ChIP-seq data from Ewing Sarcoma (EWS), we found that BACH1, OTX2 and ERG1 might affect EWS prognosis by binding to promoter and enhancer regions across the genome. We also found that FLI1 bound to the enhancer regions after ETV6 degradation, which showed the competitive binding between ETV6 and FLI1. Moreover, KMAP identified four prevalent patterns in gene editing data of the AAVS1 locus, aligning with findings reported in the literature. These applications underscore that KMAP could be a valuable tool across various biological contexts.
Thaddeus Wu: James Cook University, Australia
Behind the Transcriptomic Complexity
Abstract:
Recent advances in single-cell long-read sequencing have revolutionised our understanding of transcriptomic complexity. However, traditional analysis pipelines, designed for short-read sequencing, fail to capture the full complexity of isoform-level information. In this talk, I will introduce our ongoing project – a novel computational framework – that addresses these challenges. Using early blood development as a model system, we demonstrate the interesting findings and insights to understand the transcriptomic complexity

Luca Musella: University Hospital of Erlangen at Julio Vera’s lab, Germany
ENQUIRE reconstructs and expands context-specific co-occurrence networks from biomedical literature
Abstract: The accelerating growth of scientific literature overwhelms our capacity to manually distill complex phenomena like molecular networks linked to diseases. Moreover, biases in biomedical research and database annotation limit our interpretation of facts and generation of hypotheses. ENQUIRE (Expanding Networks by Querying Unexpectedly Inter-Related Entities) offers a time- and resource-efficient alternative to manual literature curation and database mining. ENQUIRE reconstructs and expands co-occurrence networks of genes and biomedical ontologies from user-selected input corpora and network-inferred PubMed queries. The integration of text mining, automatic querying, and network-based statistics mitigating literature biases makes ENQUIRE unique in its broad-scope applications. For example, ENQUIRE can generate co-occurrence gene networks that reflect high-confidence, functional networks. When tested on case studies spanning cancer, cell differentiation, and immunity, ENQUIRE identified interlinked genes and enriched pathways unique to each topic, thereby preserving their underlying diversity. ENQUIRE supports biomedical researchers by easing literature annotation, boosting hypothesis formulation, and facilitating the identification of molecular targets for subsequent experimentation.