miRBench: novel benchmark datasets for microRNA binding site prediction that mitigate against prevalent microRNA frequency class bias

Warning

This publication doesn't include Institute of Computer Science. It includes Central European Institute of Technology. Official publication website can be found on muni.cz.
Authors

SAMMUT Stephanie GREŠOVÁ Katarína TZIMOTOUDIS Dimosthenis MARŠÁLKOVÁ Eva ČECHÁK David ALEXIOU Panagiotis

Year of publication 2025
Type Article in Periodical
Magazine / Source Computational systems bioinformatics / Life Sciences Society. Computational Systems Bioinformatics Conference
MU Faculty or unit

Central European Institute of Technology

Citation
web https://academic.oup.com/bioinformatics/article/41/Supplement_1/i542/8199406?login=true
Doi https://doi.org/10.1093/bioinformatics/btaf233
Keywords miRNA; THERAPEUTICS; RNAS
Description MicroRNAs (miRNAs) are crucial regulators of gene expression, but the precise mechanisms governing their binding to target sites remain unclear. A major contributing factor to this is the lack of unbiased experimental datasets for training accurate prediction models. While recent experimental advances have provided numerous miRNA–target interactions, these are solely positive interactions. Generating negative examples in silico is challenging and prone to introducing biases, such as the miRNA frequency class bias identified in this work. Biases within datasets can compromise model generalization, leading models to learn dataset-specific artifacts rather than true biological patterns.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info