miRBench: novel benchmark datasets for microRNA binding site prediction that mitigate against prevalent microRNA frequency class bias
| Authors | |
|---|---|
| Year of publication | 2025 |
| Type | Article in Periodical |
| Magazine / Source | Computational systems bioinformatics / Life Sciences Society. Computational Systems Bioinformatics Conference |
| MU Faculty or unit | |
| Citation | |
| web | https://academic.oup.com/bioinformatics/article/41/Supplement_1/i542/8199406?login=true |
| Doi | https://doi.org/10.1093/bioinformatics/btaf233 |
| Keywords | miRNA; THERAPEUTICS; RNAS |
| Description | MicroRNAs (miRNAs) are crucial regulators of gene expression, but the precise mechanisms governing their binding to target sites remain unclear. A major contributing factor to this is the lack of unbiased experimental datasets for training accurate prediction models. While recent experimental advances have provided numerous miRNA–target interactions, these are solely positive interactions. Generating negative examples in silico is challenging and prone to introducing biases, such as the miRNA frequency class bias identified in this work. Biases within datasets can compromise model generalization, leading models to learn dataset-specific artifacts rather than true biological patterns. |
| Related projects: |