The vast majority of human genes are suppressed post-transcriptionally by miRNAs, a small non-coding RNA that plays a crucial role in human gene regulation1. Multiple studies have demonstrated a strong correlation between miRNA dysregulation and a broad spectrum of diseases, including cancers, neurological disorders, and cardiovascular diseases2-3. A comprehensive map of miRNA functions, miRNA-target repression, is necessary to advance our knowledge of miRNA biological function and their clinical applications4-5.
Computational models have been developed to facilitate the identification of miRNA-target mRNA suppression activities6-8. However, the existing models have exhibited limited predictive capability and applicability. In this study, we developed PRIMITI, a PRedictive model for identifying novel MIRNA-Target mRNA Interactions. Machine learning was implemented to predict miRNA-mRNA suppression activities by modelling molecular interactions between miRNAs and target sites in 3’-untranslated regions (3’-UTR). We introduced a negative sample selection to mitigate a publication bias, improving the reliability of negative samples. PRIMITI additionally combines information on single polymorphisms (SNPs) and physicochemical properties to improve target site characterisation.
The model achieved robust predictive performance, with Matthews correlation coefficients of up to 0.81 for miRNA-target site identification and up to 0.77 for mRNA repression activity prediction on cross-validation and independent blind test sets. PRIMITI was evaluated further using an external blind microarray dataset9 and experimentally validated lists of miRNA-mRNA interactions10-11, outperforming existing state-of-the-art approaches. PRIMITI is publicly available through a user-friendly web server at https://biosig.lab.uq.edu.au/primiti, making it an invaluable resource for preliminary screening of miRNA-mediated repression.