| Step | Tool / Method | Output Folder |
|---|---|---|
| Adapter trimming | miRDeep2 (mapper.pl) | 00.TrimData/ |
| Read QC | FastQC + MultiQC | 01.FastqQualityCheck/ |
| Read collapsing & genome mapping | mapper.pl (miRDeep2) + Bowtie | 02.mapping/ |
| miRNA quantification | miRDeep2.pl (miRBase v22, GRCm39) | 03.mirdeep2/ |
| Count & CPM matrix generation | Custom R script | 04.Counts/ |
| Differential expression + GO & KEGG enrichment on target genes | DESeq2 + clusterProfiler (R) | 05.DE-GO-KEGG/ |
This project comprises 27 mouse (Mus musculus) samples divided into six treatment groups (A–F): Group A (n=4, Sample-01–04), Group B (n=4, Sample-05–08), Group C (n=4, Sample-09–12), Group D (n=6, Sample-13–18), Group E (n=5, Sample-19–23), and Group F (n=4, Sample-24–27). Libraries were prepared using the Qiagen Small RNA Kit and sequenced on an Illumina platform.
mapping.statistics.
| Sample | Group | Total Reads (M) | Filtered Reads (M) | Filter Rate | Mapped Reads (M) | Overall Mapping Rate | Filtered Mapping Rate | Detected miRNAs |
|---|---|---|---|---|---|---|---|---|
| Sample-01 | Group A | 9.52 | 8.69 | 6.99 | 80.53% | 622 | ||
| Sample-02 | Group A | 10.09 | 9.25 | 7.02 | 75.88% | 626 | ||
| Sample-03 | Group A | 7.61 | 6.98 | 5.49 | 78.77% | 632 | ||
| Sample-04 | Group A | 8.40 | 7.95 | 6.62 | 83.22% | 579 | ||
| Sample-05 | Group B | 9.05 | 8.14 | 6.65 | 81.70% | 627 | ||
| Sample-06 | Group B | 9.63 | 8.99 | 7.39 | 82.20% | 613 | ||
| Sample-07 | Group B | 6.72 | 6.14 | 4.70 | 76.56% | 570 | ||
| Sample-08 | Group B | 7.15 | 6.66 | 5.50 | 82.60% | 606 | ||
| Sample-09 | Group C | 10.05 | 9.14 | 7.58 | 82.95% | 607 | ||
| Sample-10 | Group C | 10.46 | 9.63 | 7.59 | 78.78% | 595 | ||
| Sample-11 | Group C | 8.75 | 8.11 | 6.41 | 79.00% | 607 | ||
| Sample-12 | Group C | 11.42 | 10.52 | 8.34 | 79.29% | 597 | ||
| Sample-13 | Group D | 8.48 | 8.01 | 6.74 | 84.15% | 601 | ||
| Sample-14 | Group D | 9.72 | 9.16 | 7.68 | 83.81% | 614 | ||
| Sample-15 | Group D | 8.55 | 8.08 | 6.73 | 83.27% | 611 | ||
| Sample-16 | Group D | 9.67 | 9.17 | 7.77 | 84.79% | 606 | ||
| Sample-17 | Group D | 9.31 | 8.70 | 7.35 | 84.55% | 630 | ||
| Sample-18 | Group D | 7.53 | 6.84 | 5.82 | 85.04% | 580 | ||
| Sample-19 | Group E | 8.87 | 7.17 | 5.30 | 73.88% | 589 | ||
| Sample-20 | Group E | 7.53 | 6.24 | 4.68 | 74.91% | 598 | ||
| Sample-21 | Group E | 10.07 | 8.77 | 6.41 | 73.13% | 618 | ||
| Sample-22 | Group E | 7.48 | 6.68 | 5.41 | 80.98% | 596 | ||
| Sample-23 | Group E | 9.57 | 8.94 | 7.29 | 81.53% | 634 | ||
| Sample-24 | Group F | 8.07 | 7.58 | 6.31 | 83.25% | 628 | ||
| Sample-25 | Group F | 8.36 | 7.74 | 6.50 | 83.94% | 588 | ||
| Sample-26 | Group F | 6.34 | 5.60 | 4.64 | 82.78% | 561 | ||
| Sample-27 | Group F | 8.22 | 7.74 | 6.45 | 83.36% | 563 |
03.mirdeep2/.
03.mirdeep2/pdfs_*/mmu-let-7b.pdf
| File / Folder | Description |
|---|---|
expression_*.html | Per-sample miRDeep2 expression visualisation (interactive HTML) |
result_*.html | Per-sample miRDeep2 result summary including novel miRNA candidates |
pdfs_*/ | PDF hairpin structure plots for detected miRNAs |
04.Counts/.
| Matrix | File | Description |
|---|---|---|
| Raw counts | 04.Counts/rawCount/rawCount--matrix_with_all_samples.txt | Raw miRNA read counts from miRDeep2 |
| Normalised CPM | 04.Counts/normalizedCount/normalizedCount--matrix_with_all_samples.txt | RPM-normalised values from miRDeep2 |
| Per-sample raw | 04.Counts/rawCount/*_count.txt | Individual sample raw count files |
| Per-sample CPM | 04.Counts/normalizedCount/*_normalizedCount.txt | Individual sample normalised count files |
Across all 27 samples, the number of detected miRNAs (raw count > 0) was highly consistent, reflecting uniform library quality. Summary statistics are shown below.
| Statistic | Value |
|---|---|
| Number of samples | 27 |
| Mean | 603.6 |
| Median | 606.0 |
| Standard deviation | 20.9 |
| Minimum | 561 |
| Q1 (25th percentile) | 589 |
| Q3 (75th percentile) | 622 |
| Maximum | 634 |
| IQR | 33 |
05.DE-GO-KEGG/.
| Comparison | Total Significant miRNAs | Up-regulated | Down-regulated |
|---|---|---|---|
| Group A vs D | 96 | ↑ 47 | ↓ 49 |
| Group B vs A | 201 | ↑ 113 | ↓ 88 |
| Group B vs E | 151 | ↑ 96 | ↓ 55 |
| Group C vs A | 275 | ↑ 141 | ↓ 134 |
| Group C vs F | 112 | ↑ 67 | ↓ 45 |
| Group E vs D | 178 | ↑ 62 | ↓ 116 |
| Group F vs D | 208 | ↑ 106 | ↓ 102 |
A_vs_D_DE/output-enhancedVolcanoPlot.pdf
Volcano-Plot--A_vs_D--show-gene-names.html), hovering over each point displays the miRNA name, log₂ fold-change, and p-value. Source: A_vs_D_DE/Volcano-Plot--A_vs_D--show-gene-names.html
A_vs_D_DE/output-PCA.pdf
A_vs_D_DE/output-BetweenSampleDis.pdf
A_vs_D_DE/output-heatmap-50-top-genes--only-check-expression-variance-no-pvalue-filter-applied.pdf
A_vs_D_DE/GO-ALL-TargetGenes/output-BiologicalProcess-barplot.pdf
A_vs_D_DE/GO-ALL-TargetGenes/output-kegg-barplot.pdf
| File | Description |
|---|---|
output-AnalysisResult.csv | Full DE table, all miRNAs, sorted by adjusted p-value |
output-AnalysisResult-sig.csv | Significant DE miRNAs (p < 0.05, |log₂FC| > 0) |
output-AnalysisResult-sig-upregulated.csv | Significant up-regulated miRNAs |
output-AnalysisResult-sig-downregulated.csv | Significant down-regulated miRNAs |
output-normalized-count.csv | DESeq2 size-factor normalised counts |
output-PCA.pdf / output-PCA-data.csv | PCA on variance-stabilised data |
output-heatmap*.pdf | Sample distance, gene-sample, and top-gene heatmaps |
output-enhancedVolcanoPlot.pdf / Volcano-Plot-*.html | Static and interactive volcano plots |
output-Pearson-correlation-of-top-2000-genes.pdf | Pearson correlation heatmap |
output-BetweenSampleDis.pdf | Between-sample distance heatmap |
GO-ALL-TargetGenes/ GO-UP-TargetGenes/ GO-DOWN-TargetGenes/ | GO & KEGG enrichment results (all, up-regulated, down-regulated) |
miRNA-seq libraries were prepared using the Qiagen Small RNA Library Kit. Single-end sequencing was performed on an Illumina platform. Only trimmed R1 reads were used for downstream miRNA analysis.
Adapter trimming was performed using miRDeep2 (mapper.pl). Read quality was assessed using FastQC and aggregated with MultiQC.
Trimmed R1 reads were processed with mapper.pl (miRDeep2), which removes reads with non-canonical nucleotides, collapses identical reads, and aligns them to the mouse genome (GRCm39) using Bowtie. Key parameters:
| Parameter | Value | Description |
|---|---|---|
-e | — | Input is a FASTQ file |
-j | — | Remove reads with non-canonical nucleotides |
-l | 18 | Minimum read length after collapsing: 18 bp |
-m | — | Collapse reads before mapping |
-o | 4 | Maximum number of genomic mapping loci per read |
-n | — | Retain read name information |
-p | GRCm39 Bowtie index | Reference genome Bowtie index |
miRNA expression was quantified with miRDeep2.pl using mouse miRBase reference files (GRCm39 coordinates):
| Reference | Description |
|---|---|
| Mature miRNAs | Mus musculus mature miRNA sequences (miRBase) |
| Close-species miRNAs | Mature sequences from related rodent species for homology-based detection |
| Hairpin precursors | Mus musculus hairpin precursor sequences (miRBase) |
| Genome FASTA | GRCm39 primary assembly |
Raw read counts (column 2) and RPM-normalised values (column 6) were extracted from the miRDeep2 expression file for each sample and merged into cross-sample matrices using a custom R script via full outer join on miRNA ID, with missing values set to 0.
DE analysis was performed using DESeq2 (R/Bioconductor) with raw counts, design = ~condition. Size-factor normalisation and negative-binomial model fitting applied with DESeq().
Significance thresholds:
Variance-stabilising transformation (VST) was applied for all visualisations (PCA, heatmaps, sample correlation). Plots include volcano plots, heatmaps, PCA, sample distance maps, and p-value histograms.
Significant DE miRNAs were converted to predicted target genes. GO and KEGG enrichment analysis was performed on target genes (full set, up-regulated, and down-regulated subsets) using clusterProfiler (R/Bioconductor). Results are in:
GO-ALL-TargetGenes/ — GO & KEGG enrichment on all significant DE miRNA target genesGO-UP-TargetGenes/ — GO & KEGG enrichment on up-regulated miRNA target genesGO-DOWN-TargetGenes/ — GO & KEGG enrichment on down-regulated miRNA target genes| Tool | Version | Purpose |
|---|---|---|
| miRDeep2 (mapper.pl) | — | Adapter trimming |
| FastQC | v0.12.1 | Read quality control |
| MultiQC | v1.28 | FastQC report aggregation |
| Bowtie | v1.3.1 | Read alignment to GRCm39 |
| mapper.pl (miRDeep2) | — | Read collapsing and genome mapping |
| miRDeep2.pl | — | miRNA quantification (known + novel) |
| R / DESeq2 | v1.46.0 | Differential expression analysis |
| R / clusterProfiler | v4.14.4 | GO & KEGG enrichment analysis |