Admera Health
Admera Health 126 Corporate Boulevard, South Plainfield, NJ 07080
+1-908-222-0533  ·  custom-services@admerahealth.com  ·  www.admerahealth.com
CLIA ID: 31D2038676

Small RNA-seq Quality Control Report

Project: Demo-Project  |  Differentiation Time Course
Project IDDemo-Project
SpeciesHomo sapiens (hg38)
Samples9
TimepointsGroup A · Group B · Group C · Group D
RNA Types AnalysedmiRNA · piRNA · snoRNA · snRNA · tRNA · circRNA
Report Generated2026-03-04

1. Project Overview

Total Samples
9
biological replicates
Total Raw Reads
~210M
across all samples (R1)
Read Length
151 bp
paired-end
Avg Mapping Rate
88.3%
STAR alignment
RNA Types
6
miRNA/piRNA/snoRNA/snRNA/tRNA/circRNA
DE Comparisons
6
per RNA type (36 total)
Study Design: Human Group A differentiation time course across four timepoints (Group A/Day0, Group B, Group C, Group D) with 2–3 biological replicates per group. Small RNA-seq libraries were sequenced on an Illumina platform (151 bp paired-end). Six ncRNA classes were quantified using featureCounts v2.0.6 and STAR 2.7.10b alignment.

Analysis Pipeline

StepToolOutput
Read QCFastQC 0.12.1 + MultiQC01.fastqc/
Adapter trimmingTrimming (pre-processed)00.TrimmedFastq/
AlignmentSTAR 2.7.10b (hg38)02.bam/
Mapping QCSTAR logs + MultiQC03.mappingQC/
Feature countingfeatureCounts v2.0.604.smallRNAcounts/
Differential expressionDESeq2 (R)05.DEanalysis/

2. Sample Information

#Sample IDLibrary IDGroupTimepointNote
1SampleA1SampleA1_S1_L002Group AGroup AReplicate I
2SampleA2SampleA2_S2_L002Group AGroup AReplicate II
3SampleB1SampleB1_S6_L002Group BGroup BBackup replicate
4SampleB2SampleB2_S7_L002Group BGroup BReplicate I
5SampleC1SampleC1_S8_L002Group CGroup CReplicate I
6SampleC2SampleC2_S0_L001Group CGroup CReplicate II
7SampleD1SampleD1_S1_L004Group DGroup DBackup replicate
8SampleD2SampleD2_S0_L001Group DGroup DReplicate I
9SampleD3SampleD3_S0_L001Group DGroup DReplicate II

3. Read Quality (FastQC)

Metrics below are from R1 reads. High duplication rates (75–88%) are expected for small RNA-seq libraries due to the limited diversity of small RNA species. All samples pass basic quality thresholds.

Full interactive report: 01.fastqc/multiqc_report.html
SampleGroupTotal Reads (M) % Duplicates% GCRead Length
SampleA1 Group A 22.38
84.4%
55%151 bp
SampleA2 Group A 20.10
80.8%
55%151 bp
SampleB1 Group B 25.49
88.0%
53%151 bp
SampleB2 Group B 23.15
86.9%
55%151 bp
SampleC1 Group C 31.98
84.4%
53%151 bp
SampleC2 Group C 22.46
81.7%
58%151 bp
SampleD1 Group D 22.37
86.0%
56%151 bp
SampleD2 Group D 20.55
83.3%
58%151 bp
SampleD3 Group D 20.92
83.9%
59%151 bp

4. Alignment Statistics (STAR)

Alignment performed against the human genome (hg38) using STAR 2.7.10b. High multi-mapping rates are characteristic of small RNA-seq because many small RNA sequences share sequence identity across loci.

Full interactive report: 03.mappingQC/multiqc_report.html
SampleGroup Total Reads (M) Mapped % Uniquely Mapped % Multi-mapped % Unmapped % Avg Read Length Mismatch Rate
SampleA1Group A 15.83
96.0%
7.3%88.7%4.0%33 bp0.50%
SampleA2Group A 11.25
91.3%
8.2%83.1%8.8%30 bp0.59%
SampleB1Group B 17.95
82.6%
12.6%70.0%17.4%38 bp0.57%
SampleB2Group B 15.95
88.3%
12.1%76.2%11.7%33 bp0.36%
SampleC1Group C 10.81
84.1%
27.7%56.3%15.9%34 bp0.27%
SampleC2Group C 9.04
88.5%
14.5%74.0%11.5%28 bp0.51%
SampleD1Group D 5.06
84.2%
25.0%59.2%15.8%39 bp0.31%
SampleD2Group D 6.26
88.1%
14.8%73.3%11.9%29 bp0.37%
SampleD3Group D 4.45
87.9%
14.5%73.4%12.1%31 bp0.38%

5. Feature Counting (featureCounts v2.0.6)

Reads were counted against six ncRNA annotation sets using featureCounts v2.0.6. Percentages shown are fraction of total alignments assigned to each RNA class. snRNA captures the largest proportion of reads (~34–58%), followed by circRNA (~8–13%) and piRNA (~3–9%). miRNA, snoRNA, and tRNA account for <3% each. Aggregated count and CPM matrices for all samples are available in 04.smallRNAcounts/Data_matrixes_with_all_samples/.
SampleGroup miRNA % piRNA % snoRNA % snRNA % tRNA % circRNA %
SampleA1Group A 0.08% 5.22% 0.04% 43.08% 0.47% 6.97%
SampleA2Group A 0.15% 4.72% 0.11% 54.78% 0.35% 8.33%
SampleB1Group B 0.02% 9.21% 0.14% 5.31% 0.16% 12.60%
SampleB2Group B 0.18% 5.65% 0.05% 43.58% 0.15% 8.55%
SampleC1Group C 0.17% 2.78% 0.04% 49.90% 0.09% 10.60%
SampleC2Group C 0.21% 5.48% 0.04% 57.90% 0.60% 8.43%
SampleD1Group D 0.17% 6.56% 0.09% 34.17% 1.34% 7.76%
SampleD2Group D 0.20% 5.33% 0.03% 56.34% 2.93% 8.38%
SampleD3Group D 0.20% 5.10% 0.05% 52.05% 2.97% 7.93%

Reference Feature Counts

RNA TypeFeatures in ReferenceCount Files Location
miRNA1,87804.smallRNAcounts/miRNA/
piRNA27,70004.smallRNAcounts/piRNA/
snoRNA94104.smallRNAcounts/snoRNA/
snRNA1,90904.smallRNAcounts/snRNA/
tRNA61904.smallRNAcounts/tRNA/
circRNA768,98604.smallRNAcounts/circRNA/

6. Differential Expression Analysis (DESeq2)

DE analysis was performed using DESeq2. Significance thresholds: p-value < 0.05 AND |log₂FC| > 0. Six comparisons were run per RNA type against the differentiation time course. Results, volcano plots, heatmaps, PCA, and sample correlation matrices are available under 05.DEanalysis/.

miRNA — Significant DE Genes

ComparisonTotal Sig.Up-regulatedDown-regulated
Group B vs Group A31↑ 8↓ 23
Group C vs Group B18↑ 12↓ 6
Group C vs Group A19↑ 8↓ 11
Group D vs Group B30↑ 18↓ 12
Group D vs Group C8↑ 3↓ 5
Group D vs Group A49↑ 16↓ 33

piRNA — Significant DE Genes

ComparisonTotal Sig.Up-regulatedDown-regulated
Group B vs Group A46↑ 9↓ 37
Group C vs Group B35↑ 15↓ 20
Group C vs Group A68↑ 13↓ 55
Group D vs Group B81↑ 40↓ 41
Group D vs Group C34↑ 20↓ 14
Group D vs Group A78↑ 24↓ 54

snoRNA — Significant DE Genes

ComparisonTotal Sig.Up-regulatedDown-regulated
Group B vs Group A49↑ 9↓ 40
Group C vs Group B11↑ 7↓ 4
Group C vs Group A34↑ 16↓ 18
Group D vs Group B21↑ 15↓ 6
Group D vs Group C5↑ 1↓ 4
Group D vs Group A29↑ 9↓ 20

snRNA — Significant DE Genes

ComparisonTotal Sig.Up-regulatedDown-regulated
Group B vs Group A49↑ 36↓ 13
Group C vs Group B137↑ 2↓ 135
Group C vs Group A227↑ 112↓ 115
Group D vs Group B167↑ 0↓ 167
Group D vs Group C12↑ 6↓ 6
Group D vs Group A246↑ 98↓ 148

tRNA — Significant DE Genes

ComparisonTotal Sig.Up-regulatedDown-regulated
Group B vs Group A70↑ 35↓ 35
Group C vs Group B24↑ 4↓ 20
Group C vs Group A50↑ 17↓ 33
Group D vs Group B114↑ 39↓ 75
Group D vs Group C42↑ 3↓ 39
Group D vs Group A125↑ 37↓ 88

circRNA — Significant DE Genes

ComparisonTotal Sig.Up-regulatedDown-regulated
Group B vs Group A1,368↑ 480↓ 888
Group C vs Group B1,761↑ 334↓ 1,427
Group C vs Group A1,862↑ 154↓ 1,708
Group D vs Group B1,462↑ 607↓ 855
Group D vs Group C606↑ 536↓ 70
Group D vs Group A1,381↑ 350↓ 1,031

7. Methods

7.1 Library Preparation

Small RNA-seq libraries were prepared using the Takara Small RNA library preparation kit. Paired-end sequencing (2 × 151 bp) was performed on an Illumina platform. Only R1 reads were used for downstream analysis; R2 reads were discarded as they could not be trimmed correctly for small RNA analysis.

7.2 Read Quality Control and Adapter Trimming

Raw reads were assessed for quality using FastQC v0.12.1 on both R1 and R2. Quality reports were aggregated with MultiQC.

Adapter trimming was performed on R1 reads using cutadapt with the following parameters:

ParameterValueDescription
-m15Discard reads shorter than 15 bp after trimming
-u3Remove 3 bases from the 5′ end of each read
-aAAAAAAAAAATrim poly-A 3′ adapter sequence

FastQC was re-run on trimmed R1 reads to confirm adapter removal. High duplication rates (75–88%) are expected for small RNA libraries due to the limited sequence diversity of small RNA species.

7.3 Reference Genome and STAR Index

Reads were aligned to the Homo sapiens GRCh38 (hg38) genome using STAR 2.7.10b. The STAR genome index was built without a splice junction database (appropriate for small RNA) using:

ParameterValueDescription
--genomeSAindexNbases14Optimised suffix array index for small genomes / small RNA
--sjdbOverhang0No splice junction database (intron-free small RNA mode)

7.4 Alignment (STAR 2.7.10b)

Trimmed R1 reads were aligned to hg38 using STAR with parameters optimised for small RNA multi-mapping:

ParameterValueDescription
--outSAMtypeBAM SortedByCoordinateOutput coordinate-sorted BAM
--alignEndsTypeLocalSoft-clipping allowed; suited for short small RNA reads
--outFilterMismatchNmax1Maximum 1 mismatch per alignment
--outFilterMismatchNoverLmax0.05Mismatch fraction ≤ 5% of read length
--outFilterMatchNmin16Minimum matched bases = 16
--outFilterMultimapNmax1,000,000Allow up to 1 M multi-mapping loci (captures multi-copy small RNAs)
--outFilterMultimapScoreRange1Report alignments within score range 1 of best
--outFilterScoreMinOverLread0No minimum score-over-length filter
--outFilterMatchNminOverLread0No minimum match-over-length filter
--alignIntronMax1Effectively disables spliced alignments
--outSAMunmappedWithinWrite unmapped reads to the output BAM
--outReadsUnmappedNoneDo not write separate unmapped FASTQ

BAM files were indexed using samtools. Alignment statistics were summarised with MultiQC. The high multi-mapping rate observed (56–89%) is characteristic of small RNA-seq, as many small RNA species share identical or near-identical sequences across genomic loci.

7.5 Feature Counting (featureCounts v2.0.6)

Read counts for six ncRNA classes were quantified using featureCounts v2.0.6 (Subread package) against annotation files in SAF format. Multi-mapping reads were included (-M) and strand specificity was set to unstranded (-s 0). Junction reads were also reported (-J).

RNA TypeAnnotation DatabaseReference / Version
miRNAmiRBaseHomo sapiens GRCh38, Ensembl release 113
piRNApiRNAdbv1.7.6, hg38
snoRNAEnsembl GTFHomo sapiens GRCh38.113
snRNAEnsembl GTFHomo sapiens GRCh38.113
tRNAGtRNAdbhg38
circRNACircAtlasv3.0, hg38

CPM (counts per million) normalisation was applied to each sample's raw counts using a custom R script: CPM = round(count / sum(all counts) × 106). Per-sample count and CPM files were merged into cross-sample matrices using a custom R script (generate_matrixes_with_all_samples.R).

7.6 Differential Expression Analysis (DESeq2)

Differential expression (DE) analysis was performed independently for each of the six ncRNA types using DESeq2 (R/Bioconductor, v1.46.0) via a custom pipeline (prepare_for_DGE_analysis_DESeq2.sh + DGE_analysis_DESeq2.R). Six pairwise comparisons were tested across the differentiation time course:

ComparisonTest GroupControl / Reference Group
Group B vs Group AGroup BGroup A
Group C vs Group BGroup CGroup B
Group C vs Group AGroup CGroup A
Group D vs Group BGroup DGroup B
Group D vs Group CGroup DGroup C
Group D vs Group AGroup DGroup A

DESeq2 model and normalisation: Raw per-sample count files were loaded with DESeqDataSetFromHTSeqCount() using the design formula ~ condition. The control group was set as the reference level using relevel(). Size-factor normalisation and negative-binomial model fitting were performed with DESeq(). Normalised counts were extracted with counts(dds, normalized=TRUE) and saved as output-normalized-count.csv. Per-group normalised mean expression (baseMean_<group>) was calculated using rowMeans(counts(dds, normalized=TRUE)) for each condition level.

DE results: Statistics (log₂FoldChange, lfcSE, Wald statistic, p-value, adjusted p-value) were extracted with results(dds) and merged with per-group baseMeans. The full results table was sorted by adjusted p-value (Benjamini–Hochberg) and saved as output-AnalysisResult.csv.

Significance thresholds: Features were called differentially expressed if they met both:

Significant features were further split into up-regulated (log₂FC > 0) and down-regulated (log₂FC < 0) subsets and saved separately.

Variance-stabilising transformation (VST): Raw counts were transformed using varianceStabilizingTransformation() for all visualisations. Plots were generated using the top 2,000 most variable genes (ranked by row variance of the VST matrix).

Visualisation outputs per comparison:

Output FileDescription
output-PCA.pdfPCA plot on VST-transformed data (plotPCA()); PC1 vs PC2 coloured by group
output-PCA-data.csvUnderlying PCA coordinates and variance explained per PC
output-heatmap.pdfSample-to-sample distance heatmap: Pearson correlation on top 2,000 variable genes (VST), distance = √(1 − r²), clustered by pheatmap
output-Pearson-correlation-of-top-2000-genes.pdfPearson correlation matrix heatmap for top 2,000 variable genes (VST)
output-sample-correlation.csvPearson correlation matrix (numeric, top 2,000 genes)
output-heatmap-gene.pdfGene × sample heatmap (row-scaled VST, top 2,000 variable genes, samples in original order, rows clustered by correlation distance)
output-heatmap-gene---clustering-samples.pdfSame as above but with hierarchical clustering of samples
output-BetweenSampleDis.pdfBoxplot of log₂(count + 1) per sample showing raw count distributions
output-VolcanoPlot.pdfVolcano plot: log₂FC vs −log₁₀(p-value); significant features highlighted in red
output-VolcanoPlot-data.csvData table underlying the volcano plot
Volcano-Plot--*--show-gene-names.htmlInteractive HTML volcano plot with gene labels (EnhancedVolcano / Plotly)
output-MAplot.pdfMA plot: log₂(baseMean) vs log₂FC; significant features in red
output-pval.pdfHistogram of raw p-values (bin width = 0.05)
heatmap_of_top_100_smallest_pvalue_genes--*.pdfHeatmap of top 100 smallest-p-value genes for all-sig, up-regulated, and down-regulated sets
output-AnalysisResult.csvFull DE table (all features, sorted by adjusted p-value)
output-AnalysisResult-sig.csvSignificant DE features (both directions)
output-AnalysisResult-sig-upregulated.csvSignificant up-regulated features
output-AnalysisResult-sig-downregulated.csvSignificant down-regulated features
output-normalized-count.csvDESeq2 size-factor normalised counts for all samples

7.7 Software Summary

ToolVersionPurpose
FastQC0.12.1Raw and trimmed read quality control
cutadaptAdapter trimming (poly-A, 5′ trimming)
MultiQCAggregation of FastQC and STAR QC reports
STAR2.7.10bRead alignment to hg38
samtoolsBAM indexing
featureCountsv2.0.6 (Subread)Read counting against ncRNA annotations
R / DESeq2DESeq2 1.46.0Differential expression analysis

8. Deliverable File Structure

Demo-Project-smallRNA-analysis/
├── 00.TrimmedFastq/ ~1.9 GB
│ └── 9 × *.trimmed.fastq.gz
├── 01.fastqc/ ~34 MB
│ ├── 01.fastqc-multiqc_report.html ← FastQC MultiQC report
│ ├── multiqc_report.html
│ └── multiqc_data/
├── 02.bam/ ~12 GB
│ └── 9 × *_Aligned.sortedByCoord.out.bam (.bai)
├── 03.mappingQC/ ~2.9 MB
│ ├── 03.mappingQC-multiqc_report.html ← STAR alignment MultiQC report
│ ├── multiqc_report.html
│ └── multiqc_data/
├── 04.smallRNAcounts/ ~500 MB
│ ├── miRNA/ — 9 samples × {.count.txt, .count_CPM.txt, .count_CountOnly.txt, .summary, .jcounts}
│ ├── piRNA/ — 9 samples × 5 file types
│ ├── snoRNA/ — 9 samples × 5 file types
│ ├── snRNA/ — 9 samples × 5 file types
│ ├── tRNA/ — 9 samples × 5 file types
│ ├── circRNA/ — 9 samples × 5 file types
│ └── Data_matrixes_with_all_samples/ ← merged count & CPM matrices (12 files)
├── 05.DEanalysis/ ~459 MB
│ ├── Demo-Project-miRNA-analysis/06.DE-GO-KEGG/
│ │ ├── sampleInfo.csv
│ │ └── {GroupB_vs_GroupA, GroupC_vs_GroupB, GroupC_vs_GroupA, GroupD_vs_GroupB, GroupD_vs_GroupC, GroupD_vs_GroupA}_DE/
│ │ ├── output-AnalysisResult.csv / -sig.csv / -sig-upregulated.csv / -sig-downregulated.csv
│ │ ├── output-normalized-count.csv, output-PCA-data.csv, output-VolcanoPlot-data.csv
│ │ └── output-*.pdf (heatmap, PCA, volcano, MA, correlation plots)
│ ├── Demo-Project-piRNA-analysis/ — same structure
│ ├── Demo-Project-snoRNA-analysis/ — same structure
│ ├── Demo-Project-snRNA-analysis/ — same structure
│ ├── Demo-Project-tRNA-analysis/ — same structure
│ └── Demo-Project-circRNA-analysis/— same structure
└── 05.smallRNAreport/ ~1.4 MB
├── logs/ — 54 featureCounts log files (9 samples × 6 RNA types)
└── summary/ — 54 featureCounts summary files
Demo-Project · Small RNA-seq QC Report · Generated 2026-03-04