Basic Statistics
Measure | Value |
---|---|
Filename | C2-MGI-1_parsed.fq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 9014164 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 41 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGACATGGCTACGATGGGGTTAGTTTTTATTTATTAATTTTTATTATTTT | 53823 | 0.5970936406304567 | No Hit |
CGATGGGGTTAGTTTTTATTTATTAATTTTTATTATTTTTTAAAAAATTA | 16105 | 0.17866326816330388 | No Hit |
CGACATGGCTACGATGTGAACCACAGGAAGCGCAAGTCCGGCACTGTCGG | 11295 | 0.1253028012359216 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ACATGGC | 137480 | 0.0 | 85.904564 | 3 |
CGACATG | 137655 | 0.0 | 85.63829 | 1 |
CATGGCT | 139175 | 0.0 | 85.08461 | 4 |
GACATGG | 139675 | 0.0 | 84.57813 | 2 |
ATGGCTA | 144420 | 0.0 | 81.91968 | 5 |
TGGCTAC | 147575 | 0.0 | 80.18105 | 6 |
GGCTACG | 149795 | 0.0 | 78.760574 | 7 |
GCTACGA | 151375 | 0.0 | 76.76175 | 8 |
CTACGAT | 152375 | 0.0 | 74.543 | 9 |
TACGATG | 145290 | 0.0 | 37.79345 | 10-11 |
ACGATGG | 142785 | 0.0 | 37.416325 | 10-11 |
CGATCCG | 10220 | 0.0 | 33.525436 | 1 |
CGATGGG | 171775 | 0.0 | 30.424479 | 12-13 |
GATGGGA | 58935 | 0.0 | 30.093239 | 12-13 |
GATGGGG | 80125 | 0.0 | 29.94802 | 12-13 |
ATGGGGG | 14850 | 0.0 | 29.260271 | 14-15 |
ATGGGGT | 27410 | 0.0 | 27.006567 | 14-15 |
GATCCGA | 12430 | 0.0 | 26.468222 | 2 |
ATGGGGA | 36770 | 0.0 | 26.39516 | 14-15 |
GGGGTTA | 11180 | 0.0 | 26.148478 | 16-17 |