Skip to content

References

This page lists the scientific and technical references discussed in the BfxPM documentation and development plan, including international standards for reproducible research and data management.

International Guidelines & Standards

CLI & Software Design Philosophy

Sequence Data Compression

  1. Academic 0up (Lossless Genomic Compression) - General overview of essential lossless strategies.
  2. PMC6662292 (Specialized Compressors) - Performance comparisons of specialized genomic compressors vs. Gzip.
  3. IEEE Spectrum (The Desperate Quest for Genomic Compression) - Rationale for domain-specific algorithms.
  4. Nature Scientific Reports (Lossless Compression of FASTQ Files)
  5. NanoSpring (Long-read FASTQ Compression) - Specialized tool for Nanopore and PacBio data.
  6. EBI Sequence File Formats
  7. Biostars Thread (FASTQ vs CRAM vs Genozip)
  8. Illumina FASTQ ORA Format

Aligned Data (BAM/CRAM)

  1. Bioinformatics Stack Exchange (Best Archive Practices)
  2. CRAM Specification (GA4GH Standard)
  3. HTSeq Read Mapping Documentation

Raw Signal Data (FAST5/POD5/SLOW5)

  1. Oxford Nanopore Beginner's Guide to Formats
  2. SLOW5 & blow5 Documentation - Open-source lossless alternative for raw signal recording.
  3. VBZ Compression Plugin - Lossless squiggle signal compression within HDF5.
© Jyotirmoy Das 2026
3D1F C87F 8D52 BFD4 5A1D
3C86 63F2 2E14 6A0A 6B98
Made with MkDocs