SNPick

SNPick is a fast and memory-efficient tool designed to extract SNP (Single Nucleotide Polymorphism) sites from large-scale FASTA alignments, taking into consideration IUPAC ambiguous nucleotides. SNP analysis is critical for understanding genetic variation, phylogenetic relationships, and evolutionary biology. SNPick was developed to provide a highly scalable and optimized approach to SNP extraction. Unlike snp-sites, SNPick employs parallel processing and an efficient memory management strategy, making it suitable for very large datasets. This scalability allows users to work with massive alignments, extracting SNPs in a fraction of the time and with significantly lower memory requirements compared to traditional tools.

Info

Members (researchers): Paula Ruiz Rodríguez, Mireia Coscolla

Research Groups: PathoGenOmics

Contact Email: paula.ruiz.rodriguez@csic.es

Tool Repository: https://github.com/PathoGenOmics-Lab/snpick

Documentation: https://github.com/PathoGenOmics-Lab/snpick/blob/main/README.md

Application domain:

Evolutionary rates, Parallel Computing, Phylogenetic Analysis, Population genomics, Sequence analysis, Variant calling

Technical details
Type of application
  • Command line pipeline
  • Conda package
Software compatibility
  • Linux
  • Mac
Hardware requirements
Programming language
  • Rust
Type of containerization
  • None
Wrapper type
  • None
Input file formats
  • fasta
Output file formats
  • fasta
  • tsv
Compatibility with other tools