Download files

📥 Available Datasets for Download

Here are the datasets available for download. Please ensure to cite our work when using these datasets.

Group Link Species Counts Source Size Release
Physiological Sequences ST-P1 Homo sapiens 53,519 Uniprot, NCBI_RefSeq 46.7M 2025.10.31
Frameshift Sequences ST-S1 Homo sapiens 1,240,190 ClinVar, 1000GP, DepMap, GDC, dbSNP 885.6M 2025.11.11
FS-control Sequences ST-SC1 Homo sapiens 44,519 ClinVar, 1000GP, DepMap, GDC, dbSNP 24.7M 2025.11.11