Please enable JS

Benchmark dataset

Reference protein data set

  • Description: Astral 2.06 with less than 40% identity
  • Number of sequences: 6,569
  • Average seq length: 185 aa
FASTA format

Groups of structurally/functionally-related proteins at different SCOP levels:

Classes

  • Groups: 4
  • Avg # of proteins: 1649
  • Min # of proteins: 1072
  • Max # of proteins: 2478

Folds

  • Groups: 219
  • Avg: 30
  • Min: 5
  • Max: 404

Superfamilies

  • Groups: 282
  • Avg: 23
  • Min: 5
  • Max: 23

Families

  • Groups: 513
  • Avg: 12
  • Min: 5
  • Max: 133