| Name | Uploaded | Size |
|---|---|---|
| foldseek/teddb.tar.gz | Wed, 26 Feb 2025 19:19:27 GMT | 320.4 GB |
| foldseek/teddb_afdb50.tar.gz | Wed, 26 Feb 2025 21:16:51 GMT | 59.5 GB |
| foldseek/teddymer.tar.gz | Mon, 02 Mar 2026 22:28:26 GMT | 20.5 GB |
This dataset presents 0.5 million non-singleton structural clusters generated by our Foldseek-Multimer-Clustering algorithm.
The clusters are derived from the TED database, where domains are segmented across the entire AlphaFold Database. We generated a dimer database from the entire TED database, treating each domain as a chain.
From 10 million dimers whose chains are fully annotated by CATH and are entries in afdb50 (a MMseqs-clustered version of afdb), we obtained 3.5 million clusters—500,000 non-singleton and 3 million singleton.
Parameter used: coverage 0.6, chain tm threshold 0.7, interface lddt threshold 0.3, cluster mode 0, cov-mode 0.
teddb.tar.gz: Foldseek database for TED
teddb_afdb50.tar.gz: Foldseek database for TED, containing only entries that belong to afdb50
teddymer.tar.gz - dir_ted_afdb50_cath_dimerdb: Input foldseek database
teddymer.tar.gz - teddymer_repdb: Representative database
teddymer.tar.gz - cluster.tsv: Cluster results
teddymer.tar.gz - nonsingletonrep_metadata.tsv: Metadata for the non-singleton clusters.
How to generate c-alpha only .pdb from database: .pdb files will be created in the directory, resultPDBdir
How to generate dimer .pdb files
All files are available under a Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).