You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I expect the Foldseek server (https://search.foldseek.com/api) searching the pdb100 database for a monomer will not return duplicate hits for a deposited PDB chain with identical sequence alignments.
Current Behavior
Submitting PDB 1hxz chain D to the Foldseek server returns 4 PDB chains that are reported twice in the results. They are being reported for equivalent chains in the PDB assemblies with the exact same sequence alignments: 1hxz-assembly1.cif.gz_D and 1hxz-assembly1.cif.gz_D-2, 1hxz-assembly1.cif.gz_C and 1hxz-assembly1.cif.gz_C-2, 1hxl-assembly1.cif.gz_C and 1hxl-assembly1.cif.gz_C-2, 1hxl-assembly1.cif.gz_D and 1hxl-assembly1.cif.gz_D-2.
Steps to Reproduce (for bugs)
Go to web site https://search.foldseek.com/search and use Load Accession 1hxz choose only database PDB100, press Search and look at the results for job_D.
Foldssek Output (for bugs)
I've attached the .m8 output file with the duplicate entries and a screen-shot of the Foldseek server web page showing the duplicate results.
Context
I found the duplicate results using the ChimeraX Foldseek search capability but thought it would be easier for you to reproduce using the official foldseek web server.
Your Environment
Using the Foldseek web server https://search.foldseek.com/search. I don't see the Foldseek version anywhere on the server web site. It lists the PDB database as PDB100 20240101.
Expected Behavior
I expect the Foldseek server (https://search.foldseek.com/api) searching the pdb100 database for a monomer will not return duplicate hits for a deposited PDB chain with identical sequence alignments.
Current Behavior
Submitting PDB 1hxz chain D to the Foldseek server returns 4 PDB chains that are reported twice in the results. They are being reported for equivalent chains in the PDB assemblies with the exact same sequence alignments: 1hxz-assembly1.cif.gz_D and 1hxz-assembly1.cif.gz_D-2, 1hxz-assembly1.cif.gz_C and 1hxz-assembly1.cif.gz_C-2, 1hxl-assembly1.cif.gz_C and 1hxl-assembly1.cif.gz_C-2, 1hxl-assembly1.cif.gz_D and 1hxl-assembly1.cif.gz_D-2.
Steps to Reproduce (for bugs)
Go to web site https://search.foldseek.com/search and use Load Accession 1hxz choose only database PDB100, press Search and look at the results for job_D.
Foldssek Output (for bugs)
I've attached the .m8 output file with the duplicate entries and a screen-shot of the Foldseek server web page showing the duplicate results.
Context
I found the duplicate results using the ChimeraX Foldseek search capability but thought it would be easier for you to reproduce using the official foldseek web server.
Your Environment
Using the Foldseek web server https://search.foldseek.com/search. I don't see the Foldseek version anywhere on the server web site. It lists the PDB database as PDB100 20240101.
1hxz_duplicates.zip
The text was updated successfully, but these errors were encountered: