Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

why easy-cluster alignment results are slightly different from DaliLite.v5 z scores? [not bug] #365

Open
Huilin-Li opened this issue Oct 14, 2024 · 1 comment

Comments

@Huilin-Li
Copy link

Expected Behavior

After foldseek easy-cluster, in each clustered group, I also calculated the z score from DaliLite.V5. In my understanding, in each clustered group, proteins are already highly similar with each other, therefore, their z scores to a query protein should also be very close.

However, I can find outliers if I plot these z scores of each group.

Current Behavior

Let's see, there are three clustered groups A,B,C and they are generated by foldseek easy-cluster with default settings. However, in the Group C, we can see lots of outliers. These outliers seemed to say, well, although they are in same clustered group, and they are already highly similar to each other in the level of structure, they still performed differently when they are aligned with one same query protein.

image

Steps to Reproduce (for bugs)

Please make sure to execute the reproduction steps with newly recreated and empty tmp folders.

Foldssek Output (for bugs)

Please make sure to also post the complete output of Spacepharer. You can use gist.github.com for large output.

Context

Providing context helps us come up with a solution and improve our documentation for the future.

Your Environment

Include as many relevant details about the environment you experienced the bug in.

  • Git commit used (The string after "MMseqs Version:" when you execute foldseek without any parameters):
  • Which foldseek version was used (Statically-compiled, self-compiled, Conda, etc.):
  • For self-compiled and Homebrew: Compiler and Cmake versions used and their invocation:
  • Server specifications (especially CPU support for AVX2/SSE and amount of system memory):
  • Operating system and version:
@Huilin-Li Huilin-Li changed the title easy-cluster alignment results are different from DaliLite.v5 z scores why easy-cluster alignment results are different from DaliLite.v5 z scores? [not bug] Oct 14, 2024
@Huilin-Li Huilin-Li changed the title why easy-cluster alignment results are different from DaliLite.v5 z scores? [not bug] why easy-cluster alignment results are slightly different from DaliLite.v5 z scores? [not bug] Oct 14, 2024
@martin-steinegger
Copy link
Collaborator

Please see answer here: #364

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants