-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove the FILTER EXISTS
from organization_page-production.sparql
#2208
Comments
Duplicate of #2176 |
The problem with the new query is that there might be multiple paths to the target for a researcher, so there might be double count. |
Can that be solves with a |
I took a look at this:
A close inspection of these images does indeed reveal a change in the numbers (most visible for 2002 and 2014). Perhaps the researchers should be identified in a subquery that uses DISTINCT. |
Here is a version with a dedicated subquery for the researchers: https://w.wiki/6Cc2 . |
I now see that this is essentially what Finn had proposed in #2176 . |
What query is this about
The query in organization_page-production.sparql uses a clause
FILTER EXISTS { ?researcher wdt:P108 | wdt:P463 | (wdt:P1416 / wdt:P361*) target: . }
wherein the
FILTER EXISTS
part is not really necessary but slows things down considerably.What change do you propose, and why?
Just remove it:
?researcher wdt:P108 | wdt:P463 | (wdt:P1416 / wdt:P361*) target: .
Any other considerations?
This can be tested with any organization profile, e.g. https://scholia.toolforge.org/organization/Q3530296#page-production .
The text was updated successfully, but these errors were encountered: