-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DO NOT MERGE - Test converting a slice of Reactome BioPAX #4
base: master
Are you sure you want to change the base?
Conversation
@dustine32 Looks great. |
@kltm By "list of sources," do you mean something like source/15869.owl, source/15870.owl, ... being processed through |
@dustine32 The former. For example, in this repo, if we had a file (in sources?) that was like - name: foo
url: http://location1
- name: bar
url: http://location2 A script could get all of them, process them, and give them separate reports. That way, we could use skyhook as it stands now to have different pipelines for different sets of these if necessary, but mainly work with a single one. |
Ah, ok! Yeah, that metadata file should work. The most "main" pipeline I would assume would only be on the Reactome release BioPAX file
Or we could arrange Reactome to provide a direct URL to this file? Or I could be overcomplicating it? |
A direct URL would be one that pointed only to the human annotated content in the BioPAX and not any content, computationally inferred or manually annotated, for model organisms? I can ask Guanming Wu whether that is hard or dangerous, if you'd like. Actually, thinking for a minute, now I'm confused. That computational inference is performed on the database of material that has been approved for release, after the release process has started. The gk_central database, the one that we edit for curation purposes and that drives the curator.reactome.org web site and the BioPAX3 generation process that yielded your file, does not contain any of the computationally inferred material. It does contain some manually curated non-human pathways. For example, a long time ago we annotated parts of chicken nucleotide metabolism, in pathway R-GGA-419470, and a bit of rice nucleotide metabolism, R-OSA-5655149. Is that what you're seeing? (In which case I'm forgetful - I forgot that gk_central has manual nucleotide annotations for non-human species - and not so confused.) Continuing to think, maybe this is exactly your point: you would like to go to Reactome and specify a pathway (nucleotide metabolism, or signal transduction) and a species, and get a BioPAX3 file with exactly that material? |
@deustp01 |
But something to also think about is making this all generic in the future. If there is work on the Reactome side and they are the responsible party, is this going to be the standard for all the groups we support? If so, we need some type of commitment that they will live up to their end of the bargain, don't we? |
The contract I think we can commit to now is that:
The first step there is obviously the big one, so we're still trying to get a feel for what makes sense. My druthers at the moment would be to have the upstream push (leaving the responsibility with them for updates), but, practically, it may be easier to do it some other way. |
For issue #3. But this PR is not at all meant to be merged (you'll notice all but 13 models have been deleted).
This branch contains the 13 result models that were produced using @deustp01's URL. The saved source BioPAX is in
source/
, the ShEx reports are inproducts/
.To generate the models:
To generate ShEx reports with Minerva (using geneontology/minerva@fbcfbf9):
@kltm Maybe this PR setup is a model for some future "experiments?"