Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

covert hgvs g. notation to c. notation with Ensembl transcripts #433

Closed
yujiang02 opened this issue May 3, 2017 · 2 comments
Closed

covert hgvs g. notation to c. notation with Ensembl transcripts #433

yujiang02 opened this issue May 3, 2017 · 2 comments

Comments

@yujiang02
Copy link

Hi,

I just tried to use this hgvs tool to perform some conversion between hgvs g. notation and c. notation. I am using the ensembl transcripts instead of refSeq. I tried with the example provided in https://github.com/biocommons/hgvs. However, this one seems only gave RefSeq transcripts c. notation. Is there any example or document for using Ensembl transcripts?

Thanks,
Yu

@reece
Copy link
Member

reece commented May 4, 2017

Hi Yu-

Only some functions of hgvs + UTA will work with Ensembl transcripts. The limitation is really with UTA and not hgvs itself.

There are several issues. The first is that it's fairly hard to get the data I need for UTA from Ensembl, and therefore I don't do it very often. UTA is currently based on Ensembl 79. The second historical reason is that their fasta files used to have unversioned sequences. That means that the same (unversioned) accession was often associated with multiple distinct sequences over time, which violates an assumption in the code that accessions are associated with exactly one sequence. (Ensembl now versions accessions in their fasta files.) Finally, c_to_p requires a table of ENST-ENSP pairs, which is currently unpopulated for Ensembl. So, although hgvs works with Ensembl in principle (and has in the past), the combination is currently unsupported.

For the record, parsing and normalization work. And projecting variants between c and g works but only for unversioned transcripts, because that's all that was available at the time.

I'll close this issue in favor of biocommons/uta#207.

-Reece

@reece reece closed this as completed May 4, 2017
@davmlaw
Copy link
Contributor

davmlaw commented Mar 21, 2023

Hi, I recently commented on another HGVS Ensembl issue you may find it useful

#621 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants