You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm attempting to follow along with this part of the tutorial/wiki, to get a better understanding of how to create my own custom DB. Things are okay until I get to the database creation step:
# flextaxd -db 16S_database.db -tf GTDB_arc_bact_taxo_tree_unique.txt -tt CanSNPer --genomeid2taxid g2id.txt --dump --dbprogram kraken2 -o taxonomy --verbose --logs logs/zenodo
2024-02-07 18:08:45,291 custom_taxonomy_databases [INFO ] FlexTaxD logging initiated!
Warning: 16S_database.db already exists, overwrite? (y/n): y
2024-02-07 18:08:49,303 custom_taxonomy_databases [INFO ] Loading module: ReadTaxonomyCanSNPer
2024-02-07 18:08:49,352 DatabaseConnection [INFO ] 16S_database.db opened successfully.
2024-02-07 18:08:49,353 ReadTaxonomyCanSNPer [INFO ] GTDB_arc_bact_taxo_tree_unique.txt
2024-02-07 18:08:49,353 ReadTaxonomyCanSNPer [INFO ] Fetching root name from file
2024-02-07 18:08:49,353 ReadTaxonomyCanSNPer [INFO ] Adding, cellular organism node
2024-02-07 18:08:49,354 ReadTaxonomyCanSNPer [INFO ] Adding root node root!
2024-02-07 18:08:49,355 custom_taxonomy_databases [INFO ] Parse taxonomy
2024-02-07 18:08:49,355 ReadTaxonomyCanSNPer [INFO ] Parse CanSNP tree file
2024-02-07 18:08:49,902 ReadTaxonomyCanSNPer [INFO ] New taxonomy ids assigned 12929
Traceback (most recent call last):
File "/home/nnnnnn/mambaforge/lib/python3.9/site-packages/flextaxd/modules/ReadTaxonomy.py", line 153, in parse_genomeid2taxid
genomeid,taxid = row.strip().split("\t")
ValueError: not enough values to unpack (expected 2, got 1)
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/nnnnnn/mambaforge/bin/flextaxd", line 8, in <module>
sys.exit(main())
File "/home/nnnnnn/mambaforge/lib/python3.9/site-packages/flextaxd/custom_taxonomy_databases.py", line 330, in main
read_obj.parse_genomeid2taxid(args.genomeid2taxid)
File "/home/nnnnnn/mambaforge/lib/python3.9/site-packages/flextaxd/modules/ReadTaxonomy.py", line 156, in parse_genomeid2taxid
genomeid,taxid,reference = row.strip().split("\t")
ValueError: not enough values to unpack (expected 3, got 1)
It looks like the header may be the problem (if they are included in the files). If not I think the best option is if you could supply the head of your files as a text files, then we can replicate the error locally. The error itself tells says that the program finds too few columns separated by . What do you use for separation in your files? the default separator is \t
g2id.txt.gz GTDB_arc_bact_taxo_tree_unique.txt.gz
Okay great. Yes, the default separator is \t and that's what I see reflected in my input files. Should it be . instead? Here's my input files (entire files, gzipped).
I'm attempting to follow along with this part of the tutorial/wiki, to get a better understanding of how to create my own custom DB. Things are okay until I get to the database creation step:
Here's the first few lines of my two input files:
I'd like to use this tool so any help is greatly appreciated
The text was updated successfully, but these errors were encountered: