[Feature Request] Manual select #4

rohan-gt · 2020-09-09T06:09:40Z

Hi, is it possible to add an option to manually select an issue if it's not scraped automatically? I'd rather have Comixology info on all my books rather than ComicVine

SenorSmartyPants · 2020-09-09T18:02:26Z

There's no UI to manually select an issue. But you can edit the notes in ComicRack (or directly in ComicInfo.xml) and add the issue ID from comixology.com

https://www.comixology.com/Fantastic-Four-2018-1/digital-comic/687096

Copy the numbers at the end of the URL as shown above, and add it to notes in this format

[CMXDB687096]

Run the scraper again and it will use the id from the notes.

If you run comicrack with this option
"C:\Program Files\ComicRack\ComicRack.exe" -ssc

It will display the script console. I'd be curious to see the output from a couple comics that aren't getting matched.

rohan-gt · 2020-09-10T08:19:45Z

@SenorSmartyPants how do I contact you? I have a few ideas and would like to contribute to this project if I can

SenorSmartyPants · 2020-09-10T17:18:57Z

You're doing it.

rohan-gt · 2020-09-10T20:11:11Z

Okay @SenorSmartyPants some suggestions:

Is would be useful to download the entire Comixology metadata into a local database with some filters like publisher to limit the data and then simply fetch the data from it to populate the comic info
I don't know how the fuzzy matching is done at the moment but I believe it is possible to improve the logic and match comics to a very high degree since we are only matching digital releases and they usually have clean names as opposed to scans
It would be useful to have some kind of matching between collected editions and single issues. I believe this info is already available in Comixology. This info can be then used to detect duplicates, missing issues etc.

SenorSmartyPants · 2020-09-11T01:42:27Z

Each one of these items should have been a separate issue. But here goes:

I'm not going to scrape the entire comixology site. There is not API provided by comixology to download all the metadata for everything. Select your issues and download for each of them.
If you have specific examples of issues not being found (with filenames provided) I would be interested to see them. Getting good search results is something I am having issues with, but mostly because of google's bot detection.
This is not a library management tool. Try ComicRack for finding duplicates. If collected edition information is ever scraped, where would it be stored?

rohan-gt · 2020-09-11T12:06:10Z

Ah, okay I thought there was an API similar to ComicVine
I'll try to get some examples out
By duplicates I meant if you have both the trade paper back as well as the single issues within them separately, it would be useful to point those out. Comixology actually has the single issue links under the TPB page so if it's possible to store the IDs of the single issues within the TPB XML, you can reference it easily

rohan-gt · 2020-09-15T13:42:49Z

@SenorSmartyPants So I have files named:
The Books of Magic (1993) (Digital).cbr
Aquaman (2011-2016) Vol. 1 The Trench.cbr
which aren't scraped

SenorSmartyPants · 2020-09-15T15:02:11Z

These look like graphic novels or trade paperbacks. The search is currently pretty specific to single issues. I'll see what I can do (assuming I'm not blocked by Google).

Are you scraping in comicrack or with the mylar version?

What's in the comicinfo.xml? If you are in ComicRack you can select a book and right click 'copy data' to get that info.

rohan-gt · 2020-09-16T07:37:47Z

@SenorSmartyPants yes they are TPBs. I'm using ComicRack. There's no info generated since I get a message saying 0 comics scraped, 1 skipped. It seems easy to implement. You just need to fuzzy match the name with a high percentage score (95%) along with the year if it is provided

SenorSmartyPants · 2020-09-16T22:24:33Z

Comicrack will parse the file name, so there's probably proposed values at least of these books. So I'd still like a copy data output. And console output, which you can get if you run CR with the a shortcut like this
"C:\Program Files\ComicRack\ComicRack.exe" -ssc

You're welcome to submit a pull request as well. But I won't merge it until I can test it not working (which is why I want the data I'm asking for).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Manual select #4

[Feature Request] Manual select #4

rohan-gt commented Sep 9, 2020

SenorSmartyPants commented Sep 9, 2020

rohan-gt commented Sep 10, 2020

SenorSmartyPants commented Sep 10, 2020

rohan-gt commented Sep 10, 2020 •

edited

Loading

SenorSmartyPants commented Sep 11, 2020

rohan-gt commented Sep 11, 2020

rohan-gt commented Sep 15, 2020 •

edited

Loading

SenorSmartyPants commented Sep 15, 2020

rohan-gt commented Sep 16, 2020 •

edited

Loading

SenorSmartyPants commented Sep 16, 2020 •

edited

Loading

[Feature Request] Manual select #4

[Feature Request] Manual select #4

Comments

rohan-gt commented Sep 9, 2020

SenorSmartyPants commented Sep 9, 2020

rohan-gt commented Sep 10, 2020

SenorSmartyPants commented Sep 10, 2020

rohan-gt commented Sep 10, 2020 • edited Loading

SenorSmartyPants commented Sep 11, 2020

rohan-gt commented Sep 11, 2020

rohan-gt commented Sep 15, 2020 • edited Loading

SenorSmartyPants commented Sep 15, 2020

rohan-gt commented Sep 16, 2020 • edited Loading

SenorSmartyPants commented Sep 16, 2020 • edited Loading

rohan-gt commented Sep 10, 2020 •

edited

Loading

rohan-gt commented Sep 15, 2020 •

edited

Loading

rohan-gt commented Sep 16, 2020 •

edited

Loading

SenorSmartyPants commented Sep 16, 2020 •

edited

Loading