Versatile C++ game scraper: Skyscraper
-
@mitu Thank you! This is exactly what I was looking for. I have had good results with screenscraper for console scraping, so will probably stick with it for arcade then. I havent gotten very accurate results with the gamesdb (probably because it searches based on rom name).
-
i was just scraping my gba games and i noticed i got a lot of errors like this:
ScreenScraper APIv2 returned invalid XML for the following query: romnom=Dragon%20Ball%20Z%20-%20Buu_s%20Fury%20%28USA%29.gba&crc=1C1707F&md5=3A74FCE97F1EA2B28C2A50EC3DF0ACEE&sha1=F1C4B07554D2A3B1AD2F325307051E775CE68087&romtaille=8388608
it used to scrape all my gba games perfectly, so i tried to update to latest version, but it still persists resulting in a fail rate of about 6-7% of the games, where it used to find basicly every game in earlier versions of skyscraper
-
@Halvhjearne You may be running into https://retropie.org.uk/forum/topic/11826/versatile-c-game-scraper-skyscraper/1123. It's not a Skyscraper problem.
-
@mitu said in Versatile C++ game scraper: Skyscraper:
@Halvhjearne You may be running into https://retropie.org.uk/forum/topic/11826/versatile-c-game-scraper-skyscraper/1123. It's not a Skyscraper problem.
im a registered donor and skyscraper recognizes i have 6 threads available ...
-
@Halvhjearne The screenscraper site dashboard shows their CPU at 294% right now, so I wouldn't be surprised if their system overload would result in the errors you're getting.
-
@mitu
scraping again without the --refresh seems to work fine ... -
@Halvhjearne Why are you using
--refresh
by default ? It forces Skyscraper to re-fetch the data when you already have it in cache. -
@mitu
im not using it by default, but i do run skyscraper sometimes with the --refresh to see if there is any missing artwork available ... -
That's not what
--refresh
should be used for. If you don't have the corresponding artwork in the cache (marquee, cover, screenshot, video), Skyscraper would try and get it, so you don't have to add--refresh
to get missing artwork. @muldjord can correct me if I'm wrong, but forcing--refresh
just negates any advantage of having a cache. -
@mitu said in Versatile C++ game scraper: Skyscraper:
That's not what
--refresh
should be used for. If you don't have the corresponding artwork in the cache (marquee, cover, screenshot, video), Skyscraper would try and get it, so you don't have to add--refresh
to get missing artwork. @muldjord can correct me if I'm wrong, but forcing--refresh
just negates any advantage of having a cache.This is true, but there is one issue where it helps. When the screenscraper servers are overloaded, some of the media might not be scraped for a certain game, even though the textual data is (each artwork resource is a separate request that might fail due to the high load). So in those (hopefully rare cases, especially for registered users) they will need to use
--refresh
to grab them.With that said, I would always recommend only scraping a few roms with
--refresh
enabled. Instead, just add those few files to command line to grab the missing data. That is much better than scraping everything again. -
@muldjord Is there a way to use the
--refresh
function only on roms that are detected as missing some media? I.e. screenshot, box art or description are missing? Im not sure if what I am asking makes sense. Unfortunately for me I scraped hundreds of roms before realizing that credentials for screenscraper made such a difference. Now there are tons of roms that are missing some media. Going through and checking them one by one will be a huge task. -
@quicksilver said in Versatile C++ game scraper: Skyscraper:
@muldjord Is there a way to use the
--refresh
function only on roms that are detected as missing some media? I.e. screenshot, box art or description are missing? Im not sure if what I am asking makes sense. Unfortunately for me I scraped hundreds of roms before realizing that credentials for screenscraper made such a difference. Now there are tons of roms that are missing some media. Going through and checking them one by one will be a huge task.What you are asking makes perfect sense, but it is not possible to do this at the moment I'm afraid. I had not anticipated so many requests to be rejected, so this is actually quite a big problem at the moment. And it's a bit of a bad circle to get into. People will notice media being missing, and then they will start using refresh, which again puts even more load on the source servers and so on...
I am wondering if screenscraper has any sort of server caching installed on their service. This could potentially alleviate the problem. I've asked on their forum.
-
@muldjord So I'm slowly but surely getting all of ROMs scraped using skyscraper and the results are simply amazing. I should have done this long ago!
I just wanted to thank you for such a great scraper! I have read back through a lot of the posts on this thread and your commitment to creating a great program clearly shows. I hope you take great pride in this! The next time I'm at my computer I will be sure to send a tip your way. Thanks again for all your hard work!
-
@quicksilver said in Versatile C++ game scraper: Skyscraper:
@muldjord So I'm slowly but surely getting all of ROMs scraped using skyscraper and the results are simply amazing. I should have done this long ago!
I just wanted to thank you for such a great scraper! I have read back through a lot of the posts on this thread and your commitment to creating a great program clearly shows. I hope you take great pride in this! The next time I'm at my computer I will be sure to send a tip your way. Thanks again for all your hard work!
You are very welcome and thanks! Skyscraper is sort of a passion project of mine and I probably spend more time on it than I should. :D
-
@muldjord sorry to be a nuisance again so soon, but I'm having issues generating Gameslist for "ports". Using the GUI I was unable to scrape any games because the platform is set as "ports" which the sources don't recognize as a valid platform. So I scraped them all from the commandline using pc as the platform. However, when I go to generate the Gameslist it just says the games cannot be found.
Edit: I see scummvm set to look for PC and Amiga matches. Perhaps ports can be set to look at pc?
-
@quicksilver said in Versatile C++ game scraper: Skyscraper:
Using the GUI I was unable to scrape any games because the platform is set as "ports" which the sources don't recognize as a valid platform.
The
ports
platform already has several platform aliases. Among them ispc
, so it should work just fine with the file name based modules provided your scripts have a good naming convention (Such asGame Name.sh
). It will scrape.sh
file when using ports.screenscraper
might give you issues though, as I don't send a proper system id with the query since I don't know what platform the games will be for from the get-go as I do for other platforms. So there you will have to work around it as you already do.EDIT: I actually just noticed that I can make this work with screenscraper aswell. The
PC Dos
platform they have is the parent ID also for the Windows platform. So I can set that and it will work.If you want to test it out you can simply set line 571 in
screenscraper.cpp
to readreturn "135";
. Then compile Skyscraper withmake
and thensudo make install
. But this will only work if you've compiled it yourself using the approach from the github documentation. -
@muldjord the strange thing is thegamesdb will find the games if I search manually from the commandline with the platform set as pc. But if I search using the GUI it will return all games as not found.
I installed skyscraper through the retropie setup script. In order to test your fix would I have to uninstall and build it manually?
-
@quicksilver Hm, I'll see if I can reproduce this. Can you give me the details - name of the file, folder (I guess it's
ports
?) ? -
@mitu correct the game folders+the .sh scripts to run the games are in my /roms/ports/ directory.
If I attempt to scrape the .sh scripts on the command line using "pc" as the platform I will get successful scrapes for doom, doom 2, quake, hexen etc. But it won't work if the platform is set as "ports". Screenscraper shows a message that it's not a supported platform and the gamesdb can't find anything (unless I change the platform to PC like I said). Oddly enough, if I trick it by scraping my ports using PC as the platform the Gameslist will not generate anything.
Edit: and speaking of PC, it looks like skyscraper doesn't recognize .conf files as a valid extension in my PC folder.
-
@quicksilver said in Versatile C++ game scraper: Skyscraper:
if I trick it by scraping my ports using PC as the platform the Gameslist will not generate anything.
How exactly are you 'tricking it' ? Are you copying the
.sh
files to thepc
folder and scraping it from there ?
Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.
Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.