Versatile C++ game scraper: Skyscraper
-
@muldjord I'm having an issue with a few of my roms scraping incorrectly in simple mode.
They are showing incorrect names in EmulationStation.
I can give specific examples if that helps.
-
@maroonout09 It is not uncommon for a few roms to scrape incorrectly. I assume the name they are scraped as are quite close to the the one you expect. Skyscraper is based on filename searches for some modules, and checksum searches for others and use several different tricks to try and be as precise as possible. But there will be false positives, it cannot be avoided.
But yes, please give examples and also what version of Skyscraper you are running (important). I would like to make sure it is the expected behaviour and not something else entirely.
Quick note: If you want to avoid false positives completely, set '-m 100' on command line or 'minMatch' in '[homedir]/.skyscraper/config.ini'. Then it will only allow 100% correct results. But keep in mind that you will also loose a lot of the correct results if you do so. It's a bit of a balancing act.
-
@muldjord Here are the games that I found that were scraped incorrectly:
Filename: Pokemon_-_Yellow_Version.gbc
Scraped Name: Robopon: Sun Version
Comments: The scrape also included the description for Robopon: Sun Version, and for some reason, the images for Pokemon: Gold Version.Filename: Super_Mario_Advance.gba
Scraped Name: Chaoji Maliou Shijie
Comments: The scrape had the correct description and images.Filename: Super_Mario_Advance_3_-_Yoshi's_Island.gba
Scraped Name: Yaoxi Dao
Comments: The scrape had the correct description and images.Filename: Wario_Land_4.gba
Scraped Name: Waliou Xunbao Ji
Comments: The scrape had the correct description and images.I think those may have been the only ones that scraped incorrectly.
I'm using Skyscraper v2.4.3.
-
@maroonout09
Just tested all of them, these are the reasons and what you can do about it:Pokemon_-_Yellow_Version.gbc:
It returns a match for Robopon: Sun Version because of the "-" in the filename (it will include this in the search which messes with it, I will consider removing these dashes automatically in 2.4.4). And since that name matches 83%, it accepts it. You can make it work by changing the name of that file to "Pokemon_Yellow.gbc"Super_Mario_Advance.gba / Super_Mario_Advance_3_-_Yoshi's_Island.gba / Wario_Land_4.gba:
These titles are actually correct, they are just the 'wor' region titles for them and are the titles ScreenScraper returns for them. I was not aware that the 'wor' titles were sometimes to the japanese titles, so I'll prioritize the 'eu' and 'us' titles higher for the next release (2.4.4). In the meantime, please set 'region' manually with '--region us' or '--region eu' to prevent this from happening.Thank you for reporting this, I appreciate it.
-
@muldjord For Amiga: Deluxe Pacman is scraped as Deluxe Pac Man, and not found. Rock n Roll is not found, too.
With .lha files it doesnt add [AGA] anymore. Not really important though.Edit: Shame that we cant use LemonAmiga or HOL.
-
@analoghero [AGA]'s will be back in 2.4.4. :) And so will [CD32], [CDTV] and [Demo].
You can change the filenames of your lha's if you want better results. Try changing "DeluxePacManxxx.lha" to "DeluxePacmanxxx.lha" for instance, that might fix it. But for now many Amiga games with .lha suffix will scrape wrongfully since I have to convert the filenames on the fly to add spaces, and that is just bound to be a problem.
I'm working with Dom from the Amiberry team for a better solution in the future. But for now, this will have to do. I also would like to point out that Skyscraper is the only scraper to even support the .lha's at this point, so I guess anything is better than nothing. Skyscraper scrapes about 75% of the lha's at the moment.
EDIT: Agreed, I actually supported LemonAmiga and HOL half a year ago, but had to remove support since I couldn't get official permission to scrape from their sites... :S I never got a reply to my emails if I recall correctly. And without permission I won't use them of course.
-
@muldjord Yes i know that they were once supported, but removed. I think they assume a scraper for a well known platform such as retropie will cause a lot of traffic. Good idea just to rename files. Will try that. :)
-
@muldjord Thank you very much for your help!
-
@maroonout09 You're welcome. Good luck with it! :)
-
Just for reference we have also been testing this on our RetroPie base image for the Odroid XU4 and it does work well. The only item of note that we have found is that with that board a lot more folks use small EMMC or microSD cards for the base and then an ext drive for their games/media. With how the db is storing what we can tell are duplicates in the cache for quicker results when performing a rescrape it is easy to chew up the remaining space on the OS "drive" and filling it very quickly. Excellent work tho with how great the metadata that is returned for the gamelists and also the media itself.
-
@fnkngrv Thank you, glad you like it. You can change the dbFolder with '-d' and I will make sure it can be set in the config.ini file for the next release aswell. Then you can create a config.ini and add the 'dbFolder="[db base folder]"' in the main section of it, and it will put the cache there for all platforms in subfolders. That should give you the dynamic you are looking for. Will be in 2.4.4.
-
@muldjord Can you control in the sourcecode which image a scraper module downloads? When scraping amiga with openretro it sometimes gives strange results. For example RickDangerous_v1.3_2294.lha returns a screenshot from the trainermenu. ! It looks Like this but i cant find it on openretro.
Since screenscraper isnt an option with lha files, i manually imported some to replace the strange ones.
-
@analoghero I found the flaw. It seems that it returns all screenshots, including those from the cracked versions, most of them are just hidden but still exist in the source. So my function to return the screenshot even looks through the hidden ones. I'll fix this in 2.4.4 so it only chooses between the main ones. Thank you for reporting this. :) It's really helpful!
-
@muldjord Feeling bad for disturbing your break from Skyscraper. Dont know how but if i can help you with development in any other form then reporting minor bugs dont hesitate to ask. Maybe i can do something.
-
@analoghero Reporting those bugs are really important, so it is immensely appreciated on my end. I wanna focus on the stability for a while, to make sure as many people as possible using Skyscraper have a problem-free experience with it.
I've cut down from about 3 hours daily to about 1 hour daily working on Skyscraper now. It seems to work pretty well for me. And with the help of you and others reporting problems, it has gotten really stable it seems, which makes me happy and motivates me to do further work. :)
Currently working on the MobyGames module. As mentioned earlier, the module won't be included in the Simple Mode scripts, simply because their limits would make it unusable. But it will be perfect for filling in the gaps for a few roms at a time.
EDIT: Oh, and I got my hands on a B+, so really looking forward to "starting over" and getting everything set up again. :)
-
How can I enable to show just the cover then after that the video?
-
@edwardthehuman You can do this by editing the artwork.xml file in /home/pi/.skyscraper
Im no expert with editing this xml files. Please read: https://github.com/muldjord/skyscraper/blob/master/ARTWORK.md for more information.
-
Happy easter to you and your families. Enjoy your time. Hope you dont find any empty easter presents, as its also aprils fools day.
-
Thanks @AnalogHero
Happy Easter to everyone!
-
Skyscraper version 2.4.4 released: https://github.com/muldjord/skyscraper
- Changed openretro 'wheel' type to 'marquee' type. It wasn't really a wheel
- 'openretro' now filters out variants to avoid bad screenshots (Thank you AnalogHero)
- Now allows you to set custom dbFolder in config.ini both under main and platform sections
- For Amiga it now searches for "Demo", "aga", "cdtv" and "cd32" in filenames and converts to [] bracket names (Thanks Dom)
- Region 'wor' has been moved below 'eu' and 'us' for Screenscraper since it often returns Japanese titles (Thanks maroonout09)
- Now turns '-' into ' ' in filenames before doing searches (Thanks maroonout09)
- Simplified thread result communication with main thread from two functions to just one
- Now again properly tests for a scraping run with 30 consecutive misses, then exits
Just wanted to get this out there while I am working on the mobygames module. In other words, the mobygames module is NOT implemented in this release.
Instead there's some user requested stuff and also some minor fixes. It's all there in the release notes. :)
Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.
Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.