RetroPie forum home
    • Recent
    • Tags
    • Popular
    • Home
    • Docs
    • Register
    • Login

    Versatile C++ game scraper: Skyscraper

    Scheduled Pinned Locked Moved Ideas and Development
    skyscraperscrapergamelist.xmlscrapinggithub
    1.6k Posts 113 Posters 1.5m Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • BladeHunterB
      BladeHunter
      last edited by

      @screech, I just noticed that V2 doesn't list all the roms in a game:

      https://www.screenscraper.fr/api/jeuInfos.php?devid=<removed>&devpassword=<removed>&devsoftname=RMS01output=xml&ssid=<removed>&sspassword=<removed>&md5=45e12558e1b9eda1b66b86f68f995f2a&sha1=533c497761cc802624708108bc9ef6e7c4f40296

      Look at the bottom, there are 3 roms listed, 2 zip and 1 j64

      https://www.screenscraper.fr/api2/jeuInfos.php?devid=<removed>&devpassword=<removed>&devsoftname=RMS01output=xml&ssid=<removed>&sspassword=<removed>&md5=45e12558e1b9eda1b66b86f68f995f2a&sha1=533c497761cc802624708108bc9ef6e7c4f40296

      Only the j64 is listed.

      Whilst this is the ROM I have specified in the url from the MD5 and SHA-1 shouldn't it still list the whole collection because it's the gameinfo part of the API not a ROMinfo part.

      1 Reply Last reply Reply Quote 0
      • AnalogHeroA
        AnalogHero
        last edited by AnalogHero

        Soooooo...... i fried my 2 month old 64 gb sd card (stupid), which gave me the "opportunity" to rescrape my systems with the latest version.

        And i must say that the old versions were better due to more scrape-sources. I know what the issues were, but i wish that i had a backup of either the old Skyscraper or the localdb.

        Other than that it works flawless. :)

        One thing what could be added is that if it doesnt find a match in the current module, try another one, as an option.

        Edit: Scraping psx with screenscraper aborts with std::bad_alloc

        1 Reply Last reply Reply Quote 0
        • muldjordM
          muldjord
          last edited by muldjord

          Skyscraper 2.0.2 released: https://github.com/muldjord/skyscraper

          • Updated 'arcadedb' result parsing to fit new format
          • Now scrapes 'msx' platform families correctly with the 'screenscraper' module
          • Changed limit for iso checksumming to 20 megs to avoid running out of memory.

          Mostly minor stuff and a bug fix. :)

          1 Reply Last reply Reply Quote 1
          • muldjordM
            muldjord
            last edited by muldjord

            @AnalogHero, The crash comes from running out of memory when trying to do a sha1 checksum of the iso files. I've changed the limit in 2.0.2 so please try that and see if it works. Either way, the psx platform won't work with the screenscraper module, simply because the raspberry pi doesn't have enough memory to checksum the iso files. So you'll have to use thegamesdb for those. :)

            Concerning trying another module while scraping if it doesn't find one with the current one: This won't be implemented, simply because it's already possible by scraping with another scraper afterwards. Just choose to "skip existing". :) And always remember to finish off by scraping with 'localdb' to get the best result from all cached data. :) Have fun!

            mituM 1 Reply Last reply Reply Quote 0
            • muldjordM
              muldjord
              last edited by

              Skyscraper 2.0.3 released: https://github.com/muldjord/skyscraper

              • Added '--lang' and '--region' command line options. Defaults to 'us' and 'wor' if none is provided.
              • Language and region support now implemented for 'screenscraper' module.
              • Removed 'stats' option as it didn't make much sense anymore.

              You can now set language and region which will be used currently by the 'screenscraper' module only. In a later release I will implement the option to get a list of supported languages and regions. But for now I just wanted to enable it. It defaults to language 'us' and region 'wor' if none is provided.

              1 Reply Last reply Reply Quote 0
              • mituM
                mitu Global Moderator @muldjord
                last edited by

                @muldjord said in Versatile C++ game scraper: Skyscraper:

                Either way, the psx platform won't work with the screenscraper module, simply because the raspberry pi doesn't have enough memory to checksum the iso files.

                That's not necessarily true - the md5sum standard utility can operate in a constant memory space, no matter how large is the input stream/file. The checksumming does not require the whole file contents to be in memory, MD5 requires only reading sequentially 512 bits from the input.

                muldjordM 1 Reply Last reply Reply Quote 0
                • muldjordM
                  muldjord @mitu
                  last edited by

                  @mitu Thanks for pointing this out. I actually started doing sha1 and md5 summing differently in 2.0.2. Instead of feeding the functions the entire file data I feed it bits of 1 Meg until it's done. So that would actually mean that it has already been fixed. :)

                  1 Reply Last reply Reply Quote 0
                  • AnalogHeroA
                    AnalogHero
                    last edited by AnalogHero

                    @muldjord Works with psx now. A little slow, but no problem if you dont have a big psx collection like me.

                    Thanks for the update. :)

                    muldjordM 1 Reply Last reply Reply Quote 1
                    • muldjordM
                      muldjord @AnalogHero
                      last edited by

                      @analoghero Glad it works :)

                      I'm gonna start work on a gui now. This will be a whopper so it'll be a while before the next release.

                      1 Reply Last reply Reply Quote 0
                      • BenMcLeanB
                        BenMcLean
                        last edited by BenMcLean

                        EmulationStation needs a way to quickly delete incorrect scraped info on a game in the GUI from the gamepad without deleting the game. That's more important than the actual scraper being improved IMO. But this is still great though. Thanks for all your hard work.

                        ? 1 Reply Last reply Reply Quote 0
                        • ?
                          A Former User @BenMcLean
                          last edited by

                          @benmclean Yr barking up the wrong tree as @muldjord doesn't develop for ES.

                          @muldjord Is there a feature, or are you planning on a feature to scrape a single element and append the gamelist? Say, I wanted the title and genre from GamesDB, published and developed info from Screenscraper, etc.

                          muldjordM 1 Reply Last reply Reply Quote 0
                          • muldjordM
                            muldjord @A Former User
                            last edited by

                            @livefastcyyoung There's not a feature to just scrape one title per se. But you can always just copy just 1 rom to a different folder, set the input folder to that folder with '-i'. Then scrape that rom using the different modules 1 by 1, then prioritize the resource type in '~/.skyscraper/dbs/[platform]/priorities.xml' and finish by scraping using '-s localdb'. That would give you exactly the result you are asking for, except it would have created a new gamelist.xml file with just that entry in it, so you'd have to copy the files over manually afterwards.

                            If you are in doubt about the features I mention, please check the readme's on github. It's all very well explained in there (although I am aware that it can seem confusing since Skyscraper can do so many things! Hence why I am working on a gui).

                            I do not have plans to enable scraping just 1 rom from a folder in a simple way. It doesn't work well with the current design of the code and how it creates the gamelist.xml, sorry.

                            ? 1 Reply Last reply Reply Quote 0
                            • ?
                              A Former User @muldjord
                              last edited by

                              @muldjord I think I was misunderstood. I wasn’t referring to a single rom. I was referring to scraping a set of roms as we can now but setting a flag to just scrape the genre, release date, etc.

                              I’ve found no one database has all elements in great shape so putting together a gamelist by scraping the elements that database does best seems like the next best alternative.

                              Hopefully that made sense now.

                              muldjordM 1 Reply Last reply Reply Quote 0
                              • AnalogHeroA
                                AnalogHero
                                last edited by AnalogHero

                                @LiveFastCyYoung Im not sure that this is what youre looking for but you can set priorities in the priorites.xml, but its only for a whole system. You cant say use info A on rom A and info B on rom B afaik.

                                A gui will really help here imo.

                                muldjordM 1 Reply Last reply Reply Quote 1
                                • muldjordM
                                  muldjord @A Former User
                                  last edited by

                                  @livefastcyyoung Makes perfect sense now. That is not currently supported by Skyscraper but it can do just what you want in a similar way. Have you read up on the '-s localdb' scraping module contained in Skyscraper? You can basically do what you are referring to simply prioritizing the resources correctly. It's all in the readme's. :)

                                  1 Reply Last reply Reply Quote 0
                                  • muldjordM
                                    muldjord @AnalogHero
                                    last edited by

                                    @AnalogHero Exactly, beat me to the punch :)

                                    1 Reply Last reply Reply Quote 0
                                    • AnalogHeroA
                                      AnalogHero
                                      last edited by

                                      @muldjord Tested c64 with screenscraper module. Doesnt produce any good output, infact none (aborts after 30 fails) although there are c64 games in the screenscraper database online. Any idea?

                                      muldjordM 1 Reply Last reply Reply Quote 0
                                      • muldjordM
                                        muldjord @AnalogHero
                                        last edited by

                                        @AnalogHero Basically c64 has a shitstorm of different versions of every single game available (I've literally seen 1 game with 130 different d64 images available, and that's just for the d64 format!). It is not very useful to scrape it with screescraper in my experience. But I'll look into it later and see if something is in fact buggy. :)

                                        1 Reply Last reply Reply Quote 0
                                        • AnalogHeroA
                                          AnalogHero
                                          last edited by

                                          @muldjord Yeah, true. Scraping with crc doesnt make sense here. Each crack would have a different checksum. Maybe going by name is better. Is this possible with screenscraper?

                                          muldjordM 1 Reply Last reply Reply Quote 0
                                          • muldjordM
                                            muldjord @AnalogHero
                                            last edited by muldjord

                                            @analoghero Unfortunately not. They only support checksum search and EXACT filename match (which is also one of the parses I currently do). So you'd have to use thegamesdb.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post

                                            Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                            Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.