RetroPie forum home
    • Recent
    • Tags
    • Popular
    • Home
    • Docs
    • Register
    • Login

    Versatile C++ game scraper: Skyscraper

    Scheduled Pinned Locked Moved Ideas and Development
    skyscraperscrapergamelist.xmlscrapinggithub
    1.6k Posts 113 Posters 1.5m Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • o0alucard0oO
      o0alucard0o
      last edited by

      I scraped NES roms for Attract-Mode with the latest Skyscraper version.
      The process correctly fills standard cache folders (covers, marquees, screenshots, wheels) but exports only snaps to Attract-Mode.

      NES.cfg

      romext               .zip
      system               Nintendo Entertainment System
      artwork    flyer           /mnt/c/Games/Attract-Mode/scraper/NES/flyer
      artwork    marquee         /mnt/c/Games/Attract-Mode/scraper/NES/marquee
      artwork    snap            /mnt/c/Games/Attract-Mode/scraper/NES/snap
      artwork    wheel           /mnt/c/Games/Attract-Mode/scraper/NES/wheel
      artwork    boxart          /mnt/c/Games/Attract-Mode/scraper/NES/boxart
      

      Skyscraper command(s):

      Skyscraper -p nes -i /mnt/c/Games/roms/nes/ -g /mnt/c/Games/Attract-Mode/romlists/ -f attractmode -e /mnt/c/Games/Skyscraper/NES.cfg -s screenscraper -u username:password --flags unattend,skipped
      Skyscraper -p nes -i /mnt/c/Games/roms/nes/ -g /mnt/c/Games/Attract-Mode/romlists/ -f attractmode -e /mnt/c/Games/Skyscraper/NES.cfg --flags unattend,skipped,forcefilename
      
      
      muldjordM 1 Reply Last reply Reply Quote 0
      • muldjordM
        muldjord @o0alucard0o
        last edited by muldjord

        @o0alucard0o Yeah, the default Skyscraper /home/pi/.skyscraper/artwork.xml is set up to look good with EmulationStation. For Attract-Mode you can go ahead and adjust it to fit your needs. For instance:

        <?xml version="1.0" encoding="UTF-8"?>
        <artwork>
          <output type="screenshot" width="640"/>
          <output type="cover" height="640"/>
          <output type="wheel" width="640"/>
          <output type="marquee" width="640"/>
        </artwork>
        

        When you've saved that, just regenerate the game list for Attract-Mode.

        This will export all of the resources Skyscraper supports and scale them according to aspect ratio to 640 pixels height or width. This is the most basic example I can give you. But the compositor supports a lot more.

        For the full documentation, go here.

        o0alucard0oO 1 Reply Last reply Reply Quote 1
        • o0alucard0oO
          o0alucard0o @muldjord
          last edited by

          @muldjord Thanks for the explanation. 👍

          muldjordM 1 Reply Last reply Reply Quote 1
          • muldjordM
            muldjord @o0alucard0o
            last edited by

            @o0alucard0o You're welcome, please see my edit. I just added width and height to some of them.

            1 Reply Last reply Reply Quote 1
            • o0alucard0oO
              o0alucard0o
              last edited by

              ...
              artwork    snap            /mnt/c/Games/Attract-Mode/scraper/Sega 32X/snap
              ...
              

              Hi,
              are white spaces supported in path extraction from Attract Mode emulators config?
              It seems that quotes, double quotes and backslash doesn't work.

              muldjordM 1 Reply Last reply Reply Quote 0
              • muldjordM
                muldjord @o0alucard0o
                last edited by muldjord

                @o0alucard0o I just fixed this. Please update to 3.5.8. You can now use spaces in paths. Do not use quotes, it's not necessary and will actually break it. So just enter it like this:

                artwork    snap            /mnt/c/Games/Attract-Mode/scraper/Sega 32X/snap
                
                o0alucard0oO 1 Reply Last reply Reply Quote 4
                • o0alucard0oO
                  o0alucard0o @muldjord
                  last edited by

                  @muldjord Thank you

                  1 Reply Last reply Reply Quote 1
                  • o0alucard0oO
                    o0alucard0o
                    last edited by o0alucard0o

                    Hi,
                    I recently scraped PC platform with about thirty titles.
                    Unfortunately, the generation step assigns wrong resources to games.
                    Ten games with same snap and/or description.

                    My setup for Windows PC platform, consists of Autoit (.au3) files. This files differ only in name, but the content is the same. Could it depend on this?

                    For example

                    #2/30 (T4) ---- Game 'BlazeRush' found! :) ---- 
                    Scraper:        cache 
                    Title:          'Yoku's Island Express' (thegamesdb) 
                    Platform:       'PC' (thegamesdb) 
                    
                    #3/30 (T2) ---- Game 'Cuphead' found! :) ----
                    Scraper:        cache
                    Title:          'Yoku's Island Express' (thegamesdb)
                    Platform:       'PC' (thegamesdb)
                    
                    muldjordM 1 Reply Last reply Reply Quote 0
                    • muldjordM
                      muldjord @o0alucard0o
                      last edited by

                      @o0alucard0o Yes, that is exactly the reason. For most files Skyscraper does a sha1 checksum to id them. But for some script files I do the sha1 checksum on the filename instead, since they either often change, of have the same contents. .au3 files are not currently on this list, so if your files have the same contents, it will have the same id in Skyscrapers cache.

                      I will add .au3 files to this list, so in future releases this won't be a problem. Thanks for reporting it.

                      1 Reply Last reply Reply Quote 1
                      • A
                        AlCzervik
                        last edited by

                        @muldjord Any reason when I rescraped my NES many of them now are showing "load screens" (e.g., Select 1 or 2 player screen) vs. gameplay screens?

                        muldjordM 1 Reply Last reply Reply Quote 0
                        • muldjordM
                          muldjord @AlCzervik
                          last edited by

                          @AlCzervik What source is it using for the screenshots? (you can check it with Skyscraper -p nes --cache edit FILENAME. The reason is that sometimes the source has the title screens in their databases as screenshots. It's quite normal, but is a bit of a challenge when scraping from them, since I don't know one screenshot from another.

                          A 1 Reply Last reply Reply Quote 0
                          • A
                            AlCzervik @muldjord
                            last edited by

                            @muldjord I'm using screenscraper. Is it random? As in, if I scrape again would it pull different images they've stored? Some examples I know changed were Super Mario 3, Metroid, Zelda 2.

                            muldjordM 1 Reply Last reply Reply Quote 0
                            • muldjordM
                              muldjord @AlCzervik
                              last edited by

                              @AlCzervik When scraping with -s screenscraper it will first look for a screenshot of the type ss which is usually a normal screenshot. If it doesn't find that, it will look for one called sstitlewhich, obviously, is the game title screen screenshot. So in this case, my guess is that it doesn't find any of type ss and therefore uses the sstitle screenshots.

                              A 1 Reply Last reply Reply Quote 0
                              • A
                                AlCzervik @muldjord
                                last edited by

                                @muldjord That must be it for those titles but it is strange that they changed recently. These are older NES titles so perhaps screenscraper changed for some of those titles. Not a big deal, just wanted to point it out. Thanks!

                                1 Reply Last reply Reply Quote 0
                                • T
                                  theshadowzero
                                  last edited by theshadowzero

                                  @muldjord i've got a quick question about skyscraper, i've tried messing with the --cache settings but haven't been able to quite hit my use case. i scraped all my roms over a couple days with skyscraper from screenscraper, but i had videos disabled. now, i'd like to add videos, but i haven't been able to figure out how to get skyscraper to just scrape videos. because the roms already have metadata in the cache, skyscraper seems to want to either skip them entirely, or with a --cache refresh completely scrape them fresh. is there a way to tell skyscraper "the metadata is fine, i just want you to scrape the videos that are missing and not rescrape all the other fields?"

                                  ive tried running it with just --flags videos and it asks me if i want to skip already existing game list entries. saying yes obviously skips all the roms, and saying no will show that it finds the rom, show all the metadata and media i have (with a big red NO() for video), and then do nothing and proceed to the next rom. if i scrape with --cache refresh it finds the rom and the video and will download and include it. i was under the impression skyscraper would automatically add media that is missing but that doesn't seem to be the case, or more likely i am doing something wrong. guidance would be appreciated! i don't want to spend another 2-3 days hammering screenscraper.fr to get videos when i already have 90% of the scrapes i need

                                  edit: i also tried with --cache edit:new=videos but it errors saying it only supports certain resources. not sure that would have solved it for me anyway since i fear it would have expected manual input rather than scraping but i tried it regardless

                                  muldjordM 1 Reply Last reply Reply Quote 0
                                  • muldjordM
                                    muldjord @theshadowzero
                                    last edited by muldjord

                                    @theshadowzero said in Versatile C++ game scraper: Skyscraper:

                                    is there a way to tell skyscraper "the metadata is fine, i just want you to scrape the videos that are missing and not rescrape all the other fields?"

                                    No, it will always scrape the metadata since it's "free". When a game is looked up, the metadata is already there in the data received from the server. So it would be pretty silly to not use it.
                                    It's the media that might take some time to grab - especially the videos of course.

                                    I would simply rescrape all of the games with videos enabled. That's the easiest for this use case. You will need --cache refresh for that to work.

                                    i also tried with --cache edit:new=videos but it errors saying it only supports certain resources. not sure that would have solved it for me anyway since i fear it would have expected manual input rather than scraping but i tried it regardless

                                    Correct, you cannot use --cache edit for this use case. It can only edit metadata and add metadata. Not media.

                                    1 Reply Last reply Reply Quote 0
                                    • muldjordM
                                      muldjord
                                      last edited by muldjord

                                      Skyscraper 3.5.9 released: https://github.com/muldjord/skyscraper

                                      • Implemented the new IGDB v4 authentication method. IGDB will now work again, and requires free credentials. Read more about that here
                                      • Improved memory consumption when handing entries back to main thread
                                      • Added '.au3' file extension to id script exception list (Thank you to 'o0alucard0o' for reporting this)

                                      Most prominent feature is the updated IGDB v4 API authentication method. This was broken due to IGDB moving away from their old v3 API. This is now fixed and requires free personal credentials. Read more about that here.

                                      Let me know if you run into issues.

                                      ClydeC 1 Reply Last reply Reply Quote 2
                                      • ClydeC
                                        Clyde @muldjord
                                        last edited by Clyde

                                        @muldjord Perfect timing as ScreenScraper is currently down! 😉

                                        Seriously, thanks for still working on this awsome tool. 👍

                                        muldjordM 1 Reply Last reply Reply Quote 1
                                        • muldjordM
                                          muldjord @Clyde
                                          last edited by muldjord

                                          @Clyde Yeah, they seem to be hit hard this time. I hope they aren't about to shut down... :S

                                          EDIT: Ok, just visited your link. So they are working on it. Good to know. :) Thanks.

                                          1 Reply Last reply Reply Quote 0
                                          • ClydeC
                                            Clyde
                                            last edited by Clyde

                                            Hi,
                                            I just noticed that Skyscraper doesn't seem to generate a valid gamelist.xml for Daphne, at least not from the file structure for Daphne that's described in the Docs.

                                            Since that structure requires directories instead of archive files, Skyscraper writes a <folder> element instead of <game>:

                                            <?xml version="1.0"?>
                                            <gameList>
                                              <folder>
                                                <path>./ace.daphne</path>
                                            ...
                                            

                                            This makes Emulation Station ignore the game's entry:

                                            Nov 19 23:26:24 lvl2: 	Parsing XML file "/home/pi/RetroPie/roms/daphne/gamelist.xml"...
                                            Nov 19 23:26:24 lvl1: 	gameList: folder doesn't already exist, won't create
                                            Nov 19 23:26:24 lvl0: 	Error finding/creating FileData for "/home/pi/RetroPie/roms/daphne/ace.daphne", skipping.
                                            

                                            Can you confirm this? Should I open an issue on Github?

                                            Thanks
                                            Clyde

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post

                                            Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                            Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.