RetroPie forum home
    • Recent
    • Tags
    • Popular
    • Home
    • Docs
    • Register
    • Login

    Versatile C++ game scraper: Skyscraper

    Scheduled Pinned Locked Moved Ideas and Development
    skyscraperscrapergamelist.xmlscrapinggithub
    1.6k Posts 113 Posters 1.6m Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • ohmycommodoreO
      ohmycommodore
      last edited by

      Using it a week or so ago, it completely missed on some major systems like Sega and SNES and more. I'll have to try it again. Odd that I could see it scraping for those systems but in the end presented me with nothing.

      muldjordM 1 Reply Last reply Reply Quote 0
      • ohmycommodoreO
        ohmycommodore
        last edited by

        I recall I was going to try ScreenScraper as a source this next time.

        1 Reply Last reply Reply Quote 0
        • muldjordM
          muldjord @ohmycommodore
          last edited by muldjord

          @ohmycommodore So it showed the data while scraping? Then you probably forgot to generate the gamelists afterwards, or you ran it while EmulationStation was running. Please check the F.A.Q.

          1 Reply Last reply Reply Quote 1
          • muldjordM
            muldjord
            last edited by muldjord

            Interesting... While I was at work today, Github has decided to rewind my repository by several commits, without me touching it. Anyone tried this before? My code is literally just gone!

            How do I know this is a Github error? My 3.5.5 release is still there. The code is not. The repository seems to have reverted to 3.5.4. The 3.5.5 commits are nowhere to be found, except for the release, which can't exist without the commits.

            If I download the 3.5.5 release, all of the missing code is in the release. Even the version number is 3.5.5 in the release files. But not in the repository.

            I can't come up with a way I could have made this happen myself. All I ever do is git commit -m "MESSAGE" FILES, git tag TAG and git push origin TAG for releases. I never use branches or rebasing or any of that stuff. I never move the head manually either.

            This is pretty frustrating to be honest. Code gone missing is the last thing a developer wants.

            ClydeC 1 Reply Last reply Reply Quote 0
            • ClydeC
              Clyde @muldjord
              last edited by

              @muldjord My sincerest sympathies! I don't know if I understand you correctly, though. Is the code really gone or do you have a backup somewhere?

              muldjordM 1 Reply Last reply Reply Quote 0
              • mituM
                mitu Global Moderator
                last edited by

                @muldjord said in Versatile C++ game scraper: Skyscraper:

                If I download the 3.5.5 release, all of the missing code is in the release. Even the version number is 3.5.5 in the release files. But not in the repository.

                The commits still show up on the 3.5.5 tag, but don't seem associated with the master branch. Do they still show locally on you master branch ?

                1 Reply Last reply Reply Quote 0
                • muldjordM
                  muldjord @Clyde
                  last edited by muldjord

                  @Clyde Luckily I have the code in the release so I didn't loose anything. But it makes my eyes twitch to think about this even being possible. How can I trust Github to not do this again? If it is in fact their fault of course. Maybe I did something in my sleep on one of my other developer machines that would result in this... I just can't think of how.

                  @mitu Yeah, good catch, I hadn't seen that. I have no idea how it would get detached like that. I don't have the code on my current developer machine and git pull doesn't give it to me either. I'll have to check the other developer machines in a couple of days when I can get to them.

                  mituM 1 Reply Last reply Reply Quote 0
                  • mituM
                    mitu Global Moderator @muldjord
                    last edited by mitu

                    @muldjord I cloned the repo and the 'missing' commits show up (screenshot from gitk):

                    ffe4f9fb-1455-481f-8ef1-52df814f34ad-image.png

                    My guess you checked out 4.3.4 and then added the subsequent commits without a branch ? My git-fu is not so strong.

                    muldjordM 1 Reply Last reply Reply Quote 1
                    • muldjordM
                      muldjord @mitu
                      last edited by

                      @mitu I'll have to check the bash history on those other machines. It might shed some light on this. I really hope this is my own fault.

                      1 Reply Last reply Reply Quote 0
                      • muldjordM
                        muldjord
                        last edited by muldjord

                        @mitu , @Clyde Just a quick update on my little problem. I think I know what has happened. I probably made the 3.5.5 release on a different machine and forgot to push the local commits to master before tagging and pushing the tag to origin. So the code change from 3.5.4 to 3.5.5 was never pushed to master, but the tag was. I had no idea you could even do that, and I guess it's a testament to how well I actually remember to commit the changes since this has never happened to me before. I must've been tired.

                        I am really happy this was an error on my part. I was worried this was some weird mirror-caching Github error. It is not. Phew. :)

                        Guess who'll never forget to push local commits to master before tagging again. Ah, who am I kidding, I'll probably forget it again next week. :D

                        ClydeC 1 Reply Last reply Reply Quote 2
                        • ClydeC
                          Clyde @muldjord
                          last edited by

                          @muldjord Maybe there's a way to automate the push the commits before pushing the tags, or any other method to keep both machines on-par? (I don't know enough bash-fu for more than this suggestion.)

                          1 Reply Last reply Reply Quote 1
                          • muldjordM
                            muldjord
                            last edited by muldjord

                            I'll get to the other machine tomorrow and push the commits to master. I actually think that will fix it as the tag seems to be bound to a commit, which just happens to not be glued to master yet. So if that's the case it's not a big deal if I forget in the future.

                            1 Reply Last reply Reply Quote 0
                            • R
                              raspnoobie
                              last edited by

                              Hi,

                              Thanks for making this scraper; it is really nice to use it and looks better than the one that is in emulationstation.

                              I have a question regarding the script used in retropie: I don't see a parameter to use the scraper only for the roms that hasn't been parsed yet. This result in downloading the same games over and over , even if a different source scraped that game.
                              Is this a limitation of the existing script for retropie or it is not even an option, and require you to run the scraper by hand via terminal?

                              I wish there was an aggregate function that pars all sources in sequence, only for roms that were not identified; so you can leave it churning overninght :) Thanks !

                              muldjordM 1 Reply Last reply Reply Quote 0
                              • muldjordM
                                muldjord @raspnoobie
                                last edited by muldjord

                                @raspnoobie There is, but it's not implemented in the script yet. It's a command-line flag --flags onlymissing. It will make sure only games that have no data cached for them (from any source) are scraped with any source you choose. Be aware that if a game has even a single resource cached by any source, it will then be skipped. If instead you want to scrape only games that are missing a certain resource, you can use the --cache reportoption as documented here

                                R 1 Reply Last reply Reply Quote 0
                                • ClydeC
                                  Clyde
                                  last edited by Clyde

                                  Hello @muldjord, I recently ran into a problem on my new Pi 4 while starting to include Playstation 1 games into my Retropie setup.

                                  While Skyscraper does scrape covers, e.g. from Screenscraper & TheGamesDB, and they are saved in the covers cache directory, they won't be copied to ../psx/media/covers and thus, they're not used as Runcommand loading screens.

                                  Do I miss something? Any help is appreciated.

                                  muldjordM 1 Reply Last reply Reply Quote 0
                                  • muldjordM
                                    muldjord @Clyde
                                    last edited by

                                    @Clyde You need to edit your artwork.xml file and include an <output>node that outputs the cover. See the artwork docs here.

                                    1 Reply Last reply Reply Quote 0
                                    • ClydeC
                                      Clyde
                                      last edited by

                                      Thanks for the quick answer, though I'll admit that I'm a little over my head here. As far as I understand the artwork docs, a single Output node of the type "cover" placed inside of the <artwork> node should suffice:

                                        <output type="cover">
                                        <output>
                                      

                                      All other attributes of the Output node are marked as optional in the docs, but I still get no covers in the media directory.

                                      Is my node wrong? Do I have to re-scrape? … ?

                                      muldjordM 1 Reply Last reply Reply Quote 0
                                      • muldjordM
                                        muldjord @Clyde
                                        last edited by muldjord

                                        @Clyde Your output nodes are mismatched. It has to be:

                                        <output type="cover"/>
                                        

                                        Place it just before the </artwork> node. Xml nodes always come with a begin and an end. If there's no data nested in the node it can be escaped with a / at the end as seen above. It is equivalent to:

                                        <output type="cover"></output>
                                        

                                        You do not have to rescrape, the data is already cached (as long as you didn't disable the cover scraping when you scraped the files to begin with). Just regenerate the gamelist after making the change. That's the power of Skyscraper. Make a change, regenerate, and you're done!

                                        1 Reply Last reply Reply Quote 1
                                        • ClydeC
                                          Clyde
                                          last edited by

                                          Thank you very much. It's working now. 😀👍

                                          muldjordM 1 Reply Last reply Reply Quote 1
                                          • muldjordM
                                            muldjord @Clyde
                                            last edited by

                                            @Clyde Awesome! :)

                                            ClydeC 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post

                                            Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                            Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.