RetroPie forum home
    • Recent
    • Tags
    • Popular
    • Home
    • Docs
    • Register
    • Login

    [SOFT] New Scraper in the works

    Scheduled Pinned Locked Moved Projects and Themes
    scrapingscrappersoftware
    253 Posts 7 Posters 59.8k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • kiroK
      kiro @Folly
      last edited by

      @Folly hi folly...I've quickly checked the backend and it returns format MP4 for the videos of all the games you mentioned, it's surprising to say the least. I will have a deeper check tomorrow.

      F 1 Reply Last reply Reply Quote 0
      • F
        Folly @kiro
        last edited by Folly

        @kiro

        I installed retroscraper-rpie on my debian buster vm 32bit x86 VM.
        I was thinking perhaps it will work on this VM.

        First I did a total update/upgrade/reboot.

        This is my log installing it all :
        https://pastebin.com/raw/gzJpdz1b

        I scraped the atari7800 roms and had the the same results as earlier. (same results as described earlier)
        (used : python3 retroscraper.py --systems atari7800 )

        Perhaps you can find something in my log.

        I'am not quite sure which dependancies are loaded by the script.
        If somehow the same dependancies are already installed on the OS in different paths with sudo and apt, for example.
        Could be that the wrong depandent programs are used from other paths (with an other version).
        I checked this theory by forcing only "$HOME/.local/bin" in the PATH like this :

        pi@VM-32bit:~/retroscraper-rpie$ PATH="$HOME/.local/bin"
        pi@VM-32bit:~/retroscraper-rpie$ /usr/bin/python3 retroscraper.py --systems atari7800 --cleanmedia
        

        Same result, so this doesn't seem the be the problem either.

        kiroK 1 Reply Last reply Reply Quote 0
        • kiroK
          kiro @Folly
          last edited by

          @Folly Thank you so much for all the testing.

          I've uploaded new fixes into git (for both retroscraper versions) that should solve the missing games in the gamelists plus the bezels configuration issue.

          I'm struggling to reproduce the .png issue (I tried exactly with atari7800) and nothing. Just to clear out possible issues, do you have some sort of proxy at your location/machine that could be caching the API requests?

          I'll keep investigating in the meantime, and will start working on the relative path thing today.

          Take care!

          F 1 Reply Last reply Reply Quote 0
          • F
            Folly @kiro
            last edited by Folly

            @kiro said in [SOFT] New Scraper in the works:

            @Folly Thank you so much for all the testing.

            I've uploaded new fixes into git (for both retroscraper versions) that should solve the missing games in the gamelists plus the bezels configuration issue.

            Great !
            I did some tests.

            python3 retroscraper.py --systems atari7800 --cleanmedia
            

            👍 - seems to be consistent now in detected games
            👍 - seems to add all the detected games now in gamelist.xml
            👍 - no game-specific configs are generated anymore

            python3 retroscraper.py --systems atari7800 --bezel --cleanmedia
            

            👍 - seems also to be consistent now in detected games
            👍 - seems also to add all the detected games now in gamelist.xml
            👍 - game-specific configs are generated as it should
            👍 - bezel folder is created and bezel files are downloaded

            I suspected that the bezel files would be in /home/pi/RetroPie/roms/atari7800/bezels but the were in /home/pi/RetroPie/overlays/atari7800/bezels.
            Is it an idea to use this folder instead for the future ? :
            /home/pi/RetroPie/roms/atari7800/media/retroarch/overlays/bezels
            Using the same structure as I proposed for the emulationstation media :
            /home/pi/RetroPie/roms/atari7800/media/emulationstation/<folders>
            Then you can backup all in one time and with your option --cleanmedia you only have to delete 1 folder (media) which would make it much simpler.
            Good idea ?

            I'm struggling to reproduce the .png issue (I tried exactly with atari7800) and nothing. Just to clear out possible issues, do you have some sort of proxy at your location/machine that could be caching the API requests?

            No I haven't.

            I'll keep investigating in the meantime, and will start working on the relative path thing today.

            I think that somehow string variables aren't updated correctly.
            Though I have too little knowledge of your script.

            Take care!

            Same !

            kiroK 1 Reply Last reply Reply Quote 0
            • kiroK
              kiro @Folly
              last edited by

              @Folly I've updated a new version of the 'light' retroscraper with a new flag '--debug' which will create a more complete log, maybe can you run it on the system tat still creates the png so we might understand what the problem is??

              I've scanned my whole romset and no issues thousands of roms to be honest)

              Thx!

              F 1 Reply Last reply Reply Quote 0
              • F
                Folly @kiro
                last edited by Folly

                @kiro

                OK, I will do.

                Found the log, but it's too big for pastebin.
                Probably paste it on the google-drive.

                kiroK 1 Reply Last reply Reply Quote 0
                • kiroK
                  kiro @Folly
                  last edited by

                  @Folly thanks!

                  F 1 Reply Last reply Reply Quote 0
                  • F
                    Folly @kiro
                    last edited by Folly

                    @kiro

                    retroscraper.log removed from google-drive.

                    Looks like the format variable is not updated correctly.

                    kiroK 1 Reply Last reply Reply Quote 0
                    • kiroK
                      kiro @Folly
                      last edited by

                      @Folly Thanks, will have a look right now.

                      On the other hand, I've updated both versions to cater for systems with 'non-standard names' such as 'shooters' :-) The only drawback is that the scraper will not be able to do a name search, due to the fact that it does not understand which is the actual system, but if checksums are in the DB they'll show up properly.

                      F 2 Replies Last reply Reply Quote 0
                      • F
                        Folly @kiro
                        last edited by Folly

                        @kiro

                        Cool will give it a go.

                        Yea, it works really nice !

                        Seems a bit slower.
                        Does it have to search longer in the database with an unknown system ?

                        1 Reply Last reply Reply Quote 0
                        • F
                          Folly @kiro
                          last edited by

                          @kiro

                          I am getting this error, nothing works anymore :

                          pi@raspberrypi:~/retroscraper-rpie $ python3 retroscraper.py --systems arcade
                          Loading RetroScraper config File
                          Traceback (most recent call last):
                            File "/home/pi/retroscraper-rpie/retroscraper.py", line 150, in <module>
                              trans = complete['en']
                          TypeError: list indices must be integers or slices, not str
                          
                          kiroK 1 Reply Last reply Reply Quote 0
                          • kiroK
                            kiro @Folly
                            last edited by

                            @Folly Hi yes, I'm correcting the issue with the PNG, which means I had to stop the backend for a while and resync DBs (that was the issue hopefully) should be up and running anytime soon. Need to make that error more clear :-) Thanks!

                            F 1 Reply Last reply Reply Quote 0
                            • F
                              Folly @kiro
                              last edited by Folly

                              @kiro

                              Aha, that's good ;-)
                              Pfff, I was a bit afraid everything was broken.

                              Perfect, now showing this :

                              pi@raspberrypi:~/retroscraper-rpie $ python3 retroscraper.py --systems megaplay
                              Loading RetroScraper config File
                              CANNOT CONNECT TO THE BACKEND, PLEASE TRY AGAIN LATER
                              
                              kiroK 1 Reply Last reply Reply Quote 0
                              • kiroK
                                kiro @Folly
                                last edited by

                                @Folly I've uploaded new versions with the friendly error message... DB is in sync again, png issue should be solved hopefully :-) now the system is up and running again, let me know how it goes when you can test. Thanks!

                                I'll be having a look at relative paths this week. I was thinking of adding a --realtivepaths flag and a --mediadir flag, first will add the './' instead of the full path, and second would add the possibility to choose the media down dir (starting by './' would make it relative to the current system path). Was thinking also of a '--singlemediadir' to put everything under the same folder, which would work only together with the '--mediadir' flag. What do you think?

                                F 2 Replies Last reply Reply Quote 0
                                • F
                                  Folly @kiro
                                  last edited by Folly

                                  @kiro

                                  Really cool if you can add all that.
                                  Seems to be what I am looking for.

                                  Will do a clone now and report.
                                  Looks very good !!!
                                  👍 - videos are now correctly named *.mp4
                                  👍 - now also similar speed between known systems and unknown systems

                                  Was the *.png issue in the database or back-end ?

                                  kiroK 1 Reply Last reply Reply Quote 0
                                  • kiroK
                                    kiro @Folly
                                    last edited by

                                    @Folly it was a backend issue... in order to gain speed the backend is just a collection of files created from the DB (so no actual engine running at the server)... the CRC's are symbolic links to the actual games objects, and for some reason, these links were not updated properly, so yes, I had to recreate 2.4 million links :-) luckily not manually!

                                    1 Reply Last reply Reply Quote 1
                                    • F
                                      Folly @kiro
                                      last edited by Folly

                                      @kiro

                                      I must say it' now very much improved.
                                      I tested the newly scraped systems in emulationstattion and off-course they work.
                                      However, still quite some videos only play sound.
                                      An example is mamerom pc_mario from playchoice10.

                                      kiroK 1 Reply Last reply Reply Quote 0
                                      • kiroK
                                        kiro @Folly
                                        last edited by

                                        @Folly will have a look at those videos..thanks!

                                        kiroK 1 Reply Last reply Reply Quote 0
                                        • kiroK
                                          kiro @kiro
                                          last edited by

                                          @Folly ... This is the 'frontend' to the backend:

                                          http://77.68.23.83/

                                          It may work perfectly as it is more of an experiment for me.

                                          There you may want to check if the videos you see (they are the same as the one grabbed by the scraper) are working as the ones downloaded.

                                          Thx

                                          F 1 Reply Last reply Reply Quote 0
                                          • F
                                            Folly @kiro
                                            last edited by Folly

                                            @kiro
                                            Yes, they are the same :

                                            pi@raspberrypi:~/Downloads $ sha1sum download.mp4 
                                            5370f9a66196ae8a3ab82fe81347617c45a82f8a  download.mp4
                                            pi@raspberrypi:~/Downloads $ cd ..
                                            pi@raspberrypi:~ $ cd RetroPie/roms/playchoice10/videos
                                            pi@raspberrypi:~/RetroPie/roms/playchoice10/videos $ sha1sum pc_mario.mp4 
                                            5370f9a66196ae8a3ab82fe81347617c45a82f8a  pc_mario.mp4
                                            

                                            The video gives also black screen in vlc.
                                            I think it could be somehow a codec problem.
                                            Perhaps I am missing a codec dependency .

                                            kiroK 1 Reply Last reply Reply Quote 0
                                            • F Folly referenced this topic on
                                            • First post
                                              Last post

                                            Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                            Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.