RetroPie forum home
    • Recent
    • Tags
    • Popular
    • Home
    • Docs
    • Register
    • Login

    [SOFT] New Scraper in the works

    Scheduled Pinned Locked Moved Projects and Themes
    scrapingscrappersoftware
    253 Posts 7 Posters 59.7k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • kiroK
      kiro
      last edited by

      If anyone is interested, I've released the source code here: https://github.com/zayamatias/retroscraper

      Enjoy :-)

      kiroK 1 Reply Last reply Reply Quote 1
      • kiroK
        kiro @kiro
        last edited by

        A new version is out, it allows you to keep your favorites and play count after scraping (among other bug fixes).

        https://github.com/zayamatias/retroscraper

        kiroK 1 Reply Last reply Reply Quote 0
        • kiroK kiro referenced this topic on
        • kiroK
          kiro @kiro
          last edited by

          A new slimmed-down version of the scraper is available and should be easier to install and run, check it out here:

          https://github.com/zayamatias/retroscraper-rpie

          Thanks for your feedback!

          F 1 Reply Last reply Reply Quote 0
          • F
            Folly @kiro
            last edited by Folly

            @kiro

            Hi,
            Perhaps you knew already but I am the developer of this script.

            So for me it would be nice to find a good solution for generating gamelists that can be shared from within the script or perhaps could even be scraped from within the script.

            We already have some predefined gamelists with media that can be downloaded from within the script for the categories :
            konamih, tigerh , etc. (many done by @DTEAM)

            The script can install arcade categories like "shooter" or "pinball" too.
            However, for these categories there aren't predefined gamelists with media yet.
            Sadly these "categories" are not recognised by your scraper.
            Basically not a big problem so I renamed them to arcade and scraped them and renamed them back.
            Now I should have a proper gamelist+media, right.
            Well It doesn't work that way because the gamelist.xml contains full paths to files so I have to rename the roms directory from /arcade/ to /shooter/ to get it working again.

            Well, we had a different approach with our predefined gamelists.
            For our predefined gamelists, have a look here :
            https://drive.google.com/drive/folders/1f_jXMG0XMBdyOOBpz8CHM6AFj9vC1R6m
            You will see that we use relative paths rather than full paths.
            This solution makes it easier to copy to an other named folder or to a computer with an other username without editing the gamelist.xml.
            So my question is, could you incorporate that solution ?

            We also set the images/videos in the directory media/emulationstation/ .
            Which emphases that the media is used by emulationstation.
            So when running emulationstation only 1 media folder is seen instead of images/marquees/videos.
            I would like you to think about this too.
            For your folder it would mean :
            media/emulationstation/images
            media/emulationstation/marquees
            media/emulationstation/videos

            If you see something in both suggestions then we apply somewhat the same "standard" to the gamelists.
            With this we could somehow join some forces.

            What do you think ?
            Let me know.

            kiroK 1 Reply Last reply Reply Quote 0
            • kiroK
              kiro @Folly
              last edited by

              @Folly Hi, Sure it makes sense! I'm away this weekend but will put this as new features in the upcoming versions. I just need some time to understand exactly what's done.

              I believe that we could actually have the same gamelists created from my scraper, without the need to actually have 'predefined' gamelists.

              I'll definitely have a look.

              By the way, my scraper takes into accounts arcade systems such as 'Capcom classics', 'konami classics' and so forth, this is an example of my es_system for some of these classics:

                <system>
                  <name>atariclassics</name>
                  <extension> .7z .cue .fba .iso .zip .7Z .CUE .FBA .ISO .ZIP</extension>
                  <platform>arcade</platform>
                  <theme>arcadeatariclassics</theme>
                  <command>/opt/retropie/supplementary/runcommand/runcommand.sh 0 _SYS_ arcade %ROM%</command>
                  <path>../roms/atariclassics</path>
                  <fullname>Atari Classics</fullname>
                </system>
              

              The scraper will match the name tag 'atariclassics' in this example.

              F 1 Reply Last reply Reply Quote 0
              • F
                Folly @kiro
                last edited by

                @kiro

                Ok, nice to hear you agree.
                Would really be great if you could accomplish that.

                So if I can somehow add <platform>arcade</platform> to a category then it should be recognised right ?

                kiroK 1 Reply Last reply Reply Quote 0
                • kiroK
                  kiro @Folly
                  last edited by

                  @Folly You do not need to add anything like 'platform'.... my scraper recognizes the roms based on their checksum, not their names, directories or anything. In principle, you could put all roms into a single directory and it would recognize them as long as their checksum is in the database.
                  If the checksum is not in the database then the <name> tag will help the scraper try to figure out the game, but it is not mandatory.

                  To answer your question about the <platform>, I'm not sure why this tag is there, it is not taken into consideration by the scraper :-).

                  F 1 Reply Last reply Reply Quote 0
                  • F
                    Folly @kiro
                    last edited by Folly

                    @kiro

                    Somehow my categories are not recognised by your script so I can't select them in the systems menu.
                    This is an example of such category from the es_systems.cfg :

                    <system>
                      <name>shooter</name>
                      <fullname>shooter</fullname>
                      <path>/home/pi/RetroPie/roms/shooter</path>
                      <extension>.cmd .zip .7z  .CMD .ZIP .7Z </extension>
                      <command>/opt/retropie/supplementary/runcommand/runcommand.sh 0 _SYS_ shooter %ROM%</command>
                      <platform>shooter</platform>
                      <theme>shooter</theme>
                    </system>
                    

                    Indeed, changing it to <platform>arcade</platform> doesn't help also.

                    kiroK 1 Reply Last reply Reply Quote 0
                    • kiroK
                      kiro @Folly
                      last edited by

                      @Folly yes, it will not recognize such 'custom' categories...maybe I should think on way to imrpove this. As I said this should not matter if the checksums are found in the DB. Maybe I should think of an alternative here... Let me have a think this weekend

                      F 2 Replies Last reply Reply Quote 0
                      • F
                        Folly @kiro
                        last edited by Folly

                        @kiro

                        If you can do all of that it would definitely be an improvement ;-)

                        1 Reply Last reply Reply Quote 0
                        • F
                          Folly @kiro
                          last edited by Folly

                          @kiro

                          Hi,

                          Scanning on the arcade directory still gives me videos with the .png extension in the video directory, though some are good with .mp4.
                          I have .png files in the folders images and marquees, so that seems to be OK.

                          After this discovery I tried archimedes again.
                          This is good, no videos with .png extension.

                          So there still seems to be an issue with videos called .png in the videos directory for other systems.

                          kiroK 1 Reply Last reply Reply Quote 0
                          • kiroK
                            kiro @Folly
                            last edited by

                            @Folly thanks! The fact that there are still .PNG files in the video directory might be due to them still being there from previous scraps. Try using the "cleanmedia" option or --cleanmedia if running from command line. Let me know how it goes.

                            F 1 Reply Last reply Reply Quote 0
                            • F
                              Folly @kiro
                              last edited by Folly

                              @kiro

                              I removed all files manually.
                              Now I am also using your retroscraper-rpie which does the same.
                              I am seeing the files being added in the file-explorer.
                              So definitely an issue.

                              I also noticed that there are many videos that will not display, sound on the other hand is played though.
                              Could it be that some codecs are missing ?
                              I tried some files in VLC and gave me black screen but sound is there.

                              kiroK 1 Reply Last reply Reply Quote 0
                              • kiroK
                                kiro @Folly
                                last edited by

                                @Folly thanks, I'll have a look when I'm back home on Monday, I might need to revalidate the videos. Will also check about the PNG thing, definitely strange.

                                F 1 Reply Last reply Reply Quote 1
                                • F
                                  Folly @kiro
                                  last edited by

                                  @kiro

                                  If I can implement the setup somehow in my script then your retroscraper-rpie would probably be easy enough to add in my script.
                                  I will have to think about it some time.

                                  kiroK 1 Reply Last reply Reply Quote 0
                                  • kiroK
                                    kiro @Folly
                                    last edited by

                                    @Folly thanks! I actually do have a script which is running (installation goes fine as well as execution) based on scraper.sh script. My only concern right now is that output is buffered, so it seems (only seems) as if the scraper freezes.
                                    But thanks for the offer, definitely we can talk about doing something together.

                                    F 2 Replies Last reply Reply Quote 0
                                    • F
                                      Folly @kiro
                                      last edited by Folly

                                      @kiro

                                      I just did atari7800, although it also has the .png video issue, all videos and pictures are good.
                                      I must say if something is recognised then recognition seems to be perfect.
                                      I do not see that with other scraper methods.
                                      I also notice that the scraping is very fast, much faster that other scrape methods.
                                      Really nice !!!

                                      1 Reply Last reply Reply Quote 0
                                      • F
                                        Folly @kiro
                                        last edited by Folly

                                        @kiro

                                        For your information.
                                        I am testing your retroscraper-rpie now in my Debian11-x86_64 in VMWARE.
                                        Looks like the .png problem for videos doesn't occur there.
                                        Also the gamelist.xml contain the correct naming for the videos (.mp4)
                                        So this problem could be specific for the RPI/raspbian/raspbian version.

                                        ALL video seem to be playing correctly in emulationstation in my VM, although, for some videos, emulationstation seems to give errors like this :

                                        [src/libmpg123/layer3.c:2184] error: part2_3_length (1408) too large for available bit count (1368)
                                        

                                        Could it be that some videos are not that compatible or have some corruption ?

                                        kiroK 1 Reply Last reply Reply Quote 0
                                        • kiroK
                                          kiro @Folly
                                          last edited by

                                          @Folly hi, videos are being fixed as we speak, it will need some time. PNG issue should be independent from platform (after all is running the same code) let me know if you encounter this problem again.

                                          F 1 Reply Last reply Reply Quote 1
                                          • F
                                            Folly @kiro
                                            last edited by Folly

                                            @kiro

                                            I just scraped some on my RPI4 with both scrapers (normal and rpie)

                                            For archimedes all videos are named *.mp4, so for this system that's good and all videos seem to play now.
                                            However I discovered a new problem.
                                            The last 2 games that are recognised and for which media is downloaded are not added to the gamelist.xml.
                                            I checked older gamelists and they also miss these.

                                            For arcade(shooter category),arcade(pinball category) and atari7800 the .png problem still remains.
                                            Here I also checked If something was missing in atari7800.
                                            The last 4 games that are recognised and for which media is downloaded are not added to the gamelist.xml.
                                            For arcade(shooter category) the last 3 games were missing.
                                            For arcade(pinball category) the last 3 games were missing.

                                            Edit :
                                            For the system with videos named as .png for some videos only sound is available.

                                            kiroK 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post

                                            Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                            Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.