RetroPie forum home
    • Recent
    • Tags
    • Popular
    • Home
    • Docs
    • Register
    • Login

    [SOFT] New Scraper in the works

    Scheduled Pinned Locked Moved Projects and Themes
    scrapingscrappersoftware
    253 Posts 7 Posters 63.6k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • F
      Folly @kiro
      last edited by

      @kiro

      I had a good look at where tigerrz belongs to.
      Turns out these are MESS drivers so theoretically the do not actually belong to the arcade section.
      They should belong to the arcadia section.
      I that something you can change ?

      kiroK 1 Reply Last reply Reply Quote 0
      • kiroK
        kiro @Folly
        last edited by kiro

        @Folly Hmm I could... but I'm not sure it will have an impact, namely because I have no system 'arcadia' in the db :-) I could create new systems which parent as 'arcade' and then it should be solved... I guess this need some thinking...

        Correction, arcadia belongs to the system 'Emerson Radio Arcadia 2001' not entirely sure if this is correct or not...

        F 1 Reply Last reply Reply Quote 0
        • F
          Folly @kiro
          last edited by

          @kiro

          Mmmm, Yes difficult.
          Indeed, my fault, arcadia is not "non-arcade" but emerson arcadia 2001.
          Indeed, it needs some thinking.

          This is my "database" with all mame drivers (mame(arcade)/mess(non-arcade) :
          https://github.com/FollyMaddy/RetroPie-Share/blob/main/00-databases-00/mame/mame0249_systems_sorted_info
          Could be something to look up things.

          I tried now at home putting all in arcade roms directory.

          python3 /opt/retropie/supplementary/retroscraper/retroscraper.py --mediadir media/emulationstation --systems arcade
          Starting scraping
          Scanning Files
          Doing System arcade
          Scanning Directory
          Found 3 roms for the system
          This is an arcade system
          --it waits a long time here--
          System: arcade | Game : rzindy500.zip
          System: arcade | Game : rzbatfor.zip
          System: arcade | Game : rztoshden.zip
          Writing Gamelist
          Scan Done!
          SCRAPPING ENDED --- Thank you for using retroscraper!!
          

          Sadly still no media and info in gamelist.

          kiroK 1 Reply Last reply Reply Quote 0
          • kiroK
            kiro @Folly
            last edited by

            @Folly There's a nightly batch that will run and create the linking this evening (at midnight)

            I see that retroscraper has detected the rom though (i see SHA for rzindy as 22ADC5B5AE9A26FA11697984293BBEEFF3DE69DD67), strange that the scraper has not added anything to the gamelist...

            Let's see tomorrow how it goes, and if the .zip files are embedded into the same game as the 7z files..

            F 1 Reply Last reply Reply Quote 0
            • F
              Folly @kiro
              last edited by Folly

              @kiro

              Good idea.
              Curious if that will fix it.

              Just did a small test with galaga only in arcade and it works perfectly.
              So overall there seems to be no issue with other things.

              Have a good evening.

              kiroK 1 Reply Last reply Reply Quote 0
              • kiroK
                kiro @Folly
                last edited by

                @Folly cool! There are plus 800k roms in the DB and still a lot missing probably, that's why the more the scraper is used the better it will become, hopefully!

                F 1 Reply Last reply Reply Quote 0
                • F
                  Folly @kiro
                  last edited by Folly

                  @kiro

                  Just tried tigerrz on multiple computers, sadly no recognition at all.
                  Before you added the media the game names were detected so then there was recognition to some degree, only without media.

                  I also tried an all scrape from your module-script.
                  9 retropie files are recognised faulty as "World Champion Cards"
                  system : retropie | game : World Champion Cards
                  Seems to be an issue.

                  Should I make a debug log for you ?

                  kiroK 1 Reply Last reply Reply Quote 0
                  • kiroK
                    kiro @Folly
                    last edited by kiro

                    @Folly Hi, the auto script didn't insert the roms as expected, I'm looking at it right now (I saw this first thing after waking up :-)) Should be ok now --edit-- the autoscript will look for at least 3 scrappings of the same rom by different people before actually inserting it into the DB, to avoid false positives (someone putting a wrong file in their rom directories). In this case, as I know it was you, I've forced the insertion.

                    Now, there is an issue with very small files, where the checksums give always strange behavior, in principle when there was a check that the system existed, it would skip retropie, since now it has been disabled to accommodate for any type of system names, it scrapes 'retropie too', I'll adapt the script so it does not, at the same time, you can remove the gamelist.xml created in that folder. Updated version in Git.

                    F 1 Reply Last reply Reply Quote 1
                    • F
                      Folly @kiro
                      last edited by

                      @kiro

                      Updated, and its working ;-)
                      Great.

                      Is it possible to add the release date info and genre too in the future ?

                      kiroK 1 Reply Last reply Reply Quote 0
                      • kiroK
                        kiro @Folly
                        last edited by

                        @Folly great! If I have the info it should be present, although I'm. Although I'm not taking it from your gamelists for the moment it should be easy.

                        DTEAMD 1 Reply Last reply Reply Quote 0
                        • DTEAMD
                          DTEAM @kiro
                          last edited by

                          @kiro @Folly

                          Sometime, site like screenscraper recognize roms from "No Intro " romset but not from the MAME romset as ColecoVision or Videopac. Are you working on a solution for that?

                          kiroK 1 Reply Last reply Reply Quote 0
                          • kiroK
                            kiro @DTEAM
                            last edited by

                            @DTEAM retrosctaper recognizes the ROMs based on a checksum, not in names which in my opinion is a nonsense. So even if the names are whatever, your rom, as long as the checksum is in the DB, should be recognized.

                            F 1 Reply Last reply Reply Quote 1
                            • F
                              Folly @kiro
                              last edited by Folly

                              @kiro

                              I did try to scrape all_in1 but the images are "empty".
                              Or do we have to wait until others try it too ?
                              Any news about the videos ?

                              kiroK 1 Reply Last reply Reply Quote 0
                              • kiroK
                                kiro @Folly
                                last edited by

                                @Folly hey sorry, I'm away from home at the moment, I'll be back on Wednesday and check by then! Sorry!

                                F 1 Reply Last reply Reply Quote 0
                                • F
                                  Folly @kiro
                                  last edited by

                                  @kiro

                                  Aha, no problem.
                                  Have good days, see you later ;-)

                                  kiroK 1 Reply Last reply Reply Quote 0
                                  • kiroK
                                    kiro @Folly
                                    last edited by

                                    @Folly I've just tried again the rztiger and it does work for me, the only difference is that I've got the .7z roms, can you point me (to my private email to avoid issues) to the .zip version of these roms??

                                    c52c2949-f38e-4e43-be47-1fb905d65e7c-image.png

                                    and the gamelist:

                                    <?xml version='1.0' encoding='utf-8'?><gameList>
                                    	<game>
                                    		<rating></rating>
                                    		<name>R-Zone: Batman Forever</name>
                                    		<marquee>/home/pi/RetroPie/roms/tigerrz/marquees/rzbatfor-marquee.png</marquee>
                                    		<image>/home/pi/RetroPie/roms/tigerrz/images/rzbatfor-image.png</image>
                                    		<publisher>Unknown</publisher>
                                    		<releasedate></releasedate>
                                    		<players></players>
                                    		<video>/home/pi/RetroPie/roms/tigerrz/videos/rzbatfor-video.mp4</video>
                                    		<genre></genre>
                                    		<path>/home/pi/RetroPie/roms/tigerrz/rzbatfor.7z</path>
                                    		<developer>Unknown</developer>
                                    		<thumbnail/>
                                    		<desc>Batman Forever (handheld)</desc>
                                    		<playcount>0</playcount>
                                    		<lastplayed>0</lastplayed>
                                    	</game>
                                    	<game>
                                    		<rating></rating>
                                    		<name>R-Zone: Indy 500</name>
                                    		<marquee>/home/pi/RetroPie/roms/tigerrz/marquees/rzindy500-marquee.png</marquee>
                                    		<image>/home/pi/RetroPie/roms/tigerrz/images/rzindy500-image.png</image>
                                    		<publisher>Unknown</publisher>
                                    		<releasedate></releasedate>
                                    		<players></players>
                                    		<video>/home/pi/RetroPie/roms/tigerrz/videos/rzindy500-video.mp4</video>
                                    		<genre></genre>
                                    		<path>/home/pi/RetroPie/roms/tigerrz/rzindy500.7z</path>
                                    		<developer>Unknown</developer>
                                    		<thumbnail/>
                                    		<desc>Indy 500 (handheld)</desc>
                                    		<playcount>0</playcount>
                                    		<lastplayed>0</lastplayed>
                                    	</game>
                                    	<game>
                                    		<rating></rating>
                                    		<name>R-Zone: Battle Arena Toshinden</name>
                                    		<marquee>/home/pi/RetroPie/roms/tigerrz/marquees/rztoshden-marquee.png</marquee>
                                    		<image>/home/pi/RetroPie/roms/tigerrz/images/rztoshden-image.png</image>
                                    		<publisher>Unknown</publisher>
                                    		<releasedate></releasedate>
                                    		<players></players>
                                    		<video>/home/pi/RetroPie/roms/tigerrz/videos/rztoshden-video.mp4</video>
                                    		<genre></genre>
                                    		<path>/home/pi/RetroPie/roms/tigerrz/rztoshden.7z</path>
                                    		<developer>Unknown</developer>
                                    		<thumbnail/>
                                    		<desc>Battle Arena (handheld)</desc>
                                    		<playcount>0</playcount>
                                    		<lastplayed>0</lastplayed>
                                    	</game>
                                    </gameList>
                                    
                                    F 1 Reply Last reply Reply Quote 0
                                    • F
                                      Folly @kiro
                                      last edited by Folly

                                      @kiro

                                      Hi again.

                                      I tested .7z and .zip files and both worked now.
                                      I think the .7z files came from the mame-0.231-merged pack but this is not available anymore.

                                      You've got mail

                                      kiroK 1 Reply Last reply Reply Quote 0
                                      • kiroK
                                        kiro @Folly
                                        last edited by

                                        @Folly Cool, I'll test with the romset you mentioned.

                                        1 Reply Last reply Reply Quote 0
                                        • kiroK
                                          kiro
                                          last edited by

                                          I've uploaded the latest version, adding the --sort flag, it works like this:

                                          --sort ORIGDIR DESTDIR
                                          

                                          Where origdir is the place where all your roms are stored (they can all be mixed and in one single directoryt) and it will identify them and copy (not move) them over to your destdir/SYSTEM folder, where SYSTEM is the name of the system the roms belong to.

                                          Have fun!

                                          F 1 Reply Last reply Reply Quote 0
                                          • F
                                            Folly @kiro
                                            last edited by

                                            @kiro

                                            I am experimenting with running retroscraper "online".

                                            The advantage would be that the script is always up to date and that the basic script doesn't need to be stored anywhere.

                                            I want to look if this is a good way of adding it to my script.
                                            That way I only have to install the regular depends.

                                            I already use a python script called gdrivedl in my module-script.
                                            This is an example of running gdrivedl.py online using only the help option :

                                            curl https://raw.githubusercontent.com/matthuisman/gdrivedl/master/gdrivedl.py | python3 - -h
                                            

                                            So I wanted to see if I could do the same with your "retroscraper-rpie".
                                            Basically, with the same help option and it actually works as expected :

                                            curl https://raw.githubusercontent.com/zayamatias/retroscraper-rpie/main/retroscraper.py|python3 - -h
                                            

                                            However your script uses more dependant scripts.
                                            So when it comes to the actual scraping, it will miss these dependant scripts. (apicalls, checksums, scrapfunctions)

                                            So I have a question :
                                            Is there a way to make one script including these dependant scripts so it can be run online ?

                                            kiroK 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post

                                            Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                            Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.