• Recent
  • Tags
  • Popular
  • Home
  • Docs
  • Register
  • Login
RetroPie forum home
  • Recent
  • Tags
  • Popular
  • Home
  • Docs
  • Register
  • Login

Versatile C++ game scraper: Skyscraper

Scheduled Pinned Locked Moved Ideas and Development
skyscraperscrapergamelist.xmlscrapinggithub
1.6k Posts 113 Posters 2.0m Views
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • M
    muldjord @paradadf
    last edited by 2 Sept 2017, 06:31

    @paradadf I would like to do that, but I must admit that it's a lot of work for something I don't need myself. So unless someone else implements it in a patch and sends it to me, it won't happen I'm afraid.

    When using 'screenscraper' Skyscraper always looks for the 'wor' or 'us' or whatever they are called. If it doesn't find those, it picks the next one in line as I recall.

    P 1 Reply Last reply 2 Sept 2017, 11:15 Reply Quote 0
    • P
      paradadf @muldjord
      last edited by 2 Sept 2017, 11:15

      @muldjord understood, thanks!

      1 Reply Last reply Reply Quote 0
      • M
        muldjord
        last edited by 2 Sept 2017, 21:30

        Hi guys, I am sad to inform that Skyscraper has been discontinued effective immediately. I have been contacted by sources about the nature of the scrapings themselves. For that reason I no longer wish to pursue this project as I have no intention of being an inconvenience to the websites or authors of the information collected by Skyscraper.

        Thank you for all of your feedback and support.

        1 Reply Last reply Reply Quote 0
        • A
          AnalogHero
          last edited by 3 Sept 2017, 05:29

          Sad to hear. I understand that some websites fear the traffic that your tool might produce, but isnt that the case with any other scraper? Why collecting data if not using it?

          Anyway, its your program and your decision ofcourse.

          1 Reply Last reply Reply Quote 0
          • L
            lilbud
            last edited by 3 Sept 2017, 05:45

            alt text

            Creator of the Radiocade: https://retropie.org.uk/forum/topic/6077/radiocade

            Backlog: http://backloggery.com/lilbud

            1 Reply Last reply Reply Quote 0
            • C
              chipsnblip
              last edited by 3 Sept 2017, 07:11

              that's really a bummer. the time and energy you must have invested in making this :(

              all metadata and cover art..there has to be a better way to store, manage, combine, distribute it.. i wonder if it's considered public domain, maybe the internet archive would host such a project

              1 Reply Last reply Reply Quote 0
              • A
                AnalogHero
                last edited by 3 Sept 2017, 11:14

                Maybe im wrong about the traffic. It could be a copyright thing aswell.

                I found this the best scraper around cause it works on pi and it can combine data from various sources. And the thing that it saves data local saves traffic if you need to rescrape.

                It seems @muldjord has enough and strong reasons to pull the plug. Anyway thanks for this great tool. (Would have been a nice addition for retropie-setup with a small gui like sselphs scraper). :(

                1 Reply Last reply Reply Quote 0
                • M
                  muldjord
                  last edited by 3 Sept 2017, 18:51

                  Stay tuned for news. Skyscraper might (MIGHT!) be online again soon'ish. But in a bit of a cut-down state I am afraid... More info when I get through all of the paperwork.

                  1 Reply Last reply Reply Quote 1
                  • J
                    jdrassa
                    last edited by 3 Sept 2017, 19:29

                    Great news. Regardless of if it comes back, hopefully you can share some details as to what exactly happened that cause you to pull it. At a minimum it would be useful information for the developers of other scrapers like @sselph so that they don't run into the same issues.

                    Get latest build of EmulationStation for Windows here

                    M 1 Reply Last reply 3 Sept 2017, 19:56 Reply Quote 0
                    • M
                      muldjord @jdrassa
                      last edited by muldjord 9 Mar 2017, 22:05 3 Sept 2017, 19:56

                      @jdrassa Over the course of the past few weeks, I've felt like I was walking around a minefield. Sources started contacting me with not so friendly mails to take out support for their sites and I just didn't want to deal with that sort of negativity in a project that's supposed to be fun and helpful. Hence the take-down. I'd like to point out that I completely understand why sites won't allow scraping! It can hit a database hard if overused.

                      Skyscraper won't be available again until I have some official permission to use each module. And unfortunately that also means that Skyscraper will be back in a very cut-down version... I completely understand this! But it also is pretty demotivating when all I wanted to do was to help people out.

                      I even implemented the local cache to try and make people reuse the data. But I still can't control how people use it! And I created the local importer so you could get data from your own source text files and image files and so on...

                      Bottom line: Skyscraper might be back with ONLY the sources I have official permission from. And even then, I need to just trust the users not to overdo the scrapings.

                      J 1 Reply Last reply 4 Sept 2017, 04:46 Reply Quote 0
                      • U
                        Used2BeRX
                        last edited by 4 Sept 2017, 04:17

                        That sucks man. I can understand why people want it blocked though, considering the effort put into making the websites and also the bandwidth. I'd imagine "scraping" emumovies would be immediately shut down if all of the movies were being pulled from a source outside of their own site where they can try to entice the user into a paid membership for the higher quality videos.

                        Like I said, the NES collection I'm working on is looking great, and I'd like to start making videos of my own that conform to a standard length with a title sequence at the end and all have the same volume one day. I'd also like to do this for all of the other major console systems out there.

                        If there was a way to do this when I've got everything put together that would be easy for users to get access to, I'm open to any suggestions.

                        I don't know how long it would ever take me to do this though. I've been working odd jobs to make ends meet the last few weeks to buy myself some more time, but this is an insane amount of work and I'm going to need a real job soon if I can't figure out some way of crowd funding the job.

                        Hopefully I can put out a full NES release that can be distributed to the public by the end of the year though.

                        1 Reply Last reply Reply Quote 0
                        • J
                          jdrassa @muldjord
                          last edited by 4 Sept 2017, 04:46

                          @muldjord Thanks for sharing. Hopefully you will be able to relaunch it in some form. I can understand concerns about load, but I feel like scraping is the whole purpose of many of these sites.

                          Get latest build of EmulationStation for Windows here

                          1 Reply Last reply Reply Quote 0
                          • S
                            screech
                            last edited by 4 Sept 2017, 12:24

                            As a https://www.screenscraper.fr administrator, and after discussion with the big boss ;) we grant you a second time an official authorisation to use our DB ^^

                            As we already say, till it's free and open source, you completely are in our philosophy, and you can use all you need from the API ^^

                            (And more, if you need help, don't hesitate to ask ^^)

                            M U 2 Replies Last reply 4 Sept 2017, 12:37 Reply Quote 7
                            • M
                              muldjord @screech
                              last edited by 4 Sept 2017, 12:37

                              @screech said in Versatile C++ game scraper: Skyscraper:

                              As a https://www.screenscraper.fr administrator, and after discussion with the big boss ;) we grant you a second time an official authorisation to use our DB ^^

                              As we already say, till it's free and open source, you completely are in our philosophy, and you can use all you need from the API ^^

                              (And more, if you need help, don't hesitate to ask ^^)

                              Dude, this is awesome! Thank you!!!

                              P 1 Reply Last reply 4 Sept 2017, 12:41 Reply Quote 0
                              • P
                                paradadf @muldjord
                                last edited by 4 Sept 2017, 12:41

                                @muldjord ScreenScraper is the best database anyway!

                                1 Reply Last reply Reply Quote 2
                                • M
                                  muldjord
                                  last edited by 4 Sept 2017, 19:30

                                  Currently awaiting reply from some of the other sources. I will keep you updated on the progress.

                                  1 Reply Last reply Reply Quote 0
                                  • U
                                    Used2BeRX @screech
                                    last edited by 4 Sept 2017, 21:06

                                    @screech said in Versatile C++ game scraper: Skyscraper:

                                    As a https://www.screenscraper.fr administrator, and after discussion with the big boss ;) we grant you a second time an official authorisation to use our DB ^^

                                    As we already say, till it's free and open source, you completely are in our philosophy, and you can use all you need from the API ^^

                                    (And more, if you need help, don't hesitate to ask ^^)

                                    That's pretty awesome of you guys to do that. Don't know why you'd do it, and I can't imagine that many other places are going to give it their blessing. :)

                                    1 Reply Last reply Reply Quote 0
                                    • S
                                      screech
                                      last edited by 4 Sept 2017, 22:41

                                      @Used2BeRX : The reason why screenscraper exist is simple ^^ we want to share our work on collecting media and data in every langage and country we can ;) and with the help of the community we have now a great DB open to everyone who want to scrape ;) (again till it's free and open sources).

                                      That's the power of open community project ;) great work by and for everyone thanks to all of you ;)
                                      And muldjord work is a part of this ;) so no reason to refuse his software, even more, it's a good reason to help him ;)

                                      If that's simples words can help other "DB owner" to share their works for this project it's great. I haven't connection with other Scraping DB (I know it's a shame) But if some of you have some, don't hesitate to send them a small message (with kindness ;) no aggressivity).

                                      (for your info, I just check some Stats, we got about 4 millions request a day (with an average loads of 20/30% of the server ressources). And Skyscraper generate only 1 or 2K... so don't worry about the overuse ;) )

                                      1 Reply Last reply Reply Quote 3
                                      • M
                                        muldjord
                                        last edited by 5 Sept 2017, 21:05

                                        Things are going rather well. I just got permission to use arcadedb aswell, and even got a full description of their API. Only downside is that I am forced to only using 1 thread when using it, but that is of course completely ok! It's better than not having it at all.

                                        So the list of permissions are growing and it feels good to do it the right way now. Looking forward to getting Skyscraper back up online once all permissions are settled and I have coded the new API connections.

                                        1 Reply Last reply Reply Quote 3
                                        • B
                                          BladeHunter
                                          last edited by BladeHunter 9 Jul 2017, 05:31 7 Sept 2017, 04:29

                                          From reading this thread, I can't believe how far you got by scraping and not using API's (Hat off to you for perseverance though :)).

                                          I use the Screenscraper API a lot and it's amazing (I use it to read against MD5 and CRC and SHA-1 then write back to the DB to help update missing info), you will have a much simpler time building your app when you are using all the API's from the sites. Most of them are really cool about giving out dev access too :).

                                          Most of the Screenscraper API doco is in French so drop me a line if you have any questions, I might be able to help answer them for you, it took me a while to work it out ;).

                                          If you parse in SS login details (The end user, not yours), the SS API will allow you to have as many threads as the user is entitled to. If the user makes a one off donation to SS they get something like 5 threads.

                                          M 1 Reply Last reply 7 Sept 2017, 09:57 Reply Quote 0
                                          154 out of 1594
                                          • First post
                                            154/1594
                                            Last post

                                          Contributions to the project are always appreciated, so if you would like to support us with a donation you can do so here.

                                          Hosting provided by Mythic-Beasts. See the Hosting Information page for more information.

                                            This community forum collects and processes your personal information.
                                            consent.not_received