So there are multiple sites&groups that pirate video games especially on PC. I was wondering if there are places on the internet where you find source code for games especially the highly modifiable ones like Half Life 2/Portal and Skyrim. Or groups that crack into the source code of games (or even software in general), not only for PC maybe PS, XBox or mobile too, and share it. I just wanted to see some code samples of games or their engines, maybe I get hooked into video game design. Shout out to Valve for sharing a lot about the creation of Half Life 1

  • X3I@lemmy.x3i.tech
    link
    fedilink
    English
    arrow-up
    77
    ·
    edit-2
    1 year ago

    Well, source code is not sth that you “crack”, you can only reverse engineer it (I think it was done with Doom, also OpenRA) or steal it from the company’s servers. The use for it is also rather niche, so the risk vs gains ratio is not attractive enough to feed dedicated websites. You can also look at fully open source games like 0AD and check out what they did!

    Edit: I stand corrected (thrice); Doom was indeed open-sourced, not reverse engineered. Thanks for pointing out!

    • CaptainBasculin@lemmy.ml
      link
      fedilink
      English
      arrow-up
      41
      ·
      1 year ago

      Doom was open sourced later. An example of a game that got reverse engineered just fine is Super Mario 64

      • datavoid@lemmy.ml
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        1
        ·
        1 year ago

        Or the time the people who made doom reverse engineered super mario 3 for PC, then turned it into commander keen after Nintendo was unimpressed.

        • scutiger@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          It wasn’t quite reverse engineered. They found a hacky way to bypass hardware limitations and basically duplicated the game.

          • Goku@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            Yeah but that was more of a “if we can’t beat 'em, join 'em” thing and I believe some, if not most of the modders don’t even use the official mappings and prefer the cracked version

      • Excrubulent@slrpnk.net
        link
        fedilink
        English
        arrow-up
        16
        ·
        edit-2
        1 year ago

        I was once at a talk by someone in that company and he straight up said that open sourcing it was a mistake. I assume because that meant they couldn’t sell us a thousand versions of it like Skyrim.

        No word of whether its ongoing popularity was at all caused by open sourcing it.

        • gaylord_fartmaster@lemmy.world
          link
          fedilink
          English
          arrow-up
          21
          ·
          1 year ago

          There’s no way going open source has done anything but help Doom. I guarantee they’ve made more money from people buying their old games for the WADs to play with source ports and mods than they’ve lost money to things like Freedoom.

          • gaylord_fartmaster@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            ·
            1 year ago

            The game and the engine are both open source. The game’s assets just aren’t freely available, so you still need an official WAD or an asset replacement pack like Freedoom.

      • Draconic NEO@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 year ago

        Only if they wrote why in it though, plenty of people (unfortunately myself included) fail or forget to add meaningful comments or they let their comments go stale when making changes by forgetting to update them (I do it a lot too), and some people also use horrible function names that don’t make any sense.

        So it only really applies to source code intended to be released where care was made to ensure it would be readable, it might not apply for source code never intended to be public, such as stolen, leaked, or posthumously released. In this case the only real benefit is that it can be recompiled on different architectures provided there isn’t a dependency issue preventing that.

        • lud@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Source code without comments is still way easier to read than machine code.

          And there are very likely comments anyways.

          Multiple Devs will work on the same code for years and of course they need or at least appreciate comments.

    • Draconic NEO@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      But even if you can intuitively understand what the machine code is you’ll still need to convert it by hand back into something more portable, or to the machine code of another platform you might want to run it on. There’s not really an easy automated way to do that, even when playing dirty.

      That’s the hard part, getting it into a different form, such as from x86 to ARM or from 6502 RISC to x86.

  • Sylvartas@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    2
    ·
    edit-2
    1 year ago

    I mean, if you want to see some games’ source code you don’t have to rely on piracy. As other people have already said, there are open source games, some developers of older games have officially released the source code (notably VVVVVV, doom, and also quake iirc), some devs have released important part of their source code (e g the entire inputs handling code of Celeste).

    Additionally, the vast majority of all Unreal Engine games’ engine code, including huge AAAs like Fortnite, is in Unreal Engine (duh), which is open source source-available.

        • PeachMan@lemmy.world
          link
          fedilink
          English
          arrow-up
          8
          arrow-down
          1
          ·
          1 year ago

          Right, but it’s best to call that “source available” so the pedants don’t crucify you 🙄

        • Draconic NEO@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 year ago

          Yes there is indeed a difference, but for us it makes little to no difference. In the end what matters is that we have it without having to reverse engineer it, which is a slow and laborious process (even when you do it with dirty methods).

      • Sylvartas@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        1
        ·
        edit-2
        1 year ago

        I knew I was gonna get this answer but still couldn’t be bothered to check the correct term so that’s on me.

        I think you’re technically right because the EULA specifies that you basically can’t use that code (or a modified version) outside of a licensed UE project, but outside of that it basically is. All the code can be read, the engine and/or its editor and all related tools can be compiled from the source, and you can make pull requests on the official repo.

        IIRC it is not actually open source because you can’t modify and/or repackage it without epic having their say in it (I think one of the licenses tiers is basically you agreeing to pay upfront + royalties for the authorization to modify the engine’s code and ship the packaged version with the project)

  • BotCheese@beehaw.org
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    1 year ago

    From what I understand: During the compilation of the code it becomes unreadable to humans, needing to be reverse engineered, which is entails insane amounts of work. So, unless there is a leak or the game isn’t fully compiled like (I think) Unity games, it will be unlikely to find source code.

    • NuXCOM_90Percent@lemmy.zip
      link
      fedilink
      English
      arrow-up
      9
      arrow-down
      1
      ·
      1 year ago

      Yes and no.

      VERY oversimplifying but:

      The basic flow of compilation is (human readable-ish) source code becomes symbols which the compiler then applies optimizations to before converting into machine code.

      With no optimizations, it is a pretty trivial process to revert back. You lose the names of variables and functions (unless they were added as annotations by the compiler, which they often are for many reasons) but you get the logic.

      With optimizations, things get messier. An example that is pretty popular and useful to understand this is https://en.wikipedia.org/wiki/Fast_inverse_square_root The coder likely wrote invsqrt(foo), expecting this to map to a series of hardware functions in the floating point unit. Instead, they see that it became a series of integer operations and bit shifts and get very very confused. Its like that, except imagine an entire 5000 line file becomes two instructions.

      That said, compilers are stupid and most people (rightfully) use libraries to a large portion. So the above example would actually not happen because they would just see the instructions to do a function call labeled “fast_invsqrt” and not care. Similarly, basic heuristics (or, if you are trying to get funding, machine learning and neural nets) can be applied to figure out what was done in heavily optimized blocks of code. So the resulting “source code” might be something like

      int foo(int bar)
      {
        int a = 4;
        // Loop to do some fancy ass math shenanigans to a
        return a + 5;
      }
      

      Which results in more or less human readable code again. Also, a lot of compilers will actually embed basically comments in the assembly/binary unless someone remembers to actively disable that. And nobody ever does.

      Because source code really doesn’t matter all that much. If anyone ever did a deep dive of the Source Engine and Unreal Engine (of the day. I think that was 3?)… they were actually a lot more similar than not. Yeah, there was a lot of infrastructure and even some fundamental differences (been a minute, but I want to say Source is still additive because of its quake origins whereas UE3 was still subtractive geometry?). But… function influences form and they have most of the same functionality.

      The issue is more, like with DRM, those “week one sales” as it were. Data is sparse for a lot of reasons, but Securom for Mass Effect 1 PC is generally treated as a massive win because warez groups could not properly crack it for a week or so which led to a lot of pc players just buying the game because we needed hot blue lady action NOW. Same with the reason so many games still use denuvo.

      And same with engine and game features. Unity and Unreal basically were in an arms race for years (and Cryengine basically sucked on the barrel of their own gun) where one would add one feature and the other would add the same one a few months later. Having access to the source code potentially means someone can accelerate that a lot…

      Except they won’t. Because this shit is cancer. Any dev who does anything even remotely similar to how Wolverine jacks off to his friend’s wife in a game is opening themselves up to a LOT of lawsuits and investigations. Its why almost every emulator group will go scorched earth on anyone who even acknowledges looking at the nintendo leaks. And it is why someone quitting their job at Microsoft to go work at Google and offer to provide a Teams feature gets blacklisted and reported immediately. Companies don’t fuck around with that.

      Which leads to the real reason these are so detrimental to gaming and software. People… put questionable comments in their code all the time. Sometimes it is ///TODO: Implement this in Q2 2024 for release 1.5. Sometimes it is //Hey, Fuck you Fred. I finally did this. Stop your fucking bitching and stop whining to John that you couldn't do anything until I did this for you. And sometimes you get bullshit like Bungie (?) using racial slurs as codenames for a lot of the skins and the like…

      And it also makes it a lot easier to make a cracked binary if you know what the code was before the DRM was applied.

      So decompiling code is actually a lot more viable than you would expect. But also entirely pointless.

    • X3I@lemmy.x3i.tech
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      Correct. However, this looks like a direction that language processing and GPTs would excel in and to my knowledge, there already have been some ML addons made for certain RE tools, so this might become more easy in the future. If that is necessarily a good thing is a different story…