• 0 Posts
  • 691 Comments
Joined 1 year ago
cake
Cake day: June 20th, 2023

help-circle


  • One of the first things they teach you in Experimental Physics is that you can’t derive a curve from just 2 data points.

    You can just as easilly fit an exponential growth curve to 2 points like that one 20% above the other, as you can a a sinusoidal curve, a linear one, an inverse square curve (that actually grows to a peak and then eventually goes down again) and any of the many curves were growth has ever diminishing returns and can’t go beyond a certain point (literally “with a limit”)

    I think the point that many are making is that LLM growth in precision is the latter kind of curve: growing but ever slower and tending to a limit which is much less than 100%. It might even be like more like the inverse square one (in that it might actually go down) if the output of LLM models ends up poluting the training sets of the models, which is a real risk.

    You showing that there was some growth between two versions of GPT (so, 2 data points, a before and an after) doesn’t disprove this hypotesis. I doesn’t prove it either: as I said, 2 data points aren’t enough to derive a curve.

    If you do look at the past growth of precision for LLMs, whilst improvement is still happening, the rate of improvement has been going down, which does support the idea that there is a limit to how good they can get.




  • Aceticon@lemmy.worldtoGreentext@sh.itjust.worksAnon starts asking questions
    link
    fedilink
    arrow-up
    28
    arrow-down
    3
    ·
    edit-2
    12 hours ago

    Same principle as a gyroscope: a turning wheel will tend to stay perpendicular or parallel to the direction of the gravity vector because if it starts tilting away from such orientation there’s a force that pushes it back.

    Also works better with bigger wheels (if I remember it correctly the effect is related to spinning momentum).

    I was pretty surprised when learning Physics and they show us how to derive the formula for that (which I totally forgot since that was over 3 decades ago).

    Edit: Actually the gyroscopic effetc is just a part of it. See this article



  • Just because they share the same religion doesn’t mean they’re the same kind of people.

    That bunch in Israel have even accused a Holocaust Survivor of being an anti-semite for criticizing Israel, and most modern Israelis did not came from Western Europe. Most of them they share no other characteristic with the victims of the Holocaust than religious affiliation, certainly not being against racism or having other modern humanist values.


  • Aceticon@lemmy.worldtoA Boring Dystopia@lemmy.world14 pages of dead babies
    link
    fedilink
    arrow-up
    9
    arrow-down
    3
    ·
    edit-2
    13 hours ago

    “They made me do it” has been the main axis of Israeli Propaganda since the start and that’s just a variant of that.

    How about this alternative explanation: they’re stealing Palestinian land, want to ultimatelly steal all Palestinian land, are led by Sociopaths and Psychopaths and have an extremely racist society anchored on the kind of ethnic superiority ideas that would make Klanners blush, so they were always going to do something like this sooner or later to get rid of the rest of Palestinians and get the rest of their land and do it in the most inhuman ways because a large part, maybe even most, of Israeli society see themselves as “the chosen people”, a superior ethnicity and what the previous famous group of ethno-Fascists would call übermenschen, whilst they see Palestinians as lesser people, “human animals”, untermenschen.

    This is the kind of mass murdering Western nations used to do back in the days of Colonialism. It’s only shocking for us nowadays because we’ve evolved as societies and adopted Humanist values (though by their support of Israel you can see that many politicians in several countries and even a large fraction of people have in fact not evolved). Israel does in fact have Western Values, it’s just they’re the White Colonialist Values that many European nations had back in the 19th Century, not 21st Century Western Values.


  • I see. That does change the idea I had about things a bit.

    It’s been a while since I was last hiring.

    I wasn’t aware that the problem nowadays in the West (or at least the US) was an excess of people who don’t really have a natural skill for it choosing software development as a career.

    That kind of thing was one of the main problems with outsourcing to India maybe a decade ago: the profession was comparatively very well paid for the country so it attracted far too many people without the right skills resulting in a really low average quality of the programmers there - India had really good programmers just like everywhere else but then had a ton of people also working as programmers who should never had gone into it, so the experience of those having to deal with outsourced programming in India usually was pretty bad (I remotelly was a technical lead for a small outsourced team in India from London, and they were really bad whilst, curiously, the good programmers from the Indian Subcontinent I worked with had emigrated from there and were working in London and New York).


  • I think it’s even worse than just the bar for competency going up: even for a coding wizard going into the career, it’s a lot harder to squeeze through the bottleneck which is getting an entry level position nowadays unless they have some public proof out on the Net of how good they’re at coding (say, commits in open source projects, your own public projects, or even Youtube videos about it).

    This is something that will negativelly impact perfectly capable young developers who have an introvert personality type (which are most of them in my experience, even in domains such as Hacking) since some of the upsides of Introversion are a greater capacity for really focusing on on things and for detailed analysis - both things that make for the best programmers - and self publicising isn’t a part of the required skillset for good developers (though sooner or later the best ones will have to learn some “image management” if they end up in the Corporate world)

    I’m a bit torn on this since on one side salesmanship being more of a criteria determining one’s chances of getting a break at the start of one’s career as a developer is bad news (good coding and good salesmanship tend to be inverselly correlated) but on the other side a junior developer with some experience actually working with other people on real projects with real users (because they contributed to existing open source projects) has already started learning what we have to teach fresh-out-of-Uni developers to make them professionals.



  • At some point in my career I’ve actually designed mission critical high performance distributed server systems for a living, so I’m well aware of that.

    You can still pack thousands of users per server and have very low latency as long as you use the right architecture for it (it’s mainly done with in-memory caching and load balancing) when you’re accessing gigantic datasets which far exceed the data space of a game where the actual shared data space is miniscule since all clients share a local copy of most of the dataspace - i.e. the game level they’re playing in - and even with the most insane anti-cheat logic that checks every piece of data coming in from the user side against a server-side copy of the “game level data space” it’s still but a fraction of the shared data space in equivalent situations in the corporate world, plus it tends to be easilly partitionable data (i.e. even in MMORG with a single fully open massive playing space, players only affect limited areas of the entire game space so you don’t really need to check the actions of a player against the data of all other players).

    Also keep in mind that all the static (never changing or slow changing stuff) like achievements or immutable level configuration can still be served with “normal” latencies.

    Further the kind LVL1 ISP that provides network access for companies like Sony servicing millions of users already has more than good enough latency in their normal service and hence Sony needs not pay extra for “low latency”.

    Anyways, you do make a good and valid point, it’s just that IMHO that’s the kind of thing that pushes the running costs per-player-month from one dollar cents or less to, at most (and this is likely quite a large overestimation), a dollar per-player-month unless they only have tens of players per-server (which would be insane and they should fire their systems designers if that’s the case).


  • After over 3 decades as a gamer and tech user this is maybe the single most consistent important benefit for any open platform were you can just install Linux.

    The rest is nice but this one means that 10 or 20 years from now your hardware might have been repurposed for something else and still be useful and in use whilst a closed platform will just be more junk in a junkyard or sitting in a box of those things you’ve kept just because you don’t like to throw expensive stuff away but will in practice never use again.




  • I have an Orange PI Pro 5 16GB on a box that smoothly runs a full blown Ubuntu Desktop version and would fit in a pocket though it’s maybe a little too thick (from memory the box it’s about 3x5x2 cm).

    Total cost was about $170.

    The board itself would fit a thinner box, but you might have to 3D print one.

    Mind you, a N100 Mini-PC that costs the same is even more capable as a Linux Desktop, but it’s significantly larger and will definitely not fit a pocket.

    You can find cheaper SBCs capable of running a Desktop Ubuntu but in my experience (with a $35 Banana Pi P2-Zero) if you go too far down the price scale Desktop Linux performance stops being smooth, even if the board is a tiny thing.

    It was actually quite surprising for me recently when I found out some of these things are perfectly capable Linux Desktops.



  • In my experience working for almost 3 decades in software development, passive-agressive shit from upper management just causes the best people to leave (as they’re the ones who easilly find better jobs) leaving behind mainly a mix of the incompetent and those who never worked anywhere else (who are either already incompetent or will become so, as only ever having worked in just one company is far too narrow professional experience for anything beyond junior/mid level - you need to have seen more than one way of doing things to understand certain higher level concerns and choices in software development).


  • The one time some manager voiced such an idea, I very overtly in front of everybody offered to make “loop unrolling” software working at the source level (compilers already do it at the Assembly level in some cases for performance) for me and my colleagues to really boost that code line count (while totally screwing maintenability).

    Mind you, all devs in that meeting were loudly against measuring performance by code lines, but I like to think that suggestion of mine really hammered down the coup the grace on that “brilliant” idea.