• 0 Posts
  • 24 Comments
Joined 1 year ago
cake
Cake day: June 7th, 2023

help-circle


  • What’s the difference between one technology you don’t understand (AI engine-assisted ) and another you don’t understand (human-staffed radiology laboratory)?

    Regardless of whether you (as a patient hopelessly unskilled in diagnosis of any condition) trust the method, you probably have some level of faith in the provider who has selected it. And, while they most likely will choose what is most beneficial to them (cost of providing accurate diagnoses vs. cost of providing less accurate diagnoses), hopefully regulatory oversight and public influence will force them to use whichever is most effective, AI or not.




  • This would ideally become standardized among web servers with an option to easily block various automated aggregators.

    Regardless, all of us combined are a grain of rice compared to the real meat and potatoes AI trains on - social media, public image storage, copyrighted media, etc. All those sites with extensive privacy policies who are signing contracts to permit their content for training.

    Without laws (and I’m not sure I support anything in this regard yet), I do not see AI progress slowing. Clearly inbreeding AI models has a similar effect as in nature. Fortunately there is enough original digital content out there that this does not need to happen.




  • I want Ars content to be part of whatever training data is provided to the best models. How does that get done without appearing like they are being bought?

    Even if their contract explicitly states that it is a data sharing agreement only and the products of the media organization (articles/investigations) are not grounds for breach or retaliation, it is assumed that there is now some impartiality in future reporting.

    So, for all media companies, the options seem to be:

    1. Contribute to the greater good by openly permitting site scraping (for $0)
    2. Allow data sharing to contracted parties only (for a fee)
    3. Public or privately prohibit use of any data, and then seek damages down the road for theft/copyright infringement when the legal framework has been established.

    Is there a GPL or other license structure that permits data sharing for LLM training in a way that it does not get transformed into something evil?


  • I pay for Nebula and try to watch as much as I can there. The content is more “pleasant department store” and less “Mexican public market”.

    I do watch YouTube regularly when channel-surfing, but if I ever see an ad (which happens only on mobile devices), I close it immediately and do something else. It’s not that I don’t think I should be able to watch everything for $0, but YouTube ads are so jarring, random, irrelevant and just make me sick. They literally ruin whatever I was watching and make me sad to exist.

    It can be exhausting to wade through the absolute meat market of click bait titles and thumbnails to find something that not only looks interesting but won’t abuse me with infomercial-form audio/visuals.

    YouTube enables and promotes the “content creators” who abuse human psychology to accumulate views, likes, subscriptions, etc. The best thing that could happen is they continue to be exposed as the drug dealer they are.



  • I absolutely agree, but I have a sneaking but unfounded suspicion that many decision makers don’t want to prove out this theory.

    WFH during the pandemic already triggered a panic from those whose income depends on the status quo of urban commute. To them, demonstrating we don’t need offices OR personal automobiles is a dangerous experiment to conduct in one of the largest metro areas in the world.

    My god, what if it works? What would we do with all this pavement and gasoline?!



  • Yes, this is totally possible and I did it for a couple of years with OPNsense. I actually had an OPNsense box and a pfSense box both on Hyper-V. I could toggle between them easily and it worked well. There are CPU considerations which depend on your traffic load. Security is not an issue as long as you have the network interface assignments correct and have not accidentally attached the WAN interface to any other guest VM’s.

    Unfortunately, when I upgraded to 1Gb/s (now 2Gb/s) on the WAN, the VM could not keep up. No amount of tuning in the Hyper-V host (dual Xeon 3GHz) or the VM could resolve the poor throughput. I assume it came down to the 10Gb NICs and their drivers, or the Hyper-V virtual switch subsystem. Depending on what hardware offload and other tuning settings I tried, I would get perfect throughput one way, but terrible performance in the other direction, or some compromise in between on either side. There was a lot of iperf3 testing involved. I don’t blame OPNsense/pfSense – these issues impacted any 10Gb links attached to VM’s.

    Ultimately, I eliminated the virtual router and ended up where you are, with a baremetal pfSense on a much less powerful device (Intel Atom-based). I’m still not happy with it – getting a full 2Gb/s up and down is hard.

    Aside from performance, one of the other reasons for moving the firewall back to a dedicated unit was that I wanted to isolate it from any issues that might impact the host. The firewall is such a core component of my network, and I didn’t like it going offline when I needed to reboot the server.



  • This is part of the problem with using terms like “homeless” to describe the occupants of an illegal campsite. There are numerous reasons one may choose to camp in a public space.

    • Some are truly struggling to regain their financial footing and either the assistance programs are not helpful or they are unable to utilize them.
    • Some are sick which causes them to be unable to participate functionally in society, and they have “fallen through the cracks” of services designed to support them.
    • Some reject housing in favor of a lifestyle that demands less effort or accountability – possibly in service of addiction, which ties into #2 above.

    All members of society should have access to shelter (or a safe campsite, if that is our preference) and our basic needs met. As members of society, we shall follow laws which describe, for very good reasons, why we cannot simply erect a camping tent in a city park.

    The problem with ignoring campsites is plummeting hygiene and safety. Waste is generated by day to day life and must be collected or eliminated. As campers accumulate and abandon the implements of a semi-permanent hovel: furniture, bedding, tarps, etc., the surrounding area transforms into a dumping site.

    The technology described in the article already identifies potholes and illegal parking. It does not identify people or their race. Surely it could evolve into something with more potential for abuse, but in its current capacity, it is quite a neutral tool.

    We have collected a lot of data on the “ignore and do nothing” solution – the outcome is a scientific certainty. Using tools like this to measure progress (for better or worse) seems like something that would help generate support for other solutions, such as extensive expansion of low-cost/no-cost housing services.