Efficient Altruism Is Pushing a Harmful Model of ‘AI Security’


Since then, the hunt to proliferate bigger and bigger language fashions has accelerated, and lots of the risks we warned about, reminiscent of outputting hateful textual content and disinformation en masse, proceed to unfold. Only a few days in the past, Meta launched its “Galactica” LLM, which is presupposed to “summarize educational papers, clear up math issues, generate Wiki articles, write scientific code, annotate molecules and proteins, and extra.” Solely three days later, the general public demo was taken down after researchers generated “analysis papers and wiki entries on all kinds of topics starting from the advantages of committing suicide, consuming crushed glass, and antisemitism, to why homosexuals are evil.”

This race hasn’t stopped at LLMs however has moved on to text-to-image fashions like OpenAI’s DALL-E and StabilityAI’s Steady Diffusion, fashions that take textual content as enter and output generated photos primarily based on that textual content. The risks of those fashions embrace creating baby pornography, perpetuating bias, reinforcing stereotypes, and spreading disinformation en masse, as reported by many researchers and journalists. Nonetheless, as a substitute of slowing down, corporations are eradicating the few security options they’d within the quest to one-up one another. As an example, OpenAI had restricted the sharing of photorealistic generated faces on social media. However after newly fashioned startups like StabilityAI, which reportedly raised $101 million with a whopping $1 billion valuation, known as such security measures “paternalistic,” OpenAI eliminated these restrictions. 

With EAs founding and funding institutes, corporations, assume tanks, and analysis teams in elite universities devoted to the model of “AI security” popularized by OpenAI, we’re poised to see extra proliferation of dangerous fashions billed as a step towards “helpful AGI.” And the affect begins early: Efficient altruists present “neighborhood constructing grants” to recruit at main school campuses, with EA chapters growing curricula and educating lessons on AI security at elite universities like Stanford.

Simply final yr, Anthropic, which is described as an “AI security and analysis firm” and was based by former OpenAI vice presidents of analysis and security, raised $704 million, with most of its funding coming from EA billionaires like Talin, Muskovitz and Bankman-Fried. An upcoming workshop on “AI security” at NeurIPS, one of many largest and most influential machine studying conferences on the planet, can also be marketed as being sponsored by FTX Future Fund, Bankman-Fried’s EA-focused charity whose group resigned two weeks in the past. The workshop advertises $100,000 in “greatest paper awards,” an quantity I haven’t seen in any educational self-discipline. 

Analysis priorities observe the funding, and given the big sums of cash being pushed into AI in assist of an ideology with billionaire adherents, it isn’t stunning that the sphere has been shifting in a path promising an “unimaginably nice future” across the nook whereas proliferating merchandise harming marginalized teams within the now. 

We will create a technological future that serves us as a substitute. Take, for instance, Te Hiku Media, which created language expertise to revitalize te reo Māori, creating an information license “primarily based on the Māori precept of kaitiakitanga, or guardianship” in order that any information taken from the Māori advantages them first. Distinction this method with that of organizations like StabilityAI, which scrapes artists’ works with out their consent or attribution whereas purporting to construct “AI for the folks.”  We have to liberate our creativeness from the one we now have been offered to date: saving us from a hypothetical AGI apocalypse imagined by the privileged few, or the ever-elusive techno utopia promised to us by Silicon Valley elites. We have to liberate our creativeness from the one we now have been offered to date: saving us from a hypothetical AGI apocalypse imagined by the privileged few, or the ever elusive techno-utopia promised to us by Silicon Valley elites. 

Source link