This is an AI-enabled summary of an interview with cognitive psychologist and computer scientist Geoffrey Hinton. He’s played a big role in the development of computer neural networks and was the guest of Brook Silva-Braga on the CBS Saturday morning show. The YouTube video can be seen at the end of this summary. I added a couple of salient quotes that touch on the “alignment” problem. The art is by Bing’s Image Creator.
Hinton’s Role in AI History
Hinton discusses the current state of artificial intelligence and machine learning. He explains that his core interest is understanding how the brain works and that the current technique used in big models, backpropagation, is not what the human brain is doing. He also discusses the history of AI and neural nets, which he was a proponent of, and how neural nets have proven to be successful despite skepticism from mainstream AI researchers.
The video describes how ChatGPT has vast knowledge compared to a single person due to its ability to absorb large amounts of data over time. The model was first proposed in 1986 and was later able to surpass traditional speech recognition methods thanks to advancements in deep learning and pre-training techniques. Hinton’s background in psychology originally led him to neural networks, and his students’ research resulted in significant developments in speech recognition and object recognition systems.
The interview touches on various topics related to computer science and AI, such as the potential impact on people’s lives, the power consumption differences between biological and digital computers, and the use of AI technology in areas like Google search. Hinton also discusses the challenges of regulating the use of big language models and the need to ensure that AI is developed and used in a way that is beneficial to society (a need he doesn’t feel is being well met).
Silva-Braga: What do you think the chances are of AI just wiping out humanity? Can we put a number on that?
Hinton: It’s somewhere between 1 and 100 percent (laughs). Okay, I think it’s not inconceivable. That’s all I’ll say. I think if we’re sensible, we’ll try and develop it so that it doesn’t, but what worries me is the political situation we’re where it needs everybody to be sensible. There’s a massive political challenge it seems to me, and there’s a massive economic challenge in that you can have a whole lot of individuals who pursue the right course and yet the profit motive of corporations may not be as cautious as the individuals who work for them.
Hinton addresses the common criticism that large language models like GPT-3 are simply autocomplete models. He argues that these models need to understand what is being said to predict the next word accurately. In addition, they discuss the potential for computers to come up with their own ideas to improve themselves and the need for control. Hinton also addresses concerns about job displacement caused by these models, arguing that while jobs will change, people will still need to do the more creative tasks that these models cannot do.
Silva-Braga: Are we close to the computers coming up with their own ideas for improving themselves?
Hinton: Um, yes we might be
Silva-Braga: And then it could just go fast
Hinton: That’s an issue we have to think hard about, how to control that
Silva-Braga: Yeah, can we?
Hinton: We don’t know. We haven’t been there yet, but we can try.
Silva-Braga: Okay, that seems kind of concerning
Hinton: Um, yes
Overall, the interview provides insights into the current state and future of AI and machine learning, as well as the challenges and opportunities that come with their widespread use. It highlights the need for careful consideration and regulation to ensure that these technologies are developed and used in a way that benefits society.
To read a full transcript of the interview, go the original YouTube page (click on the three horizontal dots and then select “Show transcript”)
It’s hard for someone who watched the gorgeous clusterfucks of Web 1.0 and 2.0 to get starry-eyed about Web3 (or Web 3.0, or whatever we’re calling it this week).
But a daily dose of cynicism is among the sundry bitter pills that older generations take with their morning coffee. You know, to stay regular.
So, I wanted to use this post as a reason to give the W3 champions the benefit of the doubt and educate myself better about the latest “new and improved” world wide reticulum.
Blockchains: Ledgers for Liberté!
Many of the folks espousing blockchain and cryptocurrency are enthusiastic to the point of mania, seeing the tech as pivotal to forging a brave new Web3 world. Most other people are, however, blockchain agnostic or just plain apathetic. It seems like too much trouble to figure out how the damned thing works. (Then throw NFTs into the mix and you have a whole new level of bafflement.)
So let’s indulge in some obligatory but necessarily incomplete descriptions before we continue.
WTF Is a Blockchain?
A blockchain is a glorified ledger. It records debits, credits, and closing balances. The magic word is “transactions.”
If you’re old enough to remember balancing a checkbook, then it’s a lot like that, except it’s digital. And somehow going to save the world.
So it’s a spreadsheet? Kind of. Maybe database is more accurate. The data are stored in virtual “blocks” that are virtually “chained” together. Thus, of course, the name.
Bored yet? Hang on. That chain thing? In theory, you can’t break or modify it. So, the database can’t be changed. Fraud is, therefore, tough, and you don’t need some trusted third party to vouch that everything is on the up and up. No traditional contracts and middlemen. In that sense, it’s decentralized. It’s all about the network, baby.
One common trope is that it’s tech forged by libertarian nerds who hate big government, big business and bureaucracies in all their nefarious forms. Therefore, we wind up with an amalgamation of something that pushes all their hot buttons: software plus finance plus ciphers plus decentralization plus implicit political ideology.
So, no, not a sexy look.
But make no mistake. Blockchain is not just for geeks. Not anymore. In fact, whole industries have bought into it. For example, energy companies use it to build peer-to-peer energy trading platforms so that homeowners with solar panels can sell their excess solar energy to neighbors.
Therefore, blockchain becomes solar chic.
Cryptocurrency Runs on Blockchains
Blockchains and cryptocurrencies aren’t synonymous, but they often go hand in hand. Cryptocurrencies are digital money that’s kept secure via cryptography so there’s no counterfeiting them. Most of these currencies are housed on decentralized systems where financial records are maintained and transactions are verified via blockchains.
Got it? Blockchains are the motors that make cryptocurrencies run.
A Very Short History of Cryptocurrencies
The first and best known cryptocurrency, Bitcoin (or BTC), is part of a longish history with its own mythology. The second most common and well known cryptocurrency is ether, or ETH, which is based on Ethereum technology. But these are just the big guns. Other currencies have been popping up like Mario mushrooms after a virtual rainfall. In fact, there are now more than 12,000 cryptocurrencies.
1991: Stuart Haber and W. Scott Stornetta introduce blockchain technology to time-stamped digital documents, making them “tamper-free”
2000: Stefan Konst publishes his theory of cryptographic secured chains
2004: Hal Finney introduces a digital cash system that keeps the ownership of tokens registered on a “trusted” server
2008: Mystery person Satoshi Nakamoto comes up with the concept of “distributed blockchain,” which provides a peer-to-peer network of time stamping
2009: Satoshi Nakamoto releases the famed white paper on the subject of bitcoin
2014: Various industries start developing blockchain technologies that don’t include cryptocurrencies
2015: Ethereum Frontier Network is launched, and along come smart contracts and dApps (for decentralized applications)
2016: Someone exploits a bug in the Ethereum DAO code and hacks the Bitfinex bitcoin exchange.
2019: Amazon announces its Managed Blockchain service on AWS
2021: In 2021, a study by Cambridge University determines that bitcoin used more electricity than Argentina or the Netherlands. El Salvador becomes the first country to make bitcoin legal tender, requiring all businesses to accept the cryptocurrency.
2022: The University of Cambridge estimates that the two largest proof-of-work blockchains, bitcoin and ether, together use twice as much electricity in one year as the whole of Sweden. The Central African Republic is the second nation to make bitcoin legal tender.
Raise a Glass to the WWW 3.0
Okay, with all the crypto and blockchain out of the way, let’s get back to Web3.
(Oh, wait, I forgot NFTs, or non-fungible tokens, which are like one-of-a-kind digital objects that can be worth big money as collectables. These seem insane to me, which probably means they’ll play some pivotal economic role in the future).
These are the chief technologies and implied principles of Web3. As with the previous two iterations the Web, the advocates for Web3 argue that just they want to make the world a better place (even if they happen to make a killing along the way).
The main argument against the status quo is that our current systems are too centralized and corporatized. Financial institutions want to control money, governments want to control legal frameworks, and the biggest tech companies want to control data. Daniel Saito sums it up well here:
The problem with this system is that is leads to inequality and injustice. The rich get richer while the poor get poorer. The powerful get more power while the powerless are left behind. The web 3.0 economy, on the other hand, is based on a decentralized system. This means that there is no central authority or institution that has control over the system. Instead, it is a network of computers that are all connected to each other.
This makes me smile and sigh. Meet the new techno-idealist, same as the old techno-idealist.
Taking a More Skeptical Approach
Does anyone really believe that the venture capitalists are funding this stuff for the good the humanity? Do we really expect, sticking with an example that Saito uses in his article, that the Nimbies are going away and making room for high-speed rail just because someone’s throwing bitcoins at the project?
At the same time, hope springs eternal. I truly want to think that these technologies will make things better in some ways. Maybe we can avoid a certain amount of corruption, fraud, and concentration of power through blockchains. I want to believe.
A …. potential cause for concern is the shift away from centralized exchanges, which are required to conduct identify checks for customers, to decentralized exchanges like dYdX and Uniswap, which is estimated to be the largest such exchange. Decentralized exchanges rely on peer-to-peer systems to operate. This means that several computers serve as nodes in a larger network, in contrast to centralized exchanges that are operated by a single entity. Decentralized exchanges make it easier for traders to anonymously buy and sell coins; most such exchanges do not currently comply with “know your customer” laws, which means that it can be cumbersome for government officials to identify the parties involved in cryptocurrency transactions. Because these exchanges are not run by a single entity, they can be exceedingly difficult to police and lack the sanctions-enforcement mechanism of more centralized exchanges.
Look, people are people. The worst ones want to accrue and maintain power at the expense of others. To the extent that Web3 makes this less likely, good.
To the degree it reduces accountability, however, we could wind up with greater concentrations of power. Power that can’t be changed–even theoretically–at the voting booth. Careful what you wish for.
Stay Hungry and Hopeful…But Also Skeptical
I like webs and networks (and wouldn’t have a blog called The Reticulum otherwise). I think networks are fundamental to the universe whereas hierarchies are only emergent.
So, to the degree we can move in the direction of efficient and effective networks, I’m all in. But don’t ask me to believe that Web3 is going to solve the world’s ills via the mechanics of blockchain and crypto. It won’t. The best we can hope for is movement in the direction of a fairer, more just and saner world free of power-hoarding, dangerous-tech-wielding dictator types. (We’re looking at you, Vladimir)
Free markets absolutely have their place. So do collectives. Ultimately what we want are socioeconomic and technical systems that allow us to find the right balance, one that keeps the network from stumbling into disastrous chaos on one hand or frozen intractability on the other hand. Both spell doom.
When I was a kid, we had this huge book of prints by Leonardo da Vinci. I loved it. Still do. So, just for fun, I used Stable Diffusion AI to get 30 images of 20th and 21st century political and business leaders as they might have been drawn by da Vinci. Check them out and see if you can identify your leaders. (And, by identify your leaders, I don’t mean to infer that these are all people that you personally would consider your leaders.)
I am pretty amazed at how well generative AI handles this task. And there’s the added bonus that we are using an artist who can’t make any copyright complaints. I fact, I wonder what da Vinci would say. I imagine he’d be both intrigued and appalled. Our of all the great artists of his age, I think da Vinci would fit best into the 21st century. I don’t know if he’d be a solitary iconoclast artist or the billionaire owner of a technology firm, but either way he’d be make his way.
The other day, I was playing with Stable Diffusion and found myself thinking hard about the ethics of AI-generated images. Indeed, I found myself in an ethical quandary. Or maybe quandaries.
More specifically, I was playing with putting famous haiku poems into the “Generate Image” box and seeing what kinds of images the Stable Diffusion generator would concoct.
It was pretty uninspiring stuff until I started adding the names of specific illustrators in front of the haiku. Things got more interesting artistically but, from my perspective, murkier ethically. And, it made me wonder if society has yet formulated way to approach the ethics of AI-generated images today.
The Old Pond Meets the New AIs
The first famous haiku I used was “The Old Pond” by Matsuo Bashō. Here’s how it goes in the translation I found:
An old silent pond
A frog jumps into the pond—
Splash! Silence again.
At first, I got a bunch of photo-like but highly weird and often grotesque images of frogs. You’ve got to play with Stable Diffusion a while to see what I mean, but here are a a few examples:
Okay, so far, so bad. A failed experiment. But that’s when I had the bright idea of adding certain illustrators’ names to the search so the generator would be able to focus on specific portions of the reticulum to find higher quality images. For reasons that will become apparent, I’m not going to mention their names. But here are some of the images I found interesting:
Better, right? I mean, each one appeals to different tastes, but they aren’t demented and inappropriate. There was considerable trial and error, and I was a bit proud of what I eventually kept as the better ones.
“Lighting One Candle” Meets the AI Prometheus
The next haiku I decided to use was “Lighting One Candle” by Yosa Buson. Here’s how that one goes:
The light of a candle
Is transferred to another candle—
This time I got some fairly shmaltzy images that you might find in the more pious sections of the local greeting card aisle. That’s not a dig at religion, by the way, but that aesthetic has never appealed to me. It seems too trite and predictable for something as grand as God. Anyway, the two images of candles below are examples of what I mean:
I like the two trees, though. I think it’s an inspired interpretation of the poem, one that I didn’t expect. It raised my opinion of what’s currently possible for these AIs. It’d make for a fine greeting card in the right section of the store.
But, still not finding much worth preserving, I went back to putting illustrators’ names in with the haiku. I thought the following images were worth keeping.
In each of these cases, I used an illustrator’s name. Some of these illustrators are deceased but some are still creating art. And this is where the ethical concerns arise.
Where Are the New Legal Lines in Generative AI?
I don’t think the legalities relating to generative AI have been completely worked out yet. Still, it looks like does appear that artists are going to have a tough time battling the against huge tech firms with deep pockets, even in nations like Japan with strong copyright laws. Here’s one quote from the article “AI-generated Art Sparks Furious Backlash from Japan’s Anime Community”:
[W]ith art generated by AI, legal issues only arise if the output is exactly the same, or very close to, the images on which the model is trained. “If the images generated are identical … then publishing [those images] may infringe on copyright,” Taichi Kakinuma, an AI-focused partner at the law firm Storia and a member of the economy ministry’s committee on contract guidelines for AI and data, told Rest of World….But successful legal cases against AI firms are unlikely, said Kazuyasu Shiraishi, a partner at the Tokyo-headquartered law firm TMI Associates, to Rest of World. In 2018, the National Diet, Japan’s legislative body, amended the national copyright law to allow machine-learning models to scrape copyrighted data from the internet without permission, which offers up a liability shield for services like NovelAI.
How About Generative AI’s Ethical Lines?
Even if the AI generators have relatively solid legal lines defining how they can work, the ethical lines are harder to draw. With the images I generated, I didn’t pay too much attention to whether the illustrators were living or dead. I was, after all, just “playing around.”
But once I had the images, I came to think that asking the generative AI to ape someone’s artistic style is pretty sleazy if that artist is still alive and earning their livelihood through their art. That’s why I don’t want to mention any names in this post. It might encourage others to add the names of those artists into image generators. (Of course, if you’re truly knowledgeable about illustrators, you’ll figure it out anyway, but in that case, you don’t need any help from a knucklehead like me.)
It’s one thing to ask an AI to use a Picasso-esque style for an image. Picasso died back in 1973. His family may get annoyed, but I very much doubt that any of his works will become less valuable due to some (still) crummy imitations.
But it’s a different story with living artists. If a publisher wants the style of a certain artist for a book cover, for example, then the publisher should damn well hire the artist, not ask a free AI to crank out a cheap and inferior imitation. Even if the copyright system ultimately can’t protect those artists legally, we can at least apply social pressure to the AI generator companies as customers.
I think AI generator firms should have policies that allow artists to opt out of having their works used to “train” the algorithms. That is, they can request to be put on the equivalent of a “don’t imitate” list. I don’t even know if that’s doable in the long run, but it might be one step in the direction of establishing proper ethics of AI-generated images.
The Soft Colonialism of Probability and Prediction?
First is the exploitation of cultural capital. These models exploit enormous datasets of images scraped from the web without authors’ consent, and many of those images are original artworks by both dead and living artists….The second concern is the propagation of the idea that creativity can be isolated from embodiment, relations, and socio-cultural contexts so as to be statistically modeled. In fact, far from being “creative,” AI-generated images are probabilistic approximations of features of existing artworks….AI art is, in my view, soft propaganda for the ideology of prediction.
To an extent, his first concern about cultural capital is related to my previous discussion about artists’ legal and moral rights, a topic that will remain salient as these technologies evolve.
His second concern is more abstract and, I think, debatable. Probabilistic and predictive algorithms may have begun in the “Global North,” but probability is leveraged in software wherever it is developed these days. It’s like calling semiconductors part of the “West” even as a nation like Taiwan innovates the tech and dominates the space.
Some of his argument rests on the idea that generative AI is not “creative,” but that term depends entirely on how we define it. Wikipedia, for example, states, “Creativity is a phenomenon whereby something new and valuable is formed.”
Are the images created by these technologies new and valuable? Well, let’s start by asking whether they represent something new. By one definition, they absolutely do, which is why they are not infringing on copyright. On the other hand, for now they are unlikely to create truly new artistic expressions in the larger sense, as the Impressionists did in the 19th century.
As for “valuable,” well, take a look at the millions if not billions of dollars investors are throwing their way. (But, sure, there are other ways to define value as well.)
My Own Rules for Now
As I use and write about these technologies, I’ll continue to leverage the names deceased artists. But for now I’ll refrain from using images based on the styles of those stilling living. Maybe that’s too simplistic and binary. Or maybe it’s just stupid of me not to take advantage of current artistic styles and innovations. After all, artists borrow approaches from one another all the time. That’s how art advances.
I don’t know how it’s all going to work out, but it’s certainly going to require more thought from all of us. There will never be a single viewpoint, but in time let’s hope we form some semblance of consensus about what are principled and unprincipled usages of these technologies.
Featured image is from Stable Diffusion. I think I used a phrase like "medieval saint looking at a cellphone." Presto.
To a large extent, you are the culmination of activity in your neocortex. That’s the part of your brain that drives sensory perception, logic, spatial reasoning, and language, among other things. Without it, you’re pretty much an inarticulate lizard person (which I’m afraid is my disposition all too often in the mornings as I read recent newspaper headlines). You neocortex is complex, highly networked place. In short, your mind is a matrix.
Or, at least neuroscientist Jeff Hawkins conceives the neocortex as a matrix of thousands of smaller brains. Amid this reticulum, each minibrain (my word, not his) stores many different models of the world. Somewhere in there there’s a mental model for your car, your house, your pets, your significant other, whatever politician you love to hate, that sweaty dude who walks that barky dog in the neighborhood every morning, and, well, everything else in your personal universe.
The minibrains are cortical columns, each quite intelligent on its own. Hawkins writes,
A cortical column occupies about one square millimeter. It extends through the entire 2.5 mm thickness, giving it a volume of 2.5 cubic millimeters. By this definition, there are roughly 150,000 cortical columns stacked side by side in a human neocortex. You can imagine a cortical column like a little piece of thin spaghetti. A human neocortex is like 150,000 short pieces of spaghetti stacked vertically next to each other.
Have Spaghetti, Will Reference
Okay, so you are largely the sum total of lots of cortical columns. But what does a cortical column actually do?
One of its primary purposes is to store and activate reference frames: oodles and oodles of reference frames.
A reference frame is where we access the information about what an object (or even an abstract concept) is and where it’s located in the world. For example, you have a reference frame for a coffee cup in various cortical columns. You know such a cup when you see it, and feel it, and sip from it. You also know where it is and how it moves. When you turn the cup upside down (hopefully sans coffee), the reference frame in your head also moves.
Reference frames have essential virtues such as:
allowing the brain to learn the structure and components of an object
allowing the brain to mentally manipulate the object as a whole (which is why you can envision an upside down coffee cup)
allowing your brain to plan and create movements, even conceptual ones
Thanks to reference frames, just one cortical column can “learn the three-dimensional shape of objects by sensing and moving and sensing and moving.” As you walk through a strange house, for example, you are mentally building a model of the house using reference frames. This includes your judgments about it. (“Hate that mushy chair in the living room, love that painting in the study, what were they thinking with that creepy bureau in the bedroom!?”)
I Think, Therefore I Predict
You’re a futurist. We all are. Because we’re subconsciously predicting stuff every moment of our conscious day.
Let’s say, for example, that you pick up your cup of coffee without even thinking about it. Your brain predicts the feel of the familiar, smooth, warm ceramic. That’s what you get most mornings. If instead your brain gets something different, it registers surprise and draws your attention to the cup.
Maybe it’s a minor surprise, like a small crack in the cup. Maybe it’s a bigger one, as when one of your fingers unexpectedly brush a cockroach that then quickly crawls up your arm. Argh!
Either way, you didn’t get what you subconsciously predicted based on your reference frame. These tiny predictions happen all the time. Your whole life is spent predicting what comes next, even of you’re not fully aware of it. If something happens that doesn’t match your mental model, your brain gets busy trying to figure out what went wrong with your expectation/prediction and what to do next.
(“Roach! Need to swat it! Where did I put that crappy news magazine? Come on, cortical-column-based reference frames, help me find it! Fast!)
You Are Your Reticulum
In short, most of your brain (the neocortex is about 70% of its total volume) is a highly complex reticulum made up of cortical columns, which themselves are made up of dense networks of neurons that are in a constant state of anticipation, even when you’re feeling pretty relaxed.
Your consciousness doesn’t exist in any one place. Your singular identity is, rather, a clever pastiche fabricated by that squishy matrix in your noggin.
So, why does it feel as if you’re you, the real mental “decider” (as George W. Bush’s neocortex once put it)? Hawkins thinks that all your various cortical columns are essentially “voting” about what you should perceive and how you should act. When you can’t make up your mind, it’s because the vote is too close to call.
So, you’re not just a matrix. You’re a democracy! Which is great. Even if our increasingly shaky U.S. government descends into tyranny, at least our brains will keep voting.
We are about to be awash in AI-generated media, and our society may have a tough time surviving it.
Our feet are already wet, of course. The bots inhabit Twitter like so many virtual lice. And chatbots are helpfully annoying visitors on corporate websites the world over. Meanwhile, algorithms have been honing their scribbler skills on the virtual Grub Street of the Internet for a while now.
But soon, and by soon I mean within months, we will be hip deep in AI-generated content and wondering how high the tide is going to get.
My guess is high, baby. Very high indeed.
What Are We Really Talking Here?
Techopedia defines generative AI as a “broad label that’s used to describe any type of artificial intelligence that uses unsupervised learning algorithms to create new digital images, video, audio, text or code.” In short, it’s all about AI-generated media.
I think that label will ultimately prove too restrictive, but let’s start there. So far, most of the hype is indeed around media, especially image creation and automated writing, with music and video not being far behind.
But we’ll get to that.
For now it’s enough to say that generative AI works by learning from, and being “inspired by,” the dynamic global reticulum that is the Internet.
But generative AI also applies to things like computer code. And, by and by, it’ll start generating atoms in addition to bits and bytes. For example, why couldn’t generative AI be applied to 3D printing? Why not car and clothing design? Why not, even, the creation of new biological systems?
The Money Generator
First, let’s follow the money. So how much dough is going into generative AI these days?
Answer: how much you got, angels and VCs?
For example, a start-up called Stability AI, which created the increasingly popular Stable Diffusion image-generating algorithm, was recently injected with a whopping $101 million round of investment capital. The company is now valued at a billion bucks.
Meanwhile other image generators such as DALL-E 2 and Midjourney have already acquired millions of users.
But investors are not just hot for image generators. Jasper, a generative writing company that’s just a year old (and one that plagues me with ads on Facebook) recently raised $125 million in venture capital and has a $1.5 billion valuation.
Although image and prose (usually with an eye toward marketing) are the hot tickets in generative AI for now, they are just the proverbial tip of the iceberg. Indeed, it appears that Stability AI, for one, has much grander plans beyond images.
The New York Timesreports that the company’s soon-to-be massive investments in AI hardware will “allow the company to expand beyond A.I.-generated images into video, audio and other formats, as well as make it easy for users around the world to operate their own, localized versions of its algorithms.”
Think about that a second. Video. So people will be able to ask generative AI to quickly create a video of anything they can imagine.
Fake Film Flim-Flams
Who knows where this leads? I suppose soon we’ll be seeing “secret” tapes of the Kennedy assassination, purported “spy video” of the Trump/Putin bromance, and conspiracy-supporting flicks “starring” a computer-generated Joe Biden.
We can only imagine the kind of crap that will turn up on YouTube and social media. Seems likely that one of the things that generative AI will generate is a whole new slew of conspiracists who come to the party armed with the latest videos of Biden handing over Hunter’s laptop to the pedophiliac aliens who wiped Hilary’s emails to ensure that Obama’s birth place couldn’t be traced back to the socialist Venusians who are behind the great global warming scam.
Even leaving political insanity aside, however, what happens to the film and television industries? How long until supercomputers are cranking out new Netflix series at the rate of one per minute?
Maybe movies get personalized. For example, you tell some generative AI to create a brand new Die Hard movie in which a virtual you plays the Bruce Willis role and, presto, out pops your afternoon’s entertainment. Yippee ki yay, motherfucker!
So, AI-generated media on steroids. On an exponential growth curve!
Play that Fakey Music
Then there are the sound tracks to go with those AI-gen movies. The Recording Industry Association of America (RIAA) is already gearing up for these battles. Here’s a snippet of what it submitted to the Office of the U.S. Trade Representative.
There are online services that, purportedly using artificial intelligence (AI), extract, or rather, copy, the vocals, instrumentals, or some portion of the instrumentals (a music stem) from a sound recording, and/or generate, master or remix a recording to be very similar to or almost as good as reference tracks by selected, well known sound recording artists.
To the extent these services, or their partners, are training their AI models using our members’ music, that use is unauthorized and infringes our members’ rights by making unauthorized copies of our members’ works. In any event, the files these services disseminate are either unauthorized copies or unauthorized derivative works of our members’ music.
That’s an interesting argument that will probably be tried by all creative industries. That is, just training your AI based on Internet copies of musical works violates copyright even if you have no intention of directly using that work in a commercial project. I imagine the same argument could be applied to any copyrighted work. Who know what this will mean for “synthetic media,” as some are calling.
Of course, there are plenty of uncopyrighted works AI can be trained on, but keeping copyrighted stuff from being used for machine learning programs could put a sizeable dent in the quality of generative AI products.
So, it won’t only be media that’s generated. Imagine the blizzard of lawsuits until it’s all worked out.
Revenge of the Code
AI can code these days. Often impressively so. I suppose it’d be ironic if a lot of software developers were put out of work by intelligent software, but that’s the direction we seem headed.
Consider the performance of DeepMind’s AlphaCode, an AI designed to solve challenging coding problems. The team that designed it had it compete with human coders to solve 10 challenges on Codeforces, a platform hosting coding contests.
Prof. John Naughton writing in The Guardian describes the contest and summarizes, “The impressive thing about the design of the Codeforces competitions is that it’s not possible to solve problems through shortcuts, such as duplicating solutions seen before or trying out every potentially related algorithm. To do well, you have to be creative.”
On its first try, AlpaCode did pretty well. The folks at DeepMind write, “Overall, AlphaCode placed at approximately the level of the median competitor. Although far from winning competitions, this result represents a substantial leap in AI problem-solving capabilities and we hope that our results will inspire the competitive programming community.”
To me, a very amateurish duffer in Python, this is both impressive and alarming. An AI that can reason out natural language instructions and then code creatively to solve problems? It’s kind of like a Turing test for programming, one that AlphaCode might well be on target to dominate in future iterations.
Naughton tries to reassure his readers, writing that “engineering is about building systems, not just about solving discrete puzzles,” but color me stunned.
What’s next for generative AI once it finds its virtual footing?
Well, atoms are the natural next step.
Ask yourself: if generative AI can easily produce virtual images, why not sculptures via 3D printers? Indeed, why not innovative practical designs?
This is not a new idea. There is already something called generative design. Sculpteo.com describes, “Instead of starting to work on a design from scratch, with a generative design process, you tell the program what you need to accomplish, you set your design goals and mention all the parameters you can. No geometry is needed to start a project. The software will then deliver you hundreds or thousands of design options, the AI can also make an in-depth analysis of the design and establish which one is the most efficient one! This method is perfect to explore design possibilities to get an optimal part.”
How About Bio?
Not long ago, I wrote a tongue-in-cheekish post about the singularity. An acquaintance of mine expressed alarm about the idea. When I asked what scared her most, she said, “If AI can alter DNA, I’d say the planet is doomed.”
That particular scenario had never occurred to me, but it’s easy enough to see her point. DNA is biological code. Why not create a generative AI that can design new life forms almost as easily as new images?
In fact, why stop at design? Why not 3D print the new critters? Again, this is a concept that already exists. As the article “3D Bioprinting with Live Cells” describes it, “Live cell printing, or 3D bioprinting, is an emerging technology that poses a revolutionary development for tissue engineering and regeneration. This bioprinting method involves the creation of a spatial arrangement of living cells and biologics into a functionalized tissue.”
The good news? Probably some fascinating new science, designer replacement organs on demand, and all the strange new machine-generated meat you can eat!
The bad news? Shudder. Let’s not go there today.
Mickey Mouse and the Age of Innovative AI
Although we’re calling this generative AI, the better term might be innovative AI. We are essentially contracting AI writers, artists and coders to do our bidding. Sure, they’re imitating, mixing and matching human-made media, but they are nonetheless “the talent” and will only get better at their jobs. We, on the other hand, are promoted to the positions of supercilious art directors, movie producers and, inevitably (yuck) critics.
If the singularity ever actually happens, this emerging age of innovative AI will be seen as a critical milestone. It feels like a still rough draft of magic, and it may yet all turn out wonderfully.
But I find it hard not to foresee a Sorcerer’s Apprentice scenario. Remember in Fantasia, when Mickey Mouse harnesses the power of generative sorcery and winds up all wet and sucked down a whirlpool?
Unlike Mickey, we’ll have no sorcerer to save our sorry asses if we screw up the wizardry. This means that, on sum, we need to use these powerful technologies wisely. I hope we’re up to it. Forgive me if, given our recent experiences with everything from social media madness to games of nuclear chicken, I remain a bit skeptical on that front.
Feature image generated by Stable Diffusion. The prompt terms used were "Hokusai tsunami beach people," with Hokusai arguably being the greatest artist of tsunamis in human history. In other words, the AI imitated Hokusai's style and came up with this original piece.