What’s generative AI and what can it do?

Over the past 12 months or so, conversations round AI have ramped up by a large diploma. Whether or not it is an AI-generated picture of the Pope sporting a ridiculous jacket or youngsters dishonest on their homework with assist from a big language mannequin, AI has been within the information so much currently.

As you may count on, increasingly more manufacturers are getting in on the motion, too. Google’s annual developer convention this 12 months was virtually totally centred round AI capabilities, and that is doubtless only the start.


You will have seen the time period “generative AI” getting used extra steadily, however what precisely does it imply? And the way does it differ from AI as a complete? We have got you coated. On this article we’ll let you know all the pieces that you must know, and extra.

What’s generative AI?

Let’s begin with the time period AI, which stands for synthetic intelligence. Because the title suggests, it refers to a variety of functions that every one have a few issues in widespread – they’re man-made and so they simulate the power to consider their very own accord.

Some early implementations of AI have been issues like enemy characters in video video games, that are managed by the pc and appear to make selections on their very own, and predictive textual content in your cellphone, which suggests phrases you may need to use primarily based on widespread phrase combos.

To some extent, all AI programs work utilizing these rules, they’ve a algorithm to comply with (just like the online game character) and so they recognise and react to patterns (like predictive textual content).

The time period generative AI refers to an AI system that is designed to create one thing. This may be textual content, photos, code, audio and even video clips. Usually the generative AI is given a immediate by the person, after which it tries to create one thing that matches the outline.

A non-generative AI can be one thing like a self-driving automobile, as a substitute of making an finish product, it is utilizing AI to react to information and make changes in real-time.

Pope Jacket Midjourney AIGenerative AI for textual content

AI for textual content technology has had arguably the biggest influence on the world thus far, and issues are solely set to get extra fascinating. ChatGPT grew to become immensely fashionable when it was launched to the general public in late 2022, amassing over one million customers in only one week.

We’ve a devoted function that may let you know all about ChatGPT and what it will probably do, however to summarise, it is an AI chatbot which you can discuss to simply as in case you have been chatting to an individual on immediate messenger. The place it will get fascinating is its skill to generate textual content, so you’ll be able to say one thing like “Write me an essay about gravity within the fashion of William Shakespeare” and a few seconds later, it magically seems.

It’s totally highly effective stuff, and this solely compounds whenever you realise it will probably work with issues like coding, formulation and math issues. With a little bit of troubleshooting, you will get chatGPT to make you a complete web site, and train you the right way to get it on-line, all you must do is ask it.

How to delete ChatGPT data

Zac Wolff on Unsplash

Microsoft rapidly noticed the potential and applied among the tech behind ChatGPT into its Bing search engine. So, now you can chat with Bing instantly and get some very insightful outcomes.

As we talked about, Google had so much to say about AI throughout Google I/O 2023, and plenty of what it is bringing to clients is within the subject of generative AI for textual content. Google has its personal reply to ChatGPT known as Bard, however past that, it is also injecting these capabilities into its hottest software program merchandise.

One such function is Assist me write which is coming to Gmail within the close to future and gives the power to generate emails with a immediate like “Write me an expert electronic mail demanding a refund.” We’ll additionally see comparable options baked into Google’s Messages app for Android 14.

Generative AI for photos and movies

You may in all probability guess the place that is going, however a lot in the identical approach as you need to use prompts to create textual content, you can too create photos. Generative AI for photos is actually a text-to-image converter, so that you write what you want a picture of, and the AI makes it. By refining your prompts you’ll be able to change the best way the generated photos seem, too, so you’ll be able to add one thing like “..in a black and white comedian e-book fashion” or “… high-resolution {photograph}” and get drastically totally different outcomes.

One of the fashionable instruments for picture technology is DALL-E 2, from the identical staff behind ChatGPT. Nonetheless, extra rivals have been rising, comparable to Steady Diffusion and Imagen. Every system has its advantages, and if you wish to know which one is finest on your wants, try our roundup.

Turtlebug Stablediffusion AI image

Picture-generating AI is already showing in shopper merchandise. For instance, the Amazon Hearth TV Omni QLED TV means that you can create generative AI photos to set as your wallpaper, the identical can be true on Android 14 smartphones.

As if that wasn’t sufficient, AI video technology is within the works, too. In any case, a video is only a sequence of photos performed in fast succession. Google teased the subsequent technology of its Imagen AI video generator at I/O, it is nonetheless within the analysis levels in the meanwhile, but it surely’s mentioned to have the ability to output HD video at 24fps from a easy textual content enter.

Generative AI for audio

Textual content-to-speech has been round for a very long time, but it surely’s at all times had that uncommon robotic high quality about it, that is all altering due to AI. With new machine studying methods, AI can generate audio that seems like anybody you please.

Till lately, this has required huge quantities of audio information to do precisely. So, emulating the voice of a star can be doable, because of the quantity of recorded conversations out there, however producing an AI model of your individual voice can be fairly tough. That is altering, too, and it is acquired to the purpose the place Microsoft claims its VALL-E mannequin can carefully replicate an individual’s voice with as little as 3 seconds of recorded audio.

Microsoft VALL-E


This know-how is already getting used to generate voiceovers for issues like YouTube movies, and you will have come throughout one of many many memes that use this tech, like US presidents enjoying Roblox.

We are able to solely think about how pure and practical voice assistants, like Alexa, are going to sound within the coming years.

What are the downsides of generative AI?

All of this AI tech could be very thrilling, and with a little bit of know-how, it means that you can get so much accomplished in a really brief house of time. The perfect half is that a lot of the instruments can be found free of charge, which means there isn’t any barrier to entry.

On the flip facet, giving the entire world entry to such highly effective instruments has some fairly scary implications. We have already began to see a few of them play out, too. There are numerous tales of scholars making an attempt to cheat by getting ChatGPT to write down their papers, for instance.

There’s additionally the potential difficulty of copyright infringement, picture fashions are skilled on tens of millions of present photos earlier than they’ll create their very own. This database of photos consists of the work {of professional} artists and photographers, and there is plenty of dialogue about how acceptable that is.

It is also value figuring out that there are limitations to most of those instruments of their present state. Language fashions, like ChatGPT and Bing, are vulnerable to one thing known as hallucinations, whereby the AI confidently states a solution that is incorrect. So in case you’re utilizing an AI for any critical work, you’d higher ensure you’re fact-checking.

The excellent news is that every one of those points are being actively labored on. Google had so much to say about its accountable strategy to AI at I/O. It plans to implement watermarking and metadata as methods to establish AI-generated imagery, with the aim of decreasing potential misinformation and impersonation.

Sam Altman, founding father of OpenAI, is taking an lively strategy, too. He has known as for the US authorities to manage AI and desires a brand new company in place to license AI-focused firms.

“I believe if this know-how goes unsuitable, it will probably go fairly unsuitable…we need to be vocal about that,” Altman mentioned. “We need to work with the federal government to forestall that from occurring.”